<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>A World About to Change... &#187; Uncategorized</title>
	<atom:link href="http://www.vinaysethmohta.com/blog/category/uncategorized/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.vinaysethmohta.com/blog</link>
	<description></description>
	<lastBuildDate>Tue, 29 Jun 2010 03:13:38 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Digital Marketing and BI integrated in Business Process</title>
		<link>http://www.vinaysethmohta.com/blog/2010/06/28/digital-marketing-and-in-process-bi/</link>
		<comments>http://www.vinaysethmohta.com/blog/2010/06/28/digital-marketing-and-in-process-bi/#comments</comments>
		<pubDate>Tue, 29 Jun 2010 03:13:38 +0000</pubDate>
		<dc:creator>vinaysethmohta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.vinaysethmohta.com/blog/?p=70</guid>
		<description><![CDATA[With my evaluation of the major SEM platform vendors over the last few months as well as my time with demand-side platforms for display, I&#8217;ve finally experienced a productized version of business intelligence integrated with business process that scales across many different businesses. While evaluating (and now using) SEM platforms, I have been impressed by: [...]]]></description>
			<content:encoded><![CDATA[<p>With my evaluation of the major SEM platform vendors over the last few months as well as my time with demand-side platforms for display, I&#8217;ve finally experienced a productized version of business intelligence integrated with business process that scales across many different businesses.</p>
<p>While evaluating (and now using) SEM platforms, I have been impressed by:</p>
<p>1) how well-suited the tools are for the day-to-day activities of the paid search marketer<br />
2) the dramatic improvements in process and time-savings realized with having reporting and intelligence integrated with the ability to act</p>
<p>We are able to find outliers and immediately cut or increase spend.  Tasks that previously used to take several hours can now be completed in minutes.</p>
<p>Demand-side platforms (DSPs) for display advertising are poised to deliver similar operational improvements to display marketing.</p>
<p>Abstracting up one level, I find these tools fascinating in that they&#8217;ve been able to productize (and consequently scale) the integration of BI and business processes.  What is it about digital marketing that allows this approach to work, as opposed to say a tool built on top of Salesforce?  The data sources are fairly homogenous (the three major search engines) and the data itself is even more so (campaign structure, bids, clicks, and CPC&#8217;s constitute the majority of the data).</p>
<p>The major difference between users is how they define conversion and revenue; however, even here, every marketer has a revenue number and a fairly standard set of conversion events (registration, revenue event, login, etc). You specify the definition of conversion and then upload one or more related metrics to support analysis and decision-making in the tool.  Bidding is driven by revenue and cost and improving  CTR and revenue/click allows marketers to focus on improving ad copy and landing pages.</p>
<p>Thus, in this case, limited data and a fairly homogenous cross-vertical definition of success metrics allows a scalable, tight integration of business process and business intelligence.</p>
<p>With regards to the platforms, you do start hitting limits once you start getting very sophisticated e.g. custom bidding algorithms, complex keyword expansion, etc.  But even in this area, the vendors are increasingly offering solutions that are more sophisticated than what most in-house digital marketing and analytics operations will develop e.g. attribution models for bidding, automated <a href="http://adwords.google.com/support/aw/bin/answer.py?hl=en&#038;answer=68034">SQR</a>-driven keyword expansion, etc.</p>
<p>I&#8217;m looking forward to watching these products expand over the next year and simultaneously watching how digital marketing processes evolve.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vinaysethmohta.com/blog/2010/06/28/digital-marketing-and-in-process-bi/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Small businesses and BI</title>
		<link>http://www.vinaysethmohta.com/blog/2010/06/28/small-businesses-and-bi/</link>
		<comments>http://www.vinaysethmohta.com/blog/2010/06/28/small-businesses-and-bi/#comments</comments>
		<pubDate>Tue, 29 Jun 2010 02:44:52 +0000</pubDate>
		<dc:creator>vinaysethmohta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.vinaysethmohta.com/blog/?p=68</guid>
		<description><![CDATA[I saw Dayna Grayson&#8217;s post about the winners of NBVP&#8217;s Seed Competition. One of the winners was Profitably, which is providing analytics and BI on data in Quickbooks. They are joining a growing list of companies that are selling SaaS BI on standard data schemas to SMB&#8217;s (small and medium businesses). Another recent one that [...]]]></description>
			<content:encoded><![CDATA[<p>I saw <a href="http://primaryentrepreneur.com/">Dayna Grayson&#8217;s</a> <a href="http://primaryentrepreneur.com/?p=98">post</a> about the winners of <a href="http://nbvp.northbridge.com">NBVP&#8217;s</a> Seed Competition.  One of the winners was Profitably, which is providing analytics and BI on data in Quickbooks.  They are joining a growing list of companies that are selling SaaS BI on standard data schemas to SMB&#8217;s (small and medium businesses).  Another recent one that a colleague pointed out is <a href="http://metricly.com/">Metricly</a>.  I&#8217;ve signed up for the betas on each site, so will post more when I get to use them.</p>
<p>In most cases, the vendors have taken a horizontal platform (whether Quickbooks or Salesforce) and developed a reporting / analytics offering on top of it.  The real question for me goes back to some of the posts that I wrote when I started this blog &#8211; these plays are interesting, but they don&#8217;t get to the heart of the highest value problems:</p>
<ul>
<li>understanding a business (i.e. vertical) and offering BI / reporting that factors in specific challenges</li>
<li>integrating BI as part of the business process of an organization (e.g. within Quickbooks usage or within Salesforce)</li>
</ul>
<p>Some of the major players (e.g. Siebel Analytics) have offered a &#8220;matrix&#8221; (i.e. functional area x vertical) suite of applications for several years (e.g. sales for manufacturing or finance for retail). More accurately, at least they offered one several years back; however, it doesn&#8217;t seem like these got much traction in their target market as I don&#8217;t hear about them any more.  I also haven&#8217;t seen other enterprise software vendors that sell to large companies universally adopt this packaging for their go-to-market.  Presumably, they would have if the market were responding strongly.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vinaysethmohta.com/blog/2010/06/28/small-businesses-and-bi/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summing a column of numbers</title>
		<link>http://www.vinaysethmohta.com/blog/2010/06/25/summing-a-column-of-numbers/</link>
		<comments>http://www.vinaysethmohta.com/blog/2010/06/25/summing-a-column-of-numbers/#comments</comments>
		<pubDate>Sat, 26 Jun 2010 01:10:00 +0000</pubDate>
		<dc:creator>vinaysethmohta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.vinaysethmohta.com/blog/?p=63</guid>
		<description><![CDATA[Often, when I&#8217;m doing data validation with the output from Hive and comparing it to another system, it&#8217;s useful to get Unix to do some summing for me.  awk, cut, sort, and uniq are quite handy in these cases, and often much faster than modifying and re-running Hive queries.  Here&#8217;s my bag of tricks: - [...]]]></description>
			<content:encoded><![CDATA[<p>Often, when I&#8217;m doing data validation with the output from Hive and comparing it to another system, it&#8217;s useful to get Unix to do some summing for me.  awk, cut, sort, and uniq are quite handy in these cases, and often much faster than modifying and re-running Hive queries.  Here&#8217;s my bag of tricks:</p>
<p>- Summing the 5th column of numbers:</p>
<pre>cat tmp.txt | awk '{t+=$5}END{print t}'</pre>
<p>You may also find the above command useful for processing the output of <span style="font-family: courier;">ls -l</span></p>
<p>- cut, uniq, sort, grep, and wc are great for filtering and computing aggregates for a column of values e.g. for filtering a list of session ids in the 17th field, looking only for the duplicate ones:</p>
<pre>cut -f17 | sort -n | uniq -c | grep -v ' 1 '</pre>
<p>- And finally, a random aside that I learned about last week &#8211; bash treats the output of the following two commands differently:</p>
<p><span style="font-family: courier;">echo $output</span></p>
<p><span style="font-family: courier;">echo &#8220;$output&#8221;</span></p>
<p>If <span style="font-family: courier;">$output</span> has no line breaks, then the output will be identical.  If it does have line breaks, then the former will flatten the multiple lines into a single line, while the latter will print out <span style="font-family: courier;">$output</span> exactly as it was captured.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vinaysethmohta.com/blog/2010/06/25/summing-a-column-of-numbers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Datameer</title>
		<link>http://www.vinaysethmohta.com/blog/2010/06/21/datameer/</link>
		<comments>http://www.vinaysethmohta.com/blog/2010/06/21/datameer/#comments</comments>
		<pubDate>Tue, 22 Jun 2010 01:52:00 +0000</pubDate>
		<dc:creator>vinaysethmohta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.vinaysethmohta.com/blog/?p=60</guid>
		<description><![CDATA[A colleague recently pointed me at Datameer, an analytics front-end for Hadoop.  As their website and datasheet mention, they use a familiar spreadsheet interface for large data.  I recently saw a demo of the product, and I thought they had done a nice implementation of joins through a graphical user interface targeted at non-ETL experts. [...]]]></description>
			<content:encoded><![CDATA[<p>A colleague recently pointed me at <a href="http://datameer.com">Datameer</a>, an analytics front-end for Hadoop.  As their website and datasheet mention, they use a familiar spreadsheet interface for large data.  I recently saw a demo of the product, and I thought they had done a nice implementation of joins through a graphical user interface targeted at non-ETL experts.  At least based on the demo, I thought anyone who has decent experience with Excel would be able to effectively use it.  Note that it is not a tool targeting &#8220;BI for the masses&#8221;; it is definitely more of an analyst&#8217;s or an IT expert&#8217;s tool.</p>
<p>Added bonus that it easily integrates all the traditional data sources as well e.g. it&#8217;s easy to join a MySQL table against a &#8220;table&#8221; in Hadoop / Hive.  Would be cool if they could automatically discover the schema of your Hive tables and your traditional DB tables. They may already be able to; I didn&#8217;t see anything specific on their website indicating that though.</p>
<p>Tools like Datameer are a great addition to any BI practitioner&#8217;s toolset.  As datasets get larger, Hadoop allows ready access to large amounts of data at a reasonable price.  Datameer will now allow us to make that data more broadly accessible to a larger group of analysts.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vinaysethmohta.com/blog/2010/06/21/datameer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hive Annoyances</title>
		<link>http://www.vinaysethmohta.com/blog/2010/06/18/hive-annoyances/</link>
		<comments>http://www.vinaysethmohta.com/blog/2010/06/18/hive-annoyances/#comments</comments>
		<pubDate>Sat, 19 Jun 2010 02:16:34 +0000</pubDate>
		<dc:creator>vinaysethmohta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.vinaysethmohta.com/blog/?p=54</guid>
		<description><![CDATA[As I mentioned in my prior post, I&#8217;ve been using Hadoop / Hive for six months now.  My top three frustrations with Hive v0.4.0: 1) a decent quality CLI (command line interface) for Hive.  Editing of a query in Hive is very limited. You can&#8217;t use custom keybindings &#8211; ideally, you&#8217;d want the CLI to [...]]]></description>
			<content:encoded><![CDATA[<p>As I mentioned in my prior post, I&#8217;ve been using Hadoop / Hive for six months now.  My top three frustrations with Hive v0.4.0:</p>
<p>1) a decent quality CLI (command line interface) for Hive.  Editing of a query in Hive is very limited. You can&#8217;t use custom keybindings &#8211; ideally, you&#8217;d want the CLI to get editing mode (emacs or vi) from your inputrc.  History and history search is poor. My workaround to date has been to instead use the shell&#8217;s command line and execute each query using the command line</p>
<pre>hive -e "... query ..."'</pre>
<p>Another benefit of using the Bash shell as your Hive CLI is that you can include variables in your SQL statement e.g. in my script, I can do something like:</p>
<pre>TS=$1</pre>
<pre>hive -e "select * from foo where ts='$TS'"</pre>
<p>2) Hive&#8217;s inability to map columns correctly.  I am unable to systematically reproduce this behavior, but when you do complex queries with multiple selects, Hive gets confused about the order of columns.  In particular, if the order of the columns in your hive query does not match the order of the columns on disk in the file backing the table, then you get lots of junk back.</p>
<p>3) Lack of documentation and confusing documentation.  The <a href="http://wiki.apache.org/hadoop/Hive/LanguageManual">Hive Language Manual</a> looks to be the definitive reference for Hive; however, it is missing quite a bit of documentation and mixes documentation from various versions of Hive, including the most recent, unreleased version, v0.6.0.</p>
<p>Gripes aside, Hadoop / Hive have been a great platform, and we&#8217;re continuing to expand our usage of them daily.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vinaysethmohta.com/blog/2010/06/18/hive-annoyances/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Update</title>
		<link>http://www.vinaysethmohta.com/blog/2010/05/13/update/</link>
		<comments>http://www.vinaysethmohta.com/blog/2010/05/13/update/#comments</comments>
		<pubDate>Fri, 14 May 2010 03:42:30 +0000</pubDate>
		<dc:creator>vinaysethmohta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.vinaysethmohta.com/blog/?p=46</guid>
		<description><![CDATA[It&#8217;s been over one year since I last posted.  A second child and a new job (focused on digital marketing at KAYAK) are the major updates; a new non-professional blog and a lot more time on digital photography have been amongst the other changes in the last one year. In the context of digital marketing, I&#8217;ve [...]]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s been over one year since I last posted.  A second child and a new job (focused on digital marketing at KAYAK) are the major updates; a new <a href="http://tenthousandmistakes.com">non-professional blog</a> and a lot more time on digital photography have been amongst the other changes in the last one year.</p>
<p>In the context of digital marketing, I&#8217;ve spent additional time looking into BI and analytics tools, so more on that shortly.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vinaysethmohta.com/blog/2010/05/13/update/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A capital framework for evaluating career choices</title>
		<link>http://www.vinaysethmohta.com/blog/2007/12/14/a-capital-framework-for-evaluating-career-choices/</link>
		<comments>http://www.vinaysethmohta.com/blog/2007/12/14/a-capital-framework-for-evaluating-career-choices/#comments</comments>
		<pubDate>Fri, 14 Dec 2007 21:44:47 +0000</pubDate>
		<dc:creator>vinaysethmohta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.vinaysethmohta.com/blog/?p=13</guid>
		<description><![CDATA[My brother recently proposed the following framework for thinking about career choices or rather, choices &#8220;for your next gig.&#8221; It is a capital-oriented framework where traditional capital (money, equity) is just one part of it. He defines five different types of capital: Financial capital: traditional capital falls into this category e.g. How much money will [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.vivekmohta.com/">My brother</a> recently proposed the following framework for thinking about career choices or rather, choices &#8220;for your next gig.&#8221;  It is a capital-oriented framework where traditional capital (money, equity) is just one part of it.  He defines five different types of capital:</p>
<ul>
<li><strong>Financial capital</strong>: traditional capital falls into this category e.g. How much money will you make? How much equity will you get? What will the equity be worth?</li>
<li><strong>Knowledge capital</strong>: what you are going to learn / what will you know when you leave this position e.g. People management? Project management? Ruby on Rails? Running a consumer website? Consumer marketing?</li>
<li><strong>Social capital</strong>: who are you going to build relationships as a result of your work e.g. Do you meet other marketing folks? Strong technologists? CIO&#8217;s in financial services companies?  VC&#8217;s who could be potential future investors?</li>
<li><strong>Brand capital</strong>: how does this opportunity fit into your larger narrative and how does it develop your personal brand e.g. I have built deep infrastructure companies all my life and with this opportunity, I did it in the consumer space. I had spent a lot of time selling traditional enterprise software and now moved over to selling SaaS.</li>
<li><strong>Happiness</strong>: No capital suffix attached to this one e.g. Will you be happy when you come home everyday?  Will you like the people you work with? Will you find what you do fulfilling? etc.</li>
</ul>
<p>Rating my various opportunities against these categories and how much I would grow in each has been helpful as I think about what&#8217;s next for me.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vinaysethmohta.com/blog/2007/12/14/a-capital-framework-for-evaluating-career-choices/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Revenue &#8211; cost = profit</title>
		<link>http://www.vinaysethmohta.com/blog/2007/12/14/revenue-cost-profit/</link>
		<comments>http://www.vinaysethmohta.com/blog/2007/12/14/revenue-cost-profit/#comments</comments>
		<pubDate>Fri, 14 Dec 2007 21:37:51 +0000</pubDate>
		<dc:creator>vinaysethmohta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.vinaysethmohta.com/blog/?p=12</guid>
		<description><![CDATA[One of the simplest and yet striking ideas that I&#8217;ve heard for thinking about case interviews has been &#8211; always start with the simple framework that you are trying to maximize profit, which is the difference between revenue and cost. Everything else should be done either to increase revenue or decrease cost. Similarly, one of [...]]]></description>
			<content:encoded><![CDATA[<p>One of the simplest and yet striking ideas that I&#8217;ve heard for thinking about case interviews has been &#8211; always start with the simple framework that you are trying to maximize profit, which is the difference between revenue and cost.  Everything else should be done either to increase revenue or decrease cost.  Similarly, one of the dead simple ideas I realized in the context of consumer applications on the web has been that there are really only two things that matter for a consumer application company to be successful:</p>
<ul>
<li>build a great product that people want</li>
<li>tell the world about it</li>
</ul>
<p>If you do the former and not the latter, you have the &#8220;tree falling in a forest&#8221; problem &#8211; no one knows you exist, so who really cares what you have built.  If you do the latter without the former, people will come and will leave just<br />
as quickly.  For those who buy traffic, the former is what will ultimately get the company out of the business of buying traffic.</p>
<p>I didn&#8217;t mention making money in the above. My general take is that if you build something people want and you have a bunch of people using it, you&#8217;ll find a way to make money from it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vinaysethmohta.com/blog/2007/12/14/revenue-cost-profit/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Who has time for this?</title>
		<link>http://www.vinaysethmohta.com/blog/2007/06/19/who-has-time-for-this/</link>
		<comments>http://www.vinaysethmohta.com/blog/2007/06/19/who-has-time-for-this/#comments</comments>
		<pubDate>Tue, 19 Jun 2007 21:19:41 +0000</pubDate>
		<dc:creator>vinaysethmohta</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.vinaysethmohta.com/blog/?p=4</guid>
		<description><![CDATA[I finally decided to start my own blog. I always wondered how people manage to have time to write up blog entries. I will shortly find out. The title of this post, apart from being my own rhetorical question, is also the title of David Cowan&#8217;s blog.]]></description>
			<content:encoded><![CDATA[<p>I finally decided to start my own blog.  I always wondered how people manage to have time to write up blog entries. I will shortly find out.  The title of this post, apart from being my own rhetorical question, is also the title of <a title="David Cowan's blog" href="http://whohastimeforthis.blogspot.com">David Cowan&#8217;s blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.vinaysethmohta.com/blog/2007/06/19/who-has-time-for-this/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
