<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Whither Hadoop?</title>
	<atom:link href="http://www.brentozar.com/archive/2012/12/whither-hadoop/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.brentozar.com/archive/2012/12/whither-hadoop/</link>
	<description></description>
	<lastBuildDate>Wed, 19 Jun 2013 14:19:21 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
	<item>
		<title>By: Jeremiah Peschka</title>
		<link>http://www.brentozar.com/archive/2012/12/whither-hadoop/#comment-117854</link>
		<dc:creator>Jeremiah Peschka</dc:creator>
		<pubDate>Wed, 19 Dec 2012 18:53:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.brentozar.com/?p=17056#comment-117854</guid>
		<description><![CDATA[You&#039;re absolutely correct! 

There is a lot that could be said in a much more specific way in this post. If I were talking to a non-Microsoft audience (or assumed that they had more exposure to the ecosystem), I&#039;d be much more specific. Inside the Microsoft community, the entire ecosystem of Hadoopery is still frequently referred to as Hadoop. Sometimes you have to start vague and bring in specifics over time.

Thanks much for contributing, though, the differentiation is key.]]></description>
		<content:encoded><![CDATA[<p>You&#8217;re absolutely correct! </p>
<p>There is a lot that could be said in a much more specific way in this post. If I were talking to a non-Microsoft audience (or assumed that they had more exposure to the ecosystem), I&#8217;d be much more specific. Inside the Microsoft community, the entire ecosystem of Hadoopery is still frequently referred to as Hadoop. Sometimes you have to start vague and bring in specifics over time.</p>
<p>Thanks much for contributing, though, the differentiation is key.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Josh Reuben</title>
		<link>http://www.brentozar.com/archive/2012/12/whither-hadoop/#comment-117731</link>
		<dc:creator>Josh Reuben</dc:creator>
		<pubDate>Wed, 19 Dec 2012 17:20:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.brentozar.com/?p=17056#comment-117731</guid>
		<description><![CDATA[This post comes across wrong to me.
Hadoop is not an ETL tool, its a MapReduce stack for scatter-gather-aggregate scaleout of compute jobs. If you want to compare it to something on the MS stack, it is more akin to HPC than SSIS !]]></description>
		<content:encoded><![CDATA[<p>This post comes across wrong to me.<br />
Hadoop is not an ETL tool, its a MapReduce stack for scatter-gather-aggregate scaleout of compute jobs. If you want to compare it to something on the MS stack, it is more akin to HPC than SSIS !</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Links</title>
		<link>http://www.brentozar.com/archive/2012/12/whither-hadoop/#comment-117393</link>
		<dc:creator>Links</dc:creator>
		<pubDate>Wed, 19 Dec 2012 10:43:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.brentozar.com/?p=17056#comment-117393</guid>
		<description><![CDATA[Huh, I&#039;ve been trying HD-Insight the last few weeks, which is MSFT &amp; Horton&#039;s Hadoop distribution for Windows Server (local not cloud based), and you could submit Map/Reduce jobs using C# rather than Java (I think u could do the same with the Azure based Hadoop distribution too).

To be honest I don&#039;t think we should limit ourselves because we haven&#039;t played with a particular technology before, Java (or generally any popular enough programming language) is based on sound programming principles that can easily be picked up... Am having to learn Python these days for some social network data mining, and it is soo awesome, sometimes its good to step out of your comfort zone.

My only beef with the MSFT local Hadoop installation is that it does not yet support Mahout, which is only supported in the Azure distribution.]]></description>
		<content:encoded><![CDATA[<p>Huh, I&#8217;ve been trying HD-Insight the last few weeks, which is MSFT &amp; Horton&#8217;s Hadoop distribution for Windows Server (local not cloud based), and you could submit Map/Reduce jobs using C# rather than Java (I think u could do the same with the Azure based Hadoop distribution too).</p>
<p>To be honest I don&#8217;t think we should limit ourselves because we haven&#8217;t played with a particular technology before, Java (or generally any popular enough programming language) is based on sound programming principles that can easily be picked up&#8230; Am having to learn Python these days for some social network data mining, and it is soo awesome, sometimes its good to step out of your comfort zone.</p>
<p>My only beef with the MSFT local Hadoop installation is that it does not yet support Mahout, which is only supported in the Azure distribution.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeremiah Peschka</title>
		<link>http://www.brentozar.com/archive/2012/12/whither-hadoop/#comment-116829</link>
		<dc:creator>Jeremiah Peschka</dc:creator>
		<pubDate>Tue, 18 Dec 2012 22:22:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.brentozar.com/?p=17056#comment-116829</guid>
		<description><![CDATA[Hi Jay,

While some Hadoop development is done in Java, much of the day-to-day business data analysis happens in tools like Pig, Hive, and Impala. These tools make it possible for non-Java devs to pull meaningful data out of massive data systems and, very frequently, push it in a distilled form back into something like SQL Server or SSAS. Developers are typically only dropping down into Java when it&#039;s absolutely necessary or to implement custom user defined functions.

If you&#039;re interested in seeing more of the Java side of development, I suggest you check out one of the many Apache hosted mailing lists (http://hadoop.apache.org/mailing_lists.html) to see how other people are working with Hadoop.]]></description>
		<content:encoded><![CDATA[<p>Hi Jay,</p>
<p>While some Hadoop development is done in Java, much of the day-to-day business data analysis happens in tools like Pig, Hive, and Impala. These tools make it possible for non-Java devs to pull meaningful data out of massive data systems and, very frequently, push it in a distilled form back into something like SQL Server or SSAS. Developers are typically only dropping down into Java when it&#8217;s absolutely necessary or to implement custom user defined functions.</p>
<p>If you&#8217;re interested in seeing more of the Java side of development, I suggest you check out one of the many Apache hosted mailing lists (<a href="http://hadoop.apache.org/mailing_lists.html" rel="nofollow">http://hadoop.apache.org/mailing_lists.html</a>) to see how other people are working with Hadoop.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jay</title>
		<link>http://www.brentozar.com/archive/2012/12/whither-hadoop/#comment-116809</link>
		<dc:creator>Jay</dc:creator>
		<pubDate>Tue, 18 Dec 2012 22:04:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.brentozar.com/?p=17056#comment-116809</guid>
		<description><![CDATA[Hadoop is very good product but and not useful to DBA or any other person unless he or she is TRUE JAVA developer. Hadoop is developed using 100% Java

So i would like to know &amp; learn more about Hadoop been used currently. what i see that SQL Server will be front end for Hadoop &amp; you need developer who can develope good code using java to develop bridge between Hadoop &amp; SQL Server. 

Thanks
Jay-]]></description>
		<content:encoded><![CDATA[<p>Hadoop is very good product but and not useful to DBA or any other person unless he or she is TRUE JAVA developer. Hadoop is developed using 100% Java</p>
<p>So i would like to know &amp; learn more about Hadoop been used currently. what i see that SQL Server will be front end for Hadoop &amp; you need developer who can develope good code using java to develop bridge between Hadoop &amp; SQL Server. </p>
<p>Thanks<br />
Jay-</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joe Harris</title>
		<link>http://www.brentozar.com/archive/2012/12/whither-hadoop/#comment-108234</link>
		<dc:creator>Joe Harris</dc:creator>
		<pubDate>Wed, 12 Dec 2012 16:17:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.brentozar.com/?p=17056#comment-108234</guid>
		<description><![CDATA[At core Hadoop is a way to execute  logic in the language of your choice over raw data on multiple machines. It&#039;s all just files - no rows, no tables, no records, no tuples - what you do from there is entirely your responsibility.

With all respect to Jeremiah, I&#039;d suggest those interested in Hadoop should also read &quot;What Hadoop Is. What Hadoop Isn&#039;t&quot; by Mark Madsen at http://www.insideanalysis.com/2012/12/what-hadoop-is-what-is-isnt/.

A quote: &quot;The Hadoop stack is a data processing platform. It combines elements of databases, data integration tools and parallel coding environments into a new and interesting mix.&quot; &quot;It combines data storage, retrieval and programming into a single highly scalable package.&quot;]]></description>
		<content:encoded><![CDATA[<p>At core Hadoop is a way to execute  logic in the language of your choice over raw data on multiple machines. It&#8217;s all just files &#8211; no rows, no tables, no records, no tuples &#8211; what you do from there is entirely your responsibility.</p>
<p>With all respect to Jeremiah, I&#8217;d suggest those interested in Hadoop should also read &#8220;What Hadoop Is. What Hadoop Isn&#8217;t&#8221; by Mark Madsen at <a href="http://www.insideanalysis.com/2012/12/what-hadoop-is-what-is-isnt/" rel="nofollow">http://www.insideanalysis.com/2012/12/what-hadoop-is-what-is-isnt/</a>.</p>
<p>A quote: &#8220;The Hadoop stack is a data processing platform. It combines elements of databases, data integration tools and parallel coding environments into a new and interesting mix.&#8221; &#8220;It combines data storage, retrieval and programming into a single highly scalable package.&#8221;</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using memcached
Database Caching using memcached
Object Caching 293/295 objects using memcached
Content Delivery Network via Amazon Web Services: S3: cdn.prod.brentozar.com
Application Monitoring using New Relic

 Served from: www.brentozar.com @ 2013-06-19 12:37:39 by W3 Total Cache -->