Microsoft have made announcements about bringing Hadoop to Windows, but what does that mean for SQL Server? In this talk, Jeremiah Peschka will cover a the Hadoop ecosystem at a high level and discuss specific use cases for Hadoop. By the end of this talk, you should have a general idea of how different parts of Hadoop can be used to augment SQL Server’s rich functionality. This talk is for database professionals who are looking for ways to extend the capability of SQL Server with different pieces of the Hadoop ecosystem.
Getting Started with Hadoop
- Getting Started withHive
- Cloudera Hadoop Downloads (includes virtual machines)
- Apache Hadoop
- Hadoop World Follow-Up- I was lucky enough to attend Hadoop World in 2010, here’s some of the knowledge I came away with.
- Hadoop Pig Reference Card
Subhash Pant November 26, 2011 | 11:48 pm
Awesome intro about Hadoop to SQL community. I am interested to see how Microsoft will integrate this with SQL Server.
Pingback: Something for the Weekend – SQL Server Links 02/12/11
Pingback: Introduction to Hadoop « Click-Rex
Pingback: Introduction to Hadoop « Click-Rex | Programmer Solution
masud taheri February 13, 2012 | 2:58 am
hello my Freand
does hadoop support CRUD and Load balance for SQL Server
HadoopServer1 HadoopServer2 … HadoopServerN
Hadoop Provider(For SQL Server 2012)
User1 User2 User3 ….UserN (Send CRUD Command )
Does Support This Solution
Jeremiah Peschka February 13, 2012 | 8:28 am
I believe that you’ve confused Hadoop with an ORM. Hadoop is a family of products based around the Hadoop Distributed File System (HDFS) and Hadoop MapReduce – a job scheduling and execution framework.
For CRUD procedure creation, you should look something like into Entity Framework, nHibernate, or LLBLGen.
For load balancing SQL Server, you should probably talk to an architect or consultant.
masud taheri February 13, 2012 | 10:29 pm
Thanks.
I need Solution for Loadbalance SQL Server in
Insert Update Delete Select
many user ,…
Oracle :RAC
SQL: ? (Hadoop or ?)
Jeremiah Peschka February 13, 2012 | 11:11 pm
There are a lot of ways to do this. Head on over to my contact page and drop me an email and we can chat a bit more about this.
Matan Yungman March 25, 2012 | 6:55 am
Thanks for the great presentation Jeremiah.
Do you mind sharing your thoughts about what type of queries will perform better on Hadoop as oppose to relational database and vice versa?
Thanks,
Matan
Jeremiah Peschka March 25, 2012 | 8:48 am
Hi Matan, great question. Honestly, the question is too complex for even a blog post comment, but I’ll give it a shot anyway.
If you’re thinking about warehousing, any query that can easily be split among multiple nodes is likely to perform better on a Hadoop based system. Likewise, anything that you can break down into a series of Map and Reduce phases will perform well.
If you are thinking about OLTP, things get tricky because there is no OLTP alternative in the Hadoop world that speaks a SQL dialect. Instead you have to re-arrange the way you query to make sense in HBase’s data model.
Pingback: Hadoop in a Microsoft environment | Data story
Pingback: Hadoop « SQLAnidma
Pingback: Hadoop for SQL DBA « SQLAnidma