Leaked: SQL Server 2019 Big Data Clusters Introduction Video

By Brent Ozar · September 22, 2018 · 12 comments

Psst – you’re probably not supposed to see this yet, but look what @WalkingCat found:

https://x.com/h0x0d/status/1042979511074086913

What the video says

Growing volumes of data create deep pools of opportunity for those who can navigate it. SQL Server 2019 helps you stay ahead of the changing time by making data integration, management, and intelligence easier and more intuitive than ever before.

Yep, that’s a Microsoft video alright.

With SQL Server 2019 you can create a single virtual data layer that’s accessible to nearly every application. Polybase data virtualization handles the complexity of integrating all your data sources and formats without requiring you to replicate or move it. You can streamline data management using SQL Server 2019 Big Data Clusters deployed in Kubernetes. Every node of a Big Data Cluster includes SQL Server’s relational engine, HDFS storage, and Spark, which allow you to store and manage your data using the tools of your choice.

SQL Server 2019 makes it easier to build intelligent apps with big data. Now you can run Spark jobs to analyze structured and unstructured data, train models over data from anywhere with SQL Server Machine Learning Services or Spark ML, and query data from anywhere using a rich notebook experience embedded in Azure Data Studio. The torrent of data isn’t slowing down, but it doesn’t have to sink your business. Sail through with SQL Server 2019, and shorten the distance between data and action.

My take on the Big Data Clusters thing

<sarcasm> It’s like linked servers, but since they don’t perform well, we need to scale out across containers. </sarcasm>

Today, Polybase is a rare and interesting animal. You’ve probably never used it – here’s a quick introduction from James Serra – but it wasn’t really targeted at the mainstream database professional. It first shipped in PDW/APS to let data warehouses run queries against Hadoop, and then it was later added to the boxed product in SQL Server 2016.

Polybase is for data warehouse builders who want to run near-real-time reports against data without doing ETL projects. That’s really compelling to me – report on data where it’s at. That seems like a smart investment as the sizes of data grow, and our willingness to move it decreases.

I like that Microsoft is making a risky bet, planting a flag where nobody else is, saying, “We’re going to be at the center of the new modern data warehouse.” What they’re proposing is hard work – we all know first-hand the terrible performance and security complexities of running linked server queries, and this is next-level-harder. It’s going to take a lot of development investments to make this work well, and this is where the licensing revenues of a closed-source database make sense.

If you want to hitch your career caboose to this train, there are all kinds of technologies you could specialize in: machine learning, Hadoop, Spark, Kubernetes, or…just plain SQL. See, here’s the thing: there’s a whole lot of SQL Server in this image:

If you’re good at performance tuning the engine, and this feature takes off, you’re going to have a lot of work to do, and the licensing costs of this image make consulting look inexpensive. This feature’s primary use case isn’t folks with Standard Edition running on an 8-core VM. (I can almost hear the marketers wailing, “But you COULD do it with that,” hahaha.)

Free, 3× a week

Get my new posts by email

Three posts a week, plus a Monday roundup of the best database news from around the web.

12 comments

greg

September 24, 2018 at 5:14 am

I find all this stuff interesting. Nevertheless, are they already any hints on when the previews and/or the RC Versions will be available publicly? Do you maybe know if there’s also an insider program as for Windows?
Best regards

Reply
1. Arshad
  
  September 24, 2018 at 10:38 am
  
  You can download SQL Server 2019 from https://www.microsoft.com/en-us/sql-server/sql-server-2019
  Early adoption program https://sqlservervnexteap.azurewebsites.net/
  Regards
  
  Reply
2. Brent Ozar
  
  September 24, 2018 at 10:42 am
  
  Greg – today you have your answer. 😉
  
  Reply
  1. Greg
    
    September 25, 2018 at 11:19 am
    
    Yup
    Already installed a new Environment witz windows Server 2019. Will Test all the Stuff tomorrow
    
    Reply
BB

September 24, 2018 at 1:46 pm

Nothing new here that Denodo hasn’t already done already…but better. Seriously. Check it out. Google “Denodo Logical Data Warehouse”.

Reply
1. Lars Blank
  
  September 25, 2018 at 4:23 am
  
  Yes, indeed, but whatabout the cost for a Denodo license compared to SQL Server?
  
  Reply
Maria Bermudes Student since 2022

September 25, 2018 at 10:29 am

I like this. I’m in!

Reply
Pei Zhu

September 28, 2018 at 9:30 am

is it on windows or linux?

Reply
1. Brent Ozar
  
  September 28, 2018 at 11:56 pm
  
  Yes
  
  Reply
Tom

October 2, 2018 at 2:32 pm

From a management perspective, I think it’s important to temper the urge to jump on new technology which is fired up by marketing campaigns. I saw lot’s of good stuff at Ignite and I like to keep up (especially for certification purposes). That said, Microsoft wants to make as much money as possible. They are constantly dumping out new features and a good management perspective helps sort out what to grab onto and when. I like how Ozar takes a grounded approach and points out MS’s “risky bet” when it comes to Big Data Clusters (and dare I suggest what the guys at Ignite called “Modern Data Warehousing”).

Bottom line…be smart, know the technology, but know when to say when – don’t consume too much too fast 😉

Reply
Greg

March 1, 2022 at 6:02 pm

You thoughts on Polybase now since the announcement on BDC going away?

Reply
1. Brent Ozar
  
  March 2, 2022 at 10:20 am
  
  I don’t understand why companies would adopt Polybase.
  
  Reply

Leaked: SQL Server 2019 Big Data Clusters Introduction Video

What the video says

My take on the Big Data Clusters thing

Get my new posts by email

Keep digging

12 comments

Leave a comment Cancel reply