Launch week: the Season Pass & Fundamentals Week are 50% off — ends in 18d 06h 43mSee the sale

Category: Clients and Case Studies

10 Signs It Was Time to Hire Me

You've been reading my blog, watching my videos, and maybe even taking some of my training classes. You've heard me say things like "my clients" from time to time, and you're wondering... why do companies actually hire me? What problem are they trying to solve?

Well, when clients meet with me for the first time, I have a series of questions I put up on the screen for them to talk through:

Read more about 10 Signs It Was Time to Hire Me Be the first to comment

Updated, Larger Stack Overflow Demo Database

Stack Overflow publishes a data dump with all user-contributed content, and it's a fun set of data to use for demos. I took the 2024-April data dump, and imported it into a Microsoft SQL Server database.

It's an 31GB torrent (magnet) that expands to a ~202GB database. I used Microsoft SQL Server 2016, so you can attach this to anything 2016 or newer. If that's too big, no worries - for smaller versions and past versions, check out my How to Download the Stack Overflow Database page.

Read more about Updated, Larger Stack Overflow Demo Database 9 comments — Join the discussion

Download the Current Stack Overflow Database for Free (2022-06)

Stack Overflow, the place where most of your production code comes from, shares a version of their data in XML format from time to time, and then I import it into SQL Server format.

Stack Overflow's database makes for great blog post examples because it's real-world data: real data distributions, lots of different data types, easy to understand tables, simple joins. Some of the tables include:

Read more about Download the Current Stack Overflow Database for Free (2022-06) 5 comments — Join the discussion

Download the Current Stack Overflow Database for Free (2021-06)

Stack Overflow, the place where most of your production code comes from, publicly exports their data every couple/few months. @TarynPivots (their DBA) tweets about it, and then I pull some levers and import the XML data dump into SQL Server format.

Stack Overflow's database makes for great blog post examples because it's real-world data: real data distributions, lots of different data types, easy to understand tables, simple joins. Some of the tables include:

Read more about Download the Current Stack Overflow Database for Free (2021-06) Be the first to comment

Download the Current Stack Overflow Database for Free (2021-02)

Stack Overflow, the place where most of your production code comes from, publicly exports their data every couple/few months. @TarynPivots (their DBA) tweets about it, and then I pull some levers and import the XML data dump into SQL Server format.

Stack Overflow's database makes for great blog post examples because it's real-world data: real data distributions, lots of different data types, easy to understand tables, simple joins. Some of the tables include:

Read more about Download the Current Stack Overflow Database for Free (2021-02) 16 comments — Join the discussion

Updated Stack Overflow Public Data Set for June 2019

Taryn and the kind folks at Stack Overflow have updated their public XML data dump for June, so I've imported that into an updated sample database for your blogging and presenting satisfaction.

You can download the 40GB torrent (magnet) and it expands to a ~350GB SQL Server 2008 database. Because it’s so large, we only distribute it with BitTorrent – if you’re new to that, here are more detailed instructions.

Read more about Updated Stack Overflow Public Data Set for June 2019 5 comments — Join the discussion

A Presenter’s Guide to the Stack Overflow Database

You present on SQL Server topics at user groups and conferences, and you've been wondering how to get started with the Stack Overflow public database. Here's a quick list of things to know.

For stable demos, use StackOverflow2010. This smaller 10GB database has data from the first years of Stack Overflow's history. It doesn't change, so you don't have to worry about updating your screenshots and metrics every time Stack releases a new data dump.

Read more about A Presenter’s Guide to the Stack Overflow Database 4 comments — Join the discussion

New Stack Overflow Public Database Available (2018-06)

@Taryn and the kind folks at Stack Overflow publish their data export periodically with your questions, answers, comments, user info, and more. It's available as an XML data dump, which I then take and import into SQL Server for teaching performance tuning. You can download the 38GB torrent (magnet), which gives you a series of 7Zip files…

Read more about New Stack Overflow Public Database Available (2018-06) 5 comments — Join the discussion
Performance Tuning

Creating Basic Indexes on the Stack Overflow Public Database

My public SQL Server database copy of the Stack Overflow data dump only includes clustered indexes by default. I want to keep your database size as small as possible for quick downloading.

But like any database - when you create it, should you add your own indexes to make queries go faster? The answer lies in knowing your workloads, but we don't usually know our database workloads until they start.

Read more about Creating Basic Indexes on the Stack Overflow Public Database Be the first to comment

New Stack Overflow Public Database Available (2017-06)

Nick Craver and the kind folks at Stack Overflow publish their data export periodically with your questions, answers, comments, user info, and more. It's available as an XML data dump, which I then take and import into SQL Server for teaching performance tuning.

You can download the 16GB torrent (magnet), which gives you a series of 7Zip files that you can extract to produce a 118GB SQL Server 2008 database. You can then attach it to any 2008-2017 SQL Server.

Read more about New Stack Overflow Public Database Available (2017-06) 5 comments — Join the discussion
Production DBA

Building a Faux PaaS, Part 3: What the Ideal Engineering Team Looks Like

Background: I'm working with kCura to build a Faux PaaS: something akin to Microsoft's Azure SQL DB, but internally managed. You can catch up with what we've discussed so far in Part 1 and Part 2 of the series.

In the last post, I talked about measuring backup and restore throughputs across different instance types, regions, storage configs, and backup locations. It's a lot of work to answer questions like "How should we configure our new SQL Server VMs?"

Read more about Building a Faux PaaS, Part 3: What the Ideal Engineering Team Looks Like 4 comments — Join the discussion
Production DBA

Building a Faux PaaS, Part 1: The SQL Server DevOps Scene in 2017

In the cloud, treat your servers like cattle, not like pets.

In the cloud, systems administration is very different than the on-premises stuff you're used to. When you build VMs in the cloud with Infrastructure-as-a-Service (IaaS, meaning AWS EC2, GCE, or Azure VMs), you expect them to die. It's just a matter of time. If you're lucky, it'll be years from now, but if you're unlucky, it'll be tomorrow.

Read more about Building a Faux PaaS, Part 1: The SQL Server DevOps Scene in 2017 13 comments — Join the discussion