Production DBA

DBA Training Plan 16: Is It Time to Scale Out?

By Brent Ozar · August 30, 2019 · 12 comments

In our last episode, we found out what your server was waiting on. In the closing remarks, I mentioned that you should find out which queries are causing the problem, and focus your mitigation efforts there. You might have circled back to the episode where we covered sp_BlitzCache, used a @SortOrder parameter that lines up with your top wait type, and…

Now you’re frustrated.

Because you’re looking at all these bad queries and thinking, “Isn’t there an easier way? Can’t I just throw hardware at this?”

Scaling out reads with replicas:
splitting queries between servers

When you want to copy the exact same production data to a second server to offload reporting queries, some of the costs are immediately obvious:

Hardware and storage – even if you’re running it in a virtual machine, you need to account for the costs of say, 4 cores and 32GB RAM. Not only will you need storage for the databases, but you’ll also need to decide whether this server gets backed up, and copied to a disaster recovery data center.
Software licensing – Standard Edition is ~$2k per core, and Enterprise Edition is ~$7k per core. Toss in Windows (especially now that it’s licensed per-core), your management/backup/antivirus tools, and your monitoring software.
Project planning – you’ll need to design how to get the data from production to the reporting server, like with Always On Availability Groups, log shipping, or transactional replication.
App modifications – the app running reporting queries will need a new connection string. Even with Always On Availability Groups, reads aren’t automatically offloaded to readable replicas – you have to use the connection string parameter ApplicationIntent = ReadOnly to tell SQL Server that you promise not to try to write anything. If you have a single app that does both reads and writes, and you only want to offload some of the queries, you’ll need to go through the code to switch those queries over to the new connection string.

But some of the costs come as cruel surprises:

Adding a troubleshooting process – sooner or later, the data replication process will break. Depending on the method (AGs, log shipping, replication) and failure type, it’ll fail in different ways – maybe all of the data is old, maybe just some of it is, or maybe the reports aren’t accessible at all. You’ll want to list out the failure methods and explain what symptoms will look like. This helps business users recognize when their reports are wrong, and react appropriately. If you don’t do this step, then after the first failure, people are just always going to expect that there’s a bug in the report data.
Prepare for failure – for each of those failure methods, decide how you’re going to react. For example, if AG replication breaks and reports are out of date, will you point reports at the primary until the problem is resolved, or will users just have to deal with unavailable reports while you troubleshoot or resync the replicas? If you don’t do this step, then you’re going to be winging it every time, and you’ll look unprepared while reports are wrong or down.
Set realistic expectations for RPO and RTO – based on your process and preparation, make sure the business users understand how long their reports will be down when things break.
Measure the overhead of replication – AGs and transactional replication can add performance slowdowns beyond what the reports used to cost. For example, if you were only running a few reports an hour, and only hitting a subset of the data, then suddenly replicating every individual delete/update/insert operation can have a huge overhead.
Add monitoring – you need to start monitoring how far behind the reporting server is, and how performance is doing on both. Performance troubleshooting becomes a lot harder, too – for example, when you’re doing index tuning, you have to combine data across both the primary and the reporting servers in order to find the right mix of indexes across the board.

And at the end of all this, all you’ve got is a way to split out the read-only queries. Those might not even be your biggest problem – you might need to split up writes between servers, and that gets REALLY hard.

Scaling out writes with sharding:
splitting data between servers

Sharding is the technique of splitting your data between multiple shards. You have the same exact database structure on each server, but you split up customers into multiple shards – for example, maybe the US customers are on one shard, and the European customers are on another shard.

Sharding gets trickier when you have to keep data in sync between shards. For example, you might run a web site, and you want to keep all of the product and pricing data exactly the same across shards. You may also need to combine data back from all of the shards into one, like for reporting purposes.

Other techniques for splitting load across servers involve replication with multiple masters – multiple servers that can accept writes at any given time. If you’re interested in that topic, check out Many Masters, One Truth. (Spoiler: it’s a lot harder than it sounds.)

But the real bottom line with sharding, and the reason implementation is so hard, is that your application has to know which shard to query. If you’re connecting all of the apps to a single SQL Server and routing requests from there, you’re right back where you started: bottlenecked on a single SQL Server.

This whole post was a trick.

None of the scale methods are really easy. Whether you’re convincing the boss to buy more memory, taking an outage to upgrade SQL Server, or rewriting the app’s back end to accommodate sharding, you’re going to be putting in serious work.

That’s why I put so much emphasis on tuning indexes and queries. It’s hard to make your server 100x faster – but it’s really easy to make your database design and queries suddenly 100x more efficient. Fixing queries and tables and indexes is indeed hard work – but it’s totally worth it. The better you get at these techniques, the farther you can go in your career.

Free, 3× a week

Get my new posts by email

Three posts a week, plus a Monday roundup of the best database news from around the web.

12 comments

pbrander Student since 2018

August 30, 2019 at 9:12 am

You should update the prices to reflect SQL 2017. $3.7 K for Standard and $14k+ for Enterprise.

Reply
1. Brent Ozar
  
  August 30, 2019 at 9:19 am
  
  PBrander – that’s incorrect. Those are 2-core packages that you’re quoting, not per-core licenses.
  
  Reply
  1. pbrander Student since 2018
    
    August 30, 2019 at 9:32 am
    
    Reading is hard.
    
    Reply
Peter

August 30, 2019 at 9:17 am

Great post. Tuning queries can be complicated when the queries are linked to an application where making changes to the table and column structure is very difficult (sometimes impossible). You often have to account for all these changes every single time you patch or upgrade the application. The alternative is to yell, scream, beg the vendor to fix these queries but that can also be very time consuming and/or futile.

Reply
1. Brent Ozar
  
  August 30, 2019 at 9:20 am
  
  Peter – thanks!
  
  Reply
Keith

August 30, 2019 at 9:22 am

Always scale out I think. If at not for the very least to make it easier to do OS and SQL maintenance. It’s a lot easier to schedule maintenance when you have three copies in an AAG, and if everything goes smoothly, your outage is only a few moments per availability group, or if it goes badly, treat the failed node like sickly cattle and just get rid of it.

I like this better than scheduling a window to shut down the VM, snapshot it, bring it back up, block anyone from connecting, hope the maintenance goes well, revert snapshot if it goes badly, and then hope the machine password didn’t change during your maintenance which is made a total PITA when your organization uses radius authentication.

Most of the analysts I have worked with also tend to run wild, freer than the most libertarian cowboy of the old wild west with their queries and keeping apps spread across multiple nodes at least helps isolate the impact of a bad query to only the things on that server.

Reply
1. Keith
  
  August 30, 2019 at 9:25 am
  
  I really have only been doing multi-application deployment of AAGs – not connecting one big application to a scale out deployment
  
  Reply
2. Peter
  
  August 30, 2019 at 10:42 am
  
  As Brent mentions, scaling out adds a lot of maintenance, overhead and more to monitor so you’re complicating things too. A VM snapshot works quite nicely if you want to avoid that, you usually don’t need to shut down the VM?
  
  Reply
pbrander Student since 2018

August 30, 2019 at 9:35 am

What is the best way to determine how many cores you need for a replication server for offloading reports? Or is it just like every other answer to SQL, it depends? Let’s says our prod environment has 12 cores, would we cut the amount in half for the reporting server?

Reply
1. Brent Ozar
  
  September 4, 2019 at 10:04 am
  
  pbrander – the answer is, “How many cores do you need for your reports?” That’s kinda like asking how long a piece of string is.
  
  Reply
Mundaa

September 4, 2019 at 9:51 am

Just a new bee in the AAG. We have standard 2016 version in prod which only supports Basic AG. So that means the secondary can’t be used for reporting, backups etc. Will it be beneficial to run those as primary and just do transactional replication between them. So that we can use them for backups and reporting. Currently we have 300 plus database on one instance.

Reply
1. Brent Ozar
  
  September 4, 2019 at 10:03 am
  
  Mundaa – reread the takeaway section.
  
  Reply

DBA Training Plan 16: Is It Time to Scale Out?

Scaling out reads with replicas: splitting queries between servers

Scaling out writes with sharding: splitting data between servers

This whole post was a trick.

Get my new posts by email

Keep digging

12 comments

Leave a comment Cancel reply

Scaling out reads with replicas:
splitting queries between servers

Scaling out writes with sharding:
splitting data between servers