Production DBA

How to Patch SQL Server

By Brent Ozar · June 8, 2021 · 54 comments

Microsoft releases SQL Server Cumulative Updates about every 60 days. This is a part of your job that you’re going to be doing a lot, so let’s get good at it! Here’s the strategy I like to use.

Pick what patch you’re going to apply. Generally speaking, you should be on the most recent Cumulative Update available for your version. (Years ago, folks only applied Service Packs, but starting with SQL Server 2017, Service Packs are gone. Microsoft only ships Cumulative Updates now.)

Decide how you’re going to detect problems. Every now and then, an update breaks something. For example, SQL Server 2019 CU7 broke snapshots, SQL Server 2019 CU2 broke Agent, and so many more, but my personal favorite was when SQL Server 2014 SP1 CU6 broke NOLOCK. Sure, sometimes the update installer will just outright fail – but sometimes the installer succeeds, but your SQL Server installation is broken anyway, and it may take hours or days to detect the problem. You need to monitor for new and unusual failures or performance problems.

Design your ideal rollout strategy. Here’s the order I like to use, but I understand that not everyone has all of these environments. More on that in a second. Roll out the patch in this order:

Development servers – you want your developers seeing any failures or behavior changes first so they can feel confident that the eventual production patch will produce behavior they’re used to.
QA/test servers
Disaster recovery servers – often, these are the least-critical servers in the list that are actually being monitored with monitoring software, and leverage SQL Server’s high availability and disaster recovery features like clustering, and also have to keep up with the level of writes happening in production. New problems will show up here, and hopefully monitoring will detect them before you apply patches to subsequent levels.
Read-only replicas – servers where some end user activity happens, but it’s less critical than your primary servers. This advice applies whether we’re talking replication, log shipping, or Always On Availability Groups.
Failover servers – now we’re getting really close. The idea here is to patch these without taking a production outage – but that’s not always possible depending on the HA/DR features you’re using, and the way you’re using them.
Production primary servers – and the way you patch these is to actually not patch them at all. On “patch day”, simply fail over to your failover server, which has already been patched. This way, if you experience any surprise issues with the patch within the first few days, you can fail back over to your unpatched production server. (This also means you need to hold off patching that server for a few days to give yourself a safety net.)

Design your actual rollout strategy. Having read that above rosy-world scenario, now you’re looking at your own environment going, “Brent, I don’t have a bunch of those servers.” Okay, no problem: scratch out the lines you don’t have, but understand that you’re also scratching out possible safety nets. This is something to think about when you’re designing your next SQL Server architecture.

Design your rollback strategy. In the event that you do detect problems – and it’ll happen sooner or later – you want to have a rough idea of what you’re going to do. In dev/QA/test, you might just choose to uninstall the update and wait it out, giving other SQL Server customers time to troubleshoot the problem with Microsoft on their mission-critical servers, then apply the next fixed update instead. If the update made it all the way to your DR or failover tier without you catching the problem, you might not have the luxury of cleanly uninstalling the update, and your rollback strategy may be to open a support case with Microsoft to troubleshoot the problem – hopefully before applying the failed patch to your production primary servers.

What we’ve done so far seems like a lot of designing, but remember, you only have to do this once, and you can reuse it for every update you apply to this environment.

When applying the actual patch, here’s what I like to do, in order:

Verify that you have backups. Ideally, do a test restore, too: backup success messages don’t mean you have working backup files.
Stop or shut down client apps. You don’t want folks starting a transaction as your update begins.
Make sure there’s no activity happening on the server, especially long-running jobs like backups.
Apply the update – if you’re using PowerShell, check out how to automate patching with DBAtools.
Apply Windows updates since you’re down anyway. (Sometimes I find folks have been applying SQL updates, but not Windows updates – they’re both important.)
Confirm the SQL Server service is started, and check your monitoring tools for any unexpected failures.
Confirm the SQL Server Agent service is started again, and kick off your next log backup job.
Start client apps back up and make sure they function.

Over the coming days, keep a much closer eye than normal on monitoring tools looking for unexpected failures. Then, it’s time to hop back on the hamster wheel again, and start planning your next round of updates.

Free, 3× a week

Get my new posts by email

Three posts a week, plus a Monday roundup of the best database news from around the web.

54 comments

Thomas Franz

June 8, 2021 at 4:22 pm

Very nice to-do-list.

It would be nice, if you could place a link on https://sqlserverupdates.com/ to this article – it makes it easier to find when you really need it 🙂

Reply
1. Brent Ozar
  
  June 9, 2021 at 5:56 am
  
  Thomas – great idea, done!
  
  Reply
Heywood

June 8, 2021 at 4:31 pm

Are there any ramifications of applying updates on a DR server when all the databases are in Restoring mode?

Reply
1. Brent Ozar
  
  June 8, 2021 at 6:29 pm
  
  Yep. The specifics are beyond the scope of this post though – that’s where your design steps come in as I discussed in the post. The good news is you still have job security. 😉 The bad news is that I’m not doing ALL of your job, heh.
  
  Reply
James Fogel

June 8, 2021 at 6:00 pm

I got a little sick when I saw the mention of opening a ticket with MS. I haven’t had to open one for an SQL Server incident since last year, but I’ve open several in the last few months related to AD issues. We have either figured it out on our own or given up. Weeks go by with nothing to show for it. It seems that only tickets submitted via the Azure Portal are handled by people with a clue.

Reply
Markus

June 8, 2021 at 6:13 pm

As always nice post @BrentO.
Totally agree in the list.

Number 2. and number 8. are the toughest ones! Stop/Start manually or automated!

And-And-And your App-Admins won’t [always] know what Services they need to stopp 😉 you got to tell them.

for one server one instance that’s all ok but for a thousand servers or several th… ha ha

Reply
Dba locoCrazy

June 8, 2021 at 6:13 pm

what “monitoring tools” you guys sugest ? I’m used to create my owns but for sure you guys know better ones.

Reply
1. Brent Ozar
  
  June 8, 2021 at 6:27 pm
  
  That’s outside of the scope of this blog post. Let’s not turn this into a shopping comparison – just don’t Want to go there here. Thanks for understanding.
  
  Reply
Chad Miller Student since 2021

June 8, 2021 at 6:25 pm

If you work in a large enterprise, your company will have already invested in patch automation software which is fully capable of also patching SQL Server. Two which I’ve used: SCCM and Tanium. The patching team which manages all the Microsoft patches can include MSSQL CU’s and service packs as part of monthly patch updates and rollout out at same schedule as MS monthly patching staggering dev, qa, test and prod environment. This requires working silos, but that’s always a good thing.

Reply
1. Brent Ozar
  
  June 8, 2021 at 6:26 pm
  
  Chad – can you take a quick look again through the post and verify that the patching software can do those things? May want to double check those.
  
  Reply
  1. Chad Miller Student since 2021
    
    June 10, 2021 at 2:25 pm
    
    On the applying the patch, yes it does. Things like shutting down apps, verifying backups and verifying SQL Server after patching are additional processes/procedures as you pointed out.
    
    Reply
    1. Brent Ozar
      
      June 10, 2021 at 2:30 pm
      
      Right – applying the patch is the easy part. 😉
      
      Reply
    2. Valerie Mortensen
      
      October 13, 2021 at 10:52 pm
      
      yes applying the patch is the easy part… wondering how involved your automated process is and how you leverage SCCM for SQL patching.. especially if you are patching clustered instances
      
      Reply
ASP

June 8, 2021 at 7:44 pm

Should we stop sql services as well before the patching? I think that step 2 should include it as well, right?

Reply
1. Brent Ozar
  
  June 8, 2021 at 8:10 pm
  
  I’ve never done that myself, but I’ve heard other folks do it. If you want to do it, that’s completely fine.
  
  Reply
Thierry Van Durme

June 9, 2021 at 6:27 am

Thanks for your insights Brent! Adding a couple to our list 🙂
I like to separate Windows and SQL updates – if something fails it makes it a little easier to narrow down the possible causes.
Also, when D/R (log shipping for example) is in standby mode, that may not always work if your production server is not patched yet because your databases might need to be upgraded before SQL can bring them online.

Reply
Juanita Drobish Student since 2018

June 10, 2021 at 3:22 pm

Question on patching 2 servers involved in transactional replication between each other. Both servers are on SQL 2014 sp2 cu17. To apply the most current updates, I need to apply 2014 sp3, then cu4 then GDR. Do both servers have to be patched at the same time or can one server be patched then wait a few days to patch the following server?

Reply
1. Brent Ozar
  
  June 10, 2021 at 4:35 pm
  
  Howdy ma’am! You can usually patch them out of order.
  
  Reply
  1. Juanita Drobish Student since 2018
    
    June 10, 2021 at 4:42 pm
    
    Thanks so much, Brent!
    
    Reply
2. Andy
  
  June 11, 2021 at 6:55 am
  
  Just a quickie note: the GDR releases are separate to the CU releases; you cannot apply both.
  Once you have applied a CU, you must remain on the CU releases; same for GDR, once you start on GDR, you stick on GDR. The only time you can swap between them, is when applying a new SP (you could be waiting a long time these days!) or reverting back to the original RTM.
  For many of the Instances I look after, we stick on the GDR release schedule; far fewer updates to apply, most for security patching; sure, we don’t get the latest-and-greatest features and improvements; but for these systems, stability and reliability are more important.
  
  Reply
Hany Helmy

June 12, 2021 at 8:33 am

Hello Brent,

Nice article, for me as working with SQL Server since more than 17 years, I learnt the hard way, Never ever try Microsoft first patch release, wait for some time until “Someone else try it first”, then apply it in your environment in the same order you provided in the article.

As now MS is releasing a patch every 2 month, I am patching my server once per year, that`s more than enough for my current stable environment.

Reply
Dev Bethanasamy

June 12, 2021 at 2:35 pm

We upgrade passive node on FCI and failover to that node, after the upgrade and keep the upgrade other node on lower CU, so we can rollback to that node when something goes wrong. We do this for a week and then upgrade other node.
If we failover, SQL Server nicely rollbacks any Changes it made.

Reply
1. Valerie Mortensen
  
  October 13, 2021 at 10:51 pm
  
  How many sql servers are you doing manually like this? I’m trying to find if people are widely using SCCM to automate .. especially where SQL clusters are concerned
  
  Reply
Rick Harderwijk

June 29, 2021 at 6:38 am

Thierry Van Durme already mentioned it… there might be an issue with holding of updating the previous-version active node after moving the database to the formerly-standby node of a cluster. The internal database version might change and I am not 100% sure you can always re-active the old node(s) with the updated databases. I do recall having issues with that, although that was almost a lifetime ago, so that issue might no longer be a thing. If anyone could confirm that either way that would be nice.

Reply
1. Brent Ozar
  
  June 29, 2021 at 8:28 am
  
  We can never really confirm that permanently – only Microsoft could. I’m gonna be honest here: they’re not putting that level of testing into Cumulative Updates, and it’s not fair to ask them to do it, either. They would have to test upgrading from every possible CU – and that’s just not realistic.
  
  The fact that they’re not putting this level of testing in is exactly why we have to not upgrade every production server at the same time. You simply wouldn’t be able to handle the aftermath when these bugs strike, and they’ve certainly been striking with a vengeance in the last couple of years.
  
  Reply
  1. Vlad Kirov
    
    November 22, 2021 at 7:21 pm
    
    Hi Brent,
    I just stepped in to this trap while was updating my cluster environments of SQLs 2017 (CU19 to CU24) and 2019(CU11 to CU13).
    in our test all worked well, but on prod 50% failed to failover. now we are investigating what is the problem.
    this is only to share the info (and my frustration).
    never expect the patches to magically work:-)
    
    Reply
Steve Pearson

June 30, 2021 at 6:10 pm

Are security updates cumulative? For example, SQL 2014 SP3 has 2 consecutive security updates listed:
12.0.6433.1 and 12.0.6372.1.

Does 12.0.6372.1 contain what was included in 12.0.6433.1? or do I need to apply both?

Reply
Mattia Nocerino VIP Student since 2021

January 20, 2022 at 8:59 am

Hi Brent! What do you think about the note in the official documentation saying:

“Mixing versions of SQL Server instances in the same AG is not supported outside of a rolling upgrade and should not exist in that state for extended periods of time as the upgrade should take place quickly. The other option for upgrading SQL Server 2016 and later is through the use of a distributed availability group.”

https://docs.microsoft.com/en-us/sql/database-engine/availability-groups/windows/upgrading-always-on-availability-group-replica-instances?view=sql-server-2017

If I understand correctly, I shouldn’t leave the one server with the old CU update, right?

Reply
1. Brent Ozar
  
  January 20, 2022 at 12:05 pm
  
  They’re talking about versions, not patch levels.
  
  Besides, even if they were talking about patch levels, the whole reason we’re doing this is that Microsoft doesn’t really support broken CUs. All they do is tell you to uninstall it from everywhere, which involves serious outages. It’s easier and safer to leave a node unpatched.
  
  Reply
  1. Mattia Nocerino VIP Student since 2021
    
    January 20, 2022 at 1:21 pm
    
    Facepalming myself right now! Sorry for the dumb comment and as always thanks for the quick feedback!
    
    Reply
Carl VIP Student since 2017

February 12, 2022 at 3:19 pm

Thanks!

Reply
Kanchana

September 30, 2022 at 7:28 am

Can we split Windows and OS patching in to 2 different cycles instead of applying both patches on same day? Is it risky to apply both together?

Reply
1. Kanchana
  
  September 30, 2022 at 7:29 am
  
  Correction: It would be Windows and SQL patching
  
  Reply
2. Brent Ozar
  
  September 30, 2022 at 2:28 pm
  
  That’s outside of the scope of this blog post.
  
  Reply
Kal

January 10, 2023 at 4:27 am

Nice article as always Brent. I was asked to test patching SQL server using SCCM. However while checking online it seems people have had trouble with patching SQL utilizing SCCM. In addition i have tested using DBATools and that works great but my organization is quite secure and wont allow me to patch remote servers using DBATools.
My question is what have you heard about patching SCCM? Is it recommended or stick to PowerShell scripts?

Reply
1. Brent Ozar
  
  January 10, 2023 at 1:19 pm
  
  Thanks, glad you liked the post.
  
  This post is about how to patch SQL Server.
  
  If I wanted to write a post about how *not* to patch SQL Server, I’d have written about SCCM, heh.
  
  Reply
SM

January 25, 2023 at 10:40 pm

Hi Brent,

Can we apply SQL Server 2014 SP3 CU4 directly on top of SQL Server 2014 CU11 or it should be to apply SQL Server 2014 SP2 then SQL Server 2014 SP3?

Thank you

Reply
1. SM
  
  January 25, 2023 at 10:42 pm
  
  correction – on top of SQL Serve 2014 SP1 CU11
  
  Reply
2. Brent Ozar
  
  January 25, 2023 at 10:49 pm
  
  Just apply the most recent service pack, and then the most recent cumulative update for that service pack.
  
  Reply
  1. SM
    
    February 2, 2023 at 2:12 pm
    
    Thank you very much Brent!
    
    Reply
Tara Chandra

February 23, 2024 at 11:38 am

Hi brent , I am having sql server 2017 enterprise edition in our environment running on RTM. Here we are having DAG configured, and it’s critical environment since exchange is running on database. and the thing is i have been assigned to apply patches on my DAG servers. here the thing is patch will be applied first time on the servers. so which CU should i apply. I am thinking to go with CU30. and a little afraid of any unexpected behavior. so could you please advise, what patch i can apply first time?

Reply
1. Brent Ozar
  
  February 23, 2024 at 12:59 pm
  
  Tara – I’m a little confused because there’s nothing called a DAG in SQL Server, and Exchange doesn’t run atop SQL Server. You probably want to hire a consultant to help.
  
  Reply
Tara chandra

February 23, 2024 at 1:15 pm

I was talking about distributed availability group, exchange means exchange based applications are running. U please confirm can I apply CU30 directly on RTM version first time patching.

Reply
1. Brent Ozar
  
  February 23, 2024 at 1:17 pm
  
  Given what we’re talking about here, I’m not comfortable giving you production advice without digging more deeply into what’s happening in the environment. If you’d like my personalized help, feel free to click Consulting at the top of the screen. Hope that’s fair. Thanks!
  
  Reply
Cosmo

April 2, 2024 at 6:23 pm

Just a note about AG taken from the horse’s mouth.

Note

When you perform an in-place upgrade of a secondary replica to a newer version of SQL Server, the database inside the availability group remains in the Synchronizing / In recovery or Synchronized / In Recovery state until the availability group is manually failed over, which finishes the recovery and upgrades the database. An upgraded primary replica can no longer ship logs to any lower version secondary replica and data movement stops and no automatic failover can occur for that replica, and your availability databases are vulnerable to data loss. After you upgrade the old primary, you may need to resume synchronization. It is recommended to upgrade all secondary replicas before failing over to a replica with the new version. That way you have the option of doing a failover after the database(es) are upgraded to the new format.

This means that when working with AG that Mr. Ozar point 6. is not valid and once you failover your on new upgraded patch and muse continue the work to standup AG synch again.

All the more reason to do DEV/QA/UAT servers 1st. I personally give one month in those environments before going to PROD.

Reply
1. Brent Ozar
  
  April 2, 2024 at 6:39 pm
  
  Cosmo – this article is about patching, not upgrading. Patching refers to cumulative updates. Upgrading refers to different versions.
  
  Reply
  1. Cosmo
    
    April 3, 2024 at 2:24 pm
    
    Ok so Microsoft confuses Me by saying the following.
    
    “When upgrading a SQL Server instance that hosts an Always On availability group (AG) to a new SQL Server version, to a new SQL Server service pack or cumulative update, or when installing to a new Windows service pack or cumulative update, you can reduce downtime for the primary replica to only a single manual failover by performing a rolling upgrade”
    
    https://learn.microsoft.com/en-us/sql/database-engine/availability-groups/windows/upgrading-always-on-availability-group-replica-instances?view=sql-server-ver16
    
    I agree with you an upgrade to a new version of SQL server is not the same as a CU of an existing version. Yet it looks like MS says if the current version increments forward the build that an upgrade ensues on failover to the secondary that had the CU and build applied.
    
    I have never done this out of fear of all hell breaking lose in AG but what would happen if I applied a CU to secondary failover to it so DB now is upgraded to that build. Then not apply CU to old primary now an offline AG secondary and try to fail back to it? My guess is a big fat error and cant be done. so how do I then deal with that? outside of calling MS support I can’t see a way to back out the CU from My now primary to fail back to existing build and rolling back.
    
    I hope not to confuse terms and hopefully you gather the threads of my point.
    
    Cosmo
    Meet Mr. OZAR year he released/demoed Blitz in Seattle at PASS
    
    Reply
    1. Brent Ozar
      
      April 3, 2024 at 2:33 pm
      
      Cosmo – I’m sorry that Microsoft’s confusing you, but for questions on what Microsoft wrote, your best bet would be to talk to them. Fair? Thanks!
      
      Reply
Sanjeev Kumar

July 3, 2024 at 9:54 pm

Does CU update in 2019 contains all the previous CU and GDR updates?

Reply
1. Brent Ozar
  
  July 4, 2024 at 10:52 am
  
  Yes, that’s why they’re called Cumulative Updates. Cheers!
  
  Reply
Laurie Britten VIP Student since 2019

August 6, 2024 at 4:58 pm

We have SQL Server 2019 RTM. In July our patch team applied a GDR patch and our version is now 15.0.2116.2. We had planned to install CU25. Can this still be done now that a GDR patch was applied ?

Reply
1. Brent Ozar
  
  August 7, 2024 at 11:56 am
  
  Yep!
  
  Reply
Mike Button

February 26, 2025 at 9:09 pm

Hi Brent, we have several clusters, each cluster comprised of three nodes with Active/Passive/Quorum nodes.
Our Security team have requested monthly patching.
So, when having patched the quorum & passive nodes, is there a recommended minimum time period (hours, days, weeks ?) before failing over the patched Passive node to be the Active node ?

Reply
1. Brent Ozar
  
  February 26, 2025 at 9:19 pm
  
  Hi! If you need personal advice on production systems, feel free to click Consulting at the top of the page.
  
  Reply

How to Patch SQL Server

Get my new posts by email

Keep digging

54 comments

Leave a comment Cancel reply