Stats Week: Only Updating Statistics With Ola Hallengren’s Scripts

Last Updated June 17, 2017

Index Maintenance, Ola Hallengren's Database Maintenance Scripts

I hate rebuilding indexes

There. I said it. It’s not fun. I don’t care all that much for reorgs, either. They’re less intrusive, but man, that LOB compaction stuff can really be time consuming. What I do like is updating statistics. Doing that can be the kick in the bad plan pants that you need to get things running smoothly again.

I also really like Ola Hallengren’s free scripts for all your DBA broom and dustpan needs. Backups, DBCC CHECKDB, and Index and Statistics maintenance. Recently I was trying to only update statistics, and I found it a little trickier than I first imagined. So tricky, in fact, that I emailed Ola, and got a response that I printed and framed. Yes, the frame is made out of hearts. So what?

What was tricky about it?

Well, the IndexOptimize stored procedure has default values built in for index maintenance. This isn’t a bad thing, and I could have altered the stored procedure, but that would be mean. I set about trying to figure out how to get it to work on my own.

First, I tried only passing in statistics parameters.

EXEC [master].[dbo].[IndexOptimize]
    @Databases = N'USER_DATABASES' ,
    @UpdateStatistics = N'ALL' ,
    @OnlyModifiedStatistics = N'Y' ,
    @LogToTable = N'Y';

EXEC [master].[dbo].[IndexOptimize]

@Databases = N'USER_DATABASES' ,

@UpdateStatistics = N'ALL' ,

@OnlyModifiedStatistics = N'Y' ,

@LogToTable = N'Y';

But because of the default values, it would also perform index maintenance. Sad face. So I tried being clever. Being clever gets you nowhere. What are the odds any index would be 100% fragmented? I mean, not even GUIDs… Okay, maybe GUIDs.

EXEC [master].[dbo].[IndexOptimize]
    @Databases = N'USER_DATABASES' ,
    @FragmentationMedium = N'INDEX_REORGANIZE' ,
    @FragmentationHigh = N'INDEX_REORGANIZE' ,
    @FragmentationLevel1 = 100 ,
    @FragmentationLevel2 = 100 ,
    @UpdateStatistics = N'ALL' ,
    @OnlyModifiedStatistics = N'Y' ,
    @LogToTable = N'Y';

EXEC [master].[dbo].[IndexOptimize]

@Databases = N'USER_DATABASES' ,

@FragmentationMedium = N'INDEX_REORGANIZE' ,

@FragmentationHigh = N'INDEX_REORGANIZE' ,

@FragmentationLevel1 = 100 ,

@FragmentationLevel2 = 100 ,

@UpdateStatistics = N'ALL' ,

@OnlyModifiedStatistics = N'Y' ,

@LogToTable = N'Y';

But This throws an error. Why? Well, two reasons. First, 100 isn’t valid here, and second, you can’t have the same fragmentation level twice. It would screw up how commands get processed, and the routine wouldn’t know whether to use @FragmentationMedium, or @FragmentationHigh. This makes sense.

Okay, so I can’t use 100, and I can’t set them both to 99. What to do? Let’s bring another parameter in: @PageCountLevel.

EXEC [master].[dbo].[IndexOptimize]
    @Databases = N'USER_DATABASES' ,
    @FragmentationMedium = N'INDEX_REORGANIZE' ,
    @FragmentationHigh = N'INDEX_REORGANIZE' ,
    @FragmentationLevel1 = 98 ,
    @FragmentationLevel2 = 99 ,
    @UpdateStatistics = N'ALL' ,
    @OnlyModifiedStatistics = N'Y' ,
    @PageCountLevel = 2147483647,
    @LogToTable = N'Y';

EXEC [master].[dbo].[IndexOptimize]

@Databases = N'USER_DATABASES' ,

@FragmentationMedium = N'INDEX_REORGANIZE' ,

@FragmentationHigh = N'INDEX_REORGANIZE' ,

@FragmentationLevel1 = 98 ,

@FragmentationLevel2 = 99 ,

@UpdateStatistics = N'ALL' ,

@OnlyModifiedStatistics = N'Y' ,

@PageCountLevel = 2147483647,

@LogToTable = N'Y';

This seems safe, but it’s still not 100%. Even with the integer maximum passed in for the page count, it still felt hacky. Hackish. Higgity hack. The other part of the equation is that I don’t even want this thing THINKING about indexes. It will still look for indexes that meet these requirements. If your tables are big, you know, sys.dm_db_index_physical_stats can take foreeeeeeeeeeeeeeeeeeeeeeeeeeeever to run. That seems wasteful, if I’m not going to actually do anything with the information.

Hola, Ola

This is where I emailed Ola for advice. He responded pretty quickly, and here’s how you run stats only updates.

EXECUTE [dbo].[IndexOptimize]
    @Databases = 'USER_DATABASES' ,
    @FragmentationLow = NULL ,
    @FragmentationMedium = NULL ,
    @FragmentationHigh = NULL ,
    @UpdateStatistics = 'ALL' ,
    @OnlyModifiedStatistics = N'Y' ,
    @LogToTable = N'Y';

EXECUTE [dbo].[IndexOptimize]

@Databases = 'USER_DATABASES' ,

@FragmentationLow = NULL ,

@FragmentationMedium = NULL ,

@FragmentationHigh = NULL ,

@UpdateStatistics = 'ALL' ,

@OnlyModifiedStatistics = N'Y' ,

@LogToTable = N'Y';

Moral of the story

NULLs aren’t all bad! Sometimes they can be helpful. Other times, developers.

Thanks for reading!

Brent says: Subtitle: How many DBAs does it take to think of NULL as a usable option? Seriously, we all banged our heads against this one in the company chat room.

Breaking News: Query Store in All Editions of SQL Server 2016

[Video] Office Hours 2016/04/20 – Now With Transcriptions

46 Comments. Leave new

Ron Dameron
April 22, 2016 10:05 am

Oh, geez. This deserves a bazillion tweets just for the code to update stats only using Ola’s scripts.

Reply
- Erik Darling
  April 22, 2016 11:47 am
  
  I’M WAITING
  
  Reply
Zhunya
April 22, 2016 10:11 am

I can’t unseen his post =)

Reply
Bill Hughes
April 22, 2016 10:26 am

I’ve added this to my growing list of scripts that I hope someday to make use of. Thanks Erik!

Reply
- Erik Darling
  April 22, 2016 11:47 am
  
  Don’t thank me, thank Ola!
  
  Reply
Isaac Koss
April 22, 2016 10:49 am

Erik. Can you comment on some of the common cases where you would want to update Stats much more often than reorganize and/or rebuild indexes? Thanks!

Reply
- Erik Darling
  April 22, 2016 11:49 am
  
  It would be easier to comment on the only two scenarios that I might prefer to reorganize or rebuild indexes: at 40% and 80% fragmentation, respectively.
  
  Reply
Scott Caldwell
April 22, 2016 3:16 pm

Ola’s documentation is really good. It’s very cut n’ paste worthy. This may even be one of the examples 😉

Reply
- Erik Darling
  April 22, 2016 3:25 pm
  
  But that makes for a terrible blog post 😉
  
  Reply
James
April 23, 2016 5:32 pm

Why not use setstatiscs=100, this will use a full 100% sample on the table

Reply
- Bill
  April 23, 2016 7:50 pm
  
  A fullscan sampling is good for testing just how low you make production’s sampling, without changing query plans. Beyond that, I think a fullscan sampling is overkill. Usually, SQL Server does a good job at determine an appropriate sampling for you. But in order for SQL Server to pick a good sampling rate, the column that is in a statistics/index’s first ordinal position should be the least dense (the most selective) column.
  
  Reply
  - Marcin S
    April 25, 2016 7:10 am
    
    Yeah. There were rants from some mathematical statistician about that too (on Stack Overflow iirc). Speaking shortly – 100% sample is not needed most of time. It might be useful only in some few rare situations.
    
    Reply
    - Arthur
      April 25, 2016 1:33 pm
      
      Kimberly Tripp has some scripts on determining the skew of the data inside the indexes. In my experience, a higher sample size helps deal with large amounts of skew. I would use default sample size and pick a specific larger sample size for your worst skewed indexes.
      
      Reply
Denis
April 24, 2016 11:47 am

Hello. May you suggest how to exclude the single index from performance? I have the single index but it will be rebuilt and take a lot of time. I will rebuild it later.

Reply
- Brent Ozar
  April 24, 2016 11:49 am
  
  Sure, check out the documentation, specifically the Indexes parameter:
  
  https://ola.hallengren.com/sql-server-index-and-statistics-maintenance.html
  
  Reply
Matanda
April 25, 2016 1:20 pm

I’m laughing at myself here. I use Ola’s script and when I encountered this dilemma, I came up with a separate script to just handle statistics. This solution is much straight forward.

Reply
Alin Selicean
April 27, 2016 5:37 am

Hi Brent

If I knew that this trick would have saved your tables (and your heads, of course), I would have make it a blog post 🙂

I am using Ola’s scripts this way for more than 2 years. Due to a limited time window, I also split the jobs into multiple ones based on the duration of the largest tables. This way, the largest ones have their own job and can run concurrently with the rest of them. Because of this, I also extended the info that’s stored in the XML column (ExtendedInfo) to include SPIDs so I can differentiate between the what I call threads.

It’s all in the parameters 🙂

Reply
Mark Freeman
April 27, 2016 12:31 pm

One thing I wish Ola would add to the statistics part of his code would be to update all stats on a table if he’s going to update any of them. It doesn’t take much longer, and it a lot better than updating many stats on a very large table individually.

I’ve used his MaintenanceSolution for 6 years, and it has made my work much simpler.

Reply
Dale
July 14, 2016 11:46 am

I scripted out and added an UpdateStatistics job to Ola’s script from code on his website. I’ve been using it for over a year now on all of my production servers, and it works like a charm.

TL;dr RTFM

Reply
Mark Freeman
July 14, 2016 1:22 pm

Add @StatisticsResample = N’Y’, @StatisticsSample = NULL to do RESAMPLE instead of FULL.

Reply
Santa
August 5, 2016 6:30 am

“Restoring tempdb since GETDATE()”

Oh man, this cracks me up every time. And yea, great information, thanks :D…Ola

Reply
Jake
September 2, 2016 10:42 am

Noob question. Can maintenance such as this, or DBCC be run against remote db, and log the results to a local table? The Ola’s scripts log to the command table in the target db. However, I’d like to designate the server/table to log into. You know… Centralize.

Reply
- Erik Darling
  September 2, 2016 11:04 am
  
  You’d want to look into something like Multi Server Administration, I think. Some monitoring tools, like SQL Sentry, do job scheduling, though I’m not sure you’d get the centralized results you’re looking for.
  
  Another sort of clumsy option would be to set up linked servers, and use a view to union all your CommandLog tables together. Just be careful if there are XML columns — they have to be cast to NVARCHAR(MAX).
  
  Someone out there may mention PowerShell. I don’t object to that; it’s just not my thing.
  
  Reply
  - Jake
    September 2, 2016 11:17 am
    
    Based on your answer I guess this isn’t a typical approach. If we just run Ola type scripting on each server independently. What would you suggest we do to centralize the viewing of the results of all the maintenance?
    
    Reply
    - Erik Darling
      September 2, 2016 11:28 am
      
      Let’s start by figuring out what you’re trying to get out of this!
      
      Do you just want success/failure, or are you trying to do some analysis? CommandLog only gives you start and end times for each command, which is valuable, but not necessarily actionable information.
      
      So like, I can’t imagine that knowing msdb took 2 seconds to back up is good information, but I can see knowing that it got backed up is good information.
      
      Reply
      - Jake
        September 2, 2016 11:44 am
        
        We want to implement best practice for maintenance. Then centralize notification of success. If there are any problems, I want to set off the klaxon.
        
        We don’t have a dedicated DBA resource, so the tools/help I’ve received from you guys, and many around the interwebs have been much appreciated. Keep up the good work!
      - Erik Darling
        September 2, 2016 11:59 am
        
        The best practice here would be to set up job failure alerts via email for all critical jobs, and have them sent to a distribution list email address. That way multiple people are notified and can react if one person is on vacation or whatever.
Jake
September 2, 2016 12:29 pm

So, how would you know if DBCC reports any corruption? The emails from agent would only send if the job fails, not if the job reported issues with the db, right? My guess is you would need to review the output files for reports of issues. Unless I’m missing something huge. That is where something like an SQL Sentry makes their case no?

Reply
- Erik Darling
  September 2, 2016 12:41 pm
  
  Great question!
  
  No, Ola’s job will fail with (some of) the error text if you hit a database with corruption. You will want to make sure your job logs the output to a file for you to review (this is set up under the Advanced tab of the Job Step in the Agent job properties).
  
  I would encourage you to test stuff like this out. It’s easy enough to find example corrupt databases to download.
  
  Reply
  - Jake
    September 2, 2016 12:51 pm
    
    Thanks a bunch Erik!
    
    Reply
Aravind
August 31, 2017 4:21 pm

how can we use ola’s script for update statistics for a table ? what parameter we need to add to his command..
EXECUTE dbo.IndexOptimize
@Databases = ‘USER_DATABASES’,
@FragmentationLow = NULL,
@FragmentationMedium = NULL,
@FragmentationHigh = NULL,
@UpdateStatistics = ‘ALL’

generally we use “update statistics .; ” . If you can help me out to run update statistics on a table.
thanks
AT

Reply
- Erik Darling
  August 31, 2017 4:34 pm
  
  Aravind — you really should read through the documentation for questions like that.
  
  Reply
Syed Ali
October 16, 2017 11:21 am

I think LOB Compaction is really not fragmentation. We could skip that when reorganizing indexes on large tables with 3 Billion some rows.

Reply
Mahesh Babu
October 20, 2017 4:02 pm

Any idea how the OLA program prioritizes which Tables to do/run Stats updates first?
Also it seems it run in sequence one table at a time.

Reply
- Erik Darling
  October 20, 2017 4:24 pm
  
  I’d be doing you a disservice if I told you. You should read through Ola’s code and figure it out.
  
  Reply
Duane
January 14, 2019 1:04 pm

Brent: In my opinion, your honesty in this post gives you more credability.

Reply
- Brent Ozar
  January 14, 2019 1:39 pm
  
  Thanks sir!
  
  Reply
David
June 5, 2019 10:46 am

Nothing better than reading the “tried this… didn’t fly… tried that… didn’t fly… here’s what worked”. It is powerful to see we all go through trials before finding the solution. And thank you for the solution!

Reply
Gregory Loeffler
August 22, 2019 7:16 am

I know this is an old post, but its one I recently came across and implemented on a number of servers. I’ve dug into the commandlog and came across something that I am not sure how to handle. Any ideas on best practices on the below?

Currently I have a job that will do just the index maintenance and no updating of statistics. I handle the update on stats separately based on this article. Since index reorganizations do not update stats like rebuilds do, I am left with <1% sample rates on many indexes until my update stats jobs run later in the week. If I add the @updateStatistics = 'index' and @statisticssample = 25 to my index maintenance job, will this just update the stats on the reorganized indexes and skip the rebuilt ones since they get the equivalent of a fullscan when rebuilt? i don't want to override a fullscan with a 25% sample if possible. I also don't really want to leave <1% sample on indexes until my update stats job runs. my update stats job is set to update ALL stats, not just column or index stats.

Reply
- Brent Ozar
  August 22, 2019 7:18 am
  
  Greg – for general questions, head to a Q&A site like https://dba.stackexchange.com.
  
  Reply
Niels Broertjes
May 4, 2022 12:50 pm

Great stuff! You saved me some hours trial and error. Thanks for posting this!

Reply
Shibasish Jena
August 16, 2024 8:24 pm

How could we make the stats run in parallel for multiple table at a time

Reply
- Brent Ozar
  August 16, 2024 9:27 pm
  
  By creating multiple statistics jobs that work on different tables.
  
  Reply
Peter Riis
November 11, 2024 10:00 am

When updating Statistics why not use WITH FULLSCAN?

EXECUTE dbo.IndexOptimize
@Databases = N’USER_DATABASES’
, @LogToTable = N’Y’
, @UpdateStatistics = N’ALL’
, @StatisticsSample = 100 /* UPDATE STATISTICS WITH FULLSCAN */
, @OnlyModifiedStatistics = N’Y’

Reply
- Brent Ozar
  November 13, 2024 12:04 pm
  
  You certainly can – but SQL Server scans the entire object once per statistic, so it tends to take a prohibitively long time in production environments.
  
  Reply
Hui Shi
February 11, 2025 9:26 pm

sorry but these two lines

@UpdateStatistics = ‘ALL’ ,
@OnlyModifiedStatistics = N’Y’

I found you’ve included these two in the same script. Is this correct as it was pointed out in another blog before.
Thanks

https://www.sqlskills.com/blogs/erin/updating-statistics-with-ola-hallengrens-script/

Reply