How to Think Like the SQL Server Engine: So Index Seeks are Great, Right?

In our last episode, we added a nonclustered index, and now even though this query has a filter and an ORDER BY, it hardly did any work at all – just a lightweight index seek:

Ah, index seeks – nothing better than that, right?

Well, not exactly. Let’s make one tweak to the WHERE clause and find all of the people who accessed Stack Overflow going all the way back to the days of programming your ox team:

We still get an index “seek”:

Yes, it’s a seek, even though it’s reading every single row in the index. You probably thought that would be a scan, but it’s not.

Index seek doesn’t mean lightweight.

Index seek only means SQL Server is going to jump to a specific point in the index and start reading.

It has absolutely nothing to do with how few or how many rows you’re going to read. You can do an index seek and read just one row, or read all of the rows in the table.

If you’ve been looking at an execution plan and going, “seek seek seek, all seeks here, guess it can’t get any better than that,” then you’ve been missing out on something pretty big.

Index scan doesn’t mean reading the entire table, either.

Let’s get any 10 users:

We get a clustered index scan – meaning, it’s reading the entire table, right?

Nope – SQL Server starts scanning the table, but it can bail as soon as it’s found enough rows to deliver your query results. Hover your mouse over the clustered index scan, and it says “Number of Rows Read: 10.” If your number of rows read equals the number of rows your query outputted, that’s great! I can’t possibly design a better index to support this query. A clustered index scan is absolutely perfect here.

Here’s what seeks and scans really mean:

  • Index seek: “I’m going to jump to a specific point and start reading.”
  • Index scan: “I’m going to start at either end of the table and start reading.”

Here’s what they don’t mean:

  • How many rows we’re going to read
  • Whether we’re seeking on all of the columns in the index, or just the first one (more on that later)
  • Whether the index is good or bad for this query

I can’t emphasize this enough: you can’t make index judgements based purely on seeks or scans. This is especially true when you start looking at index usage metrics from tools like sp_BlitzIndex: we’re simply reporting how the index is being accessed. You can’t say, “This index is getting seeks, so it’s perfect,” nor can you say, “This index is getting scans, so the keys must be in the wrong order.” You’ve gotta dig deeper than that to see how much data is moving around – and that just isn’t something the DMVs track. You’ve gotta look at query plans closely to decipher what’s going on.

Next, let’s add some columns to our query and force SQL Server to make a tough decision.

Previous Post
SQL Server 2019 Standard Edition Feature Limitations Are Out
Next Post
How to Think Like the SQL Server Engine: What’s a Key Lookup?

5 Comments. Leave new

  • Hi Brent, One problem I always have and I think this article highlights it, is reading the info the plan pops up(the big yellowy box). Do you have another article that goes over what we should be looking for in them? I read and have read lots on Pref tuning, but when I try to do it myself I usually go “Ahhhh! too much info!”.

    Reply
  • Did this change between SQL 2012 and 2016? I always remember seeks looking at less rows/pages than scans

    Reply
    • Alen – nope.

      Reply
    • ITRW, seeks *often* but not always read fewer rows than scans do. Seeks are usually used when you do a lookup against indexed fields (as long as your values include the first item in the index), and queries with seeks are often limited in scope, such as selecting a page’s worth of Order Detail rows from a table.

      Index scans, OTOH, are most often seen with TOP, or when SELECTing based on non-indexed fields. For example, when looking up the most expensive things ordered in your Order Detail table, it is unlikely that you have an index where the net cost is the first item in the index. In this case, you will frequently see a clustered index scan performed.

      Also, TOP selects on indexed fields are index scans. For example, you might need to know the last 100 orders approved. For this, your query might be SELECT TOP 100 Orders.OrderId FROM Orders ORDER BY Orders.ApprovalDate DESC. Assuming you have an index by ApprovalDate , this would go to the maximum value of ApprovalDate and count down 100 rows based on the index, just like Brent’s query in his examples.

      And its been this way for as long as I can recall… not as long as Brent, perhaps, but since SQL 7.0 at least.

      Reply

Leave a Reply

Your email address will not be published. Required fields are marked *

Fill out this field
Fill out this field
Please enter a valid email address.

Menu
{"cart_token":"","hash":"","cart_data":""}