How Computed Columns Can Cause Blocking

Last Updated May 1, 2021

The short story: when you add a computed column that references another table, like with a scalar user-defined function, you can end up causing concurrency problems even when people didn’t really want to go see that other table, and that table is locked by someone else.

Here’s my query:

SELECT Id, u.DisplayName, u.BadgeCount
FROM dbo.Users AS u
WHERE u.BadgeCount >= 500;

1

2

3

SELECT Id, u.DisplayName, u.BadgeCount

FROM dbo.Users AS u

WHERE u.BadgeCount >= 500;

Here’s what I occasionally see when the query runs, using sp_BlitzWho:

Block Party

I’m selecting data from users.

It’s being blocked by an insert into Badges.

There’s weird code running preventing my query from finishing.

What’s The Problem?

Someone had tried to be clever. Looking at the code running, if you’ve been practicing SQL Server for a while, usually means one thing.

A Scalar Valued Function was running!

In this case, here’s what it looked like:

CREATE OR ALTER FUNCTION dbo.BadIdea ( @uid INT )
RETURNS BIGINT
WITH RETURNS NULL ON NULL INPUT, SCHEMABINDING
AS
    BEGIN
        DECLARE @BCount BIGINT;
        SELECT   @BCount = COUNT_BIG(*)
        FROM     dbo.Badges AS b
        WHERE    b.UserId = @uid
        GROUP BY b.UserId;
        RETURN @BCount;
    END;

1

2

3

4

5

6

7

8

9

10

11

12

CREATE OR ALTER FUNCTION dbo.BadIdea ( @uid INT )

RETURNS BIGINT

WITH RETURNS NULL ON NULL INPUT, SCHEMABINDING

AS

BEGIN

DECLARE @BCount BIGINT;

SELECT @BCount = COUNT_BIG(*)

FROM dbo.Badges AS b

WHERE b.UserId = @uid

GROUP BY b.UserId;

RETURN @BCount;

END;

Someone had added that function as a computed column to the Users table:

ALTER TABLE dbo.Users ADD BadgeCount AS dbo.BadIdea(Id);

1	ALTER TABLE dbo.Users ADD BadgeCount AS dbo.BadIdea(Id);

Abhorrent

I know, I know. Bad, right? Who would ever do this, right?

We get some funny emails.

Now, in the past, I’ve written about Scalar Functions from a performance and parallelism angle here, here, and here.

I can feel the question coming: Can’t I just persist that computed column?

Nope.

Msg 4934, Level 16, State 3, Line 19
Computed column ‘BadgeCount’ in table ‘Users’ cannot be persisted because the column does user or system data access.

Or surely we can index it.

Nope.

Msg 2709, Level 16, State 1, Line 21
Column ‘BadgeCount’ in table ‘dbo.Users’ cannot be used in an index or statistics or as a partition key because it does user or system data access.

Without being able to do either of those things, our function does a couple nasty things to our query

Executes for each row returned to grab the count of badges
Forces it to run serial

Unfortunately, workarounds for the parallelism issue aren’t applicable here, since we can’t persist it.

You Can Guess…

This made all sorts of things like concurrency, locking, and blocking very tricky to figure out.

There’s an expectation that when someone writes a single-table query, their only concern should be that table. If we were to add additional joins, or additional aggregating functions as computed columns to other tables, we’d be making things even worse.

Hiding code like that in a function, and then hiding the function in a computed column may seem like a nice trick, but it doesn’t help performance, and it doesn’t make issues any more clear when you take all these new found SQL skills and run off to your Brand! New! Job! Leaving them as an exercise to the [next person].

And the workarounds weren’t any more pleasant.

We could have added a trigger on the Badges table to update the Users table whenever someone got a Badge added.
We could have had a process run every so often to recalculate Badge counts
We could have made an indexed view to pre-aggregate the data

There’s nothing wrong with any of those in theory, but they’d require a lot of extra development and testing.

Something that hadn’t been done with the computed column.

Thanks for reading!

Typoman
April 27, 2018 4:15 am

Tiny Typo in the last section: “run off to a your Brand! New! Job!”. “a” or “your”?
Maybe both cause you’re excited about BRAND NEW JOB!1!

Reply
- Erik Darling
  April 27, 2018 6:01 am
  
  GET THEE BEHIND ME, TYPOMAN
  
  Reply
Scalar Function Blocking – Curated SQL
April 27, 2018 7:05 am

[…] Erik Darling notes that scalar functions can cause multi-table blocking: […]

Reply
Darren
October 9, 2019 12:15 pm

I’m troubleshooting a case of “ghost blocking”, and don’t see any of the usual suspects, or even the _un_usual suspects. However, one of the tables involved is _used_ in a scalar function…would that have the same effect? I can’t see how, but I’m grasping at straws…
Also, I realize that Erik has since moved on, but desperate times, etc.

Reply
- Brent Ozar
  October 9, 2019 1:14 pm
  
  Hi – for questions, your best bet is a Q&A site like https://dba.stackexchange.com.
  
  Reply