Another Hidden Parallelism Killer: Scalar UDFs In Check Constraints

Last Updated March 12, 2018

Every single time

Really. Every single time. It started off kind of funny. Scalar functions in queries: no parallelism. Scalar functions in computed columns: no parallelism, even if you’re not selecting the computed column. Every time I think of a place where someone could stick a scalar function into some SQL, it ends up killing parallelism. Now it’s just sad.

This is (hopefully. HOPEFULLY.) a less common scenario, since uh… I know most of you aren’t actually using any constraints. So there’s that! Developer laziness might be a saving grace here. But if you read the title, you know what’s coming. Here’s a quick example.

USE tempdb;
SET NOCOUNT ON;

SELECT TOP 10000
       ROW_NUMBER() OVER ( ORDER BY ( SELECT NULL )) AS ID, DATEADD(MINUTE, m.message_id, SYSDATETIME()) AS SomeDate
INTO   dbo.constraint_test
FROM   sys.messages AS m, sys.messages AS m2;
GO

CREATE FUNCTION dbo.DateCheck ( @d DATETIME2(7))
RETURNS BIT
WITH RETURNS NULL ON NULL INPUT
AS
    BEGIN
        DECLARE @Out BIT;
        SELECT @Out = CASE WHEN @d < DATEADD(DAY, 30, SYSDATETIME()) THEN 1 ELSE 0 END;
        RETURN @Out;
    END;
GO

ALTER TABLE dbo.constraint_test
ADD CONSTRAINT ck_cc_dt CHECK ( dbo.DateCheck(SomeDate) = 1 );

SELECT *
FROM   dbo.constraint_test
OPTION ( QUERYTRACEON 8649, MAXDOP 0, RECOMPILE );

USE tempdb;

SET NOCOUNT ON;

SELECT TOP 10000

ROW_NUMBER() OVER ( ORDER BY ( SELECT NULL )) AS ID, DATEADD(MINUTE, m.message_id, SYSDATETIME()) AS SomeDate

INTO dbo.constraint_test

FROM sys.messages AS m, sys.messages AS m2;

CREATE FUNCTION dbo.DateCheck ( @d DATETIME2(7))

RETURNS BIT

WITH RETURNS NULL ON NULL INPUT

BEGIN

DECLARE @Out BIT;

SELECT @Out = CASE WHEN @d < DATEADD(DAY, 30, SYSDATETIME()) THEN 1 ELSE 0 END;

RETURN @Out;

END;

ALTER TABLE dbo.constraint_test

ADD CONSTRAINT ck_cc_dt CHECK ( dbo.DateCheck(SomeDate) = 1 );

SELECT *

FROM dbo.constraint_test

OPTION ( QUERYTRACEON 8649, MAXDOP 0, RECOMPILE );

Parallelism appears to be rejected for maintenance operations as well as queries, just like with computed columns.

Interestingly, if we look in the plan XML (the execution plan itself just confirms that the query didn’t go parallel) we can see SQL tried to get a parallel plan, but couldn’t.

There’s a short list of possible reasons for plans not going parallel here from a while back. A quick search didn’t turn up a newer or more complete list.

Check yourself, etc. and so forth

How do you know if this is happening to you? Here’s a simple query to look at constraint definitions and search them for function names. This query is dumb and ugly, but my wife is staring at me because it’s 5:30 on a Saturday and I’m supposed to be getting ready. If you have a better idea, feel free to share in the comments.

WITH c1
    AS
     (
         SELECT name, definition
         FROM   sys.check_constraints
         UNION ALL
         SELECT name, definition
         FROM   sys.default_constraints
     )
SELECT *
FROM   c1, sys.objects AS o
WHERE  o.type IN ( 'FN', 'TF' )
AND    c1.definition LIKE '%' + o.name + '%';

WITH c1

(

SELECT name, definition

FROM sys.check_constraints

UNION ALL

SELECT name, definition

FROM sys.default_constraints

)

SELECT *

FROM c1, sys.objects AS o

WHERE o.type IN ( 'FN', 'TF' )

AND c1.definition LIKE '%' + o.name + '%';

Thanks for reading!

I’m on the RunAsRadio Podcast

Breaking News, Literally: SQL CLR Support Removed from Azure SQL DB

15 Comments. Leave new

Chris F
April 7, 2016 2:03 pm

This will get a list of all objects (and their types) that reference a function:

SELECT [ObjectName] = [obj_constraint].[name],
[ObjectType] = [obj_constraint].[type_desc],
[ReferencedFunction] = [obj_function].[name],
[ReferencedType] = [obj_function].[type_desc]
FROM [sys].[objects] [obj_constraint]
INNER JOIN [sys].[sql_expression_dependencies] [sed] ON [sed].[referencing_id] = [obj_constraint].[object_id]
INNER JOIN [sys].[objects] [obj_function] ON [obj_function].[object_id] = [sed].[referenced_id]
WHERE [obj_function].[type] IN (‘FN’, ‘TF’);

This will also catch computed columns, views, and stored procedures that reference functions. It baffles me that a default constraint screws with the query plan on a select. That smells like a bug to me.

Reply
- Isaac Koss
  April 8, 2016 9:54 am
  
  I was also wondering about the constraint on SELECT. Can anyone share some information on this? Thanks!
  
  Reply
Dylan Bacon
April 8, 2016 3:29 pm

“We couldn’t make the plan parallel because we couldn’t make the plan parallel.”

Brilliant.

Reply
Robert
April 9, 2016 9:24 pm

This article does a good job of demonstrating something to look for in a single query plan. It would be very beneficial it it also provided a means to identify if the condition exists in the plan cache. The query below will provide the query plans from the plan cache where this exist. Unfortunately, it also shows query plans that will should not run parallel, such as simple inserts or updates. It would be grate to be able to tweak the query below to remove such query plans.

WITH XMLNAMESPACES
(DEFAULT ‘http://schemas.microsoft.com/sqlserver/2004/07/showplan’)
SELECT
query_plan AS CompleteQueryPlan
, n.value(‘(@StatementText)[1]’, ‘varchar(4000)’) AS StatementText
, n.value(‘(@StatementOptmLevel)[1]’, ‘varchar(25)’) AS StatementOptimizationLevel
, n.value(‘(QueryPlan/@NonParallelPlanReason)[1]’, ‘nvarchar(128)’) AS NonParallelPlanReason
, ecp.usecounts
, ecp.size_in_bytes
FROM sys.dm_exec_cached_plans AS ecp
CROSS APPLY sys.dm_exec_query_plan(plan_handle) AS eqp
CROSS APPLY query_plan.nodes(‘/ShowPlanXML/BatchSequence/Batch/Statements/StmtSimple’) AS qn(n)
WHERE n.query(‘.’).exist(‘//QueryPlan[@NonParallelPlanReason]’) = 1

Reply
- Brent Ozar
  April 10, 2016 4:13 am
  
  Robert – yeah, just be careful because this query can be pretty slow to run. For bonus points, you might want to filter above a certain query cost, or logical reads, or duration, or execution count. You may not want to bother with smaller queries.
  
  Reply
  - Robert
    April 10, 2016 12:00 pm
    
    I missed the obvious. I need to filter out all the queries where the cost is below the cost threshold for parallelism. Plan cache queries are generally slow. I’ve found that the better you filter the XML, the more likely you will be able to execute the query within an acceptable execution time.
    
    Below is a modified query that returns the plans where the cost is greater that the cost threshold for parallelism.
    
    DECLARE @Cost float
    
    SELECT @Cost = CAST([value_in_use] AS float)
    FROM [sys].[configurations]
    WHERE [name] = ‘cost threshold for parallelism’;
    
    WITH XMLNAMESPACES
    (DEFAULT ‘http://schemas.microsoft.com/sqlserver/2004/07/showplan’)
    SELECT
    query_plan AS CompleteQueryPlan
    , n.value(‘(@StatementText)[1]’, ‘varchar(4000)’) AS StatementText
    , n.value(‘(@StatementOptmLevel)[1]’, ‘varchar(25)’) AS StatementOptimizationLevel
    , n.value(‘(QueryPlan/@NonParallelPlanReason)[1]’, ‘nvarchar(128)’) AS NonParallelPlanReason
    , n.value(‘@StatementSubTreeCost’,’float’) AS StatementSubTreeCost
    , ecp.usecounts
    , ecp.size_in_bytes
    FROM sys.dm_exec_cached_plans AS ecp
    CROSS APPLY sys.dm_exec_query_plan(plan_handle) AS eqp
    CROSS APPLY query_plan.nodes(‘/ShowPlanXML/BatchSequence/Batch/Statements/StmtSimple’) AS qn(n)
    WHERE n.query(‘.’).exist(‘//QueryPlan[@NonParallelPlanReason]’) = 1
    AND n.value(‘@StatementSubTreeCost’,’float’) > @Cost
    
    Reply
    - JD
      May 10, 2022 2:04 pm
      
      Note, it also includes scalar CLR functions, which aren’t necessarily an issue, because they are an exception and do not prevent parallelism from occurring.
      
      Reply
      - JD
        May 10, 2022 2:31 pm
        
        Scratch that, it does appear to filter out CLR functions.
Bryan Rebok
June 20, 2017 8:24 am

Eric,

Some more info on this – it appears that having a column with an untrusted check constraint that calls a scalar UDF does NOT force a plan for a query that references that column to run serially.

Reply
- Erik Darling
  June 20, 2017 10:23 am
  
  Weird, Brian. I haven’t come across that.
  
  Reply
krishnaroopa
September 11, 2018 11:37 pm

Hi Brent,

Its a nice article. I understand that when using a udf in a check constraint it affects the query performance.
But why check constraint comes in when SELECT?
Also, we have scenario like this, In the master table, two columns , code char(4), location int, both form the primary key whereas in the child table, we have only code column used. In this scenario, i am forced to use UDF with check constraint.. is there any other means to enforce integrity?

Thanks for your time.
KRS

Reply
- Zddn?k Novák
  August 22, 2022 11:20 am
  
  Vhen building a plan, SQL server considers also check constraints (if column is, for example “CHECK Value>0” and you have “LEFT JOIN table ON Value<0", the plan can count on the fact, that this join will return 0 rows. IF the constraint is WITH CHECK, of course.
  
  So it seems that if the constraint is WITH CHECK, SQL Server will consider it and that is how it encounters the scalar function.
  
  Reply
ANIL K RAGHAVAN
February 2, 2022 1:56 pm

I tested this on a SQL Server 2019 instance, with compat level 150 and 130
Details of the test can be shared if anyone is interested

Long story short ..
CouldNotGenerateValidParallelPlan was the result when selecting from the table alone
If the table had a computed column calling a Scalar UDF or a check constraint calling a Scalar UDF

However, when this table was joined to other tables in a different query, only when there was a computed column calling a Scalar UDF did the query not go parallel. It consistently went parallel even with the Check Constraint in this test (table joined to other tables)

#confusedasheck

Reply
- Brent Ozar
  February 2, 2022 1:57 pm
  
  If you have questions about code or execution plans, put together a repro query and share it as a question at https://dba.stackexchange.com. Include enough details that other people can reproduce what you’re seeing on their servers.
  
  Reply
  - ANIL K RAGHAVAN
    February 2, 2022 2:11 pm
    
    Done
    
    Reply