SQL Server Why is UNION faster than an OR statement

9avjhtql  于 2023-10-15  发布在  其他
关注(0)|答案(2)|浏览(130)

I have a problem where I need to find records that either have a measurement that matches a value, or do not have that measurement at all. I solved that problem with three or four different approaches, using JOIN s, using NOT IN and using NOT EXISTS . However, the query ended up being extremely slow every time. I then tried splitting the query in two, and they both run very fast (three seconds). But combining the queries using OR takes more than five minutes.

Reading on SO I tried UNION , which is very fast, but very inconvenient for the script I am using.

So two questions:

  1. Why is UNION so much faster? (Or why is OR so slow)?
  2. Is there any way I can force SQL Server to use a different approach for the OR statement that is fast?
1hdlvixo

1hdlvixo1#

The reason is that using OR in a query will often cause the Query Optimizer to abandon use of index seeks and revert to scans. If you look at the execution plans for your two queries, you'll most likely see scans where you are using the OR and seeks where you are using the UNION . Without seeing your query it's not really possible to give you any ideas on how you might be able to restructure the OR condition. But you may find that inserting the rows into a temporary table and joining on to it may yield a positive result.

Also, it is generally best to use UNION ALL rather than UNION if you want all results, as you remove the cost of row-matching.

e5njpo68

e5njpo682#

There is currently no way in SQL Server to force a UNION execution plan if no UNION statement was used. If the only difference between the two parts is the WHERE clause, create a view with the complex query. The UNION query then becomes very simple:

SELECT * FROM dbo.MyView WHERE <cond1>
UNION ALL
SELECT * FROM dbo.MyView WHERE <cond2>

It is important to use UNION ALL in this context when ever possible. If you just use UNION SQL Server has to filter out duplicate rows, which requires an expensive sort operation in most cases.

相关问题