Speed Tips For SQL Server SELECT Statements

You can code a SQL SELECT statement in a number of ways to get the same results, but some versions of the same query may perform better than others. In this article we will look at ways to exploit this.

Here is a query that I cut-and-pasted straight out of SQL 7 Books Online. The query runs in the Northwind database and is designed to pull out the maximum unit price for each order in the database.

SELECT Ord.OrderID, Ord.OrderDate,
(SELECT MAX(OrdDet.UnitPrice)
FROM Northwind.dbo.[Order Details] AS OrdDet
WHERE Ord.OrderID = OrdDet.OrderID) AS MaxUnitPrice
FROM Northwind.dbo.Orders AS Ord


This type of query is called a Correlated Subquery – you can see that there are two SELECT statements, the ‘main’ one (SELECT Ord.OrderID, Ord.OrderDate ) which selects the order ID and date from the orders table, and then the ‘sub’ query (in red), which selects the maximum unit price for each order.

The ‘sub’ query is actually ran once for each row that the main query returns, and this repeated access to the [order details] table can be fairly inefficient.

Books Online goes on to say that queries like the one above can usually be re-written as a straightforward join – the example below uses an INNER JOIN between the Orders and [Order Details] table in association with the MAX() function to produce exactly the same results, but more efficiently.

SELECT Ord.OrderID, Ord.OrderDate, 
MAX(OrdDet.UnitPrice) AS maxUnitPrice
FROM Northwind.dbo.[Order Details] AS OrdDet 
INNER JOIN 
Northwind.dbo.Orders AS Ord
ON Ord.OrderID = OrdDet.OrderID
GROUP BY Ord.OrderID, Ord.OrderDate

Although the same data is returned by both queries, Query Analyzer indicated that the second version took around 40% less SQL Server resources to run than the first, so no prizes for guessing which is the preferred option. However, in some cases we can use a third method to gain an even greater performance improvement.

Derived Tables

Using a derived table is in effect like using a temporary table, without the hassle of specifically creating and referring to one. I have re-coded the above BOL query to use derived tables in the example below:

SELECT Ord.OrderID, Ord.OrderDate, maxUnitPrice
FROM Northwind.dbo.Orders AS Ord INNER JOIN
(
SELECT orderID,
MAX(UnitPrice) AS maxUnitPrice
FROM Northwind.dbo.[Order Details]
GROUP BY OrderID
) AS OrdDet
ON ordDet.orderID = Ord.orderID
ORDER BY Ord.OrderID DESC, Ord.OrderDate, maxUnitPrice


The code in red causes the SQL Server optimizer to generate a notional (or derived) table called OrdDet for the duration of the query. The derived table notionally takes up much less space than the original [order details] table, because it contains only two columns and only one detail row for each order. Because of this, my ‘derived table’ version of the query should run even faster than the INNER JOIN version. When I checked the execution plan for the derived table version against that of the “join” version to see what sort of improvement I got, the results came out … exactly the same!

Continues…

Pages: 1 2




Related Articles :

  • No Related Articles Found

One Response to “Speed Tips For SQL Server SELECT Statements”

  1. Hi Neil,

    Nice article as it gave some more idea about writing queries. But I am amazed to see that in first query group by orderid,orderdate is working fine.
    Can you just explain this how this work as there can be more dates for one id.

Software Reviews | Book Reviews | FAQs | Tips | Articles | Performance Tuning | Audit | BI | Clustering | Developer | Reporting | DBA | ASP.NET Ado | Views tips | | Developer FAQs | Replication Tips | OS Tips | Misc Tips | Index Tuning Tips | Hints Tips | High Availability Tips | Hardware Tips | ETL Tips | Components Tips | Configuration Tips | App Dev Tips | OLAP Tips | Admin Tips | Software Reviews | Error | Clustering FAQs | Performance Tuning FAQs | DBA FAQs |