Home
TFH
Special OS/400 V5R2 Edition
Volume 11, Number 36 -- August 29, 2002

New Statistics Optimize V5R2 Database Performance


by David Morris

One of the not readily apparent features of V5R2 is a new query optimizer that uses file statistics gathered in a background process to improve SQL performance. Initially, use of those statistics by a rewritten query optimizer is limited. In most cases, the old query uses the optimizer that existed on releases before V5R2. However, IBM is phasing in use of the file statistics and new query optimizer on subsequent releases and possibly through PTFs.


A new system value--QDBFSTCCOL--identifies the types of statistics requests that a background system job--QDBFSTCCOL--will process. You can change this system value using the Work with System Value (WRKSYSVAL) and Change System Value (CHGSYSVAL) commands or you can use iSeries Navigator (formerly Operations Navigator. With iSeries Navigator, modify this system value by selecting your system, configuration and service, system values, performance, and database.

The QDBFSTCCOL system value supports four values:

  • *NONE--No statistics are gathered
  • *USER--Only user requested database file statistics collections are allowed to be processed
  • *SYSTEM--Only system requested database file statistics collections are allowed to be processed
  • *ALL--Both user and system statistics are gathered

In most cases, you will want to keep the system default of *ALL. This will tell the system to gather statistics in the background and allow processing of user-requested collections. To better control collections, you can call APIs to start, update, list, delete, or cancel statistics gathering. The following list describes the statistics-gathering APIs:

  • QDBSTCRS cancels statistics collections that have been requested, but are not yet completed or not successfully completed
  • QDBSTDS deletes existing, completed statistics collections immediately
  • QDBSTLRS lists details for not yet completed or not successfully completed statistics collection requests
  • QDBSTLDS lists additional statistics data for a single statistics collection not returned by the List Statistics Collections (QDBSTLS, QdbstListStatistics) API
  • QDBSTLS lists all of the columns and combination of columns for a given file member for which statistics are available
  • QDBSTRS requests that one or more statistics collections for a given set of columns of a database file member be created
  • QDBSTUS updates the attributes and refreshes the data of an existing single statistics collection

The main reason why you would force statistics to be built is to gather statistics for new files. One scenario might be to build statistics when new objects are moved into a production library and to delete statistics when an object is removed from production. You might also decide to force statistics to be built when data is replicated between systems.

The actual statistics gathered keep track of cardinality (the number of unique values in a field), estimated value ranges, and counts for the most common values. Initially, statistics are gathered for single columns. Future releases or PTFs may enable statistics collection for combined columns. The following table gives you a rough idea of what is collected.

Column Value Code First Row Last Row Value Count
Adams 1 1 2 2
Jones 12 37 42 6
Morris 23 68 69 2
Smith 34 200 214 15

The query optimizer uses these statistics in several ways. The most common use is for the query optimizer to use statistics to better estimate the cost associated with a particular way of implementing a query. For example, in the example above, a query for records with a name less than or equal to Adams will only return the first two rows in the underlying file. Without statistics or an index, the query optimizer often assumes a query involving less than or greater than comparisons will return half of the records in a file. In some cases, statistics can replace the need for a specialized index. In other cases, the query optimizer uses statistics to build a temporary index more quickly.

The file statistics that V5R2 gathers lay the foundation for future performance enhancements. Those enhancements have the potential to dramatically improve query performance and to reduce the amount of optimization work done by application developers. Initially, use of these statistics is so limited that their value is harder to justify. In the future, the value of these statistics will far outweigh the cost of gathering and storing this information on your system.

David Morris is a software architect at Plum Creek Timber Company and started the iSeries-toolkit open source project. He can be reached at dmorris@itjungle.com.


Sponsored By
WORKSRIGHT SOFTWARE

On June 30, 2002,
$$$$$$$$    Postal Rates went UP!    $$$$$$$$

On July 1, 2002,
$$$$$    you wanted your postage bill to go down.    $$$$$

We have the solution! CASS certify your mailing names and addresses and presort your outgoing mail and save. Our CASS certification software ensures that your address files have valid ZIP Code and address information. Our presort software ensures that you can properly prepare you mail for delivery to your Post Office.

WorksRight Software, Inc. is the number-one source for iSeries and AS/400 CASS, presort, ZIP Code, and area code software and data.

Visit our Web site - www.worksright.com - to learn more about our CASS and presorting software, or contact WorksRight Software, Inc., phone 601-856-8337,
e-mail software@worksright.com .


THIS ISSUE
SPONSORED BY:

BCD Int'l
COMMON
Infinium Software
SoftLanding Systems
Coglin Mill
WorksRight Software


BACK ISSUES

TABLE OF CONTENTS
A New 5250 Interface in V5R2 for Those on the Go

iASPs Provide Storage Options With V5R2

IPv6: Internet Protocol of the Future, Currently in V5R2

V5R2 Opens the Door Wider for Windows

Get Ready to Boost Your IFS Performance with V5R2

New Statistics Optimize V5R2 Database Performance


Editor
Timothy Prickett Morgan

Managing Editor
Shannon Pastore

Contributing Editors:
Dan Burger
Joe Hertvik
Kevin Vandever
Shannon O'Donnell
Victor Rozek
Hesh Wiener
Alex Woodie

Publisher and
Advertising Director:

Jenny Thomas

Contact the Editors
Do you have a gripe, inside dope or an opinion?
Email the editors:
editors@itjungle.com



Last Updated: 8/28/02
Copyright © 1996-2008 Guild Companies, Inc. All Rights Reserved.