fhg
Volume 11, Number 26 -- September 14, 2011

Admin Alert: Three Common Problems with CBUs

Published: September 14, 2011

by Joe Hertvik

Power i Capacity BackUp (CBU) systems are complicated critters. Aside from the basic tasks of configuring, testing, and enabling a CBU for taking over processing from a production box, there are a lot of different problems that can occur with CBU configuration, and even when the CBU is running in replication mode. This week, let's look at a few not-so-obvious CBU configuration issues that can hurt your CBU or production system setup.

Issue #1: Problems with libraries that start with "Q"

By default, many Power i replication packages do not replicate objects in libraries whose names start with "Q", such as QGPL, QSYS, and QUSRSYS. This is because many of these libraries contain system objects that are specific to a certain machine, and replication may damage the machine setup.

However, there may be certain times when you need to replicate objects from a production system Q library to your CBU system. In particular, you may find that the following items need to be replicated on an exception basis from your QGPL or QUSRSYS libraries in order for your CBU to work correctly when it is running production processing:

  • Modified subsystem descriptions for popular subsystems such as QBATCH and QPGMR
  • Job queues that are attached to various subsystems
  • Output queues for iSeries Access for Windows printers and other system printers. Many of these queues are automatically created in QUSRSYS
  • Class objects that specify run time priorities for Power i jobs
  • Application files that don't start with the letter Q. Programmers and administrators like to keep general administrative files in the QGPL library
  • Text files containing SQL runtime commands
  • Job descriptions that contain job runtime parameters for individual users or groups of users
  • Specialized commands and programs that programmers or administrators stow in QGPL because they don't fit anywhere else
  • The system's startup program (QSTRUP)

Regarding that last item, be careful with replicating the QSTRUP program from one machine to another. By default, Power i system startup program names are named QSTRUP, and people generally locate modified versions of these programs in QGPL. You can easily clobber your CBU startup program if you're replicating the entire QGPL library to the CBU and both machines use QSTRUP as their startup program name. For that reason, I usually recommend that you use different startup program names for your production and CBU machines, and the CBU startup program should be located in a library that isn't replicated.

Recommended solution: To avoid conflicts and to ensure you correctly replicate all production objects between your source and target machines, the best situation is to not keep application objects like these in your QGPL and QUSRSYS libraries or any other library whose name starts with the letter Q. If that's not possible, I'd recommend adding exceptions to your HA replication software data groups so that you only replicate relevant objects from QGPL and QUSRSYS, instead of replicating the entire library. For all future objects, place those objects in a special production library whose name does not start with a Q.

Issue #2: System objects that need to be manually replicated

There are specific Power i OS objects and functions that are difficult to replicate and may need to be manually adjusted so that the CBU versions are in sync with the production. These objects may include:

System host tables that contain host names and associated IP addresses of the hosts your production and CBU boxes contact. On the green screen, you can check your host names by running the Configure TCP/IP (GO CFGTCP) command and then taking Option 10, Work with TCP/IP host table entries. These entries should match to ensure that inter-system communication works correctly on your CBU when it is impersonating production.

Shared storage pools that allocate system memory to different subsystems. Use the Work with Shared Storage Pools (WRKSHRPOOL) command to determine whether your storage pool setup is the same on both systems. Storage pool configuration is a part of your production system's work management setup. If your production subsystem descriptions are replicated to the CBU, they will be configured to use the same storage pools as the production machine. Because of that, your CBU storage pools should be configured roughly the same as your production system storage pools.

Recommended solution: As a matter of regular CBU maintenance, check to make sure non-replicated system objects such as these are the same between your production machine and your CBU.

Issue #3: Spooled file replication gone wrong

Similar to what can happen when users mistake development system output for production data, a CBU can also send fake output to your production system users. Here's a real-life example that I recently encountered.

One of my warehouse facilities complained that they were printing duplicate shipping labels for a big bulk shipment that was scheduled to go out the next day. There were 14,000 labels to print, but to our surprise, 25,000 to 28,000 labels printed, which included several duplicates of the original 14,000 labels.

We scratched our heads for three days trying to figure out what had happened while our users dutifully separated the first set of labels from the duplicates in their bulk run. Finally, it hit us.

For our CBU and HA solution, we were replicating spooled files from the production machine to the CBU. We did this to ensure that our users could still access their spooled files if we had to switch over to the CBU in an emergency. This meant that for every production output queue, the system tried to replicate any available spooled files over to the same output queue on the CBU.

On the CBU side, it turned out that the QSPL subsystem, which controls all operating system print jobs, was turned on. Further, it also turned out that the same printer output queue that sent labels to our warehouse label printer on the production system was also turned on inside the CBU system. Since the output queue sent production labels directly to our warehouse label printer, the following sequence of events occurred with this particular printer.

On the production system: While spooled files were waiting in the source output queue to print, they were being replicated over to the same output queue on the CBU.

On the CBU: Since the same production printer was active on the production box and the CBU and the remote output queue attached to this CBU printer device was active, any new spooled files that reached the CBU's production printer output queue were also sent to the warehouse's label printer.

At the warehouse label printer: Labels were received at the printer both from our production system (good labels) and our CBU (replicated labels). The end result was that the staff received duplicated sets of shipping labels interspersed within the same print run.

Once we knew the problem was a second active printer on our HA box, it was easy to fix this problem by turning off the QSPL subsystem that controls printing.

Recommended solution: Be careful running CBU subsystems that communicate with other devices or users outside of the CBU. Many of these subsystems, including QSPL (for printers) and QHTTPSVR (for Web sites), may be configured with the same parameters as those of the production system that it is meant to replicate. If these subsystems are active when the CBU is in replication mode, they could easily send fake data out to your users, the same way a development partition sometimes sends out fake data when programmers are testing various functions.




                     Post this story to del.icio.us
               Post this story to Digg
    Post this story to Slashdot


Sponsored By
SYSTEM i DEVELOPER

Join the Gurus at the Summit!

Rise to the i can . . . can you? challenge
at the RPG & DB2 Summit this Oct 17-19 in St. Louis.

Upgrade your skills - and your career - with the latest on SQL, RPG & the Web,
RPG IV, PHP, mobile apps, DB2 for i, SQL tuning, modern tools
and more!

Get 1-on-1 advice and learn practical, use-it-today tips and techniques from
top experts Susan Gantner, Jon Paris, Paul Tuohy,
Skip Marchesani, Scott Klement, Mike Cain, Kent Milligan

in a highly interactive, fun environment.

See the SESSION GRID here and register today!


Senior Technical Editor: Ted Holt
Technical Editor: Joe Hertvik
Contributing Technical Editors: Edwin Earley, Brian Kelly, Michael Sansoterra
Publisher and Advertising Director: Jenny Thomas
Advertising Sales Representative: Kim Reed
Contact the Editors: To contact anyone on the IT Jungle Team
Go to our contacts page and send us a message.

Sponsored Links

PowerTech:  2011 Security Event of the Year. September 22–23 in Las Vegas. RVSP today!
Botz & Associates, Inc.:  FREE Single Sign-On video tutorial and ROI Calculator
ProData Computer Services:  Learn how to access remote data -- RDB Connect On-Demand Webinar


 

IT Jungle Store Top Book Picks

BACK IN STOCK: Easy Steps to Internet Programming for System i: List Price, $49.95

The iSeries Express Web Implementer's Guide: List Price, $49.95
The iSeries Pocket Database Guide: List Price, $59
The iSeries Pocket SQL Guide: List Price, $59
The iSeries Pocket WebFacing Primer: List Price, $39
Migrating to WebSphere Express for iSeries: List Price, $49
Getting Started with WebSphere Express for iSeries: List Price, $49
The All-Everything Operating System: List Price, $35
The Best Joomla! Tutorial Ever!: List Price, $19.95


 
The Four Hundred
Appreciation Rising for Web Serving on IBM i

Aussies And Kiwis Get Killer Power 720 Bundle

Second Quarter Server Sales Humming Right Along

As I See It: Going Silent

A Big Data Hungry IBM Buys i2 and Algorithmics

Four Hundred Stuff
IBM Patches 'Apache Killer' DOS Vulnerability in IBM i

SAP Talks Up HANA In-Memory Database at TechEd 2011

iQ for Business Looks to Grow BI Market Share

Infor Hooks Up with Salesforce.com

Linoma Updates GoAnywhere Products

Four Hundred Monitor
Four Hundred Monitor's
Full iSeries Events Calendar

System i PTF Guide
September 10, 2011: Volume 13, Number 1

September 25, 2010: Volume 12, Number 39

September 18, 2010: Volume 12, Number 38

September 11, 2010: Volume 12, Number 37

September 4, 2010: Volume 12, Number 36

August 28, 2010: Volume 12, Number 35

TPM at The Register
Broadcom breaks piggy bank for $3.7bn NetLogic buy

PostgreSQL revs to 9.1, aims for enterprise

Sparc T4 chips: Oracle makes stealthy spec change

Netflix: Not interested in cloud 'heavy lifting'

Unisys gets 'stealthy' with secure virtual terminal

PC sales tank on fears of Meltdown, Part Deux

Intel kicks dough to cloud, analytics startups

IBM loans SMBs $1bn to buy stuff

IBM, 3M glue chips into silicon skyscrapers

AMD ships 'Interlagos' Opteron 6200 chips

Server sales up, but Great Recession lingers

HP-UX stretches over new Superdome 2

THIS ISSUE SPONSORED BY:

WorksRight Software
Bytware
System i Developer


Printer Friendly Version


TABLE OF CONTENTS
Learn To Debug Authority Failures, Part 1

Now Where Did I Leave That Source Member?

Admin Alert: Three Common Problems with CBUs

Four Hundred Guru

BACK ISSUES




 
Subscription Information:
You can unsubscribe, change your email address, or sign up for any of IT Jungle's free e-newsletters through our Web site at http://www.itjungle.com/sub/subscribe.html.

Copyright © 1996-2011 Guild Companies, Inc. All Rights Reserved.
Guild Companies, Inc., 50 Park Terrace East, Suite 8F, New York, NY 10034

Privacy Statement