• The Four Hundred
  • Subscribe
  • Media Kit
  • Contributors
  • About Us
  • Contact
Menu
  • The Four Hundred
  • Subscribe
  • Media Kit
  • Contributors
  • About Us
  • Contact
  • What Should I Monitor For On My IBM i Partition?

    March 12, 2014 Hey, Joe

    I want to shore up my IBM i monitoring plan but I need ideas on what to monitor for. I know a problem when it hits me, but I’m not always sure what to look for. What should I be monitoring?

    –Victoria

    IBM i and Power i system monitoring is an important topic because like a shipping manager, your number one priority is to keep the production lines running. You need to find problems both before and after they occur. Here are the general areas I believe every IBM i administrator should be watching on a daily basis. In the Related Stories section following these tips, I’ve also listed several other IT Jungle articles I’ve written on the subject for further research. I’ve posted a more comprehensive index of system monitoring articles on my joehertvik.com website that I’ll be adding to as time goes on. All these collected resources should provide a good idea on how set up and configure IBM i partitions for system monitoring.

    Check to see if your partition is available–This sounds obvious, but would you know if your Power i partition went offline? There are a number of good networking packages you can use to monitor whether your partition is up and running. In a high availability environment, you can configure ping tests between your high availability box and your production system/development systems. Then set up your monitoring software to issue an alert if three or more pings are lost between the two partitions. That way, you would have two systems in different physical locations watching each other.

    Environmental issues–Did something happen to the physical environment outside of the IBM i partition? Was there a power outage, network failure, or other outside issue that will affect your business processing? This isn’t strictly a Power i issue but you should check with your network team to ensure someone is watching the network your partition is connected to.

    Power i hardware problems–Did a Power i component just fail, such as a disk drive, processor unit, power supply, backplane, Storage Area Network, etc.? When a partition element fails, IBM i always logs a system problem in the Work with Problems (WRKPRB) command screen, and it also sends a message to the QSYSOPR message queue. Set up your monitoring software to look for the following QSYSOPR hardware attention message IDs from message file QCPFMSG in library QSYS. These messages will find the majority of hardware issues on your system.

    • Any message ID that begins with CPPEA0*.
    • CPPEA11 – *Attention* Hardware service may be required.
    • CPPEA12 – *Attention* Contact your hardware service provider.
    • CPPEA13 – *Attention* Contact your hardware service provider.
    • CPPEA14 – *Attention* Contact your hardware service provider.
    • CPPEA23 – *Attention* Resource &28 removed or failed.
    • CPPEA28 – *Attention* Hardware service may be required.
    • Any message that begins with CPPEA3*.
    • CPPEA40 – *Attention* Hardware service may be required.

    Maintenance messages–Did your Power system just send off a message that a battery-powered item, such as a disk cache battery or a UPS, is nearing end of life and should be replaced? Power and battery message are also included in the QCPFMSG message file in QSYS. You will want to check QCPFMSG for more power messages you may want to monitor for, but some power messages you can monitor for in QSYSOPR, include:

    • CPP951B – Battery Power Unit fault.
    • CPP0292 – Internal Battery Power Unit fault.
    • CPP9522 – Battery Power Unit fault.
    • CPP9617 – A Battery Power Unit needs to be replaced.
    • CPP9618 – A Battery Power Unit needs to be replaced.

    Operating system issues–Did the IBM i OS just throw off an operating system error? Sometimes these errors will be caught under the hardware problem messages listed above. They will also show up in the Work with Problems screen and in the QSYSOPR message queue. Check both of these locations every day.

    Losing connectivity with key communication partners or servers–Are all your co-processing servers still available? Can you still reach your critical customers? Did a data transfer just fail? You can also set up ping testing for critical co-servers. Communication failures with co-servers will many times show up in application errors.

    Applications issues–Is there a programming inquiry message in QSYSOPR that needs attention? Did a key batch process run longer or shorter than expected? Did a critical job fail to run? Application errors fall into two categories: 1) Hard errors where the system either cancels a job or puts up an inquiry message that needs answering; or 2) Silent problems where an application is malfunctioning but it’s not sending out an error messages. Check out the articles in the Related Stories section below for tips on how to monitor for application issues.

    Backup failures–Is the system hung up because of a media failure or something as simple as waiting for a new tape? For IBM i full systems backups (GO SAVE, option 21), the system is restricted before backing up. If a full system backup hangs and the system doesn’t restart, you can check for system availability again with a ping test from another server to make sure the system comes back up at the right time. For nightly backups where the system isn’t restricted but processing may be limited, these errors can generally be trapped through the hardware messages above.

    –Joe

    RELATED STORIES

    Admin Alert: A Checklist For Monitoring Your IBM i Environment

    Admin Alert: Seven Things You Should Be Monitoring On Your System

    Admin Alert: Basic i/OS Error Monitoring and Response, Part 2

    Admin Alert: Basic i/OS Error Monitoring and Response, Part 1

    Admin Alert: Protecting Your System from Critical Storage Errors

    Community Post: List of IBM i System Monitoring Products



                         Post this story to del.icio.us
                   Post this story to Digg
        Post this story to Slashdot

    Share this:

    • Reddit
    • Facebook
    • LinkedIn
    • Twitter
    • Email

    Tags:

    Sponsored by
    Midrange Dynamics North America

    Git up to speed with MDChange!

    Git can be lightning-fast when dealing with just a few hundred items in a repository. But when dealing with tens of thousands of items, transaction wait times can take minutes.

    MDChange offers an elegant solution that enables you to work efficiently any size Git repository while making your Git experience seamless and highly responsive.

    Learn more.

    Share this:

    • Reddit
    • Facebook
    • LinkedIn
    • Twitter
    • Email

    Sponsored Links

    BCD:  View our Webinar: How Prototypes Help Your Web Development Projects Succeed
    System i Developer:  Upgrade your skills at the RPG & DB2 Summit in Dallas, March 18-20.
    Northeast User Groups Conference:  24th Annual Conference, April 7 - 9, Framingham, MA

    More IT Jungle Resources:

    System i PTF Guide: Weekly PTF Updates
    IBM i Events Calendar: National Conferences, Local Events, and Webinars
    Breaking News: News Hot Off The Press
    TPM @ EnterpriseTech: High Performance Computing Industry News From ITJ EIC Timothy Prickett Morgan

    Moving Apps To The Cloud Saves Money, Users Say Systems Of Engagement

    Leave a Reply Cancel reply

Volume 14, Number 6 -- March 12, 2014
THIS ISSUE SPONSORED BY:

SEQUEL Software
WorksRight Software
Northeast User Groups Conference

Table of Contents

  • Who Needs Custom Perspectives In RSE?
  • Quick And Handy RPG Output
  • What Should I Monitor For On My IBM i Partition?

Content archive

  • The Four Hundred
  • Four Hundred Stuff
  • Four Hundred Guru

Recent Posts

  • Public Preview For Watson Code Assistant for i Available Soon
  • COMMON Youth Movement Continues at POWERUp 2025
  • IBM Preserves Memory Investments Across Power10 And Power11
  • Eradani Uses AI For New EDI And API Service
  • Picking Apart IBM’s $150 Billion In US Manufacturing And R&D
  • FAX/400 And CICS For i Are Dead. What Will IBM Kill Next?
  • Fresche Overhauls X-Analysis With Web UI, AI Smarts
  • Is It Time To Add The Rust Programming Language To IBM i?
  • Is IBM Going To Raise Prices On Power10 Expert Care?
  • IBM i PTF Guide, Volume 27, Number 20

Subscribe

To get news from IT Jungle sent to your inbox every week, subscribe to our newsletter.

Pages

  • About Us
  • Contact
  • Contributors
  • Four Hundred Monitor
  • IBM i PTF Guide
  • Media Kit
  • Subscribe

Search

Copyright © 2025 IT Jungle