IBM Support

PI13410: WHEN ENDPOINT SERVER SERVANT REGION CANCELLED, JOB MANAGEMENT CONSOLE SHOWS JOBS REMAINING IN SUBMITTED STATE

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • Occasionally, when endpoint server servant region is
    cancelled, the Job Management Console shows a job that was
    submitted to that endpoint remaining in submitted state.
    

Local fix

Problem summary

  • ****************************************************************
    * USERS AFFECTED:  WebSphere Compute Grid version 8.0 on z/OS  *
    ****************************************************************
    * PROBLEM DESCRIPTION: Occasionally, when endpoint server      *
    *                      servant region is cancelled, the Job    *
    *                      Management Console shows jobs that      *
    *                      were submitted to that endpoint         *
    *                      remaining in submitted state.           *
    ****************************************************************
    * RECOMMENDATION:                                              *
    ****************************************************************
    The job scheduler dispatch process was completed successfully
    and the job was marked as owned by the endpoint.  However, on
    the endpoint server, there is a timing window where the servant
    region was cancelled after it picked up the job but before the
    record keeping was updated.  Because of this, the control
    region did not know about this job so no clean up was done for
    this job when the servant region was down.
    

Problem conclusion

  • Currently, when about to execute a job (calling the
    BatchJobController.submitJob()),the endpoint server adds the
    job to the control region job cache.  However, if
    the endpoint servant region is canceled at any point
    before the job is added to the job cache, no status update will
    be sent out.
    The runtime code has been updated to add the jobid to the
    control region job cache as soon as the runtime gets the job
    submission.
    This way, if the servant region goes down before it processes
    the job submission, the control region will have a record of
    this job and it will cancel the job and send update status to
    the endpoint.
    The fix for this APAR is currently targeted for inclusion in
    fixpack 8.0.0.4 Please refer to the Recommended Updates page
    for delivery information:
    http://www.ibm.com/support/docview.wss?uid=swg27022998
    

Temporary fix

Comments

APAR Information

  • APAR number

    PI13410

  • Reported component name

    WXD COMPUTE GRI

  • Reported component ID

    5725C9301

  • Reported release

    800

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2014-03-11

  • Closed date

    2014-06-04

  • Last modified date

    2014-07-21

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    WXD COMPUTE GRI

  • Fixed component ID

    5725C9301

Applicable component levels

  • R800 PSY

       UP

[{"Business Unit":{"code":"BU029","label":"Software"},"Product":{"code":"SSFVRM","label":"WebSphere Extended Deployment Compute Grid"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"8.0"}]

Document Information

Modified date:
27 April 2022