AIX

AIX

Connect with fellow AIX users and experts to gain knowledge, share insights, and solve problems.

 View Only
  • 1.  ERROR LOGGING TURNED OFF and user sessions are dropped

    Posted Fri July 21, 2006 12:00 PM

    Originally posted by: SystemAdmin


    I have experienced a serious problem on a p5-550
    running AIX 5.3 and wonder if anyone else has had a similiar
    experience or knowledge of what might be causing this problem.

    The errdemon stops and the following is
    in the output of errpt:
    ERROR LOGGING TURNED OFF

    The Console station (CDE) goes blank with just a gray screen and X cursor.

    Users that are logged in get their sessions dropped.

    As root, I restart the errdemon: '/usr/lib/errdemon'

    More errors get logged like the following:
    BC3BE5A3 0720160206 P S SRC SOFTWARE PROGRAM ERROR
    BC3BE5A3 0720160206 P S SRC SOFTWARE PROGRAM ERROR
    BC3BE5A3 0720160206 P S SRC SOFTWARE PROGRAM ERROR
    BA431EB7 0720160206 P S SRC SOFTWARE PROGRAM ERROR
    BC3BE5A3 0720160206 P S SRC SOFTWARE PROGRAM ERROR
    BC3BE5A3 0720160206 P S SRC SOFTWARE PROGRAM ERROR
    BC3BE5A3 0720160206 P S SRC SOFTWARE PROGRAM ERROR
    BC3BE5A3 0720160206 P S SRC SOFTWARE PROGRAM ERROR
    BA431EB7 0720160206 P S SRC SOFTWARE PROGRAM ERROR
    BA431EB7 0720160206 P S SRC SOFTWARE PROGRAM ERROR

    All of the errors have similiar items like the following
    with different FAILING MODULE for each error.

    DETECTING MODULE
    'srchevn.c'@line:'350'
    FAILING MODULE
    i4lmd

    DETECTING MODULE
    'srchevn.c'@line:'350'
    FAILING MODULE
    i4gdb

    DETECTING MODULE
    'srchevn.c'@line:'350'
    FAILING MODULE
    i4llmd

    DETECTING MODULE
    'srchevn.c'@line:'217'
    FAILING MODULE
    rpc.lockd

    DETECTING MODULE
    'srchevn.c'@line:'350'
    FAILING MODULE
    hostmibd

    DETECTING MODULE
    'srchevn.c'@line:'350'
    FAILING MODULE
    snmpd

    DETECTING MODULE
    'srchevn.c'@line:'350'
    FAILING MODULE
    aixmibd

    DETECTING MODULE
    'srchevn.c'@line:'350'
    FAILING MODULE
    muxatmd

    DETECTING MODULE
    'srchevn.c'@line:'217'
    FAILING MODULE
    rpc.statd

    DETECTING MODULE
    'srchevn.c'@line:'217'
    FAILING MODULE
    biod
    The errpt for the errdemon logging off
    is as follows:


    LABEL: ERRLOG_OFF
    IDENTIFIER: 192AC071

    Date/Time: Thu Jul 20 16:02:30 CDT 2006
    Sequence Number: 730
    Machine Id: 00CE395F4C00
    Node Id: aims
    Class: O
    Type: TEMP
    Resource Name: errdemon

    Description
    ERROR LOGGING TURNED OFF

    Probable Causes
    ERRSTOP COMMAND

    User Causes
    ERRSTOP COMMAND

    Recommended Actions
    RUN ERRDEAD COMMAND
    TURN ERROR LOGGING ON

    Has anyone seen this problem or have any idea what
    might be causing this problem?

    Thanks,

    Denny Watkins
    Morningside College
    Sioux City, Iowa
    712-274-5250


  • 2.  Couple of ideas

    Posted Mon July 24, 2006 06:39 AM

    Originally posted by: nagger


    Firstly, mayhem like this can be caused by a lack of memory and filling up you paging space.
    Run "topas" and check if you have free memory and run "lsps -a" to check you paging space is large enough.

    Secondly, make sure you file systems are NOT 100% full.
    Al sorts of strange things can happen is /tmp or /var gets full

    Third, I would make sure you are on the latest AIX version and ML level.
    "oslevel -r" will tell you the current level and you should
    • NOT be on the GA level i.e. ML 00
    • be on ML03 or TL03 or higher.

    You might have an underlying hardware problem generating lots of errors and flooding the error reporting. Do you get these problems on isint booting the machine of later on?

    Save you full errpt output to a file and then ditch the current file with (from memory) errclear 0
    then watch for new errors.

    AIX support can analyse error logs and find out excatly the cause and fix.
    If you have AIX support then get them working on it - that is what you pay for :-)

    Is the machine in a very hot room (having a heat wave here in the UK and my machine room cooling failed - some older machines decided to power off but my p5-5050 was OK)? Might be worth checking it you are not near the machine.

    Hope this helps, N


  • 3.  Re: ERROR LOGGING TURNED OFF and user sessions are dropped

    Posted Fri March 05, 2010 12:22 PM

    Originally posted by: wirojas


    We are experiencing the same problem right now in our environment. Have you solved this issue?
    AIX:5.3

    Any help will be apreciated.

    S,WR


  • 4.  Re: ERROR LOGGING TURNED OFF and user sessions are dropped

    Posted Fri March 05, 2010 12:43 PM

    Originally posted by: Juredd1


    Have you checked everything that nagger suggested in the previous post? I had a customer with the same problem for a few week. He would call saying all of his users were getting kicked out of the system. It got to the point it was happening every day, then he rebooted the server and it was about a week before it started again. Looked at paging and it was 86% full, I suggested increasing but the next day he had not, at 97% full it kicked them off again. He increased paging and I have not heard from him since. Just my 2 cents.


  • 5.  Re: ERROR LOGGING TURNED OFF and user sessions are dropped

    Posted Tue March 09, 2010 03:16 AM

    Originally posted by: SystemAdmin


    I had the same prob when paging filled up on one of our servers.

    r/
    R