AIX

AIX

Connect with fellow AIX users and experts to gain knowledge, share insights, and solve problems.


#Power
 View Only
  • 1.  Weird IO Wait

    Posted Mon May 11, 2009 12:41 PM

    Originally posted by: SystemAdmin


    I've spent a fair amount of time on AIX, but I have never seen this one. I have an LPAR, AIX 5.2.10 on a 9119-595, that went to almost 100% IO wait between 0230 and 0245 a little over 3 weeks ago. This is an IHS server that does very little disk IO.

    iostat 1 5

    System configuration: lcpu=1 disk=12

    tty: tin tout avg-cpu: % user % sys % idle % iowait
    0.4 18.4 3.0 3.0 54.2 39.8

    Disks: % tm_act Kbps tps Kb_read Kb_wrtn
    hdisk5 0.0 0.0 0.0 10368 51572
    hdisk3 0.0 0.0 0.0 61216 12204
    hdisk0 0.0 0.0 0.0 0 0
    hdisk1 0.0 0.0 0.0 11157 79242
    hdisk2 0.0 0.0 0.0 6320 2384
    hdisk11 0.0 0.0 0.0 0 0
    hdisk12 0.0 0.0 0.0 0 0
    hdisk4 0.0 0.0 0.0 40432 14784
    hdisk6 0.0 0.0 0.0 43192 41340
    hdisk10 0.0 0.0 0.0 0 1580
    hdisk8 0.0 0.0 0.0 0 1056
    hdisk7 0.0 0.0 0.0 88 184660

    tty: tin tout avg-cpu: % user % sys % idle % iowait
    1.0 1769.0 4.0 4.0 0.0 92.0

    Disks: % tm_act Kbps tps Kb_read Kb_wrtn
    hdisk5 1.0 0.0 0.0 0 0
    hdisk3 1.0 0.0 0.0 0 0
    hdisk0 0.0 0.0 0.0 0 0
    hdisk1 0.0 0.0 0.0 0 0
    hdisk2 1.0 0.0 0.0 0 0
    hdisk11 0.0 0.0 0.0 0 0
    hdisk12 0.0 0.0 0.0 0 0
    hdisk4 1.0 0.0 0.0 0 0
    hdisk6 1.0 0.0 0.0 0 0
    hdisk10 1.0 0.0 0.0 0 0
    hdisk8 1.0 0.0 0.0 0 0
    hdisk7 1.0 0.0 0.0 0 0

    tty: tin tout avg-cpu: % user % sys % idle % iowait
    0.0 984.0 1.0 6.0 0.0 93.0

    Disks: % tm_act Kbps tps Kb_read Kb_wrtn
    hdisk5 0.0 0.0 0.0 0 0
    hdisk3 0.0 0.0 0.0 0 0
    hdisk0 0.0 0.0 0.0 0 0
    hdisk1 0.0 0.0 0.0 0 0
    hdisk2 0.0 0.0 0.0 0 0
    hdisk11 0.0 0.0 0.0 0 0
    hdisk12 0.0 0.0 0.0 0 0
    hdisk4 0.0 0.0 0.0 0 0
    hdisk6 0.0 0.0 0.0 0 0
    hdisk10 0.0 0.0 0.0 0 0
    hdisk8 0.0 0.0 0.0 0 0
    hdisk7 0.0 0.0 0.0 0 0

    tty: tin tout avg-cpu: % user % sys % idle % iowait
    0.0 1701.0 2.0 2.0 0.0 96.0

    Disks: % tm_act Kbps tps Kb_read Kb_wrtn
    hdisk5 0.0 0.0 0.0 0 0
    hdisk3 0.0 0.0 0.0 0 0
    hdisk0 0.0 0.0 0.0 0 0
    hdisk1 0.0 0.0 0.0 0 0
    hdisk2 0.0 0.0 0.0 0 0
    hdisk11 0.0 0.0 0.0 0 0
    hdisk12 0.0 0.0 0.0 0 0
    hdisk4 0.0 0.0 0.0 0 0
    hdisk6 0.0 0.0 0.0 0 0
    hdisk10 0.0 0.0 0.0 0 0
    hdisk8 0.0 0.0 0.0 0 0
    hdisk7 0.0 0.0 0.0 0 0

    tty: tin tout avg-cpu: % user % sys % idle % iowait
    0.0 984.0 6.0 10.0 0.0 84.0

    Disks: % tm_act Kbps tps Kb_read Kb_wrtn
    hdisk5 0.0 0.0 0.0 0 0
    hdisk3 0.0 0.0 0.0 0 0
    hdisk0 0.0 0.0 0.0 0 0
    hdisk1 0.0 0.0 0.0 0 0
    hdisk2 0.0 0.0 0.0 0 0
    hdisk11 0.0 0.0 0.0 0 0
    hdisk12 0.0 0.0 0.0 0 0
    hdisk4 0.0 0.0 0.0 0 0
    hdisk6 0.0 0.0 0.0 0 0
    hdisk10 0.0 0.0 0.0 0 0
    hdisk8 0.0 0.0 0.0 0 0
    hdisk7 0.0 116.0 2.0 0 116

    So, why is the system flagging what is probably idle time as IO wait?

    Is a reboot the only way to make this host come to its senses?

    There are no processes that jump out, and no processes that were started during the 15 minute window when the wait time went haywire.

    Attached is a ganglia image of the server going crazy.

    Scott
    #AIX-Forum


  • 2.  Re: Weird IO Wait

    Posted Mon May 11, 2009 07:55 PM

    Originally posted by: dukessd


    looks like a backend storage (hdisk7) problem to me, not any aix problem.

    looks like aix queued a load of i/o to hdisk7 and then had to wait while hdisk7 had a problem writing it out to disk.

    if aix is waiting on a write and has nothing else to do - as in this case - it will keep looking for the incoming i/o completion and in this time 'book' all its time to an i/o wait because that is all it is trying to do.

    what type of connection and disk is hdisk7?
    #AIX-Forum


  • 3.  Re: Weird IO Wait

    Posted Tue May 12, 2009 08:47 AM

    Originally posted by: SystemAdmin


    All backend storage is EMC Symmetrix, but during the 5 second iostat period there was only one slice with IO to hdisk7. Also, IO Wait has been greater than 90% for over three weeks. I'm about convinced something has gone haywire in the kernel and a reboot is the only way to clear it up.
    #AIX-Forum


  • 4.  Re: Weird IO Wait

    Posted Tue May 12, 2009 10:11 AM

    Originally posted by: flodstrom


    Might be just a stray/failed process that is causing the apparent I/O wait? I would look for old user processes that should normally not be there and kill them. However, if there's a lot of activity on the system it might be hard to find it so a reboot (if you can do that?) would probably be the best way.

    That said, do you notice any read/write performance issues on the file system that is using hdisk7?
    #AIX-Forum


  • 5.  Re: Weird IO Wait

    Posted Tue May 12, 2009 02:33 PM

    Originally posted by: SystemAdmin


    No performance issues on any drives. Very little disk activity, basically just writing Apache log files. I'll just schedule a reboot and chalk this one up as a mystery. Been working with AIX for over 20 years and I have never seen this before.

    Thanks for the replies.
    #AIX-Forum


  • 6.  Re: Weird IO Wait

    Posted Tue May 12, 2009 04:45 PM

    Originally posted by: KentPerrier


    I have you bounced IHS? If you have, are there any old httpd processes still out there from before the bounce?
    #AIX-Forum