AIX

AIX

Connect with fellow AIX users and experts to gain knowledge, share insights, and solve problems.


#Power
 View Only
Expand all | Collapse all

My program is hanging wanted to understand what does following stack mean to say

  • 1.  My program is hanging wanted to understand what does following stack mean to say

    Posted Thu December 14, 2006 02:01 AM

    Originally posted by: SystemAdmin


    Hi,
    We have our product in distributed architecture using CORBA(TAO ).
    our customer is facing some problem when he is disconnecting from our server processes(All are in pure C++ code) .it goes to a hang state.
    He has AIX 5.2 and
    thread library version : 5.2.0.75

    As we couldn't figure out any techincal/code level problems with our product , seeking help here.After drilling to a level I couldn't understand what is going on here.When it hangs this is the state of threads.The process has around 50 threads (Most of them are in wait mode)
    ********************************
    dbx) thread
    thread state-k wchan state-u k-tid mode held scope function
    $t1 wait 0xf100008790029140 blocked 2691109 k no sys _event_sleep
    >$t2 run blocked 2490517 k no sys _event_sleep
    $t3 wait 0xf100008790009c40 blocked 639021 k no sys _event_sleep
    $t4 wait 0xf100008790029f40 blocked 2748553 k no sys _event_sleep
    $t5 wait 0xf10000879001ee40 blocked 2023521 k no sys _event_sleep
    $t6 wait 0xf100008790042140 blocked 4329711 k no sys _event_sleep
    $t7 wait 0xf100008790041940 blocked 4296815 k no sys _event_sleep
    $t8 wait 0xf100008790058a40 blocked 5808155 k no sys _event_sleep
    $t9 wait 0xf100008790041b40 blocked 4304991 k no sys _event_sleep
    $t10 wait 0xf100008790032040 blocked 3276951 k no sys _event_sleep
    $t11 wait 0xf100008790056f40 blocked 5697751 k no sys _event_sleep
    $t12 wait 0xf100008790029d40 blocked 2740321 k no sys _event_sleep
    $t13 wait 0xf100008790041e40 blocked 4317299 k no sys _event_sleep
    $t14 wait 0xf100008790054040 blocked 5505113 k no sys _event_sleep
    $t15 wait 0xf100008790029940 blocked 2724005 k no sys _event_sleep
    $t16 wait 0xf100008790055040 blocked 5570763 k no sys _event_sleep
    $t17 wait 0xf100008790009f40 blocked 651343 k no sys _event_sleep
    $t18 wait 0xf100008790032340 blocked 3289171 k no sys _event_sleep
    $t19 wait 0xf100008790056a40 blocked 5677133 k no sys _event_sleep
    $t20 wait 0xf100008790042040 blocked 4325457 k no sys _event_sleep
    $t21 wait 0xf100008790041f40 blocked 4321361 k no sys _event_sleep
    $t22 wait 0xf100008790031f40 blocked 3272883 k no sys _event_sleep
    $t23 wait 0xf10000879002a240 blocked 2760779 k no sys _event_sleep
    $t24 wait 0xf100008790029640 blocked 2711625 k no sys _event_sleep
    $t25 wait 0xf100008790041a40 blocked 4300879 k no sys _event_sleep
    $t26 wait 0xf100008790041140 blocked 4264001 k no sys _event_sleep
    $t27 wait 0xf100008790015240 blocked 1384659 k no sys _event_sleep
    $t28 wait 0xf100008790025840 blocked 2457615 k no sys _event_sleep
    $t29 wait 0xf100008790041c40 blocked 4309067 k no sys _event_sleep
    $t30 wait 0xf100008790032440 blocked 3293265 k no sys _event_sleep
    $t31 wait 0xf100008790041d40 blocked 4313163 k no sys _event_sleep
    $t32 wait 0xf100008790057940 blocked 5738709 k no sys _event_sleep
    $t33 wait 0xf100008790022c40 blocked 2277535 k no sys _event_sleep
    $t34 wait 0xf100008790029840 blocked 2719849 k no sys _event_sleep
    $t35 wait 0xf10000879002a840 blocked 2785419 k no sys _event_sleep
    $t36 wait 0xf100008790032540 blocked 3297383 k no sys _event_sleep
    $t37 wait 0xf100008790029440 blocked 2703447 k no sys _event_sleep
    $t38 wait 0xf100008790042540 blocked 4345933 k no sys _event_sleep
    $t39 wait 0xf100008790042640 blocked 4350031 k no sys _event_sleep
    $t40 wait 0xf100008790042740 blocked 4354129 k no sys _event_sleep
    $t41 wait 0xf100008790042840 blocked 4358227 k no sys _event_sleep
    $t42 wait 0xf100008790042940 blocked 4362325 k no sys _event_sleep
    $t43 wait 0xf100008790042a40 blocked 4366423 k no sys _event_sleep
    $t44 wait 0xf100008790042b40 blocked 4370521 k no sys _event_sleep
    $t45 wait 0xf100008790042c40 blocked 4374619 k no sys _event_sleep
    $t46 wait 0xf100008790042d40 blocked 4378717 k no sys _event_sleep
    $t47 wait 0xf100008790042e40 blocked 4382815 k no sys _event_sleep
    $t48 wait 0xf100008790042f40 blocked 4386913 k no sys _event_sleep
    $t49 wait 0xf100008790043040 blocked 4391011 k no sys _event_sleep
    $t50 wait 0xf100008790043140 blocked 4395109 k no sys _event_sleep
    $t51 wait running 4399207 k no sys select
    (dbx)

    (dbx) thread current 51
    warning: Thread is in kernel mode, not all registers can be accessed.
    (dbx) where
    glink64.select() at 0x900000001db6474
    OS.select(int,fd_set*,fd_set*,fd_set*,const ACE_Time_Value*)() at 0x900000001dace54
    OS.select(int,fd_set*,fd_set*,fd_set*,const ACE_Time_Value*)() at 0x900000001dace54
    wait_for_multiple_events__20ACE_Select_Reactor_TXT26ACE_Select_Reactor_Token_TXT9ACE_Token__FR29ACE_Select_Reactor_Handle_SetP14ACE_Time_Value() at 0x9000000018d5c90
    get_event_for_dispatching__14ACE_TP_ReactorFP14ACE_Time_Value() at 0x900000001e88db8
    dispatch_i__14ACE_TP_ReactorFP14ACE_Time_ValueR18ACE_TP_Token_Guard() at 0x900000001e88c48
    handle_events__14ACE_TP_ReactorFP14ACE_Time_Value() at 0x900000001e88b78
    handle_events__11ACE_ReactorFP14ACE_Time_Value() at 0x900000001e1a400
    run__12TAO_ORB_CoreFP14ACE_Time_Valuei() at 0x90000000173fec0
    run__9CORBA_ORBFP14ACE_Time_Value() at 0x9000000017ac314
    svc__6WorkerFv() at 0x100131938
    svc_run__13ACE_Task_BaseFPv() at 0x900000001e31b5c
    invoke_i__18ACE_Thread_AdapterFv() at 0x900000001e10e1c
    invoke__18ACE_Thread_AdapterFv() at 0x900000001e10cfc
    ace_thread_adapter() at 0x900000001db9d44

    **********************************************

    What does this glink64.select() mean ?
    it hangs here for a long time..some time it is coming out of that hang some time it is not coming out.
    Kindly help me how to resolve this issue..

    Thanks
    Krishna

    #AIX-Forum


  • 2.  Sorry but I think you need to call UNIX Support

    Posted Thu December 14, 2006 07:12 AM

    Originally posted by: nagger


    After taking a look, I have not found any answers to your question.

    glink64 is related to the GNU gcc compiler object linking on AIX.

    Deep compiler internal, problem determination like this is very hard on a Forum unless someone says "ah, I hit that exact problem myself".

    I can only suggest you raise a UNIX Support call and get them to have a look they may ask for all sorts of extra infromation and command outputs to investigate further.

    Sorry I don't have a better answer, N
    #AIX-Forum