AIX

AIX

Connect with fellow AIX users and experts to gain knowledge, share insights, and solve problems.


#Power
 View Only
  • 1.  Issue with WPAR mobility

    Posted Wed September 18, 2013 03:31 AM

    Originally posted by: 29X4_Hiroyuki_Tanaka


    Hi,
    I have problem with WPAR mobility on AIX 7.1 + WPAR Manager 2.3. I'm trying checkpoint / restart by CLI on same server. Filesystem is on NFS, and the NFS server is running on another LPAR. I can checkpoint the WPAR, but I cannot restart the WPAR with following error.

    >0973-004 Error reading checkpoint file errno = 14 (9000000000003) [10158290 18:9:2013 14:52:18]

    Is there any clue to solve this problem ? Any advice is appreciated.

    Here is what I tried. First, I created WPAR like this and succeed.

    • eaix71[/]# mkwpar -c -M directory=/ vfs=nfs dev=/export host=aaix71 -M vfs=directory directory=/var -M vfs=directory directory=/tmp -M vfs=directory directory=/home -M vfs=directory directory=/usr -M vfs=directory directory=/opt -n wpar1 -N interface=en0 address=192.168.93.33 netmask=255.255.255.0

    Then confirmed with lswpar.

    • eaix71[/]# lswpar
    • Name   State  Type  Hostname  Directory     RootVG WPAR
    • --------------------------------------------------------
    • wpar1  D      S     wpar1     /wpars/wpar1  no


    I can clogin to the WPAR.

    • eaix71[/]# startwpar: 0960-229 ATTENTION: Previous workload partition operation chkptwpar did not complete.
    • Starting workload partition wpar1.
    • Mounting all workload partition file systems.
    • Loading workload partition.
    • 1020-285 The MCR kernel extension sucessfully loaded
    • Exporting workload partition devices.
    • Exporting workload partition kernel extensions.
    • Starting workload partition subsystem cor_wpar1.
    • 0513-059 The cor_wpar1 Subsystem has been started. Subsystem PID is 7864362.
    • Verifying workload partition startup.
    • eaix71[/]# clogin wpar1
    • *******************************************************************************
    • *                                                                             *
    • *                                                                             *
    • *  Welcome to AIX Version 7.1!                                                *
    • *                                                                             *
    • *                                                                             *
    • *  Please see the README file in /usr/lpp/bos for information pertinent to    *
    • *  this release of the AIX Operating System.                                  *
    • *                                                                             *
    • *                                                                             *
    • *******************************************************************************
    • Last login: Wed Sep 18 14:36:58 JST 2013 on /dev/Global from eaix71
    •  
    • # exit


    Then, I checkpoint the WPAR, and succeeded
     

    • eaix71[/]# chkptwpar -k -d /wpars/wpar1/chkptdir -o /wpars/wpar1/chkpt.log -l debug wpar1
    • 1020-191 WPAR wpar1 was checkpointed in /wpars/wpar1/chkptdir.
    • Stopping workload partition wpar1.
    • Stopping workload partition subsystem cor_wpar1.
    • 0513-004 The Subsystem or Group, cor_wpar1, is currently inoperative.
    • Shutting down all workload partition processes.
    • Unmounting all workload partition file systems.
    • 1020-186 chkptwpar command succeeded


    But when I restart this WPAR on same server, I got the error.

    • eaix71[/]# restartwpar -d /wpars/wpar1/chkptdir -o /wpars/wpar1/restart.log
    • Starting workload partition wpar1.
    • Mounting all workload partition file systems.
    • Loading workload partition.
    • Exporting workload partition devices.
    • Exporting workload partition kernel extensions.
    • Starting workload partition subsystem cor_wpar1.
    • 0513-059 The cor_wpar1 Subsystem has been started. Subsystem PID is 7274568.
    • 0973-004 Error reading checkpoint file errno = 14 (9000000000003) [10158290 18:9:2013 14:52:18]
    • 1020-252 Acting thread TID:28967065 of process PID:10158290 has died unexpectedly [00.459.1016] [10158290 18:9:2013 14:52:18]
    • Stopping workload partition wpar1.
    • Stopping workload partition subsystem cor_wpar1.
    • 0513-004 The Subsystem or Group, cor_wpar1, is currently inoperative.
    • Shutting down all workload partition processes.
    • Unmounting all workload partition file systems.
    • 1020-187 restartwpar command failed.
    • eaix71[/]#


    Following is information about my environment.

    eaix71[/]# oslevel -s
    7100-02-02-1316

    eaix71[/]# lslpp -L | grep -i mcr
      mcr.rte                   7.1.2.15    C     F    Metacluster Checkpoint and

    eaix71[/]# lslpp -L | grep -i wpar
      bos.wpars                 7.1.2.15    C     F    AIX Workload Partitions
      wparmgt.agent.rte          2.3.1.1    A     F    Workload Partitions Manager

    eaix71[/]# lswpar -L wpar1
    =================================================================
    wpar1 - Defined
    =================================================================
    GENERAL
    Type:                    S
    RootVG WPAR:             no
    Owner:                   root
    Hostname:                wpar1
    WPAR-Specific Routing:   no
    Virtual IP WPAR:
    Directory:               /wpars/wpar1
    Start/Stop Script:
    Auto:                    no
    Private /usr:            yes
    Checkpointable:          yes
    Application:

    OStype:                  0
    Cross-WPAR IPC:          no
    Architecture:            none
    UUID:                    162e2172-11fc-4dfe-a7b8-a8851dd7bdee

    NETWORK
    Interface     Address(6)        Mask/Prefix       Broadcast
    -----------------------------------------------------------------
    en0           192.168.93.33     255.255.255.0     192.168.93.255

    USER-SPECIFIED ROUTES
    Type    Destination          Gateway           Interface     Family
    -----------------------------------------------------------------

    FILE SYSTEMS
    MountPoint               Device           Vfs     Nodename   Options
    -----------------------------------------------------------------
    /wpars/wpar1             /export          nfs     aaix71     bg,intr
    /wpars/wpar1/proc        /proc            namefs             rw

    RESOURCE CONTROLS
    Active:                             yes
    Resource Set:
    CPU Shares:
    CPU Limits:
    Memory Shares:
    Memory Limits:
    Per-Process Virtual Memory Limit:   unlimited
    Total Virtual Memory Limit:         unlimited
    Total Processes:
    Total Threads:
    Total PTYs:
    Total Large Pages:
    Max Message Queue IDs:
    Max Semaphore IDs:
    Max Shared Memory IDs:
    Max Pinned Memory:

    OPERATION
    Operation:    none
    Process ID:
    Start Time:

    SECURITY SETTINGS
    Privileges:   PV_AU_,PV_AU_ADD,PV_AU_ADMIN,PV_AU_PROC,PV_AU_READ,
                  PV_AU_WRITE,PV_AZ_ADMIN,PV_AZ_CHECK,PV_AZ_READ,PV_AZ_ROOT,
                  PV_DAC_,PV_DAC_GID,PV_DAC_O,PV_DAC_R,PV_DAC_RID,PV_DAC_UID,
                  PV_DAC_W,PV_DAC_X,PV_DEV_CONFIG,PV_DEV_QUERY,PV_FS_CHOWN,
                  PV_FS_CHROOT,PV_FS_CNTL,PV_FS_LINKDIR,PV_FS_MKNOD,
                  PV_FS_MOUNT,PV_FS_PDMODE,PV_FS_QUOTA,PV_KER_ACCT,
                  PV_KER_CONF,PV_KER_DR,PV_KER_EWLM,PV_KER_EXTCONF,
                  PV_KER_IPC,PV_KER_IPC_O,PV_KER_IPC_R,PV_KER_IPC_W,
                  PV_KER_NFS,PV_KER_RAC,PV_KER_RAS_ERR,PV_KER_REBOOT,
                  PV_NET_PORT,PV_PROC_CKPT,PV_PROC_CORE,PV_PROC_CRED,
                  PV_PROC_ENV,PV_PROC_PRIO,PV_PROC_PDMODE,PV_PROC_RAC,
                  PV_PROC_RTCLK,PV_PROC_SIG,PV_PROC_TIMER,PV_PROC_VARS,
                  PV_PROC_PRIV,PV_SU_UID,PV_TCB,PV_TP,PV_TP_SET,PV_MIC,
                  PV_MIC_CL,PV_LAB_,PV_LAB_CL,PV_LAB_CLTL,PV_LAB_LEF,
                  PV_LAB_SLDG,PV_LAB_SLDG_STR,PV_LAB_SL_FILE,PV_LAB_SL_PROC,
                  PV_LAB_SL_SELF,PV_LAB_SLUG,PV_LAB_SLUG_STR,PV_LAB_TL,
                  PV_MAC_,PV_MAC_CL,PV_MAC_R,PV_MAC_R_CL,PV_MAC_R_STR,
                  PV_MAC_R_PROC,PV_MAC_W,PV_MAC_W_CL,PV_MAC_W_DN,PV_MAC_W_UP,
                  PV_MAC_W_PROC,PV_MAC_OVRRD,PV_KER_SECCONFIG,
                  PV_PROBEVUE_TRC_USER,PV_PROBEVUE_TRC_USER_SELF,PV_KER_LVM,
                  PV_WPAR_DEV_LOAD

    DEVICE EXPORTS
    Name               Type     Virtual Device     RootVG   Status
    -----------------------------------------------------------------
    /dev/null          pseudo                               ALLOCATED
    /dev/tty           pseudo                               ALLOCATED
    /dev/console       pseudo                               ALLOCATED
    /dev/zero          pseudo                               ALLOCATED
    /dev/clone         pseudo                               ALLOCATED
    /dev/sad           clone                                ALLOCATED
    /dev/xti/tcp       clone                                ALLOCATED
    /dev/xti/tcp6      clone                                ALLOCATED
    /dev/xti/udp       clone                                ALLOCATED
    /dev/xti/udp6      clone                                ALLOCATED
    /dev/xti/unixdg    clone                                ALLOCATED
    /dev/xti/unixst    clone                                ALLOCATED
    /dev/error         pseudo                               ALLOCATED
    /dev/errorctl      pseudo                               ALLOCATED
    /dev/audit         pseudo                               ALLOCATED
    /dev/nvram         pseudo                               ALLOCATED
    /dev/kmem          pseudo                               ALLOCATED

    KERNEL EXTENSIONS
    EXTENSION NAME                         Local   Major   Status
    -----------------------------------------------------------------

    eaix71[/]#


    #AIX-Forum