View cluster information

About this task

LSF provides commands for users to access information about the cluster.

Procedure

Cluster information includes the cluster master host, cluster name, cluster resource definitions, cluster administrator, and so on.

To view the ...

Run ...

Version of LSF

lsid

Cluster name

lsid

Current master host

lsid

Cluster administrators

lsclusters

Configuration parameters

bparams

LSF system runtime information

badmin showstatus

View LSF version, cluster name, and current master host

Procedure

Run lsid to display the version of LSF, the name of your cluster, and the current master host. For example:
lsid
IBM Platform LSF Standard 9.1.2, May 5 2013 
© Copyright IBM Corporation 1992, 2013. 
US Governmant Users Restricted Rights - Use, duplication or disclosure restricted
  by GSA ADP Schedule Contract with IBM Corp.
My cluster name is lsf91_bw3 
My master name is delpe04.lsf.ibm.com

View cluster administrators

Procedure

Run lsclusters to find out who your cluster administrator is and see a summary of your cluster:
lsclusters
CLUSTER_NAME   STATUS   MASTER_HOST    ADMIN        HOSTS     SERVERS
cluster1       ok       hostA          lsfadmin     6         6

If you are using the LSF MultiCluster product, you can see one line for each of the clusters that your local cluster is connected to in the output of lsclusters.

View configuration parameters

Procedure

  1. Run bparams to display the generic configuration parameters of LSF. These include default queues, job dispatch interval, job checking interval, and job accepting interval.
    bparams
    Default Queues:  normal idle
    MBD_SLEEP_TIME used for calculations: 20 seconds
    Job Checking Interval:  15 seconds
    Job Accepting Interval:  20 seconds
  2. Run bparams -l to display the information in long format, which gives a brief description of each parameter and the name of the parameter as it appears in lsb.params.
    bparams -l
    System default queues for automatic queue selection:
        DEFAULT_QUEUE = normal idle
    Amount of time in seconds used for calculating parameter values:
        MBD_SLEEP_TIME = 20 (seconds)
    The interval for checking jobs by slave batch daemon:
        SBD_SLEEP_TIME = 15 (seconds)
    The interval for a host to accept two batch jobs subsequently:
        JOB_ACCEPT_INTERVAL = 1 (* MBD_SLEEP_TIME)
    The idle time of a host for resuming pg suspended jobs:
        PG_SUSP_IT = 180 (seconds)
    The amount of time during which finished jobs are kept in core:
        CLEAN_PERIOD = 3600 (seconds)
    The maximum number of finished jobs that are logged in current event file:
        MAX_JOB_NUM = 2000
    The maximum number of retries for reaching a slave batch daemon:
        MAX_SBD_FAIL = 3
    The number of hours of resource consumption history:
        HIST_HOURS = 5
    The default project assigned to jobs.
        DEFAULT_PROJECT = default
    Sync up host status with master LIM is enabled:
    LSB_SYNC_HOST_STAT_LIM = Y
    MBD child query processes will only run on the following CPUs:
    MBD_QUERY_CPUS=1 2 3
  3. Run bparams -a to display all configuration parameters and their values in lsb.params.
    For example:
    bparams -a
         MBD_SLEEP_TIME = 20
         SBD_SLEEP_TIME = 15
         JOB_ACCEPT_INTERVAL = 1
         SUB_TRY_INTERVAL = 60
         LSB_SYNC_HOST_STAT_LIM =  N
         MAX_JOBINFO_QUERY_PERIOD = 2147483647
         PEND_REASON_UPDATE_INTERVAL = 30
    			. . .

View daemon parameter configuration

Before you begin

Log on to a server host.

Procedure

  1. Display all configuration settings for running LSF daemons.
    • Run lsadmin showconf to display all configured parameters and their values in lsf.conf or ego.conf for LIM.

    • Run badmin showconf mbd or badmin showconf sbd to display all configured parameters and their values in lsf.conf or ego.conf for mbatchd and sbatchd.

    In a MultiCluster environment, the parameters apply to the local cluster only.

  2. Display mbatchd and root sbatchd configuration.
    • Run badmin showconf mbd to display the parameters configured in lsf.conf or ego.conf that apply to mbatchd.

    • Run badmin showconf sbd to display the parameters configured in lsf.conf or ego.conf that apply to root sbatchd.

Examples

  • Show mbatchd configuration:

    badmin showconf mbd
    MBD configuration at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/scratch/dev/lsf/user1/0604/work
         LSF_CONFDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
  • Show sbatchd configuration on a specific host:

    badmin showconf sbd hosta
    SBD configuration for host <hosta> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/scratch/dev/lsf/user1/0604/work
         LSF_CONFDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ,,,
  • Show sbatchd configuration for all hosts:

    badmin showconf sbd all
    SBD configuration for host <hosta> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/scratch/dev/lsf/user1/0604/work
         LSF_CONFDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
    SBD configuration for host <hostb> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/scratch/dev/lsf/user1/0604/work
         LSF_CONFDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
  • Show lim configuration:

    lsadmin showconf lim
    LIM configuration at Fri Jun 8 10:27:52 CST 2010
         LSB_SHAREDIR=/scratch/dev/lsf/user1/0604/work
         LSF_CONFDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
  • Show lim configuration for a specific host:

    lsadmin showconf lim hosta
    LIM configuration for host <hosta> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/scratch/dev/lsf/user1/0604/work
         LSF_CONFDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
  • Show lim configuration for all hosts:

    lsadmin showconf lim all
    LIM configuration for host <hosta> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/scratch/dev/lsf/user1/0604/work
         LSF_CONFDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...
    LIM configuration for host <hostb> at Fri Jun 8 10:27:52 CST 2011
         LSB_SHAREDIR=/scratch/dev/lsf/user1/0604/work
         LSF_CONFDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_LOG_MASK=LOG_WARNING
         LSF_ENVDIR=/scratch/dev/lsf/user1/0604/conf
         LSF_EGO_DAEMON_CONTROL=N
         ...

View runtime cluster summary information

Procedure

Run badmin showstatus to display a summary of the current LSF runtime information about the whole cluster, including information about hosts, jobs, users, user groups, and mbatchd startup and reconfiguration:
% badmin showstatus

LSF runtime mbatchd information
    Available local hosts (current/peak):
        Clients:                0/0
        Servers:                8/8
           CPUs:                14/14
           Cores:               50/50
           Slots:               50/50

    Number of servers:          8
        Ok:                     8
        Closed:                 0
        Unreachable:            0
        Unavailable:            0

    Number of jobs:             7
        Running:                0
        Suspended:              0
        Pending:                0
        Finished:               7

    Number of users:            3
    Number of user groups:      1
    Number of active users:     0

    Latest mbatchd start:       Thu Nov 22 21:17:01 2012
    Active mbatchd PID:         26283
 
    Latest mbatchd reconfig:    Thu Nov 22 21:18:06 2012
 
    mbatchd restart information
        New mbatchd started:    Thu Nov 22 21:18:21 2012
        New mbatchd PID:        27474