Hp XC System 2.x Software Manuel d'utilisateur Page 85

  • Télécharger
  • Ajouter à mon manuel
  • Imprimer
  • Page
    / 154
  • Table des matières
  • MARQUE LIVRES
  • Noté. / 5. Basé sur avis des utilisateurs
Vue de la page 84
7
Using LSF
The Load Sharing Facility (LSF) from Platform Computing Corporation is a batch system
resource manager used on the HP XC system. LSF is included with HP XC, and is an integral
part of the HP XC environment. On an HP XC system, a job is s ubm itted to LSF, which
places the job in a queue and allows it to run when the necessary resources become available.
In addition to launching jobs, LSF provides extensive job m anagemen t and information
capabilities. LSF schedules, launches, controls, and tracks jobs that are submitted to it according
to the polic ies established by the HP XC s it e administrator.
This chapter describes the functionality ofLSFinanHPXCsystem,anddiscusseshow
to use some basic LSF commands to submit jobs, manage jobs, and access job information.
The following topics are discussed:
Introduction to LSF on HP XC (Section 7.1)
Determining the LSF execution host (Section 7.2)
Determining available LSF resources (Section 7.3)
SubmittingjobstoLSF(Section7.4)
Getting information about LSF jobs (S ection 7.5)
Working interactively within an LSF-HPC allocation (Section 7.6)
LSF Equivalents of SLURM options (Section 7.7)
For full infor m a tio n about L SF, refer t o the stan dar d LSF docu m en tati on set, which is describ ed
in the Related Information section of this manual. LSF m anpages are also available online on
the HP XC system.
7.1 Introduction to LSF in the HP XC Environment
This section introd uces you to LSF in the HP XC environment. It provides an overview of how
LSF works, and discusses some of the features and differences of standard LSF compared to
LSFonanHPXCsystem.Thissectionalsocontains an important discussion of how LSF and
SLURM wo rk together to prov ide the HP XC job manageme nt environment. A description of
SLURM is provided in C hapter 6.
7.1.1 Overview of LSF
LSF is a batch system resource manager. In the HP XC environment, LSF manages just one
resource the total number of HP XC processors designated for batch processing. The HP
XC system is based on dedicating processors to jobs, and LS F is implemented to use these
processors in the most efficient manner.
As jobs are submitted to LSF, LSF places the jobs in queues and determines an overall priority
for launching the jobs. W hen the required number of HP XC processors become available to
launch the next job, LSF reserves t hem and launch es t he job on these processors. When a job is
completed, LSF returns job output, job information, and any errors.
A standard LSF installation on an HP XC system would consist of LSF daemons running
on every node and providing activity and resource information for each node. LSF-HPC for
SLURM on an HP XC system consists of one node running LSF-HPC d aemo ns, and these
daemons commu nicate with SLURM for resou rce information about the other nodes. LSF-HPC
consolidates this resource information i nto one "virtual" n ode. Thus LSF-HPC integrated with
Using LSF 7-1
Vue de la page 84
1 2 ... 80 81 82 83 84 85 86 87 88 89 90 ... 153 154

Commentaires sur ces manuels

Pas de commentaire