HWRF/scripts/run_8py_source.html

 """!A shell-like syntax for running serial, MPI and OpenMP programs.


 This module implements a shell-like syntax for launching MPI and

 non-MPI programs from Python.  It recognizes three types of

 executables: mpi, "small serial" (safe for running on a batch node)

 and "big serial" (which should be run via aprun if applicable).  There

 is no difference between "small serial" and "big serial" programs

 except on certain architectures (like Cray) where the job script runs

 on a heavily-loaded batch node and has compute nodes assigned for

 running other programs.


 @section progtype Program Types


 There are three types of programs: mpi, serial and "big non-MPI."  A

 "big" executable is one that is either OpenMP, or is a serial program

 that cannot safely be run on heavily loaded batch nodes.  On Cray

 architecture machines, the job script runs on a heavily-populated

 "batch" node, with some compute nodes assigned for "large" programs.

 In such environments, the "big" executables are run on compute nodes

 and the small ones on the batch node.


 * mpi('exename') = an executable "exename" that calls MPI_Init and

     MPI_Finalize exactly once each, in that order.

 * exe('exename') = a small non-MPI program safe to run on a batch node

 * bigexe('exename') = a big non-MPI program that must be run on a

     compute node it may or may not use other forms of parallelism


 You can also make reusable aliases to avoid having to call those

 functions over and over (more on that later).  Examples:


 * Python:   wrf=mpi('./wrf.exe')

 * Python:   lsl=alias(exe('/bin/ls')['-l'].env(LANG='C',LS_COLORS='never'))


 Those can then be reused later on as if the code is pasted in, similar

 to a shell alias.


 @section serexs Serial Execution Syntax


 Select your serial programs by exe('name') for small serial programs

 and bigexe('name') for big serial programs.  The return value of those

 functions can then be used with a shell-like syntax to specify

 redirection and piping.  Example:


 *  shell version: ls -l / | wc -l

 *  Python version: run(exe('ls')['-l','/'] | exe('wc')['-l'])


 Redirection syntax similar to the shell (< > and << operators):

 @code

   run( ( exe('myprogram')['arg1','arg2','...'] < 'infile' ) > 'outfile')

 @endcode


 Note the extra set of parentheses: you cannot do "exe('prog') < infile

 > outfile" because of the order of precedence of Python operators


 Append also works:

 @code

   run(exe('myprogram')['arg1','arg2','...'] >> 'appendfile')

 @endcode


 You can also send strings as input with <<

 @code

   run(exe('myprogram')['arg1','arg2','...'] << 'some input string')

 @endcode


 One difference from shells is that < and << always modify the

 beginning of the pipeline:


 * shell: cat < infile | wc -l

 * Python #1: ( exe('cat') < 'infile' ) | exe('wc')['-l']

 * Python #2: exe('cat') | ( exe('wc')['-l'] < 'infile' )


 Note that the last second one, equivalent to `cat|wc -l<infile`, would

 NOT work in a shell since you would be giving wc -l two inputs.


 @section parexs Parallel Execution Syntax


 Use mpi('exename') to select your executable, use [] to set arguments,

 use multiplication to set the number of ranks and use addition to

 combine different executables together into a multiple program

 multiple data (MPMD) MPI program.


 Run ten copies of ls -l:

 @code

   run(mpirun(mpiserial(('ls')['-l'])*10))

 @endcode


 Run HyCOM coupled HWRF: one wm3c.exe, 30 hycom.exe and 204 wrf.exe:

 @code

   run(mpirun(mpi('wm3c.exe') + mpi('hycom.exe')*30 + mpi('wrf.exe')*204))

 @endcode


 You can set environment variables, pipe MPI output and handle

 redirection using the mpirun() function, which converts MPI programs

 into an bigexe()-style object (Runner):


 Shell version:

 @code{.unformatted}

     result=$( mpirun -n 30 hostname | sort -u | wc -l )

 @endcode


 Python version:

 @code

     result=runstr( mpirun(mpi('hostname')*30) | exe['sort']['-u'] | exe['wc']['-l'] )

 @endcode


 @section aliases Aliases


 If you find yourself frequently needing the same command, or you need

 to store a command for multiple uses, then then you should define an

 alias.  Let's say you want "long output" format Japanese language "ls"

 output:


 @code

   exe('ls')['-l','/path/to/dir'].env(LANG='JP')

 @endcode


 but you find yourself running that on many different directories.

 Then you may want to make an alias:


 @code

   jplsl=alias(exe('ls')['-l'].env(LANG='JP'))

 @endcode


 The return value jplsl can be treated as an exe()-like return value

 since it was from exe() originally, but any new arguments will be

 appended to the original set:


 @code

   run(jplsl['/path/to/dir'])

 @endcode


 Note that if we did this:

 @code

   badlsl=exe('ls')['-l'].env(LANG='JP')  # Bad! No alias!

   run(badlsl['/'])  # will list /

   run(badlsl['/home'])  # will list / and /home

   run(badlsl['/usr/bin']) # will list / /home and /usr/bin


   goodlsl=alias(exe('ls')['-l'].env(LANG='JP')

   run(goodlsl['/'])  # will list /

   run(goodlsl['/home'])  # will list /home

   run(goodlsl['/usr/bin']) # will list /usr/bin

 @endcode


 Then the run(badlsl['/home']) would list /home AND / which is NOT what

 we want.  Why does it do that?  It is because badlsl is not an alias

 --- it is a regular output from exe(), so every time we call its []

 operator, we add an argument to the original command.  When we call

 alias() it returns a copy-on-write version (goodlsl), where every call

 to [] creates a new object.


 Note that alias() also works with pipelines, but most operations will

 only modify the last the command in the pipeline (or the first, for

 operations that change stdin).

 """


 import time, logging

 import produtil.mpi_impl as mpiimpl

 import produtil.sigsafety

 import produtil.prog as prog

 import produtil.mpiprog as mpiprog

 import produtil.pipeline as pipeline


 ##@var __all__

 # List of symbols exported by "from produtil.run import *"

 __all__=['alias','exe','run','runstr','mpi','mpiserial','mpirun',

          'runbg','prog','mpiprog','mpiimpl','waitprocs','runsync',

          'InvalidRunArgument','ExitStatusException','checkrun',

          'batchexe','bigexe']


 ##@var module_logger

 # Default logger used by some functions if no logger is given

 module_logger=logging.getLogger('produtil.run')


 class InvalidRunArgument(prog.ProgSyntaxError):

     """!Raised to indicate that an invalid argument was sent into one

     of the run module functions."""


 class ExitStatusException(Exception):

     """!Raised to indicate that a program generated an invalid return

     code.


     Examine the "returncode" member variable for the returncode value.

     Negative values indicate the program was terminated by a signal

     while zero and positive values indicate the program exited.  The

     highest exit status of the pipeline is returned when a pipeline is

     used.


     For MPI programs, the exit status is generally unreliable due to

     implementation-dependent issues, but this package attempts to

     return the highest exit status seen.  Generally, you can count on

     MPI implementations to return zero if you call MPI_Finalize() and

     exit normally, and non-zero if you call MPI_Abort with a non-zero

     argument.  Any other situation will produce unpredictable results."""

     ##@var message

     # A string description for what went wrong


     ##@var returncode

     # The return code, including signal information.


     def __init__(self,message,status):

         """!ExitStatusException constructor

         @param message a description of what went wrong

         @param status the exit status"""

         self.message=message

         self.returncode=status


     @property

     def status(self):

         """!An alias for self.returncode: the exit status."""

         return self.returncode


     def __str__(self):

         """!A string description of the error."""

         return '%s (returncode=%d)'%(str(self.message),int(self.returncode))

     def __repr__(self):

         """!A pythonic description of the error for debugging."""

         return 'NonZeroExit(%s,%s)'%(repr(self.message),repr(self.returncode))


 def alias(arg):

     """!Attempts to generate an unmodifiable "copy on write" version

     of the argument.  The returned copy will generate a modifiable

     duplicate of itself if you attempt to change it.

     @returns a produtil.prog.ImmutableRunner

     @param arg a produtil.prog.Runner or produtil.prog.ImmutableRunner"""

     if isinstance(arg,prog.Runner):

         return prog.ImmutableRunner(arg)

     elif isinstance(arg,mpiprog.MPIRanksBase):

         arg.make_runners_immutable()

         return arg

     else:

         raise InvalidRunArgument('Arguments to alias() must be Runner objects (such as from exe()) or MPIRanksBase objects (such as from mpi() or mpiserial()).  Got: %s'%(repr(arg),))


 def batchexe(name,**kwargs):

     """!Returns a prog.ImmutableRunner object that represents a small

     serial program that can be safely run on a busy batch node.

     @param name the executable name or path

     @param kwargs passed to produtil.prog.Runner.__init__

     @returns a new produtil.prog.ImmutableRunner"""

     return prog.ImmutableRunner([str(name)],**kwargs)


 def exe(name,**kwargs):

     """!Returns a prog.ImmutableRunner object that represents a large

     serial program that must be run on a compute node.

     @note This function does NOT search $PATH on Cray.  That ensures

       the $PATH will be expanded on the compute node instead.  Use

       produtil.fileop.find_exe() if you want to explicitly search the

       PATH before execution.

     @param name the executable name or path

     @param kwargs passed to produtil.prog.Runner.__init__

     @returns a new produtil.prog.ImmutableRunner"""

     return mpiimpl.make_bigexe(str(name),**kwargs)


 def bigexe(name,**kwargs):

     """!Alias for exe() for backward compatibility.  Use exe() instead."""

     return exe(name,**kwargs)


 def mpirun(arg,**kwargs):

     """!Converts an MPI program specification into a runnable shell

     program suitable for run(), runstr() or checkrun().


     Options for kwargs:

     * allranks=True --- to run on all available MPI ranks.  This cannot be

       used if a specific number of ranks (other than 1) was requested in

       the arg.

     * logger=L --- a logging.Logger for log messages

     * Other platform-specific arguments.  See produtil.mpi_impl for details.


     @param arg the mpiprog.MPIRanksBase describing the MPI program to

     run.  This is the output of the mpi() or mpiserial() function.

     @param kwargs additional arguments to control output.

     @returns a prog.Runner object for the specified

     mpiprog.MPIRanksBase object."""

     return mpiimpl.mpirunner(arg,**kwargs)


 def make_pipeline(arg,capture,**kwargs):

     """!This internal implementation function generates a

     prog.PopenCommand object for the specified input, which may be a

     prog.Runner or mpiprog.MPIRanksBase.

     @param arg the produtil.prog.Runner to convert.  This is the

       output of exe(), bigexe() or mpirun()

     @param capture if True, capture the stdout into a string

     @param kwargs additional keyword arguments, same as for  mpirun()"""

     if isinstance(arg,prog.Runner):

         runner=arg

     elif isinstance(arg, mpiprog.MPIRanksBase):

         runner=mpiimpl.mpirunner(arg,**kwargs)

     else:

         raise InvalidRunArgument(

             'Can only run a Runner object (such as from exe()) or an '

             'MPIRanksBase object (such as from mpi() or mpiserial()).  '

             'Got: %s'%(repr(arg),))

     logger=None

     if 'logger' in kwargs: logger=kwargs['logger']

     if logger is not None:

         logger.info('Starting: %s'%(repr(arg),))

         if capture: logger.info('  - and will capture output.')

     pl=pipeline.Pipeline(runner,capture=capture,logger=logger)

     if logger is not None:

         logger.debug('Pipeline is %s'%(repr(pl),))

     return pl


 def runbg(arg,capture=False,**kwargs):

     """!Not implemented: background execution


     Runs the specified process in the background.  Specify

     capture=True to capture the command's output.  Returns a

     produtil.prog.PopenCommand.  Call poll() to determine process

     completion, and use the stdout_data property to get the output

     after completion, if capture=True was specified.


     @bug produtil.run.runbg() is not implemented


     @warning this is not implemented


     @param arg the produtil.prog.Runner to execute (output of

       exe(), bigexe() or mpirun()

     @param capture if True, capture output

     @param kwargs same as for mpirun()"""

     p=make_pipeline(arg,capture,**kwargs)

     p.background()

     return p


 def waitprocs(procs,logger=None,timeout=None,usleep=1000):

     """!Not implemented: background process monitoring


     Waits for one or more backgrounded processes to complete.  Logs to

     the specified logger while doing so.  If a timeout is specified,

     returns False after the given time if some processes have not

     returned.  The usleep argument is the number of microseconds to

     sleep between checks (can be a fraction).  The first argument,

     procs specifies the processes to check.  It must be a

     produtil.prog.Pipeline (return value from runbg) or an iterable

     (list or tuple) of such.


     @bug produtil.run.waitprocs() is untested


     @warning This is not tested and probably does not work.


     @param procs the processes to watch

     @param logger the logging.Logger for log messages

     @param timeout how long to wait before giving up

     @param usleep sleep time between checks"""

     p=set()

     if isinstance(procs,produtil.prog.PopenCommand):

         p.add(procs)

     else:

         for pp in procs:

             p.add(pp)

     if logger is not None: logger.info("Wait for: %s",repr(p))

     while p: # keep looping as long as there are unfinished processes

         p2=set()

         for proc in p:

             ret=proc.poll()

             if ret is not None:

                 if logger is not None:

                     logger.info("%s returned %s"%(repr(proc),repr(ret)))

             elif logger is not None and usleep>4.99999e6:

                 # babble about running processes if the sleep time is long.

                 logger.info("%s is still running"%(repr(proc),))

                 p2.add(proc)

         p=p2


         if not p: break # done! no need to sleep...


         if usleep>4.99999e6 and logger is not None:

             # babble about sleeping if the sleep time is 5sec or longer:

             logger.info("... sleep %f ..."%(float(usleep/1.e6),))

         time.sleep(usleep/1.e6)

     return False if(p) else True


 def runsync(logger=None):

     """!Runs the "sync" command as an exe()."""

     return mpiimpl.runsync(logger=logger)


 def run(arg,logger=None,sleeptime=None,**kwargs):

     """!Executes the specified program and attempts to return its exit

     status.  In the case of a pipeline, the highest exit status seen

     is returned.  For MPI programs, exit statuses are unreliable and

     generally implementation-dependent, but it is usually safe to

     assume that a program that runs MPI_Finalize() and exits normally

     will return 0, and anything that runs MPI_Abort(MPI_COMM_WORLD)

     will return non-zero.  Programs that exit due to a signal will

     return statuses >255 and can be interpreted with WTERMSIG,

     WIFSIGNALLED, etc.

     @param arg the produtil.prog.Runner to execute (output of

       exe(), bigexe() or mpirun()

     @param logger a logging.Logger to log messages

     @param sleeptime time to sleep between checks of child process

     @param kwargs ignored"""

     p=make_pipeline(arg,False,logger=logger)

     p.communicate(sleeptime=sleeptime)

     result=p.poll()

     if logger is not None:

         logger.info('  - exit status %d'%(int(result),))

     return result


 def checkrun(arg,logger=None,**kwargs):

     """!This is a simple wrapper round run that raises

     ExitStatusException if the program exit status is non-zero.


     @param arg the produtil.prog.Runner to execute (output of

       exe(), bigexe() or mpirun()

     @param logger a logging.Logger to log messages

     @param kwargs The optional run=[] argument can provide a different

     list of acceptable exit statuses."""

     r=run(arg,logger=logger)

     if kwargs is not None and 'ret' in kwargs:

         if not r in kwargs['ret']:

             raise ExitStatusException('%s: unexpected exit status'%(repr(arg),),r)

     elif not r==0:

         raise ExitStatusException('%s: non-zero exit status'%(repr(arg),),r)

     return r


 def openmp(arg,threads=None):

     """!Sets the number of OpenMP threads for the specified program.


     @warning Generally, when using MPI with OpenMP, the batch system

     must be configured correctly to handle this or unexpected errors

     will result.


     @param arg The "arg" argument must be from mpiserial, mpi, exe or

     bigexe.


     @param threads The optional "threads" argument is an integer number of

     threads.  If it is not specified, the maximum possible number of

     threads will be used.  Note that using threads=None with

     mpirun(...,allranks=True) will generally not work unless the batch

     system has already configured the environment correctly for an

     MPI+OpenMP task with default maximum threads and ranks.

     @returns see run()"""

     return mpiimpl.openmp(arg,threads)


 def runstr(arg,logger=None,**kwargs):

     """!Executes the specified program or pipeline, capturing its

     stdout and returning that as a string.


     If the exit status is non-zero, then NonZeroExit is thrown.


     Example:

     @code

       runstr(exe('false'),ret=(1))

     @endcode


     succeeds if "false" returns 1, and raises ExitStatusError otherwise.


     @param arg The "arg" argument must be from mpiserial, mpi, exe or

     bigexe.

     @param logger a logging.Logger for logging messages

     @param kwargs You can specify an optional list or tuple "ret" that

     contains an alternative list of valid return codes.  All return

     codes are zero or positive: negative values represent

     signal-terminated programs (ie.: SIGTERM produces -15, SIGKILL

     produces -9, etc.) """

     p=make_pipeline(arg,True,logger=logger)

     s=p.to_string()

     r=p.poll()

     if kwargs is not None and 'ret' in kwargs:

         if not r in kwargs['ret']:

             raise ExitStatusException('%s: unexpected exit status'%(repr(arg),),r)

     elif not r==0:

         raise ExitStatusException('%s: non-zero exit status'%(repr(arg),),r)

     return s


 def mpi(arg,**kwargs):

     """!Returns an MPIRank object that represents the specified MPI

     executable.

     @param arg the MPI program to run

     @param kwargs logger=L for a logging.Logger to log messages"""

     return mpiprog.MPIRank(arg,**kwargs)


 def mpiserial(arg,**kwargs):

     """!Generates an mpiprog.MPISerial object that represents an MPI

     rank that executes a serial (non-MPI) program.  The given value

     MUST be from bigexe() or exe(), NOT from mpi().

     @param arg the MPI program to run

     @param kwargs logger=L for a logging.Logger to log messages"""

     return mpiprog.MPISerial(arg,**kwargs)

produtil.run.waitprocs
def waitprocs
Not implemented: background process monitoring.
Definition: run.py:324

produtil.run.mpirun
def mpirun(arg, kwargs)
Converts an MPI program specification into a runnable shell program suitable for run(), runstr() or checkrun().
Definition: run.py:258

produtil.sigsafety
Sets up signal handlers to ensure a clean exit.
Definition: sigsafety.py:1

produtil.pipeline.Pipeline
This class is a wrapper around launch and manage.
Definition: pipeline.py:564

produtil.run.checkrun
def checkrun(arg, logger=None, kwargs)
This is a simple wrapper round run that raises ExitStatusException if the program exit status is non-...
Definition: run.py:398

produtil.run.run
def run(arg, logger=None, sleeptime=None, kwargs)
Executes the specified program and attempts to return its exit status.
Definition: run.py:376

produtil.run.ExitStatusException.__init__
def __init__(self, message, status)
ExitStatusException constructor.
Definition: run.py:201

produtil.run.openmp
def openmp
Sets the number of OpenMP threads for the specified program.
Definition: run.py:415

produtil.prog
Implements the produtil.run: provides the object tree for representing shell commands.
Definition: prog.py:1

produtil.run.ExitStatusException.__str__
def __str__(self)
A string description of the error.
Definition: run.py:213

produtil.mpiprog.MPIRank
Represents a single MPI rank.
Definition: mpiprog.py:469

produtil.run.ExitStatusException.__repr__
def __repr__(self)
A pythonic description of the error for debugging.
Definition: run.py:216

produtil.mpi_impl
Converts a group of MPI ranks to a runnable command.
Definition: __init__.py:1

produtil.prog.ProgSyntaxError
Base class of exceptions raised when a Runner is given arguments that make no sense.
Definition: prog.py:46

produtil.run.alias
def alias(arg)
Attempts to generate an unmodifiable "copy on write" version of the argument.
Definition: run.py:220

produtil.run.ExitStatusException.returncode
returncode
The return code, including signal information.
Definition: run.py:206

produtil.run.InvalidRunArgument
Raised to indicate that an invalid argument was sent into one of the run module functions.
Definition: run.py:175

produtil.run.mpiserial
def mpiserial(arg, kwargs)
Generates an mpiprog.MPISerial object that represents an MPI rank that executes a serial (non-MPI) pr...
Definition: run.py:472

produtil.run.ExitStatusException
Raised to indicate that a program generated an invalid return code.
Definition: run.py:179

produtil.mpiprog.MPISerial
Represents a single rank of an MPI program that is actually running a serial program.
Definition: mpiprog.py:604

produtil.mpiprog
Object structure for describing MPI programs.
Definition: mpiprog.py:1

produtil.run.runbg
def runbg(arg, capture=False, kwargs)
Not implemented: background execution.
Definition: run.py:303

produtil.run.runstr
def runstr(arg, logger=None, kwargs)
Executes the specified program or pipeline, capturing its stdout and returning that as a string...
Definition: run.py:434

produtil.run.make_pipeline
def make_pipeline(arg, capture, kwargs)
This internal implementation function generates a prog.PopenCommand object for the specified input...
Definition: run.py:276

produtil.mpiprog.MPIRanksBase
This is the abstract superclass of all classes that represent one or more MPI ranks, including MPI ranks that are actually serial programs.
Definition: mpiprog.py:68

produtil.run.ExitStatusException.status
def status(self)
An alias for self.returncode: the exit status.
Definition: run.py:209

produtil.run.exe
def exe(name, kwargs)
Returns a prog.ImmutableRunner object that represents a large serial program that must be run on a co...
Definition: run.py:242

produtil.prog.Runner
Represents a single stage of a pipeline to execute.
Definition: prog.py:299

produtil.run.runsync
def runsync
Runs the "sync" command as an exe().
Definition: run.py:372

produtil.run.ExitStatusException.message
message
A string description for what went wrong.
Definition: run.py:205

produtil.pipeline
Internal module that launches and monitors processes.
Definition: pipeline.py:1

produtil.run.mpi
def mpi(arg, kwargs)
Returns an MPIRank object that represents the specified MPI executable.
Definition: run.py:465

produtil.prog.ImmutableRunner
An copy-on-write version of Runner.
Definition: prog.py:884

produtil.run.bigexe
def bigexe(name, kwargs)
Alias for exe() for backward compatibility.
Definition: run.py:254

produtil.run.batchexe
def batchexe(name, kwargs)
Returns a prog.ImmutableRunner object that represents a small serial program that can be safely run o...
Definition: run.py:234