Multithreading and MPI#
As typical operations on time-ordered data can be quite consuming, the LiteBIRD Simulation Framework provides a number of tools to exploit the presence of multiple CPU cores and even multiple computing nodes. This section details how to take advantage of these facilities and is split in two parts:
We will first present the ability of the framework to use multiple CPU threads; in this context, the data samples are kept in a chunk of memory that is shared between several processes. The framework uses Numba, which can take advantage either of the Intel Threading Building Blocks library or of OpenMP.
Then, we will discuss the possibility to run the code on multiple computing nodes, where the memory of each node is not shared with the others. The framework is able to use any MPI library, through the Python package mpi4py.
Multithreading#
Some parts of the LiteBIRD Simulation Framework are able to exploit multiple cores because several of its modules rely on the Numba library.
If you are running your code on your multi-core laptop, you do not have to do anything fancy in order to use all the CPUs on your machine: in its default configuration, the Framework should be able to take advantage all the available CPU cores.
However, if you want to tune the way the Framework uses the CPUs,
you can either set the environment variable OMP_NUM_THREADS
to the number of CPUs to use, or use two parameters in
the constructor of the class Simulation:
numba_num_of_threads: this is the number of CPUs that Numba will use for parallel calculations. The parameter defaults to
None, which means that Numba will check how many CPUs are available and will use all of them.numba_threading_layer: this parameter is a string that specifies which threading library should be used by Numba. The value depends both on the version of Numba you are running and on the availability of these libraries, as they are not installed together with the LiteBIRD Simulation Framework. Numba 0.53 provides the following choices:
tbb: Intel Threading Building Blocks. You should pick this if you are running your code on Intel machines and the Tbb library is available.omp: OpenMP. If you pick this one, be sure that the OpenPM library is available.workqueue: this is an internal threading library provided by Numba. It’s probably the least efficient of the three; its main advantage is that it is always available.
These parameters can be passed through a TOML parameter file (see Parameter files) as well:
# This is file "my_conf.toml"
[simulation]
random_seed = 12345
numba_num_of_threads = 32
numba_threading_layer = "tbb"
Both tbb and omp require that the relevant library be available on
your system, as the command pip install litebird_sim does not install
them. If you are running your code on a HPC cluster, it is probably a matter
of running a command like the following:
# This might change depending on how the environment on your cluster
# is configured; the following commands are just examples.
$ module load tbb # Intel Threading Building Blocks
$ module load openmp # OpenMP
MPI#
The LiteBIRD Simulation Framework lists mpi4py as an optional dependency. This means that simulation codes should be able to cope with the lack of MPI.
The framework can be forced to use MPI or not using the variable
LITEBIRD_SIM_MPI:
Set the variable to 1 or an empty value to force importing mpi4py;
Set the variable to 0 to avoid importing mpi4py;
If the variable is not set, the code will try to import mpi4py, but in case of error it will not complain and will silently shift to serial execution.
The framework provides a global variable, MPI_COMM_WORLD,
which is the same as mpi4py.MPI.COMM_WORLD if MPI is being used.
Otherwise, if MPI is not being used, it still contains the
following members:
rank (set to
0);size (set to
1).
Thus, the following code works regardless whether MPI is present or not:
import litebird_sim as lbs
if lbs.MPI_COMM_WORLD.rank == 0:
print("Hello, world!")
However, you can use MPI_COMM_WORLD to call MPI functions
only if MPI was actually enabled. You can check this using the Boolean
variable MPI_ENABLED:
import litebird_sim as lbs
comm = lbs.MPI_COMM_WORLD
if lbs.MPI_ENABLED:
comm.barrier()
To ensure that your code uses MPI in the proper way, you should always
use MPI_COMM_WORLD instead of importing mpi4py directly.
The simulation framework also provides a global object
MPI_COMM_GRID. It has two attributes:
COMM_OBS_GRID: This is an MPI communicator that contains all the MPI processes with the global rank less thann_blocks_time * n_blocks_det. It provides a safety net to the operations and MPI communications that are needed to be performed only on the partition ofMPI_COMM_WORLDthat contain non-zero number of pointings and TODs. By default,COMM_OBS_GRIDpoints to the global MPI communicatorMPI_COMM_WORLD. It is updated onceObservationare defined. For example, consider the case when a user runs the simulation with 10 MPI processes but due some specificdet_blocks_attributesargument inObservationclass, the number of detector and time blocks are determined to be 2 and 4 respectively. Then the simulation framework will store the pointings and TODs only on \(2\times4=8\) MPI processes and the last two ranks ofMPI_COMM_WORLDwill be left unused. Once this happens,COMM_OBS_GRIDon first 8 ranks (rank 0 to 7) will point to the local sub-communicator containing the processes with global rank 0 to 7. On the unused ranks, it will simply point to the NULL communicator.COMM_NULL: IfMPI_ENABLEDisTrue, this object points to a NULL MPI communicator (mpi4py.MPI.COMM_NULL). Otherwise it is set toNone. The user should compareCOMM_OBS_GRIDwithCOMM_NULLon every MPI process in order to avoid running a piece of code on unused MPI processes.
Enabling/disabling MPI#
The user can control whether MPI must be used or not in a script,
through the environment variable LITEBIRD_SIM_MPI (ENABLE_MPI
is accepted as well):
If the variable is set to the empty string or to
1,true,on,yes, thenmpi4pyis imported, and an exception is raised if this cannot be done (e.g., because it was not installed using the flag--extra mpiwhenuv syncwas called).If the variable is set to
0,false,offorno, thenmpi4pyis not imported, even if it is installed.If the variable is not set, then
mpi4pywill be imported, but any failure will be accepted and the framework will silently switch to serial mode.
Grasping how MPI is being used#
You will typically use MPI to spread TODs among many MPI processes, so that the simulation can span several detectors and a longer time scale. Unfortunately, this means that it’s often complicated to understand how data is being kept in memory.
If you use the Simulation object (and you should, you really
should!), you can call Simulation.describe_mpi_distribution() after
you have allocated the TODs via Simulation.create_observations(); it
will return an instance of the class MpiDistributionDescr, which
can be inspected and printed to the terminal. See Section Simulations
for more information about this.
API reference#
- litebird_sim.mpi.MPI_COMM_GRID = <litebird_sim.mpi._GridCommClass object>#
Global object with two attributes:
COMM_OBS_GRID: It is a partition ofMPI_COMM_WORLDthat includes all the MPI processes with global rank less thann_blocks_time * n_blocks_det. On MPI processes with higher ranks, it points to NULL MPI communicatormpi4py.MPI.COMM_NULL.COMM_NULL: IfMPI_ENABLEDisTrue, this object points to a NULL MPI communicator (mpi4py.MPI.COMM_NULL). Otherwise it isNone.
- litebird_sim.mpi.MPI_COMM_WORLD = <litebird_sim.mpi._SerialMpiCommunicator object>#
Global variable equal either to mpi4py.MPI.COMM_WORLD or a object that defines the member variables rank = 0 and size = 1.
- litebird_sim.mpi.MPI_CONFIGURATION = {}#
If
MPI_ENABLEDis True, this is a dictionary containing information about the MPI configuration. Otherwise, it is an empty dictionary
- litebird_sim.mpi.MPI_ENABLED = False#
True if MPI should be used by the application. The value of this variable is set according to the following rules:
If the environment variable
LITEBIRD_SIM_MPIis set to 1, use MPI and fail if mpi4py cannot be imported;If the environment variable
LITEBIRD_SIM_MPIis set to 0, avoid using MPI even if mpi4py is present;If the environment variable
LITEBIRD_SIM_MPIis not set, try to use MPI and gracefully revert to a serial mode of execution if mpi4py cannot be imported.