.. _observations:

Observations
============

.. contents:: Table of Contents
   :depth: 2
   :local:

The :class:`.Observation` class is the container for the data acquired by the
telescope during a scanning period (and the relevant information about it).

Serial applications
-------------------

In a serial code :class:`.Observation` is equivalent to an empty class in which you
can put anything::

  import litebird_sim as lbs
  import numpy as np

  obs = lbs.Observation(
      detectors=2,
      start_time_global=0.0,
      sampling_rate_hz=5.0,
      n_samples_global=5,
  )

  obs.my_new_attr = 'value'
  setattr(obs, 'another_new_attr', 'another value')

Across the framework, the coherence in the names and content of the
attributes is guaranteed **by convention** (no check is done by the
:class:`.Observation` class). At the moment, the most common field
that you can find in the class are the following, assuming that it is
meant to collect :math:`N` samples for :math:`n_d` detectors:

1. ``Observation.tod`` is initialized when you call
   :meth:`.Simulation.create_observations`. It's a 2D array
   of shape :math:`(n_d, N)`. This means that ``Observation.tod[0]`` is
   the time stream of the first detector, and ``obs.tod[:, 0]`` are the
   first time samples of all the detectors.

   You can create other TOD-like arrays through the parameter ``tods``;
   it accepts a list of :class:`.TodDescription` objects that specify
   the name of the field used to store the 2D array, the physical units
   of the TOD expressed as an item of :class:.Units (e.g. Units.K_CMB);
   a textual description, and the value for ``dtype``. (By default, the ``tod``
   field uses 32-bit floating-point numbers.) Here is an example::

    sim.create_observations(
        detectors=[det1, det2, det3],
        tods=[
            lbs.TodDescription(
                name="tod",
                units=Units.K_CMB,
                description="TOD",
                dtype=np.float64,
            ),
            lbs.TodDescription(
                name="noise", 
                units=Units.K_CMB,
                description="1/f+white noise", 
                dtype=np.float32
            ),
        ],
    )

    for cur_obs in sim.observations:
        print("Shape of 'tod': ", cur_obs.tod.shape)
        print("Shape of 'noise': ", cur_obs.noise.shape)

   The default TOD named "tod" uses units Units.K_CMB and 32-bit
   floating-point numbers (numpy.float32), which are adequate for most
   LiteBIRD simulations.

   The parameter ``allocate_tod`` controls whether these TOD arrays are effectivelly
   allocated or not. By default TODs are allocated, but in some special situations,
   e.g. you need just to deal with pointing, you can set it to ``False`` and save 
   memory.

2. If you called :func:`.prepare_pointings()` and then
   :func:`.precompute_pointings()`, the field ``Observation.pointing_matrix``
   is a :math:`(n_d, N, 3)` matrix containing the pointing information in
   Ecliptic coordinates for each detector: colatitude θ, longitude φ,
   orientation ψ. If you specified a HWP in the call to
   :func:`.prepare_pointings()`, the field ``Observation.hwp_angle`` will
   be a :math:`(N,)` vector containing the angle of the HWP in radians.

3. ``Observation.local_flags`` is a :math:`(n_d, N)` matrix containing
   flags for the :math:`n_d` detectors. These flags are typically
   associated to peculiarities in the single detectors, like
   saturations or mis-calibrations.

4. ``Observation.global_flags`` is a vector of :math:`N` elements
   containing flags that must be associated with *all* the detectors
   in the observation.

Keep in mind that the general rule is that detector-specific
attributes are collected in arrays. Thus, ``obs.calibration_factors``
should be a 1-D array of :math:`n_d` elements (one per each detector).

With this memory layout, typical operations look like this::

  # Collect detector properties in arrays
  obs.calibration_factors = np.array([1.1, 1.2])
  obs.wn_levels = np.array([2.1, 2.2])

  # Apply to each detector its own calibration factor
  obs.tod *=  obs.calibration_factors[:, None]

  # Add white noise at a different level for each detector
  obs.tod += (np.random.normal(size=obs.tod.shape)
              * obs.wn_levels[:, None])


Parallel applications
---------------------

The :class:`.Observation` class allows the distribution of ``obs.tod`` over multiple MPI
processes to enable the parallelization of computations. The distribution of ``obs.tod``
can be achieved in two different ways:

1. Uniform distribution of detectors along the detector axis
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

With ``n_blocks_det`` and ``n_blocks_time`` arguments of :class:`.Observation` class,
the ``obs.tod`` is evenly distributed over a
``n_blocks_det`` by ``n_blocks_time`` grid of MPI ranks. The blocks can be
changed at run-time.

The coherence between the serial and parallel operations is achieved by
distributing also the arrays of detector properties:
each rank keeps in memory only an interval of ``calibration_factors`` or
``wn_levels``, the same detector interval of ``tod``.

The main advantage is that the example operations in the Serial section are
achieved with the same lines of code.
The price to pay is that you have to set detector properties with special methods.

.. code-block:: python

  import litebird_sim as lbs
  from mpi4py import MPI

  comm = MPI.COMM_WORLD

  obs = lbs.Observation(
      detectors=2,
      start_time_global=0.0,
      sampling_rate_hz=5.0,
      n_samples_global=5,
      n_blocks_det=2, # Split the detector axis in 2
      comm=comm  # across the processes of this communicator
  )

  # Add detector properties either with a global array that contains
  # all the detectors (which will be scattered across the processor grid)
  obs.setattr_det_global('calibration_factors', np.array([1.1, 1.2]))

  # Or with the local array, if somehow you have it already
  if comm.rank == 0:
      wn_level_local = np.array([2.1])
  elif comm.rank == 1:
      wn_level_local = np.array([2.2])
  else:
      wn_level_local = np.array([])
  obs.setattr_det('wn_levels', wn_level_local)

  # Operate on the local portion of the data just like in serial code

  # Apply to each detector its own calibration factor
  obs.tod *=  obs.calibration_factors[:, None]

  # Add white noise at a different level for each detector
  obs.tod += (np.random.normal(size=obs.tod.shape)
              * obs.wn_levels[:, None])

  # Change the data distribution
  obs.set_blocks(n_blocks_det=1, n_blocks_time=1)
  # Now the rank 0 has exactly the data of the serial obs object

For clarity, here is a visualization of how data (a detector attribute and the
TOD) gets distributed.

.. image:: ./images/observation_data_distribution.png

2. Custom grouping of detectors along the detector axis
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

While uniform distribution of detectors along the detector axis optimizes load
balancing, it is less suitable for simulating some effects, like crosstalk and
noise correlation between the detectors. For these effects, uniform distribution
across MPI processes necessitates the transfer of large TOD arrays across multiple
MPI processes, which complicates the code implementation and may potentially lead
to significant performance overhead. To save us from this situation, the
:class:`.Observation` class
accepts an argument ``det_blocks_attributes`` that is a list of string objects
specifying the detector attributes to create the group of detectors. Once the
detector groups are made, the detectors are distributed to the MPI processes in such
a way that all the detectors of a group reside on the same set of MPI processes.

If a valid ``det_blocks_attributes`` argument is passed to the :class:`.Observation`
class, the arguments ``n_blocks_det`` and ``n_blocks_time`` are ignored. Since the
``det_blocks_attributes`` creates the detector blocks dynamically, the
``n_blocks_time`` is computed during runtime using the size of MPI communicator and
the number of detector blocks (``n_blocks_time = comm.size // n_blocks_det``).

The detector blocks made in this way can be accessed with
``Observation.detector_blocks``. It is a dictionary object that has the tuple of
``det_blocks_attributes`` values as dictionary keys and the list of detectors
corresponding to the key as dictionary values. This dictionary is sorted so that the
group with the largest number of detectors comes first and the one with
the fewest detectors comes last.

The following example illustrates the distribution of ``obs.tod`` matrix across the
MPI processes when ``det_blocks_attributes`` is specified.

.. code-block:: python

  import litebird_sim as lbs

  comm = lbs.MPI_COMM_WORLD

  start_time = 456
  duration_s = 100
  sampling_freq_Hz = 1

  # Creating a list of detectors.
  dets = [
      lbs.DetectorInfo(
          name="channel1_w9_detA",
          wafer="wafer_9",
          channel="channel1",
          sampling_rate_hz=sampling_freq_Hz,
      ),
      lbs.DetectorInfo(
          name="channel1_w3_detB",
          wafer="wafer_3",
          channel="channel1",
          sampling_rate_hz=sampling_freq_Hz,
      ),
      lbs.DetectorInfo(
          name="channel1_w1_detC",
          wafer="wafer_1",
          channel="channel1",
          sampling_rate_hz=sampling_freq_Hz,
      ),
      lbs.DetectorInfo(
          name="channel1_w1_detD",
          wafer="wafer_1",
          channel="channel1",
          sampling_rate_hz=sampling_freq_Hz,
      ),
      lbs.DetectorInfo(
          name="channel2_w4_detA",
          wafer="wafer_4",
          channel="channel2",
          sampling_rate_hz=sampling_freq_Hz,
      ),
      lbs.DetectorInfo(
          name="channel2_w4_detB",
          wafer="wafer_4",
          channel="channel2",
          sampling_rate_hz=sampling_freq_Hz,
      ),
  ]

  # Initializing a simulation
  sim = lbs.Simulation(
      start_time=start_time,
      duration_s=duration_s,
      random_seed=12345,
      mpi_comm=comm,
  )

  # Creating the observations with detector blocks
  sim.create_observations(
      detectors=dets,
      split_list_over_processes=False,
      num_of_obs_per_detector=3,
      det_blocks_attributes=["channel"], # case 1 and 2
      # det_blocks_attributes=["channel", "wafer"] # case 3
  )

With the list of detectors defined in the code snippet above, we can see how the
detectors axis and time axis is divided depending on the size of MPI communicator and
``det_blocks_attributes``.

**Case 1**

*Size of global MPI communicator = 3*, ``det_blocks_attributes=["channel"]``

.. image:: ./images/detector_groups_case1.png

**Case 2**

*Size of global MPI communicator = 4*, ``det_blocks_attributes=["channel"]``

.. image:: ./images/detector_groups_case2.png

**Case 3**

*Size of global MPI communicator = 10*, ``det_blocks_attributes=["channel", "wafer"]``

.. image:: ./images/detector_groups_case3.png

.. note::
  When ``n_blocks_det != 1``,  keep in mind that ``obs.tod[0]`` or
  ``obs.wn_levels[0]`` are quantities of the first *local* detector, not global.
  This should not be a problem as the only thing that matters is that the two
  quantities refer to the same detector. If you need the global detector index,
  you can get it with ``obs.det_idx[0]``, which is created at construction time.
  ``obs.det_idx`` stores the detector indices of the detectors available to an
  :class:`.Observation` class, with respect to the list of detectors stored in
  ``obs.detectors_global`` variable.

.. note::
  To get a better understanding of how observations are being used in a
  MPI simulation, use the method :meth:`.Simulation.describe_mpi_distribution`.
  This method must be called *after* the observations have been allocated using
  :meth:`.Simulation.create_observations`; it will return an instance of the
  class :class:`.MpiDistributionDescr`, which can be inspected to determine
  which detectors and time spans are covered by each observation in all the
  MPI processes that are being used. For more information, refer to the Section
  :ref:`simulations`.

MPI communicators
^^^^^^^^^^^^^^^^^

The simulation framework exposes MPI communicators at different
levels. The first one is the global MPI communicator. It can be
accessed with a global variable :data:`.MPI_COMM_WORLD`, and it is
same as ``mpi4py.MPI.COMM_WORLD``. It contains all the MPI processes
used by the script. The other MPI communicators defined in the
simulation framework are the partitions of global MPI communicator and
they contain the MPI processes that have certain properties as we
explain below. For all the examples in this sub-section, we consider
the distribution of TODs across 10 MPI processes with ``n_blocks_time = 2``
and ``n_blocks_det = 4``.

To distribute the TODs across
``n_blocks_det`` :math:`\times` ``n_blocks_time`` :math:`=\, N`
MPI ranks, it is necessary that the script is executed with at least
:math:`N` MPI processes. There is, however, no upper limit on the
number of MPI processes to be used. When the number of MPI processes
is higher than :math:`N`, it leaves all the MPI processes with
``rank`` :math:`\geq N` with no detector (and TOD). In many cases, it
is useful to identify the unused ranks so that they can be avoided
while performing some computations. The :class:`.Observation` class
makes this happen by making a sub-communicator containing only the
processes that contain a non-zero number of detectors (and TODs).

However, once the detectors and TODs are distributed across several
processes, it is not trivial to find out all the ranks that contain
the TOD chunks of a given detector (or detector block). Likewise, it
is hard to find all the ranks that contain the TOD chunks
corresponding to the same time interval. To solve this issue, :class:`.Observation`
class provides the MPI sub-communicators corresponding to different
groups of MPI processes.

The three sub-communicators provided by the framework - each generated
by splitting the global MPI communicator - are listed below. The
process ranks in the sub-communicators are based on the order of their
global ranks:

- **Grid communicator**: Once the number of detector blocks and the
  number of time blocks are available, the Observation class splits the
  global communicator into a grid communicator and a null communicator.
  Grid communicator consist all the mpi processes whose rank is less
  than :math:`N`. The null communicator contains all other MPI
  processes. On the MPI processes with global rank less than
  :math:`N`, the global variable ``MPI_COMM_GRID.COMM_OBS_GRID``
  points to the grid communicator. On other MPI processes, it points
  to the null MPI communicator (``MPI_COMM_GRID.COMM_NULL`` which is
  same as ``mpi4py.MPI.COMM_NULL``). In the example below, the
  processes with global rank from 0 to 7 belong to the grid
  sub-communicator. Since the processes with rank 8 and 9 are unused,
  they are excluded from the grid communicator.

  .. image:: ./images/grid_communicator.png

  Note that, to exclude the MPI processes belonging to the null
  communicator from the computations, one should enclose the
  computations under an if condition comparing
  ``MPI_COMM_GRID.COMM_OBS_GRID`` against ``MPI_COMM_GRID.COMM_NULL``:

  ::

    if MPI_COMM_GRID.COMM_OBS_GRID != MPI_COMM_GRID.COMM_NULL:
        # proceed with the following computations when
        # MPI_COMM_GRID.COMM_OBS_GRID is not null
        ...

- **Detector-block communicator**: The detector-block communicator is
  made by splitting the grid communicator into ``n_blocks_det``
  sub-communicators, such that each sub-communicator contains the
  processes where TODs of the same set of detectors reside. This
  sub-communicator can be accessed with ``comm_det_blocks`` attribute
  of the :class:`.Observation` class. In the following example, the
  grid communicator is split into 4 detector-block communicators
  containing the processes with global rank 0-1, 2-3, 4-5, and 6-7
  respectively.

  .. image:: ./images/detector_block_communicator.png

- **Time-block communicator**: The time-block communicator is made by
  splitting the grid communicator into ``n_blocks_time``
  sub-communicators, such that each sub-communicator contains the
  processes where TOD chunks of same time interval reside. This
  sub-communicator can be accessed with ``comm_time_block`` attribute
  of the :class:`.Observation` class. In the example below, the grid
  communicator is split into 2 time-block communicators containing the
  processes with global rank 0-2-4-6 and 1-3-5-7 respectively.

  .. image:: ./images/time_block_communicator.png

Other notable functionalities
-----------------------------

The starting time can be represented either as floating-point values
(appropriate in 99% of the cases) or MJD; in the latter case, it
is handled through the `AstroPy <https://www.astropy.org/>`_ library.

Instead of adding detector attributes after construction, you can pass a list of
dictionaries (one entry for each detectors). One attribute is created for every
key.

::

  import litebird_sim as lbs
  from astropy.time import Time

  # Second case: use MJD to track the time
  obs_mjd = lbs.Observation(
      detectors=[{"name": "A"}, {"name": "B"}],
      start_time_global=Time("2020-02-20", format="iso"),
      sampling_rate_hz=5.0,
      n_samples_global=5,
  )

  obs.name == np.array(["A", "B"])  # True


Obtain pointings
----------------
The pointing information for one or more detectors in an observation are
accessible through the :func:`.get_pointings` which returns a tuple containing
the pointing matrix of the detectors and the HWP angle. This call must be prepared
calling either the method :meth:`.Observation.prepare_pointings` of this class or
:meth:`.Simulation.prepare_pointings`, or the function :func:`litebird_sim.prepare_pointings`

The structure of the pointing matrix returned depends on the parameter `detector_idx`
that specifies which detectors should be included in the computation.
The following calls are all legitimate.

::

  # All the detectors are included
  pointings, hwp_angle = cur_obs.get_pointings("all")
  # This returns ``(N_det, N_samples, 3)`` array

  # Only the first two detectors are included
  pointings, hwp_angle = cur_obs.get_pointings([0, 1])
  # This returns ``(2, N_samples, 3)`` array

  # Only the first detector is used
  pointings, hwp_angle = cur_obs.get_pointings(0)
  # This returns ``(N_samples, 3)`` array

  # NB if a list of indices is passed an array of dimension ``(N_det, N_samples, 3)``
  # is always returned
  # For example this
  pointings, hwp_angle = cur_obs.get_pointings([0])
  # returns ``(1, N_samples, 3)`` array

The last dimension spans the three angles θ (colatitude, in radians), φ (longitude, in
radians), and ψ (orientation angle, in radians).
The HWP angle is always a vector with shape ``(N_samples,)``, as it does not depend on
the list of detectors.

The method :func:`.get_hwp_angle` allows to obtain only the HWP angle.

The function :func:`litebird_sim.precompute_pointings()` precomputes all the pointings
for a set of observations storing them in ``Observation.pointing_matrix`` and
``Observation.hwp_angle``. See above for more details. The same result is achieved
calling the method :meth:`.Observation.precompute_pointings()`.

Reading/writing observations to disk
------------------------------------

The framework implements a couple of functions to write/read
:class:`.Observation` objects to disk, using the HDF5 file format. By
default, each observation is saved in a separate HDF5 file; the
following information are saved and restored:

- Whether times are tracked as floating-point numbers or proper
  AstroPy dates;
- The TOD matrix (in ``.tod``);
- The quaternions used to create the pointings.
- Optionally, full pointings can be computed on the fly and stored
  in the files; this is useful if the TOD is supposed to be read by
  some other program.
- Global and local flags saved in ``.global_flags`` and
  ``.local_flags`` (see below).

The function used to save observations is :func:`.Simulation.write_observations`,
which acts on a :class:`.Simulation` object; if you prefer to
operate without a :class:`.Simulation` object, you can call
:func:`.write_list_of_observations`.

To read observations, you can use :func:`.Simulation.read_observations` and
:func:`.read_list_of_observations`.

The framework writes one HDF5 file for each :class:`.Observation`; each
file contains the following datasets:

- One dataset per each TOD; each dataset has the same name as the ones passed to
  ``tods=`` in the call to ``create_observations``. It has the following attributes:

  - ``use_mjd`` (Boolean): ``True`` if ``start_time`` is a MJD, ``False`` if it is
    a plain floating-point value

  - ``start_time`` (Float): the time of the first sample in the TOD, see also
    ``use_mjd`` to correctly interpret this

  - ``sampling_rate_hz`` (Float)

  - ``detectors`` (string): a JSON record containing basic information about the
    detectors

  - ``description`` (string): a human-readable string describing what's in this TOD

- ``global_flags``: the matrix of the global flags for the observation

- ``flags_NNNN`: the local flags for detector ``NNNN`` (starting from ``0000``).
  There are as many datasets of this kind as the number of detectors in this
  :class:`.Observation` object.

- ``pointing_provider_rot_quaternion``: the rotation quaternion that
  converts the boresight direction of the focal plane of the instrument
  into ecliptic coordinates. It is a matrix with shape ``(N, 4)``, and it
  has the attributes ``start_time`` (either a floating-point value or a string,
  the latter being used for ``astropy.time.Time`` types) and ``sampling_rate_hz``.

- ``pointing_provider_hwp``: a dataset containing the details of the Half-Wave
  Plate. Its interprentation depends on the kind of HWP; for instances of the
  class :class:`.IdealHWP`, the dataset is empty and the only fields are the
  attributes ``class_name`` (always equal to ``IdealHWP``), ``ang_speed_radpsec``,
  and ``start_angle_rad`` (two floating-point numbers).

- ``rot_quaternion_NNNN``: the rotation quaternion for detector
  ``NNNN`` (starting from ``0000``). It has the same structure as
  ``pointing_provider_rot_quaternion`` (see above), and there are as many datasets of
  this kind as the number of detectors in this :class:`.Observation` object.


Flags
-----

The LiteBIRD Simulation Framework permits to associate flags with the
scientific samples in a TOD. These flags are usually unsigned integer
numbers that are treated like bitmasks that signal peculiarities in
the data. They are especially useful when dealing with data that have
been acquired by some real instrument to signal if the hardware was
malfunctioning or if some disturbance was recorded by the detectors.
Other possible applications are possible:

1. Marking samples that should be left out of map-making (e.g.,
   because a planet or some other transient source was being observed
   by the detector).

2. Flagging potentially interesting observations that should be used
   in data analysis (e.g., observations of the Crab nebula that are
   considered good enough for polarization angle calibration).

Similarly to other frameworks like TOAST, the LiteBIRD Simulation
Framework lets to store both «global» and «local» flags associated
with the scientific samples in TODs. Global flags are associated with
all the samples in an Observation, and they must be a vector of ``M``
elements, where ``M`` is the number of samples in the TOD. Local
samples are stored in a matrix of shape ``N × M``, where ``N`` is the
number of detectors in the observation.

The framework encourages to store these flags in the fields
``local_flags`` and ``global_flags``: in this way, they can be saved
correctly in HDF5 files by functions like :func:`.write_observations`.


API reference
-------------

.. automodule:: litebird_sim.observations
    :members:
    :undoc-members:
    :show-inheritance:

.. automodule:: litebird_sim.io
    :members:
    :undoc-members:
    :show-inheritance: