Quickstart

Installation

To install qp, you can run the following commands:

git clone https://github.com/LSSTDESC/qp.git
cd qp
pip install .

For more information on alternate installation methods see Installation.

How to use `qp`

qp is a library that allows you to store and manipulate tables of probability distribution data. These data can be represented, or “parameterized”, in different ways, including:

Histogram Bins and Edges
Multiple Gaussian Components
Sampled X, Y Points
Sampled Quantiles

The different parameterizations allow users to store both analytically and data-based parameterized distributions and derive standard statistical quantities from them through a common interface. Some of the quantities include:

Probability Distribution Functions
Cumulative Distribution Functions
Quantiles
Generating Random Samples

It also allows for use of any of the scipy.stats.rv_continuous distributions as parameterization types under the qp.stats module. For a full list of supported parameterizations and more detailed explanations, see Parameterizations.

The main object of qp that stores these distributions is the qp.Ensemble. It can store one or more distributions of the same parameterization. It has three components:

Metadata (qp.Ensemble.metadata): Shared parameters of all distributions in the qp object, including the parameterization type
Data values (qp.Ensemble.objdata): underlying data of each distribution, where one row = one distribution
(optional) Ancillary data table (qp.Ensemble.ancil): Additional data for each of the distributions, where there is one row per distribution

The printed representation of an Ensemble tells you the parameterization type and the shape of the arrays in the objdata: (\(n_{pdf}\), \(n_{vals}\)), where \(n_{pdf}\) is the number of distributions and \(n_{vals}\) is the number of values or data points for each distribution in the Ensemble.

Creating an Ensemble

To create an Ensemble of any number of distributions, you can use the create_ensemble method of any of the existing parameterization classes. For example, to create an Ensemble of interpolations use qp.interp:

>>> import qp
>>> import numpy as np
>>> xvals = np.array([0,0.5,1,1.5,2])
>>> yvals = np.array([[0.01,0.2,0.3,0.2,0.01],
... [0.1,0.3,0.5,0.2,0.05]])
>>> ens = qp.interp.create_ensemble(xvals,yvals)
>>> ens
Ensemble(the_class=interp,shape=(2,5))

You can also read an Ensemble from a file using qp.read. For example:

>>> ens_r = qp.read("ensemble.hdf5")
>>> ens_r
Ensemble(the_class=interp,shape=(3, 50))

Working with an Ensemble

Now that you have created an Ensemble, you can get a sense of what’s in it by using some useful attributes:

qp.Ensemble.npdf: Number of distributions in the Ensemble
qp.Ensemble.shape: Shape of the data, (\(n_{pdf}\), \(n_{vals}\))
qp.Ensemble.metadata : The metadata dictionary
qp.Ensemble.objdata : The data dictionary
qp.Ensemble.ancil : The ancillary data (only works if there is an ancillary data table)

You can also use the available Ensemble methods. These allow you to manipulate your Ensemble, or get statistical information about your Ensembles. For example, here are some useful methods:

qp.Ensemble.pdf(): Get the PDF values at specified \(x\) values
qp.Ensemble.cdf(): Get the CDF values at specified \(x\) values
qp.Ensemble.convert(): Convert the Ensemble to a different parameterization

Writing an Ensemble to file

To write an Ensemble to file, simply use the qp.Ensemble.write_to() method. The only required argument is the full path to the file you would like to write. For example, we can write out the Ensemble we created like so:

>>> ens.write_to("ensemble-test.hdf5")

The available file formats can be found in Writing an Ensemble to file.

Where to find more information

For more detailed explanations, take a look at the user guide:

Installation explains installation for parallel use
qp Primer covers some of the statistics basics necessary for using qp
Basic Usage provides more detailed explanations of topics discussed here
Parameterizations contains more details on specific parameterizations
Troubleshooting for common pitfalls and errors

Or for quick reference pages:

Cookbook contains detailed examples for specific use cases, i.e. conversion, plotting, iteration
Ensemble Methods lists all the available Ensemble methods