# Quickstart ## Installation To install `qp`, you can run the following commands: ```bash git clone https://github.com/LSSTDESC/qp.git cd qp pip install . ``` For more information on alternate installation methods see . ## How to use `qp` `qp` is a library that allows you to store and manipulate tables of probability distribution data. These data can be represented, or **"parameterized"**, in different ways, including: - Histogram Bins and Edges - Multiple Gaussian Components - Sampled X, Y Points - Sampled Quantiles The different parameterizations allow users to store both analytically and data-based parameterized distributions and derive standard statistical quantities from them through a common interface. Some of the quantities include: - Probability Distribution Functions - Cumulative Distribution Functions - Quantiles - Generating Random Samples It also allows for use of any of the distributions as parameterization types under the `qp.stats` module. For a full list of supported parameterizations and more detailed explanations, see . The main object of `qp` that stores these distributions is the {py:class}`qp.Ensemble`. It can store one or more distributions of the same parameterization. It has three components: **Metadata** ({py:attr}`qp.Ensemble.metadata`) : Shared parameters of all distributions in the qp object, including the parameterization type **Data values** ({py:attr}`qp.Ensemble.objdata`) : underlying data of each distribution, where one row = one distribution _(optional)_ **Ancillary data table** ({py:attr}`qp.Ensemble.ancil`) : Additional data for each of the distributions, where there is one row per distribution The printed representation of an Ensemble tells you the parameterization type and the shape of the arrays in the `objdata`: ($n_{pdf}$, $n_{vals}$), where $n_{pdf}$ is the number of distributions and $n_{vals}$ is the number of values or data points for each distribution in the Ensemble. ### Creating an Ensemble To create an Ensemble of any number of distributions, you can use the `create_ensemble` method of any of the existing parameterization classes. For example, to create an Ensemble of interpolations use {py:class}`qp.interp `: ```{doctest} >>> import qp >>> import numpy as np >>> xvals = np.array([0,0.5,1,1.5,2]) >>> yvals = np.array([[0.01,0.2,0.3,0.2,0.01], ... [0.1,0.3,0.5,0.2,0.05]]) >>> ens = qp.interp.create_ensemble(xvals,yvals) >>> ens Ensemble(the_class=interp,shape=(2,5)) ``` You can also read an Ensemble from a file using {py:meth}`qp.read `. For example: ```{doctest} >>> ens_r = qp.read("ensemble.hdf5") >>> ens_r Ensemble(the_class=interp,shape=(3, 50)) ``` ### Working with an Ensemble Now that you have created an Ensemble, you can get a sense of what's in it by using some useful attributes: - {py:attr}`qp.Ensemble.npdf`: Number of distributions in the Ensemble - {py:attr}`qp.Ensemble.shape`: Shape of the data, ($n_{pdf}$, $n_{vals}$) - {py:attr}`qp.Ensemble.metadata` : The metadata dictionary - {py:attr}`qp.Ensemble.objdata` : The data dictionary - {py:attr}`qp.Ensemble.ancil` : The ancillary data (only works if there is an ancillary data table) You can also use the available [Ensemble methods](methods.md). These allow you to manipulate your Ensemble, or get statistical information about your Ensembles. For example, here are some useful methods: - {py:meth}`qp.Ensemble.pdf`: Get the PDF values at specified $x$ values - {py:meth}`qp.Ensemble.cdf`: Get the CDF values at specified $x$ values - {py:meth}`qp.Ensemble.convert`: Convert the Ensemble to a different parameterization ### Writing an Ensemble to file To write an Ensemble to file, simply use the {py:meth}`qp.Ensemble.write_to` method. The only required argument is the full path to the file you would like to write. For example, we can write out the Ensemble we created like so: ```{doctest} >>> ens.write_to("ensemble-test.hdf5") ``` The available file formats can be found in . ## Where to find more information For more detailed explanations, take a look at the user guide: - explains installation for parallel use - covers some of the statistics basics necessary for using `qp` - provides more detailed explanations of topics discussed here - contains more details on specific parameterizations - for common pitfalls and errors Or for quick reference pages: - contains detailed examples for specific use cases, i.e. conversion, plotting, iteration - lists all the available Ensemble methods