Inference with parameter dependent signal yields#

In this tutorial we will look into signal yields that depend on certain functional structure, e.g. depend on parameters \(c_1\) and \(c_2\). For this tutorial we will assume the following functional structure for signal yields:

\[ s(c_1, c_2) = 2.5 c_1^2 + 3.7 c_2^2 \]

Let us first install all the dependencies and functions that we will use in this tutorial.

Lets assume a bin with 3 observed yields, 3.6 background events and signal yields that depend on the function \(s(c_1,c_2)\). One can include more than one bin but lets keep it simple.

data = [3]
background = [3.6]


def signal(c1, c2):
    return [2.5 * c1**2 + 3.7 * c2**2]

For simplisity we will use Poisson based likelihood which does not include background uncertainties

\[ \mathcal{L}(\mu) = \prod_{i\in{\rm bins}}{\rm Poiss}(n^i|\mu n_s^i + n_b^i) \]

where \(n\), \(n_s\) and \(n_b\) are observed, signal and background yields. Details on usage can be found in this link.

pdf_wrapper = spey.get_backend("default_pdf.poisson")

Let’s compute \(\chi^2\)

\[ \chi^2 = -2\log\left(\frac{\mathcal{L}(\mu=1)}{\mathcal{L}(\hat\mu)}\right) \]

\(\hat\mu\) is the signal strength that maximises the likelihood. Note that since our signal is based on two variables, we can compute \(\chi^2(c_1,c_2)\) as follows

chi2 = []
for c1 in np.linspace(-1, 1, 10):
    for c2 in np.linspace(-1, 1, 10):
        stat_model = pdf_wrapper(
            signal_yields=signal(c1, c2),
            background_yields=background,
            data=data,
            analysis="poisson",
        )
        chi2.append([c1, c2, stat_model.chi2()])
chi2 = np.array(chi2)

chi2 contains the \(\chi^2(c_1,c_2)\) values where \(c_1, c_2 \in [-1,1]\). Note that we rather did a sparse scan over \(c_1\) and \(c_2\) values, in the following we will interpolate over these values using griddata function. The smoothness can be increased by increasing the number of points that we scaned through. For this example we only used 10 points for both parameters.

../_images/8d18950594f5970f521e59f6dee6e68b031cb6de2465522da4b7456e49da9d1b.png

Including uncertainties#

Instead of using simple Poisson likelihood, which does not include any uncertainty definition, one can also use uncorrelated background function which includes the uncertainties through serries of Gausian terms in the likelihood. For details see this link.

Here we will define 6 observed yields and background is \(10.6\pm4.8\).

data = [6]
background = [10.6]
bkg_unc = [4.8]

In the following we will create our PDF wrapper which includes the definition for uncorrelated backgrounds:

pdf_wrapper = spey.get_backend("default_pdf.uncorrelated_background")

As before, let us compute \(\chi^2(c_1, c_2)\) in a similar fashion.

chi2 = []
for c1 in np.linspace(-1, 1, 10):
    for c2 in np.linspace(-1, 1, 10):
        stat_model_with_unc = pdf_wrapper(
            signal_yields=signal(c1, c2),
            background_yields=background,
            data=data,
            absolute_uncertainties=bkg_unc,
            analysis="with uncertainty",
        )
        chi2.append([c1, c2, stat_model_with_unc.chi2()])
chi2 = np.array(chi2)

../_images/02d20dcd5d06094f45e7dcc09778a3a75f9ddda32b635917bfd2f2939d559878.png

Combining likelihoods#

Assuming these likelihoods are part of an uncorrelated histogram we can combine two likelihoods to build a more generic likelihood. For this, we will use UnCorrStatisticsCombiner. This class can combine any likelihood that is written as a spey plug-in as follows

\[ \mathcal{L}^\prime_{\rm indep}(\mu) = \prod_{i\in {\rm models}} \mathcal{L}_i(\mu, \theta_i) \]

As before, let us compute \(\chi^2(c_1, c_2)\) but this time lets get this value for the combined likelihood.

chi2 = []
for c1 in np.linspace(-1, 1, 10):
    for c2 in np.linspace(-1, 1, 10):
        stat_model_with_unc = spey.get_backend("default_pdf.uncorrelated_background")(
            signal_yields=signal(c1, c2),
            background_yields=background,
            data=data,
            absolute_uncertainties=bkg_unc,
            analysis="with uncertainty",
        )
        stat_model = spey.get_backend("default_pdf.poisson")(
            signal_yields=signal(c1, c2),
            background_yields=background,
            data=data,
            analysis="poisson",
        )
        combined = spey.UnCorrStatisticsCombiner(stat_model, stat_model_with_unc)
        chi2.append([c1, c2, combined.chi2()])
chi2 = np.array(chi2)

../_images/980704e5797f566d522fdbc1dda79700dd577874320b3acd3a8b2dd391cd4e42.png

Inference with parameter dependent signal yields

Contents

Inference with parameter dependent signal yields#

Including uncertainties#

Combining likelihoods#