Under-coverage in high-statistics counting experiments with finite MC samples

Alexe, Cristina Andreea; Bendavid, J.; Bianchini, Lorenzo; Bruschini, Davide

doi:10.1016/j.nima.2026.171360

We consider the problem of setting confidence intervals on a parameter of interest from the maximum-likelihood fit of a physics model to a binned data set with a large number of bins, large event-counts per bin, and in the presence of systematic uncertainties modeled as nuisance parameters. We use the profile-likelihood ratio for statistical inference and focus on the case in which the model is determined from Monte Carlo simulated samples of finite size. We start by presenting a toy model in which the properties of widely used approximations of the profile-likelihood ratio in the asymptotic limit, which are commonly expected to hold in the high-statistics regime, are manifestly broken even if the numbers of events per bin in both the data and simulated samples are seemingly large enough to warrant their validity. We then move to the general setting to show how statistical uncertainties in the Monte Carlo predictions can affect the coverage of confidence intervals constructed in the asymptotic approximation always in the same direction, namely they lead to systematic under-coverage.

Under-coverage in high-statistics counting experiments with finite MC samples

Alexe, Cristina Andreea;Bendavid, J.;Bianchini, Lorenzo;Bruschini, Davide

2026

Abstract

We consider the problem of setting confidence intervals on a parameter of interest from the maximum-likelihood fit of a physics model to a binned data set with a large number of bins, large event-counts per bin, and in the presence of systematic uncertainties modeled as nuisance parameters. We use the profile-likelihood ratio for statistical inference and focus on the case in which the model is determined from Monte Carlo simulated samples of finite size. We start by presenting a toy model in which the properties of widely used approximations of the profile-likelihood ratio in the asymptotic limit, which are commonly expected to hold in the high-statistics regime, are manifestly broken even if the numbers of events per bin in both the data and simulated samples are seemingly large enough to warrant their validity. We then move to the general setting to show how statistical uncertainties in the Monte Carlo predictions can affect the coverage of confidence intervals constructed in the asymptotic approximation always in the same direction, namely they lead to systematic under-coverage.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2026
			
	Settore Scientifico Disciplinare (validi dal 09/05/2024)
	
				Settore PHYS-01/A - Fisica sperimentale delle interazioni fondamentali e applicazioni
			
	Titolo Rivista
	
				NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH. SECTION A, ACCELERATORS, SPECTROMETERS, DETECTORS AND ASSOCIATED EQUIPMENT
			
	DOI
	
				https://dx.doi.org/10.1016/j.nima.2026.171360
			
	Parole chiave
	
				Barlow–Beeston; Confidence interval; Coverage; HEP
			
	Dataset relativi alla pubblicazione
	
	DOI
	
									https://dx.doi.org/10.1016/j.nima.2026.171360
								
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0168900226000860-main.pdf accesso aperto Tipologia: Published version Licenza: Creative Commons Dimensione 2.35 MB Formato Adobe PDF	2.35 MB	Adobe PDF