Bayesian networks are general, well-studied probabilistic models that capture dependencies among a set of variables. Variable Elimination is a fundamental algorithm for probabilistic inference over Bayesian networks. In this paper, we propose a novel materialization method, which can lead to significant efficiency gains when processing inference queries using the Variable Elimination algorithm. In particular, we address the problem of choosing a set of intermediate results to precompute and materialize, so as to maximize the expected efficiency gain over a given query workload. For the problem we consider, we provide an optimal polynomial-time algorithm and discuss alternative methods. We validate our technique using real-world Bayesian networks. Our experimental results confirm that a modest amount of materialization can lead to significant improvements in the running time of queries, with an average gain of 70%, and reaching up to a gain of 99%, for a uniform workload of queries. Moreover, in comparison with existing junction tree methods that also rely on materialization, our approach achieves competitive efficiency during inference using significantly lighter materialization.

Workload-aware materialization for efficient variable elimination on bayesian networks

Ciaperoni, Martino;
2021

Abstract

Bayesian networks are general, well-studied probabilistic models that capture dependencies among a set of variables. Variable Elimination is a fundamental algorithm for probabilistic inference over Bayesian networks. In this paper, we propose a novel materialization method, which can lead to significant efficiency gains when processing inference queries using the Variable Elimination algorithm. In particular, we address the problem of choosing a set of intermediate results to precompute and materialize, so as to maximize the expected efficiency gain over a given query workload. For the problem we consider, we provide an optimal polynomial-time algorithm and discuss alternative methods. We validate our technique using real-world Bayesian networks. Our experimental results confirm that a modest amount of materialization can lead to significant improvements in the running time of queries, with an average gain of 70%, and reaching up to a gain of 99%, for a uniform workload of queries. Moreover, in comparison with existing junction tree methods that also rely on materialization, our approach achieves competitive efficiency during inference using significantly lighter materialization.
2021
Settore INFO-01/A - Informatica
37th IEEE International Conference on Data Engineering, ICDE 2021
Chania, Greece
19-22 aprile 2021
2021 IEEE 37th International Conference on Data Engineering : ICDE 2021 : Chania, Greece, 19-22 April 2021 : proceedings
Institute of Electrical and Electronics Engineers
9781728191843
172819184X
Materialization; Probabilistic inference
File in questo prodotto:
File Dimensione Formato  
query_the_model.pdf

Accesso chiuso

Tipologia: Published version
Licenza: Tutti i diritti riservati
Dimensione 1.9 MB
Formato Adobe PDF
1.9 MB Adobe PDF   Richiedi una copia
Workload_aware_materialization_for_efficient_variable_elimination_on_Bayesian_networks_2021.pdf

accesso aperto

Tipologia: Accepted version (post-print)
Licenza: Licenza OA dell'editore
Dimensione 1.23 MB
Formato Adobe PDF
1.23 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11384/167294
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 1
  • OpenAlex 4
social impact