The computation of accurate geometric parameters at density functional theory cost for large molecules in the gas phase is addressed through a novel strategy that combines quantum chemical models with machine learning techniques. The first key step is the expansion of a database of accurate semi-experimental equilibrium structures with additional molecular geometries optimized by version 2 of the Pisa composite scheme. Then, the templating synthon approach is used to improve the accuracy of structures optimized by a hybrid density functional paired with a double zeta basis set, leveraging chemical similarity to cluster different molecular environments and refine bond lengths and valence angles. A set of prototypical biomolecular building blocks is used to demonstrate that it is possible to achieve spectroscopic accuracy for molecular systems too large to be treated by state-of-the-art composite wavefunction methods. In addition, a freely accessible web-based tool has been developed to facilitate the post-processing of geometries optimized using standard electronic structure codes, thereby providing an accurate and efficient tool for the computational study of medium- to large-sized molecules, also accessible to experiment-oriented researchers.

Molecular structures with spectroscopic accuracy at DFT cost by the templating synthon approach and the PCS141 database

Crisci, Luigi;Mendolicchio, Marco;Barone, Vincenzo.
2025

Abstract

The computation of accurate geometric parameters at density functional theory cost for large molecules in the gas phase is addressed through a novel strategy that combines quantum chemical models with machine learning techniques. The first key step is the expansion of a database of accurate semi-experimental equilibrium structures with additional molecular geometries optimized by version 2 of the Pisa composite scheme. Then, the templating synthon approach is used to improve the accuracy of structures optimized by a hybrid density functional paired with a double zeta basis set, leveraging chemical similarity to cluster different molecular environments and refine bond lengths and valence angles. A set of prototypical biomolecular building blocks is used to demonstrate that it is possible to achieve spectroscopic accuracy for molecular systems too large to be treated by state-of-the-art composite wavefunction methods. In addition, a freely accessible web-based tool has been developed to facilitate the post-processing of geometries optimized using standard electronic structure codes, thereby providing an accurate and efficient tool for the computational study of medium- to large-sized molecules, also accessible to experiment-oriented researchers.
2025
Settore CHEM-02/A - Chimica fisica
File in questo prodotto:
File Dimensione Formato  
114310_1_5.0255564.pdf

Open Access dal 04/07/2025

Tipologia: Published version
Licenza: Creative Commons
Dimensione 7.24 MB
Formato Adobe PDF
7.24 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11384/153785
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 10
  • OpenAlex 11
social impact