The computation of accurate geometric parameters at density functional theory cost for large molecules in the gas phase is addressed through a novel strategy that combines quantum chemical models with machine learning techniques. The first key step is the expansion of a database of accurate semi-experimental equilibrium structures with additional molecular geometries optimized by version 2 of the Pisa composite scheme. Then, the templating synthon approach is used to improve the accuracy of structures optimized by a hybrid density functional paired with a double zeta basis set, leveraging chemical similarity to cluster different molecular environments and refine bond lengths and valence angles. A set of prototypical biomolecular building blocks is used to demonstrate that it is possible to achieve spectroscopic accuracy for molecular systems too large to be treated by state-of-the-art composite wavefunction methods. In addition, a freely accessible web-based tool has been developed to facilitate the post-processing of geometries optimized using standard electronic structure codes, thereby providing an accurate and efficient tool for the computational study of medium- to large-sized molecules, also accessible to experiment-oriented researchers.
Molecular structures with spectroscopic accuracy at DFT cost by the templating synthon approach and the PCS141 database
Crisci, Luigi;Mendolicchio, Marco;Barone, Vincenzo.
2025
Abstract
The computation of accurate geometric parameters at density functional theory cost for large molecules in the gas phase is addressed through a novel strategy that combines quantum chemical models with machine learning techniques. The first key step is the expansion of a database of accurate semi-experimental equilibrium structures with additional molecular geometries optimized by version 2 of the Pisa composite scheme. Then, the templating synthon approach is used to improve the accuracy of structures optimized by a hybrid density functional paired with a double zeta basis set, leveraging chemical similarity to cluster different molecular environments and refine bond lengths and valence angles. A set of prototypical biomolecular building blocks is used to demonstrate that it is possible to achieve spectroscopic accuracy for molecular systems too large to be treated by state-of-the-art composite wavefunction methods. In addition, a freely accessible web-based tool has been developed to facilitate the post-processing of geometries optimized using standard electronic structure codes, thereby providing an accurate and efficient tool for the computational study of medium- to large-sized molecules, also accessible to experiment-oriented researchers.| File | Dimensione | Formato | |
|---|---|---|---|
|
114310_1_5.0255564.pdf
Open Access dal 04/07/2025
Tipologia:
Published version
Licenza:
Creative Commons
Dimensione
7.24 MB
Formato
Adobe PDF
|
7.24 MB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



