Prediction of pKa values for druglike molecules using semiempirical quantum chemical methods
Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt
Dokumenter
- Paper.R2
Indsendt manuskript, 1,23 MB, PDF-dokument
Rapid yet accurate pKa prediction for druglike molecules is a key challenge in computational chemistry. This study uses PM6-DH+/COSMO, PM6/COSMO, PM7/COSMO, PM3/COSMO, AM1/COSMO, PM3/SMD, AM1/SMD, and DFTB3/SMD to predict the pKa values of 53 amine groups in 48 druglike compounds. The approach uses an isodesmic reaction where the pKa value is computed relative to a chemically related reference compound for which the pKa value has been measured experimentally or estimated using a standard empirical approach. The AM1- and PM3-based methods perform best with RMSE values of 1.4-1.6 pH units that have uncertainties of ±0.2-0.3 pH units, which make them statistically equivalent. However, for all but PM3/SMD and AM1/SMD the RMSEs are dominated by a single outlier, cefadroxil, caused by proton transfer in the zwitterionic protonation state. If this outlier is removed, the RMSE values for PM3/COSMO and AM1/COSMO drop to 1.0 ± 0.2 and 1.1 ± 0.3, whereas PM3/SMD and AM1/SMD remain at 1.5 ± 0.3 and 1.6 ± 0.3/0.4 pH units, making the COSMO-based predictions statistically better than the SMD-based predictions. For pKa calculations where a zwitterionic state is not involved or proton transfer in a zwitterionic state is not observed, PM3/COSMO or AM1/COSMO is the best pKa prediction method; otherwise PM3/SMD or AM1/SMD should be used. Thus, fast and relatively accurate pKa prediction for 100-1000s of druglike amines is feasible with the current setup and relatively modest computational resources.
Originalsprog | Engelsk |
---|---|
Tidsskrift | Journal of Physical Chemistry Part A: Molecules, Spectroscopy, Kinetics, Environment and General Theory |
Vol/bind | 121 |
Udgave nummer | 3 |
Sider (fra-til) | 699-707 |
Antal sider | 9 |
ISSN | 1089-5639 |
DOI | |
Status | Udgivet - 2017 |
Antal downloads er baseret på statistik fra Google Scholar og www.ku.dk
ID: 173557431