An et al., 2023 - Google Patents
Explainable Graph Neural Networks with Data Augmentation for Predicting p K a of C–H AcidsAn et al., 2023
- Document ID
- 8323965169177089887
- Author
- An H
- Liu X
- Cai W
- Shao X
- Publication year
- Publication venue
- Journal of Chemical Information and Modeling
External Links
Snippet
The p K a of C–H acids is an important parameter in the fields of organic synthesis, drug discovery, and materials science. However, the prediction of p K a is still a great challenge due to the limit of experimental data and the lack of chemical insight. Here, a new model for …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/24—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for machine learning, data mining or biostatistics, e.g. pattern finding, knowledge discovery, rule extraction, correlation, clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/28—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for programming tools or database systems, e.g. ontologies, heterogeneous data integration, data warehousing or computing architectures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/12—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for modelling or simulation in systems biology, e.g. probabilistic or dynamic models, gene-regulatory networks, protein interaction networks or metabolic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/16—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for molecular structure, e.g. structure alignment, structural or functional relations, protein folding, domain topologies, drug targeting using structure data, involving two-dimensional or three-dimensional structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/70—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds
- G06F19/704—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds for prediction of properties of compounds, e.g. calculating and selecting molecular descriptors, details related to the development of SAR/QSAR/QSPR models, ADME/Tox models or PK/PD models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/70—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds
- G06F19/708—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds for data visualisation, e.g. molecular structure representations, graphics generation, display of maps or networks or other visual representations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/22—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for sequence comparison involving nucleotides or amino acids, e.g. homology search, motif or SNP [Single-Nucleotide Polymorphism] discovery or sequence alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/20—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for hybridisation or gene expression, e.g. microarrays, sequencing by hybridisation, normalisation, profiling, noise correction models, expression ratio estimation, probe design or probe optimisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/30—Medical informatics, i.e. computer-based analysis or dissemination of patient or disease data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/70—Software maintenance or management
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wei et al. | Rapid prediction of electron–ionization mass spectrometry using neural networks | |
Balcells et al. | tmQM dataset—quantum geometries and properties of 86k transition metal complexes | |
Pan et al. | MolGpka: A web server for small molecule p K a prediction using a graph-convolutional neural network | |
Ji et al. | Predicting a molecular fingerprint from an electron ionization mass spectrum with deep neural networks | |
Grambow et al. | Deep learning of activation energies | |
Williams et al. | The evolution of data-driven modeling in organic chemistry | |
Coley et al. | Prediction of organic reaction outcomes using machine learning | |
Colby et al. | Deep learning to generate in silico chemical property libraries and candidate molecules for small molecule identification in complex samples | |
Doan et al. | Quantum chemistry-informed active learning to accelerate the design and discovery of sustainable energy storage materials | |
Ji et al. | Deep MS/MS-aided structural-similarity scoring for unknown metabolite identification | |
Sheridan | Using random forest to model the domain applicability of another random forest model | |
Hong et al. | Mold2, molecular descriptors from 2D structures for chemoinformatics and toxicoinformatics | |
Gensch et al. | Design and application of a screening set for monophosphine ligands in cross-coupling | |
Zhu et al. | Rapid approximate subset-based spectra prediction for electron ionization–mass spectrometry | |
Low et al. | Explainable solvation free energy prediction combining graph neural networks with chemical intuition | |
Wu et al. | Machine learning methods for pKa prediction of small molecules: Advances and challenges | |
Fitzner et al. | Machine learning C–N couplings: Obstacles for a general-purpose reaction yield prediction | |
Celma et al. | Prediction of retention time and collision cross section (CCSH+, CCSH–, and CCSNa+) of emerging contaminants using multiple adaptive regression splines | |
An et al. | Explainable Graph Neural Networks with Data Augmentation for Predicting p K a of C–H Acids | |
Kammeraad et al. | What does the machine learn? Knowledge representations of chemical reactivity | |
Aouichaoui et al. | Combining Group-Contribution concept and graph neural networks toward interpretable molecular property models | |
Krzyzanowski et al. | Spacial Score─ A Comprehensive Topological Indicator for Small-Molecule Complexity | |
Zhang et al. | AllCCS2: Curation of ion mobility collision cross-section atlas for small molecules using comprehensive molecular representations | |
Xue et al. | Advances in the Application of Artificial Intelligence-Based Spectral Data Interpretation: A Perspective | |
Kleinekorte et al. | APPROPRIATE life cycle assessment: a PRO cess-specific, PR edictive I mpact A ssessmen T method for emerging chemical processes |