Boiko et al., 2023 - Google Patents
Autonomous chemical research with large language modelsBoiko et al., 2023
View HTML- Document ID
- 8097577445064259203
- Author
- Boiko D
- MacKnight R
- Kline B
- Gomes G
- Publication year
- Publication venue
- Nature
External Links
Snippet
Transformer-based large language models are making significant strides in various fields, such as natural language processing,,,–, biology,, chemistry,–and computer programming,. Here, we show the development and capabilities of Coscientist, an artificial intelligence …
- 238000011160 research 0 title abstract description 18
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30389—Query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
- G06F17/30522—Query processing with adaptation to user needs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G06F17/30864—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems
- G06F17/30867—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems with filtering and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/28—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for programming tools or database systems, e.g. ontologies, heterogeneous data integration, data warehousing or computing architectures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/70—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds
- G06F19/702—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds for analysis and planning of chemical reactions and syntheses, e.g. synthesis design, reaction prediction, mechanism elucidation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/70—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds
- G06F19/708—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds for data visualisation, e.g. molecular structure representations, graphics generation, display of maps or networks or other visual representations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/16—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for molecular structure, e.g. structure alignment, structural or functional relations, protein folding, domain topologies, drug targeting using structure data, involving two-dimensional or three-dimensional structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Boiko et al. | Autonomous chemical research with large language models | |
Kearnes et al. | The open reaction database | |
De Almeida et al. | Synthetic organic chemistry driven by artificial intelligence | |
Wilbraham et al. | Digitizing chemistry using the chemical processing unit: from synthesis to discovery | |
Seifrid et al. | Autonomous chemical experiments: Challenges and perspectives on establishing a self-driving lab | |
Coley et al. | SCScore: synthetic complexity learned from a reaction corpus | |
Lu et al. | Unified deep learning model for multitask reaction predictions with explanation | |
Coley | Defining and exploring chemical spaces | |
Dimitrov et al. | Autonomous molecular design: then and now | |
Maser et al. | Multilabel classification models for the prediction of cross-coupling reaction conditions | |
Coley et al. | Machine learning in computer-aided synthesis planning | |
Coley et al. | Prediction of organic reaction outcomes using machine learning | |
Nugmanov et al. | CGRtools: Python library for molecule, reaction, and condensed graph of reaction processing | |
Thakkar et al. | Artificial intelligence and automation in computer aided synthesis planning | |
Law et al. | Route designer: a retrosynthetic analysis tool utilizing automated retrosynthetic rule generation | |
Ewald et al. | Web-based multi-omics integration using the analyst software suite | |
Gao et al. | Autonomous platforms for data-driven organic synthesis | |
Haywood et al. | Kernel methods for predicting yields of chemical reactions | |
Wang et al. | Identifying general reaction conditions by bandit optimization | |
Nicolaou et al. | Context aware data-driven retrosynthetic analysis | |
Susanto | Cheminformatics—The promising future: Managing change of approach through ICT emerging technology | |
Wang et al. | ChemistGA: a chemical synthesizable accessible molecular generation algorithm for real-world drug discovery | |
Seifrid et al. | Routescore: punching the ticket to more efficient materials development | |
Tugizimana et al. | The disruptive 4IR in the life sciences: Metabolomics | |
Genheden et al. | Clustering of synthetic routes using tree edit distance |