Lin et al., 2016 - Google Patents
Deep structured scene parsing by learning with image descriptionsLin et al., 2016
View PDF- Document ID
- 1611003948026832545
- Author
- Lin L
- Wang G
- Zhang R
- Zhang R
- Liang X
- Zuo W
- Publication year
- Publication venue
- Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
External Links
Snippet
This paper addresses the problem of structured scene parsing, ie, parsing the input scene into a configuration including hierarchical semantic objects with their interaction relations. We propose a deep architecture consisting of two networks: i) a convolutional neural …
- 230000001537 neural 0 abstract description 21
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Lin et al. | Deep structured scene parsing by learning with image descriptions | |
CN112288091B (en) | Knowledge inference method based on multi-mode knowledge graph | |
CN109918489B (en) | Multi-strategy fused knowledge question answering method and system | |
WO2018207723A1 (en) | Abstract generation device, abstract generation method, and computer program | |
Tran et al. | Neural metric learning for fast end-to-end relation extraction | |
Yu et al. | Dual attention on pyramid feature maps for image captioning | |
CN109783666A (en) | A kind of image scene map generation method based on iteration fining | |
Zhang et al. | Hierarchical scene parsing by weakly supervised learning with image descriptions | |
CN113051914A (en) | Enterprise hidden label extraction method and device based on multi-feature dynamic portrait | |
CN108846000A (en) | A kind of common sense semanteme map construction method and device based on supernode and the common sense complementing method based on connection prediction | |
Miao et al. | A dynamic financial knowledge graph based on reinforcement learning and transfer learning | |
Jandial et al. | Trace: Transform aggregate and compose visiolinguistic representations for image search with text feedback | |
Dai et al. | Relation classification via LSTMs based on sequence and tree structure | |
CN115168579A (en) | Text classification method based on multi-head attention mechanism and two-dimensional convolution operation | |
Upreti | Convolutional neural network (cnn). a comprehensive overview | |
Li et al. | A Survey on Deep Active Learning: Recent Advances and New Frontiers | |
KR102182619B1 (en) | Knowledge extraction system using frame based on ontology | |
CN111597816A (en) | Self-attention named entity recognition method, device, equipment and storage medium | |
Wu et al. | Visual Question Answering | |
Tian et al. | Scene graph generation by multi-level semantic tasks | |
CN115358477B (en) | Fight design random generation system and application thereof | |
Zhang et al. | CAE-GReaT: Convolutional-Auxiliary Efficient Graph Reasoning Transformer for Dense Image Predictions | |
CN117290478A (en) | Knowledge graph question-answering method, device, equipment and storage medium | |
Mathur et al. | DocEdit: language-guided document editing | |
Xu et al. | Panel-page-aware comic genre understanding |