Ngomo et al., 2017 - Google Patents
BENGAL: an automatic benchmark generator for entity recognition and linkingNgomo et al., 2017
View PDF- Document ID
- 16866407764868734758
- Author
- Ngomo A
- Röder M
- Moussallem D
- Usbeck R
- Speck R
- Publication year
- Publication venue
- arXiv preprint arXiv:1710.08691
External Links
Snippet
The manual creation of gold standards for named entity recognition and entity linking is time- and resource-intensive. Moreover, recent works show that such gold standards contain a large proportion of mistakes in addition to being difficult to maintain. We hence present …
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold data:image/svg+xml;base64,PD94bWwgdmVyc2lvbj0nMS4wJyBlbmNvZGluZz0naXNvLTg4NTktMSc/Pgo8c3ZnIHZlcnNpb249JzEuMScgYmFzZVByb2ZpbGU9J2Z1bGwnCiAgICAgICAgICAgICAgeG1sbnM9J2h0dHA6Ly93d3cudzMub3JnLzIwMDAvc3ZnJwogICAgICAgICAgICAgICAgICAgICAgeG1sbnM6cmRraXQ9J2h0dHA6Ly93d3cucmRraXQub3JnL3htbCcKICAgICAgICAgICAgICAgICAgICAgIHhtbG5zOnhsaW5rPSdodHRwOi8vd3d3LnczLm9yZy8xOTk5L3hsaW5rJwogICAgICAgICAgICAgICAgICB4bWw6c3BhY2U9J3ByZXNlcnZlJwp3aWR0aD0nMzAwcHgnIGhlaWdodD0nMzAwcHgnIHZpZXdCb3g9JzAgMCAzMDAgMzAwJz4KPCEtLSBFTkQgT0YgSEVBREVSIC0tPgo8cmVjdCBzdHlsZT0nb3BhY2l0eToxLjA7ZmlsbDojRkZGRkZGO3N0cm9rZTpub25lJyB3aWR0aD0nMzAwLjAnIGhlaWdodD0nMzAwLjAnIHg9JzAuMCcgeT0nMC4wJz4gPC9yZWN0Pgo8dGV4dCB4PScxMzguMCcgeT0nMTcwLjAnIGNsYXNzPSdhdG9tLTAnIHN0eWxlPSdmb250LXNpemU6NDBweDtmb250LXN0eWxlOm5vcm1hbDtmb250LXdlaWdodDpub3JtYWw7ZmlsbC1vcGFjaXR5OjE7c3Ryb2tlOm5vbmU7Zm9udC1mYW1pbHk6c2Fucy1zZXJpZjt0ZXh0LWFuY2hvcjpzdGFydDtmaWxsOiMzQjQxNDMnID5BPC90ZXh0Pgo8dGV4dCB4PScxNjUuNicgeT0nMTcwLjAnIGNsYXNzPSdhdG9tLTAnIHN0eWxlPSdmb250LXNpemU6NDBweDtmb250LXN0eWxlOm5vcm1hbDtmb250LXdlaWdodDpub3JtYWw7ZmlsbC1vcGFjaXR5OjE7c3Ryb2tlOm5vbmU7Zm9udC1mYW1pbHk6c2Fucy1zZXJpZjt0ZXh0LWFuY2hvcjpzdGFydDtmaWxsOiMzQjQxNDMnID51PC90ZXh0Pgo8cGF0aCBkPSdNIDE4OS44LDE1MC4wIEwgMTg5LjgsMTQ5LjggTCAxODkuOCwxNDkuNyBMIDE4OS44LDE0OS41IEwgMTg5LjcsMTQ5LjMgTCAxODkuNiwxNDkuMiBMIDE4OS42LDE0OS4wIEwgMTg5LjUsMTQ4LjkgTCAxODkuNCwxNDguNyBMIDE4OS4zLDE0OC42IEwgMTg5LjEsMTQ4LjUgTCAxODkuMCwxNDguNCBMIDE4OC44LDE0OC4zIEwgMTg4LjcsMTQ4LjIgTCAxODguNSwxNDguMSBMIDE4OC40LDE0OC4xIEwgMTg4LjIsMTQ4LjAgTCAxODguMCwxNDguMCBMIDE4Ny45LDE0OC4wIEwgMTg3LjcsMTQ4LjAgTCAxODcuNSwxNDguMCBMIDE4Ny40LDE0OC4xIEwgMTg3LjIsMTQ4LjEgTCAxODcuMCwxNDguMiBMIDE4Ni45LDE0OC4yIEwgMTg2LjcsMTQ4LjMgTCAxODYuNiwxNDguNCBMIDE4Ni41LDE0OC41IEwgMTg2LjMsMTQ4LjcgTCAxODYuMiwxNDguOCBMIDE4Ni4xLDE0OC45IEwgMTg2LjAsMTQ5LjEgTCAxODYuMCwxNDkuMiBMIDE4NS45LDE0OS40IEwgMTg1LjksMTQ5LjYgTCAxODUuOCwxNDkuNyBMIDE4NS44LDE0OS45IEwgMTg1LjgsMTUwLjEgTCAxODUuOCwxNTAuMyBMIDE4NS45LDE1MC40IEwgMTg1LjksMTUwLjYgTCAxODYuMCwxNTAuOCBMIDE4Ni4wLDE1MC45IEwgMTg2LjEsMTUxLjEgTCAxODYuMiwxNTEuMiBMIDE4Ni4zLDE1MS4zIEwgMTg2LjUsMTUxLjUgTCAxODYuNiwxNTEuNiBMIDE4Ni43LDE1MS43IEwgMTg2LjksMTUxLjggTCAxODcuMCwxNTEuOCBMIDE4Ny4yLDE1MS45IEwgMTg3LjQsMTUxLjkgTCAxODcuNSwxNTIuMCBMIDE4Ny43LDE1Mi4wIEwgMTg3LjksMTUyLjAgTCAxODguMCwxNTIuMCBMIDE4OC4yLDE1Mi4wIEwgMTg4LjQsMTUxLjkgTCAxODguNSwxNTEuOSBMIDE4OC43LDE1MS44IEwgMTg4LjgsMTUxLjcgTCAxODkuMCwxNTEuNiBMIDE4OS4xLDE1MS41IEwgMTg5LjMsMTUxLjQgTCAxODkuNCwxNTEuMyBMIDE4OS41LDE1MS4xIEwgMTg5LjYsMTUxLjAgTCAxODkuNiwxNTAuOCBMIDE4OS43LDE1MC43IEwgMTg5LjgsMTUwLjUgTCAxODkuOCwxNTAuMyBMIDE4OS44LDE1MC4yIEwgMTg5LjgsMTUwLjAgTCAxODcuOCwxNTAuMCBaJyBzdHlsZT0nZmlsbDojMDAwMDAwO2ZpbGwtcnVsZTpldmVub2RkO2ZpbGwtb3BhY2l0eToxO3N0cm9rZTojMDAwMDAwO3N0cm9rZS13aWR0aDowLjBweDtzdHJva2UtbGluZWNhcDpidXR0O3N0cm9rZS1saW5lam9pbjptaXRlcjtzdHJva2Utb3BhY2l0eToxOycgLz4KPC9zdmc+Cg== data:image/svg+xml;base64,PD94bWwgdmVyc2lvbj0nMS4wJyBlbmNvZGluZz0naXNvLTg4NTktMSc/Pgo8c3ZnIHZlcnNpb249JzEuMScgYmFzZVByb2ZpbGU9J2Z1bGwnCiAgICAgICAgICAgICAgeG1sbnM9J2h0dHA6Ly93d3cudzMub3JnLzIwMDAvc3ZnJwogICAgICAgICAgICAgICAgICAgICAgeG1sbnM6cmRraXQ9J2h0dHA6Ly93d3cucmRraXQub3JnL3htbCcKICAgICAgICAgICAgICAgICAgICAgIHhtbG5zOnhsaW5rPSdodHRwOi8vd3d3LnczLm9yZy8xOTk5L3hsaW5rJwogICAgICAgICAgICAgICAgICB4bWw6c3BhY2U9J3ByZXNlcnZlJwp3aWR0aD0nODVweCcgaGVpZ2h0PSc4NXB4JyB2aWV3Qm94PScwIDAgODUgODUnPgo8IS0tIEVORCBPRiBIRUFERVIgLS0+CjxyZWN0IHN0eWxlPSdvcGFjaXR5OjEuMDtmaWxsOiNGRkZGRkY7c3Ryb2tlOm5vbmUnIHdpZHRoPSc4NS4wJyBoZWlnaHQ9Jzg1LjAnIHg9JzAuMCcgeT0nMC4wJz4gPC9yZWN0Pgo8dGV4dCB4PSczNS4wJyB5PSc1My42JyBjbGFzcz0nYXRvbS0wJyBzdHlsZT0nZm9udC1zaXplOjIzcHg7Zm9udC1zdHlsZTpub3JtYWw7Zm9udC13ZWlnaHQ6bm9ybWFsO2ZpbGwtb3BhY2l0eToxO3N0cm9rZTpub25lO2ZvbnQtZmFtaWx5OnNhbnMtc2VyaWY7dGV4dC1hbmNob3I6c3RhcnQ7ZmlsbDojM0I0MTQzJyA+QTwvdGV4dD4KPHRleHQgeD0nNTEuMCcgeT0nNTMuNicgY2xhc3M9J2F0b20tMCcgc3R5bGU9J2ZvbnQtc2l6ZToyM3B4O2ZvbnQtc3R5bGU6bm9ybWFsO2ZvbnQtd2VpZ2h0Om5vcm1hbDtmaWxsLW9wYWNpdHk6MTtzdHJva2U6bm9uZTtmb250LWZhbWlseTpzYW5zLXNlcmlmO3RleHQtYW5jaG9yOnN0YXJ0O2ZpbGw6IzNCNDE0MycgPnU8L3RleHQ+CjxwYXRoIGQ9J00gNjcuMyw0Mi4wIEwgNjcuMyw0MS45IEwgNjcuMyw0MS44IEwgNjcuMiw0MS43IEwgNjcuMiw0MS42IEwgNjcuMiw0MS41IEwgNjcuMSw0MS40IEwgNjcuMSw0MS4zIEwgNjcuMCw0MS4zIEwgNjYuOSw0MS4yIEwgNjYuOSw0MS4xIEwgNjYuOCw0MS4xIEwgNjYuNyw0MS4wIEwgNjYuNiw0MS4wIEwgNjYuNSw0MC45IEwgNjYuNCw0MC45IEwgNjYuMyw0MC45IEwgNjYuMiw0MC44IEwgNjYuMSw0MC44IEwgNjYuMCw0MC44IEwgNjUuOSw0MC45IEwgNjUuOCw0MC45IEwgNjUuNyw0MC45IEwgNjUuNyw0MC45IEwgNjUuNiw0MS4wIEwgNjUuNSw0MS4wIEwgNjUuNCw0MS4xIEwgNjUuMyw0MS4yIEwgNjUuMyw0MS4yIEwgNjUuMiw0MS4zIEwgNjUuMSw0MS40IEwgNjUuMSw0MS41IEwgNjUuMCw0MS42IEwgNjUuMCw0MS43IEwgNjUuMCw0MS44IEwgNjUuMCw0MS45IEwgNjUuMCw0Mi4wIEwgNjUuMCw0Mi4wIEwgNjUuMCw0Mi4xIEwgNjUuMCw0Mi4yIEwgNjUuMCw0Mi4zIEwgNjUuMCw0Mi40IEwgNjUuMSw0Mi41IEwgNjUuMSw0Mi42IEwgNjUuMiw0Mi43IEwgNjUuMyw0Mi44IEwgNjUuMyw0Mi44IEwgNjUuNCw0Mi45IEwgNjUuNSw0My4wIEwgNjUuNiw0My4wIEwgNjUuNyw0My4xIEwgNjUuNyw0My4xIEwgNjUuOCw0My4xIEwgNjUuOSw0My4xIEwgNjYuMCw0My4yIEwgNjYuMSw0My4yIEwgNjYuMiw0My4yIEwgNjYuMyw0My4xIEwgNjYuNCw0My4xIEwgNjYuNSw0My4xIEwgNjYuNiw0My4wIEwgNjYuNyw0My4wIEwgNjYuOCw0Mi45IEwgNjYuOSw0Mi45IEwgNjYuOSw0Mi44IEwgNjcuMCw0Mi43IEwgNjcuMSw0Mi43IEwgNjcuMSw0Mi42IEwgNjcuMiw0Mi41IEwgNjcuMiw0Mi40IEwgNjcuMiw0Mi4zIEwgNjcuMyw0Mi4yIEwgNjcuMyw0Mi4xIEwgNjcuMyw0Mi4wIEwgNjYuMSw0Mi4wIFonIHN0eWxlPSdmaWxsOiMwMDAwMDA7ZmlsbC1ydWxlOmV2ZW5vZGQ7ZmlsbC1vcGFjaXR5OjE7c3Ryb2tlOiMwMDAwMDA7c3Ryb2tlLXdpZHRoOjAuMHB4O3N0cm9rZS1saW5lY2FwOmJ1dHQ7c3Ryb2tlLWxpbmVqb2luOm1pdGVyO3N0cm9rZS1vcGFjaXR5OjE7JyAvPgo8L3N2Zz4K [Au] 0 abstract description 14
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G06F17/30684—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2785—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/271—Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/3071—Clustering or classification including class or cluster creation or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/30707—Clustering or classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30731—Creation of semantic tools
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G06F17/2881—Natural language generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G06F17/30864—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems
- G06F17/30867—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems with filtering and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F1/00—Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Medhat et al. | Sentiment analysis algorithms and applications: A survey | |
Nguyen et al. | J-NERD: joint named entity recognition and disambiguation with rich linguistic features | |
Škrlj et al. | tax2vec: Constructing interpretable features from taxonomies for short text classification | |
Penalver-Martinez et al. | Feature-based opinion mining through ontologies | |
Ling et al. | Fine-grained entity recognition | |
Gupta et al. | Analyzing the dynamics of research by extracting key aspects of scientific papers | |
Bovi et al. | Large-scale information extraction from textual definitions through deep syntactic and semantic analysis | |
Gudivada et al. | Big data driven natural language processing research and applications | |
Ghag et al. | Comparative analysis of the techniques for sentiment analysis | |
Pais et al. | NLP-based platform as a service: a brief review | |
Loureiro et al. | Don't neglect the obvious: on the role of unambiguous words in word sense disambiguation | |
Khan et al. | Opinion mining summarization and automation process: a survey | |
Pinnis | Latvian and Lithuanian named entity recognition with TildeNER | |
Vīksna et al. | Sentiment analysis in Latvian and Russian: A survey | |
Ngomo et al. | BENGAL: an automatic benchmark generator for entity recognition and linking | |
Makrynioti et al. | PaloPro: a platform for knowledge extraction from big social data and the news | |
Silva et al. | Word tagging with foundational ontology classes: Extending the wordnet-dolce mapping to verbs | |
Lau et al. | Learning context-sensitive domain ontologies from folksonomies: A cognitively motivated method | |
Rahat et al. | A recursive algorithm for open information extraction from Persian texts | |
Petrasova et al. | Building the semantic similarity model for social network data streams | |
Van Thin et al. | A Systematic Literature Review on Vietnamese Aspect-based Sentiment Analysis | |
Reiter et al. | A resource-poor approach for linking ontology classes to Wikipedia articles | |
Valls et al. | Natural language generation through case-based text modification | |
Francisco | Aspect Term Extraction in Aspect-Based Sentiment Analysis | |
Guo et al. | Recurrent neural CRF for aspect term extraction with dependency transmission |