Anand Kumar et al., 2010 - Google Patents

A sequence labeling approach to morphological analyzer for tamil language

Anand Kumar et al., 2010

Document ID: 6657470950506147006
Author: Anand Kumar M; Dhanalakshmi V; Soman K; Rajendran S
Publication year: 2010
Publication venue: International Journal on Computer Science and Engineering

External Links

Cited by

Snippet

Natural Language Processing task. Morphology is the study of internal structure of the word. Morphological analysis retrieves the grammatical features and properties of a morphologically inflected word. Capturing the agglutinative structure of Tamil words by an …

Continue reading at citeseerx.ist.psu.edu (PDF) (other versions)

230000000877 morphologic 0 title abstract description 77

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/271—Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/274—Grammatical analysis; Style critique
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G06F17/2217—Character encodings
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2863—Processing of non-latin text
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30943—Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type
- G06F17/30964—Querying
- G06F17/30979—Query processing
- G06F17/30985—Query processing by using string matching techniques
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/68—Methods or arrangements for recognition using electronic means using sequential comparisons of the image signals with a plurality of references in which the sequence of the image signals or the references is relevant, e.g. addressable memory
- G06K9/6807—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries
- G06K9/6842—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries according to the linguistic properties, e.g. English, German
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems

Similar Documents

Publication	Publication Date	Title
Anand Kumar et al.	2010	A sequence labeling approach to morphological analyzer for tamil language
Soudi et al.	2007	Arabic computational morphology: knowledge-based and empirical methods
Kumar et al.	2010	Part of speech taggers for morphologically rich indian languages: a survey
Kumar et al.	2009	Morphological analyzer for agglutinative languages using machine learning approaches
KR101023209B1 (en)	2011-03-18	Document translation apparatus and its method
KR101072460B1 (en)	2011-10-11	Method for korean morphological analysis
Ezhilarasi et al.	2021	Depicting a Neural Model for Lemmatization and POS Tagging of words from Palaeographic stone inscriptions
Pal et al.	2020	Vartani Spellcheck--Automatic Context-Sensitive Spelling Correction of OCR-generated Hindi Text Using BERT and Levenshtein Distance
Mulugeta et al.	2012	Learning morphological rules for Amharic verbs using inductive logic programming
Belay et al.	2021	Impacts of homophone normalization on semantic models for amharic
Haq et al.	2023	NLPashto: NLP toolkit for low-resource Pashto language
Vasiu et al.	2020	Enhancing tokenization by embedding romanian language specific morphology
Masri et al.	2024	Transformer Models in Education: Summarizing Science Textbooks with AraBART, MT5, AraT5, and mBART
Doumi et al.	2016	A semi-automatic and low cost approach to build scalable lemma-based lexical resources for Arabic verbs
Voditel et al.	2023	Image Captioning-A Deep Learning Approach Using CNN and LSTM Network
KR20040018008A (en)	2004-03-02	Apparatus for tagging part of speech and method therefor
Maulud et al.	2023	Towards a Complete Kurdish NLP Pipeline: Challenges and Opportunities
kumar et al.	2014	AMRITA_CEN@ FIRE-2014: morpheme extraction and lemmatization for tamil using machine learning
Olivo et al.	2019	CRFPOST: Part-of-Speech Tagger for Filipino Texts using Conditional Random Fields
Divate	2023	Hybrid Morph-Analysis Model for Marathi
Paul et al.	2023	Bengali UPOS-Tag: A Systematic Approach to Universal Dependency-Based Dataset Creation for Enhanced NLP Research
Shanilka	2022	Learning a wide-coverage generalized classifier model for Sinhala morphology
Gamo et al.	2024	Deep Learning Based Model for a Spell Checker of Wolaita Language
Birhanu	2024	Transition Based Dependency Parser for Amharic Language using Transformer Model
Vetriselvi et al.	2023	Regex Parsing in Hybrid and Pure Approaches of Text Summarization