Liu et al., 2021 - Google Patents
Morphological segmentation for SenecaLiu et al., 2021
View PDF- Document ID
- 17862669748803266342
- Author
- Liu Z
- Jimerson R
- Prud’Hommeaux E
- Publication year
- Publication venue
- First Workshop on Natural Language Processing for Indigenous Languages of the Americas
External Links
Snippet
This study takes up the task of low-resource morphological segmentation for Seneca, a critically endangered and morphologically complex Native American language primarily spoken in what is now New York State and Ontario. The labeled data in our experiments …
- 230000011218 segmentation 0 title abstract description 38
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Mave et al. | Language identification and analysis of code-switched social media text | |
CN109325229B (en) | Method for calculating text similarity by utilizing semantic information | |
Berardi et al. | Word Embeddings Go to Italy: A Comparison of Models and Training Datasets. | |
Arshad et al. | Corpus for emotion detection on roman urdu | |
Schmaltz et al. | Sentence-level grammatical error identification as sequence-to-sequence correction | |
Liu et al. | Morphological segmentation for Seneca | |
CN111339772B (en) | Russian text emotion analysis method, electronic device and storage medium | |
Etxeberria et al. | Evaluating the noisy channel model for the normalization of historical texts: Basque, Spanish and Slovene | |
Pennell et al. | Normalization of text messages for text-to-speech | |
Vyas et al. | Real time machine translation system for english to indian language | |
Suwanbandit et al. | Thai dialect corpus and transfer-based curriculum learning investigation for dialect automatic speech recognition | |
Hamed et al. | Holy quran-italian seq2seq machine translation with attention mechanism | |
JP2016224483A (en) | Model learning device, method and program | |
Sarkar | Part-of-speech tagging for code-mixed indian social media text at icon 2015 | |
Tedla et al. | Analyzing word embeddings and improving POS tagger of tigrinya | |
Sen et al. | Bangla natural language processing: A comprehensive review of classical machine learning and deep learning based methods | |
Godard | Unsupervised word discovery for computational language documentation | |
Larkin et al. | MSLC24 submissions to the general machine translation task | |
Cui et al. | Learning effective word embedding using morphological word similarity | |
Núñez et al. | Phonetic normalization for machine translation of user generated content | |
Hossain et al. | Bert-based text simplification approach to reduce linguistic complexity of bangla language | |
Asahiah | Development of a Standard Yorùbá digital text automatic diacritic restoration system | |
Al-Banna et al. | Automatic Text Summarization Based on Pre-trained Models | |
Mitreska et al. | Syllable and Morpheme Segmentation of Macedonian Language | |
Sakr et al. | AraPunc: Arabic Punctuation Restoration Using Transformers |