Liu et al., 2021 - Google Patents

Morphological segmentation for Seneca

Liu et al., 2021

Document ID: 17862669748803266342
Author: Liu Z; Jimerson R; Prud’Hommeaux E
Publication year: 2021
Publication venue: First Workshop on Natural Language Processing for Indigenous Languages of the Americas

External Links

Cited by

Snippet

This study takes up the task of low-resource morphological segmentation for Seneca, a critically endangered and morphologically complex Native American language primarily spoken in what is now New York State and Ontario. The labeled data in our experiments …

Continue reading at par.nsf.gov (PDF) (other versions)

230000011218 segmentation 0 title abstract description 38

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling

Similar Documents

Publication	Publication Date	Title
Mave et al.	2018	Language identification and analysis of code-switched social media text
CN109325229B (en)	2023-01-31	Method for calculating text similarity by utilizing semantic information
Berardi et al.	2015	Word Embeddings Go to Italy: A Comparison of Models and Training Datasets.
Arshad et al.	2019	Corpus for emotion detection on roman urdu
Schmaltz et al.	2016	Sentence-level grammatical error identification as sequence-to-sequence correction
Liu et al.	2021	Morphological segmentation for Seneca
CN111339772B (en)	2023-11-14	Russian text emotion analysis method, electronic device and storage medium
Etxeberria et al.	2016	Evaluating the noisy channel model for the normalization of historical texts: Basque, Spanish and Slovene
Pennell et al.	2010	Normalization of text messages for text-to-speech
Vyas et al.	2020	Real time machine translation system for english to indian language
Suwanbandit et al.	2023	Thai dialect corpus and transfer-based curriculum learning investigation for dialect automatic speech recognition
Hamed et al.	2022	Holy quran-italian seq2seq machine translation with attention mechanism
JP2016224483A (en)	2016-12-28	Model learning device, method and program
Sarkar	2016	Part-of-speech tagging for code-mixed indian social media text at icon 2015
Tedla et al.	2017	Analyzing word embeddings and improving POS tagger of tigrinya
Sen et al.	2021	Bangla natural language processing: A comprehensive review of classical machine learning and deep learning based methods
Godard	2019	Unsupervised word discovery for computational language documentation
Larkin et al.	2024	MSLC24 submissions to the general machine translation task
Cui et al.	2014	Learning effective word embedding using morphological word similarity
Núñez et al.	2019	Phonetic normalization for machine translation of user generated content
Hossain et al.	2021	Bert-based text simplification approach to reduce linguistic complexity of bangla language
Asahiah	2014	Development of a Standard Yorùbá digital text automatic diacritic restoration system
Al-Banna et al.	2023	Automatic Text Summarization Based on Pre-trained Models
Mitreska et al.	2023	Syllable and Morpheme Segmentation of Macedonian Language
Sakr et al.	2023	AraPunc: Arabic Punctuation Restoration Using Transformers