[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Liu et al., 2021 - Google Patents

Morphological segmentation for Seneca

Liu et al., 2021

View PDF
Document ID
17862669748803266342
Author
Liu Z
Jimerson R
Prud’Hommeaux E
Publication year
Publication venue
First Workshop on Natural Language Processing for Indigenous Languages of the Americas

External Links

Snippet

This study takes up the task of low-resource morphological segmentation for Seneca, a critically endangered and morphologically complex Native American language primarily spoken in what is now New York State and Ontario. The labeled data in our experiments …
Continue reading at par.nsf.gov (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • G06F17/30657Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • G06F17/277Lexical analysis, e.g. tokenisation, collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2809Data driven translation
    • G06F17/2827Example based machine translation; Alignment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2705Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2872Rule based translation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling

Similar Documents

Publication Publication Date Title
Mave et al. Language identification and analysis of code-switched social media text
CN109325229B (en) Method for calculating text similarity by utilizing semantic information
Berardi et al. Word Embeddings Go to Italy: A Comparison of Models and Training Datasets.
Arshad et al. Corpus for emotion detection on roman urdu
Schmaltz et al. Sentence-level grammatical error identification as sequence-to-sequence correction
Liu et al. Morphological segmentation for Seneca
CN111339772B (en) Russian text emotion analysis method, electronic device and storage medium
Etxeberria et al. Evaluating the noisy channel model for the normalization of historical texts: Basque, Spanish and Slovene
Pennell et al. Normalization of text messages for text-to-speech
Vyas et al. Real time machine translation system for english to indian language
Suwanbandit et al. Thai dialect corpus and transfer-based curriculum learning investigation for dialect automatic speech recognition
Hamed et al. Holy quran-italian seq2seq machine translation with attention mechanism
JP2016224483A (en) Model learning device, method and program
Sarkar Part-of-speech tagging for code-mixed indian social media text at icon 2015
Tedla et al. Analyzing word embeddings and improving POS tagger of tigrinya
Sen et al. Bangla natural language processing: A comprehensive review of classical machine learning and deep learning based methods
Godard Unsupervised word discovery for computational language documentation
Larkin et al. MSLC24 submissions to the general machine translation task
Cui et al. Learning effective word embedding using morphological word similarity
Núñez et al. Phonetic normalization for machine translation of user generated content
Hossain et al. Bert-based text simplification approach to reduce linguistic complexity of bangla language
Asahiah Development of a Standard Yorùbá digital text automatic diacritic restoration system
Al-Banna et al. Automatic Text Summarization Based on Pre-trained Models
Mitreska et al. Syllable and Morpheme Segmentation of Macedonian Language
Sakr et al. AraPunc: Arabic Punctuation Restoration Using Transformers