[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Steinunn Rut Friðriksdóttir and Anton Karl Ingason

Affiliation: Faculty of Icelandic and Comparative Cultural Studies, University of Iceland, Sæmundargata 2, 102 Reykjavík, Iceland

Keyword(s): Confusion Sets, Homophones, Context Dependency, Rich Morphology, Disambiguation, Icelandic.

Abstract: The processing of strings which are semantically distinct but can be easily confused with each other, often on account of being pronounced identically, is a prime example of context dependency in Natural Language Processing. This problem arises when a system needs to distinguish whether a bank is a ‘river bank’ or a ‘financial institution’ and it also challenges systems for context-sensitive spelling and grammar correction because pairs like their/there and I/me are one common source of issues that such systems must address. In practice, this type of context-dependency can be especially prominent in languages with rich morphology where large paradigms of inflected word forms lead to a proliferation of such confusion sets. In this paper, we present our novel confusion set corpus for Icelandic as well as our findings from an experiment that uses well-known classification algorithms to disambiguate confusion sets that appear in our corpus.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 79.170.44.78

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Friðriksdóttir, S. and Ingason, A. (2020). Disambiguating Confusion Sets in a Language with Rich Morphology. In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI; ISBN 978-989-758-395-7; ISSN 2184-433X, SciTePress, pages 446-451. DOI: 10.5220/0009371504460451

@conference{nlpinai20,
author={Steinunn Rut Friðriksdóttir and Anton Karl Ingason},
title={Disambiguating Confusion Sets in a Language with Rich Morphology},
booktitle={Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI},
year={2020},
pages={446-451},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009371504460451},
isbn={978-989-758-395-7},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI
TI - Disambiguating Confusion Sets in a Language with Rich Morphology
SN - 978-989-758-395-7
IS - 2184-433X
AU - Friðriksdóttir, S.
AU - Ingason, A.
PY - 2020
SP - 446
EP - 451
DO - 10.5220/0009371504460451
PB - SciTePress

<style> #socialicons>a span { top: 0px; left: -100%; -webkit-transition: all 0.3s ease; -moz-transition: all 0.3s ease-in-out; -o-transition: all 0.3s ease-in-out; -ms-transition: all 0.3s ease-in-out; transition: all 0.3s ease-in-out;} #socialicons>ahover div{left: 0px;} </style>