SPLRE: Vol 57, No 1

Volume 57, Issue 1Mar 2023

Volume 57, Issue 1

Mar 2023

Publisher:

Springer-Verlag
Berlin, Heidelberg

ISSN:1574-020X

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

editorial

Editorial: LRE updates

Pages 1–3https://doi.org/10.1007/s10579-023-09645-4

research-article

TIARA 2.0: an interactive tool for annotating discourse structure and text improvement

Pages 5–29https://doi.org/10.1007/s10579-021-09566-0

Abstract

Discourse structure annotation aims at analysing how discourse units (e.g. sentences or clauses) relate to each other and what roles they play in the overall discourse. Several annotation tools for discourse structure have been developed. However, ...

research-article

Statistical quality estimation for partially subjective classification tasks through crowdsourcing

Pages 31–56https://doi.org/10.1007/s10579-022-09617-0

Abstract

When constructing a large-scale data resource, the quality of artifacts has great significance, especially when they are generated by creators through crowdsourcing. A widely used approach is to estimate the quality of each artifact based on ...

research-article

Public Access

COLLIE: a broad-coverage ontology and lexicon of verbs in English

Pages 57–86https://doi.org/10.1007/s10579-022-09600-9

Abstract

Progress on deep language understanding is inhibited by the lack of a broad coverage lexicon that connects linguistic behavior to ontological concepts and axioms. We have developed COLLIE-V, a deep lexical resource for verbs, with the coverage of ...

correction

Correction: COLLIE: a broad-coverage ontology and lexicon of verbs in English

Page 87https://doi.org/10.1007/s10579-023-09639-2

research-article

The WASABI song corpus and knowledge graph for music lyrics analysis

Pages 89–119https://doi.org/10.1007/s10579-022-09601-8

Abstract

We present the WASABI Song Corpus, a large corpus of songs enriched with metadata extracted from music databases on the Web, and resulting from the processing of song lyrics and from audio analysis. More specifically, given that lyrics encode an ...

research-article

Between welcome culture and border fence: A dataset on the European refugee crisis in German newspaper reports

Pages 121–153https://doi.org/10.1007/s10579-023-09641-8

Abstract

Newspaper reports provide a rich source of information on the unfolding of public debates, which can serve as basis for inquiry in political science. Such debates are often triggered by critical events, which attract public attention and incite ...

research-article

Investigating the role of swear words in abusive language detection tasks

Pages 155–188https://doi.org/10.1007/s10579-022-09582-8

Abstract

Swearing plays an ubiquitous role in everyday conversations among humans, both in oral and textual communication, and occurs frequently in social media texts, typically featured by informal language and spontaneous writing. Such occurrences can be ...

research-article

EventDNA: a dataset for Dutch news event extraction as a basis for news diversification

Pages 189–221https://doi.org/10.1007/s10579-022-09623-2

Abstract

News organizations increasingly tailor their news offering to the reader through personalized recommendation algorithms. However, automated recommendation algorithms reflect a commercial logic based on calculated relevance to the user, rather than ...

research-article

Usage disambiguation of Turkish discourse connectives

Pages 223–256https://doi.org/10.1007/s10579-022-09614-3

Abstract

This paper describes a rule-based approach and a machine learning approach to disambiguate the discourse usage of Turkish connectives, which not only has single and phrasal connectives as most languages do, but also suffixal connectives that ...

research-article

The impact of preprocessing on word embedding quality: a comparative study

Pages 257–291https://doi.org/10.1007/s10579-022-09620-5

Abstract

Data preprocessing is among the principal stages in virtually all text-based tasks. In this light, recent approaches have employed word embeddings in the majority of text-based tasks, wherein word co-occurrences are used as the basis of word ...

research-article

Spelling errors made by people with dyslexia

Pages 293–322https://doi.org/10.1007/s10579-022-09603-6

Abstract

In this paper, we present a review of studies that have collected and annotated errors produced by people with dyslexia from corpora of written texts (six studies involving English, Spanish, German and French). Such resources are useful for ...

research-article

Nonverbal communication with emojis in social media: dissociating hedonic intensity from frequency

Pages 323–342https://doi.org/10.1007/s10579-022-09611-6

Abstract

As a popular means of nonverbal communication in social media, emojis provide quick predictions about public sentiments towards social events. Previous analyses of emojis reported that people use positive emojis more frequently than negative ...

research-article

Managing, storing, and sharing long-form recordings and their annotations

Pages 343–375https://doi.org/10.1007/s10579-022-09579-3

Abstract

The technique of long-form recordings via wearables is gaining momentum in different fields of research, notably linguistics and neurology. This technique, however, poses several technical challenges, some of which are amplified by the ...

brief-report

Manipuri–English comparable corpus for cross-lingual studies

Pages 377–413https://doi.org/10.1007/s10579-021-09576-y

Abstract

This paper presents Mni-EnCC, a temporal alligned Manipuri–English comparable corpus, to facilitate cross-lingual studies between Manipuri and English. Mni-EnCC has been created by collating text from two publicly published news sources in ...

brief-report

The ParlaMint corpora of parliamentary proceedings

Pages 415–448https://doi.org/10.1007/s10579-021-09574-0

Abstract

This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17 European national parliaments with half a billion words. The corpora are uniformly encoded, contain rich meta-data about 11 thousand speakers, and are ...

research-article

Resources for Turkish natural language processing: A critical survey

Pages 449–488https://doi.org/10.1007/s10579-022-09605-4

Abstract

This paper presents a comprehensive survey of corpora and lexical resources available for Turkish. We review a broad range of resources, focusing on the ones that are publicly available. In addition to providing information about the available ...

correction

Correction to: Resources for Turkish natural language processing: A critical survey

Page 489https://doi.org/10.1007/s10579-022-09625-0

correction

Correction to: Two sepedi‑english code‑switched speech corpora

Page 491https://doi.org/10.1007/s10579-022-09607-2

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Language Resources and Evaluation

Sections

Editorial: LRE updates

TIARA 2.0: an interactive tool for annotating discourse structure and text improvement

Statistical quality estimation for partially subjective classification tasks through crowdsourcing

COLLIE: a broad-coverage ontology and lexicon of verbs in English

Correction: COLLIE: a broad-coverage ontology and lexicon of verbs in English

The WASABI song corpus and knowledge graph for music lyrics analysis

Between welcome culture and border fence: A dataset on the European refugee crisis in German newspaper reports

Investigating the role of swear words in abusive language detection tasks

EventDNA: a dataset for Dutch news event extraction as a basis for news diversification

Usage disambiguation of Turkish discourse connectives

The impact of preprocessing on word embedding quality: a comparative study

Spelling errors made by people with dyslexia

Nonverbal communication with emojis in social media: dissociating hedonic intensity from frequency

Managing, storing, and sharing long-form recordings and their annotations

Manipuri–English comparable corpus for cross-lingual studies

The ParlaMint corpora of parliamentary proceedings

Resources for Turkish natural language processing: A critical survey

Correction to: Resources for Turkish natural language processing: A critical survey

Correction to: Two sepedi‑english code‑switched speech corpora