TALLIP: Vol 22, No 6

Volume 22, Issue 6June 2023

Volume 22, Issue 6

June 2023

Editor:

Imed Zitouni
Google, USA

Publisher:

Association for Computing Machinery
New York
NY
United States

ISSN:2375-4699

EISSN:2375-4702

Tags:

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Issue Downloads

PDFfront matter (TOC, masthead, submission information)

Select All

Export Citations Save to Binder

research-article

Development of a Benchmark Odia Handwritten Character Database for an Efficient Offline Handwritten Character Recognition with a Chronological Survey

Article No.: 155, Pages 1–28https://doi.org/10.1145/3583988

A good benchmark dataset is a primary requirement in the offline handwritten character recognition (HCR) process. Only three handwritten numerals and alphabet datasets from Odia are publicly accessible for study, although many writers have used several ...

research-article

Detection of Offensive Language and ITS Severity for Low Resource Language

Article No.: 156, Pages 1–27https://doi.org/10.1145/3580476

Continuous proliferation of hate speech in different languages on social media has drawn significant attention from researchers in the past decade. Detecting hate speech is indispensable irrespective of the scale of use of language, as it inflicts huge ...

research-article

Contrastive Adversarial Training for Multi-Modal Machine Translation

Article No.: 157, Pages 1–18https://doi.org/10.1145/3587267

The multi-modal machine translation task is to improve translation quality with the help of additional visual input. It is expected to disambiguate or complement semantics while there are ambiguous words or incomplete expressions in the sentences. ...

research-article

Think More Ambiguity Less: A Novel Dual Interactive Model with Local and Global Semantics for Chinese Named Entity Recognition

Article No.: 158, Pages 1–21https://doi.org/10.1145/3583685

Chinese is a representative East Asian language. Chinese Named Entity Recognition (CNER) aims to recognize various entities. It is significant for other NLP tasks to utilize CNER. Recent research to develop CNER systems has been dedicated to either ...

research-article

Knowledge-enhanced Prompt-tuning for Stance Detection

Article No.: 159, Pages 1–20https://doi.org/10.1145/3588767

Investigating public attitudes on social media is important in opinion mining systems. Stance detection aims to analyze the attitude of an opinionated text (e.g., favor, neutral, or against) toward a given target. Existing methods mainly address this ...

research-article

BayesKGR: Bayesian Few-Shot Learning for Knowledge Graph Reasoning

Article No.: 160, Pages 1–21https://doi.org/10.1145/3589183

Reasoning over knowledge graphs (KGs) has received increasing attention recently due to its promising applications in many areas, such as semantic search and recommendation systems. Subsequently, most reasoning models are inherently transductive and ...

research-article

Image–Text Multimodal Sentiment Analysis Framework of Assamese News Articles Using Late Fusion

Article No.: 161, Pages 1–30https://doi.org/10.1145/3584861

Before the arrival of the web as a corpus, people detected positive and negative news based on the understanding of the textual content from physical newspaper rather than an automatic identification approach from readily available e-newspapers. Thus, the ...

research-article

Semi-Supervised Semantic Role Labeling with Bidirectional Language Models

Article No.: 162, Pages 1–20https://doi.org/10.1145/3587160

The recent success of neural networks in NLP applications has provided a strong impetus to develop supervised models for semantic role labeling (SRL) that forego the requirement for extensive feature engineering. Recent state-of-the-art approaches require ...

research-article

Open Access

Integrating Reconstructor and Post-Editor into Neural Machine Translation

Article No.: 163, Pages 1–15https://doi.org/10.1145/3588766

Neural machine translation (NMT) mainly comprises the encoder and decoder. The encoder is mainly used to extract the feature vector of the source language sentence. The decoder predicts the next token according to the feature vector extracted by the ...

research-article

An Efficient and Accurate Detection of Fake News Using Capsule Transient Auto Encoder

Article No.: 164, Pages 1–22https://doi.org/10.1145/3589184

Fake news is “news reports that are deliberatively and indisputably fake.” News that uses fake information is becoming a threat. It becomes challenging for humans to distinguish between fake and actual news. It has become necessary to detect fake news, ...

research-article

LFWE: Linguistic Feature Based Word Embedding for Hindi Fake News Detection

Article No.: 165, Pages 1–24https://doi.org/10.1145/3589764

It is essential for research communities to investigate ways for authenticating news. The use of linguistic feature based analysis to automatically detect false news is gaining popularity among the scientific community. However, such techniques are ...

research-article

Vietnamese Sentiment Analysis: An Overview and Comparative Study of Fine-tuning Pretrained Language Models

Article No.: 166, Pages 1–27https://doi.org/10.1145/3589131

Sentiment Analysis (SA) is one of the most active research areas in the Natural Language Processing (NLP) field due to its potential for business and society. With the development of language representation models, numerous methods have shown promising ...

research-article

Part-of-Speech Tagging of Odia Language Using Statistical and Deep Learning Based Approaches

Article No.: 167, Pages 1–24https://doi.org/10.1145/3588900

Automatic part-of-speech (POS) tagging is a preprocessing step of many natural language processing tasks, such as named entity recognition, speech processing, information extraction, word sense disambiguation, and machine translation. It has already ...

research-article

Komala and Kaṭhora: A Novel Approach Towards Classification of Hindi Poetry

Article No.: 168, Pages 1–13https://doi.org/10.1145/3589249

Literary compositions are very often analyzed using various constituent units like words, phrases, sentences, and paragraphs. Unlike the conventional research that focuses on the aforementioned constituent units, our task is a statistical effort carried ...

research-article

Improving Multilingual Neural Machine Translation System for Indic Languages

Article No.: 169, Pages 1–24https://doi.org/10.1145/3587932

The Machine Translation System (MTS) serves as effective tool for communication by translating text or speech from one language to another language. Recently, neural machine translation (NMT) has become popular for its performance and cost-effectiveness. ...

research-article

Prose2Poem: The Blessing of Transformers in Translating Prose to Persian Poetry

Article No.: 170, Pages 1–18https://doi.org/10.1145/3592791

Persian poetry has consistently expressed its philosophy, wisdom, speech, and rationale based on its couplets, making it an enigmatic language on its own to both native and non-native speakers. Nevertheless, the noticeable gap between Persian prose and ...

research-article

TPoet: Topic-Enhanced Chinese Poetry Generation

Article No.: 171, Pages 1–15https://doi.org/10.1145/3593805

Chinese poetry generation has been a challenging part of natural language processing due to the unique literariness and aesthetics of poetry. In most cases, the content of poetry is topic related. In other words, specific thoughts or emotions are usually ...

research-article

Metadial: A Meta-learning Approach for Arabic Dialogue Generation

Article No.: 172, Pages 1–21https://doi.org/10.1145/3590960

Dialogue generation is the automatic generation of a text response, given a user’s input. Dialogue generation for low-resource languages has been a challenging tasks for researchers. However, the advancements in deep learning models have made developing ...

research-article

Cross-lingual Text Reuse Detection at Document Level for English-Urdu Language Pair

Article No.: 173, Pages 1–22https://doi.org/10.1145/3592761

In recent years, the problem of Cross-Lingual Text Reuse Detection (CLTRD) has gained the interest of the research community due to the availability of large digital repositories and automatic Machine Translation (MT) systems. These systems are readily ...

research-article

Enhancing RDF Verbalization with Descriptive and Relational Knowledge

Article No.: 174, Pages 1–18https://doi.org/10.1145/3595293

RDF verbalization has received increasing interest, which aims to generate a natural language description of the knowledge base. Sequence-to-sequence models based on Transformer are able to obtain strong performance equipped with pre-trained language ...

research-article

Open Access

Semantic Tagging for the Urdu Language: Annotated Corpus and Multi-Target Classification Methods

Article No.: 175, Pages 1–32https://doi.org/10.1145/3582496

Extracting and analysing meaning-related information from natural language data has attracted the attention of researchers in various fields, such as natural language processing, corpus linguistics, information retrieval, and data science. An important ...

research-article

Cross-lingual Sentence Embedding for Low-resource Chinese-Vietnamese Based on Contrastive Learning

Article No.: 176, Pages 1–18https://doi.org/10.1145/3589341

Cross-lingual sentence embedding’s goal is mapping sentences with similar semantics but in different languages close together and dissimilar sentences farther apart in the representation space. It is the basis of many downstream tasks such as cross-...

research-article

Text Polishing with Chinese Idiom: Task, Datasets and Pre-trained Baselines

Article No.: 177, Pages 1–24https://doi.org/10.1145/3593806

This work presents the task of text polishing, which generates a sentence that is more graceful than the input sentence while retaining its semantic meaning. Text polishing has great value in real usage and is an important component in modern writing ...

research-article

Alabib-65: A Realistic Dataset for Algerian Sign Language Recognition

Article No.: 178, Pages 1–23https://doi.org/10.1145/3596909

Sign language recognition (SLR) is a promising research field that aims to blur boundaries between Deaf and hearing people by creating a system that can transcribe signs into a written or vocal language. There is a growing body of literature that ...

research-article

Using Data Augmentation and Bidirectional Encoder Representations from Transformers for Improving Punjabi Named Entity Recognition

Article No.: 179, Pages 1–13https://doi.org/10.1145/3595861

Named entity recognition (NER) is a task of proper noun identification from natural language text and classification into various types such as location, person, and organization. Due to NER's applications in different natural language processing (NLP) ...

research-article

From Softmax to Nucleusmax: A Novel Sparse Language Model for Chinese Radiology Report Summarization

Article No.: 180, Pages 1–21https://doi.org/10.1145/3596219

The Chinese radiology report summarization is a crucial component in smart healthcare that employs language models to summarize key findings in radiology reports and communicate these findings to physicians. However, most language models for radiology ...

research-article

The Impact of Arabic Diacritization on Word Embeddings

Article No.: 181, Pages 1–30https://doi.org/10.1145/3592603

Word embedding is used to represent words for text analysis. It plays an essential role in many Natural Language Processing (NLP) studies and has hugely contributed to the extraordinary developments in the field in the last few years. In Arabic, diacritic ...

short-paper

Robust Multi-task Learning-based Korean POS Tagging to Overcome Word Spacing Errors

Article No.: 182, Pages 1–13https://doi.org/10.1145/3591206

End-to-end neural network-based approaches have recently demonstrated significant improvements in natural language processing (NLP). However, in the NLP application such as assistant systems, NLP components are still processed to extract results using a ...

short-paper

Multilingual BERT-based Word Alignment By Incorporating Common Chinese Characters

Article No.: 183, Pages 1–13https://doi.org/10.1145/3594634

Word alignment is an important task of detecting translation equivalents between a sentence pair. Although word alignment is no longer necessarily needed for neural machine translation, it’s still useful in a wealth of applications, e.g., bilingual ...

note

Open Access

Dataset Enhancement and Multilingual Transfer for Named Entity Recognition in the Indonesian Language

Article No.: 184, Pages 1–21https://doi.org/10.1145/3592854

Named entity recognition in the Indonesian language has significantly developed in recent years. However, it still lacks standardized publicly available corpora; a small dataset is available but suffers from inconsistent annotations. Therefore, we re-...

Subjects

Comments

Please enable JavaScript to view thecomments powered by Disqus.

ACM Transactions on Asian and Low-Resource Language Information Processing

Sections

Issue Downloads

Development of a Benchmark Odia Handwritten Character Database for an Efficient Offline Handwritten Character Recognition with a Chronological Survey

Detection of Offensive Language and ITS Severity for Low Resource Language

Contrastive Adversarial Training for Multi-Modal Machine Translation

Think More Ambiguity Less: A Novel Dual Interactive Model with Local and Global Semantics for Chinese Named Entity Recognition

Knowledge-enhanced Prompt-tuning for Stance Detection

BayesKGR: Bayesian Few-Shot Learning for Knowledge Graph Reasoning

Image–Text Multimodal Sentiment Analysis Framework of Assamese News Articles Using Late Fusion

Semi-Supervised Semantic Role Labeling with Bidirectional Language Models

Integrating Reconstructor and Post-Editor into Neural Machine Translation

An Efficient and Accurate Detection of Fake News Using Capsule Transient Auto Encoder

LFWE: Linguistic Feature Based Word Embedding for Hindi Fake News Detection

Vietnamese Sentiment Analysis: An Overview and Comparative Study of Fine-tuning Pretrained Language Models

Part-of-Speech Tagging of Odia Language Using Statistical and Deep Learning Based Approaches

Komala and Kaṭhora: A Novel Approach Towards Classification of Hindi Poetry

Improving Multilingual Neural Machine Translation System for Indic Languages

Prose2Poem: The Blessing of Transformers in Translating Prose to Persian Poetry

TPoet: Topic-Enhanced Chinese Poetry Generation

Metadial: A Meta-learning Approach for Arabic Dialogue Generation

Cross-lingual Text Reuse Detection at Document Level for English-Urdu Language Pair

Enhancing RDF Verbalization with Descriptive and Relational Knowledge

Semantic Tagging for the Urdu Language: Annotated Corpus and Multi-Target Classification Methods

Cross-lingual Sentence Embedding for Low-resource Chinese-Vietnamese Based on Contrastive Learning

Text Polishing with Chinese Idiom: Task, Datasets and Pre-trained Baselines

Alabib-65: A Realistic Dataset for Algerian Sign Language Recognition

Using Data Augmentation and Bidirectional Encoder Representations from Transformers for Improving Punjabi Named Entity Recognition

From Softmax to Nucleusmax: A Novel Sparse Language Model for Chinese Radiology Report Summarization

The Impact of Arabic Diacritization on Word Embeddings

Robust Multi-task Learning-based Korean POS Tagging to Overcome Word Spacing Errors

Multilingual BERT-based Word Alignment By Incorporating Common Chinese Characters

Dataset Enhancement and Multilingual Transfer for Named Entity Recognition in the Indonesian Language