Starred repositories
The code for ICASSP 2023 paper: MRML: Multimodal Rumor Detection by Deep Metric Learning.
Code for "Question Generation for Adaptive Education", to appear at ACL 2021.
Scraped word definition pairs from words.hk for semantic similarity task. Formatted to use with BERT.
cantonese-mandarin unsupervised neural translation for sw project
fastText vectors created from Hong Kong data.
A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP
Malaysia Cantonese Corpus (MYCanCor) - A video corpus of natural Cantonese conversations
Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project
This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity"
An audio and transcribed corpus of contemporary Hong Kong Cantonese
Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).
MLNLP: This repository is a collection of AI top conferences papers (e.g. ACL, EMNLP, NAACL, COLING, AAAI, IJCAI, ICLR, NeurIPS, and ICML) with open resource code
An open-source tool-augmented conversational language model from Fudan University
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
The course notes about Stanford CS224n Natural Language Processing with Deep Learning Winter 2019 (using PyTorch)
MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)
A powerful HTTP client for Dart and Flutter, which supports global settings, Interceptors, FormData, aborting and canceling a request, files uploading and downloading, requests timeout, custom adap…