A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

C++ 17,178 3,875 Updated May 6, 2025

zhengyima / DHAP

Source code of SIGIR2021 Paper 'One Chatbot Per Person: Creating Personalized Chatbots based on Implicit Profiles'

Python 45 2 Updated Sep 21, 2021

songhaoyu / BoB

The released codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'

Python 128 24 Updated Sep 13, 2021

perkfly / reverse-interview-zh

技术面试最后反问面试官的话

18,094 1,377 Updated Mar 4, 2024

sebastian-hofstaetter / intra-document-cascade

Jupyter Notebook 17 3 Updated Jul 11, 2021

zhengyima / pretrain4ir_tutorial

NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking

Python 13 Updated Sep 10, 2021

wasiahmad / context_attentive_ir

Official implementation of our ICLR 2018 and SIGIR 2019 papers on Context-aware Neural Information Retrieval

Python 118 30 Updated Apr 3, 2020

chuhaojin / BriVL-BUA-applications

Bling's Object detection tool

Python 56 11 Updated Jan 9, 2023

DaoD / COCA

CIKM 2021: Contrastive Learning of User Behavior Sequence for Context-Aware Document Ranking

Python 19 1 Updated Sep 28, 2022

syuqings / Fashion-MMT

Dataset and codes for the paper "Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training".

Python 25 3 Updated Mar 6, 2022

zhengyima / Anchors

Source code of CIKM2021 Paper 'Pre-training for Ad-hoc Retrieval: Hyperlink is Also You Need'

Python 16 1 Updated Aug 30, 2021

jiqizhixin / Artificial-Intelligence-Terminology-Database

A comprehensive mapping database of English to Chinese technical vocabulary in the artificial intelligence domain

1,959 334 Updated Dec 30, 2022

zhengyima / knowqa

预训练模型知识量度量竞赛 Baseline F1 0.35 BERTForMaskedLM

Python 12 2 Updated Sep 2, 2021

allenai / longformer

Longformer: The Long-Document Transformer

Python 2,118 281 Updated Feb 8, 2023

qhjqhj00 / SIGIR2021-Pchatbot

Python 79 6 Updated Jul 3, 2023

taesunwhang / UMS-ResSel

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

Python 51 12 Updated Jan 16, 2021

AdeDZY / DeepCT

DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.

Python 320 46 Updated May 9, 2021

Georgetown-IR-Lab / cedr

Code for CEDR: Contextualized Embeddings for Document Ranking, accepted at SIGIR 2019.

amyz zhengyima

Starred repositories

information-retrieval

seq2seq-chatbot

dialogue

dialogue-systems

Natural language processing

Deep learning

relation-extraction