Stars
【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io
A Gradio web UI for Large Language Models with support for multiple inference backends.
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
The Largest-scale Chinese Medical QA Dataset: with 26,000,000 question answer pairs.
A game theoretic approach to explain the output of any machine learning model.