Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
The official GitHub page for the survey paper "A Survey of Large Language Models".
Transformer related optimization, including BERT, GPT
Example models using DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Ongoing research training transformer models at scale
🧠 A study guide to learn about Transformers
Parameter Server implementation in Apache Flink
Jupyter notebooks for analyzing crypto data
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
手撸解释器教程《Crafting Interpreters》中文翻译
Library for reading and writing large multi-dimensional arrays.
Applied Machine Learning Explainability Techniques, published by Packt
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
Python toolkit for quantitative finance
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为TensorFlow 2.0实现,项目已得到李沐老师的认可
Using a feature store to connect the DataOps and MLOps workflows to enable collaborative teams to develop efficiently.
A walkthrough of transformer architecture code
快速上手AI理论及应用实战:基础知识、Transformer、NLP、ML、DL、竞赛。含大量注释及数据集,力求每一位能看懂并复现。
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…