Stars
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation
The Munich Open-Source Large-Scale Multimedia Feature Extractor
Face recognition with deep neural networks.
Predicting depression from acoustic features of speech using a Convolutional Neural Network.
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…
Pytorch Implementation of Tensor Fusion Networks for multimodal sentiment analysis.
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
Frontier Multimodal Foundation Models for Image and Video Understanding
中文地址提取工具,支持中国三级区划地址(省、市、区)提取和映射,支持地址热力图绘制。
Solve Visual Understanding with Reinforced VLMs
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…
使用 Qwen2ForSequenceClassification 简单实现文本分类任务。
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"
Codes for ACL 2023 findings paper "Coarse-to-fine Few-shot Learning for Named Entity Recognition"
The code and data for our paper (EMNLP 2023 findings) "Type-Aware Decomposed Framework for Few-Shot Named Entity Recognition".
Code for ACL 2022 paper "CONTaiNER: Few-Shot Named Entity Recognition via Contrastive Learning"
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程