-
-
MiniCPM-o Public
Forked from OpenBMB/MiniCPM-oMiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Python Apache License 2.0 UpdatedJan 24, 2025 -
UniTraj Public
Forked from vita-epfl/UniTrajA Unified Framework for scalable Vehicle Trajectory Prediction, ECCV 2024
Python Other UpdatedJan 21, 2025 -
home-generative-agent Public
Forked from goruck/home-generative-agentA home assistant generative agent integration based on langchain and langgraph
Python MIT License UpdatedJan 8, 2025 -
InternLM-XComposer Public
Forked from InternLM/InternLM-XComposerInternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Python Apache License 2.0 UpdatedDec 26, 2024 -
xfeatSLAM Public
Forked from udaysankar01/xfeatSLAMReal-time SLAM with deep features (XFeat + ORB-SLAM3)
C++ GNU General Public License v3.0 UpdatedDec 18, 2024 -
Robotic-grasping-papers Public
Forked from rhett-chen/Robotic-grasping-paperspaper list of robotic grasping and some related works
UpdatedOct 31, 2024 -
LingoQA Public
Forked from wayveai/LingoQA[ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"
Python Other UpdatedSep 26, 2024 -
Driving-with-LLMs Public
Forked from wayveai/Driving-with-LLMsPyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"
Python Apache License 2.0 UpdatedSep 26, 2024 -
kotaemon Public
Forked from Cinnamon/kotaemonAn open-source RAG-based tool for chatting with your documents.
Python Apache License 2.0 UpdatedSep 25, 2024 -
Show-o Public
Forked from showlab/Show-oRepository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Python Apache License 2.0 UpdatedSep 1, 2024 -
fingervision Public
Forked from akihikoy/fingervisionData processing programs for the vision-based tactile sensor FingerVision
C++ Other UpdatedAug 31, 2024 -
open-genie Public
Forked from myscience/open-geniePytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).
Python MIT License UpdatedAug 21, 2024 -
ml-mdm Public
Forked from apple/ml-mdmTrain high-quality text-to-image diffusion models in a data & compute efficient manner
Python Other UpdatedAug 14, 2024 -
OpenPCDet Public
Forked from open-mmlab/OpenPCDetOpenPCDet Toolbox for LiDAR-based 3D Object Detection.
Python Apache License 2.0 UpdatedAug 8, 2024 -
jetson-containers Public
Forked from dusty-nv/jetson-containersMachine Learning Containers for NVIDIA Jetson and JetPack-L4T
Python MIT License UpdatedAug 5, 2024 -
swift Public
Forked from modelscope/ms-swiftms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Python Apache License 2.0 UpdatedAug 5, 2024 -
MindSearch C42E Public
Forked from InternLM/MindSearch🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Python Apache License 2.0 UpdatedAug 4, 2024 -
lmdeploy Public
Forked from InternLM/lmdeployLMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Python Apache License 2.0 UpdatedAug 4, 2024 -
xtuner Public
Forked from InternLM/xtunerAn efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Python Apache License 2.0 UpdatedAug 3, 2024 -
glomap Public
Forked from colmap/glomapGLOMAP - Global Structured-from-Motion Revisited
C++ BSD 3-Clause "New" or "Revised" License UpdatedAug 2, 2024 -
NanoLLM Public
Forked from dusty-nv/NanoLLMOptimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
Python MIT License UpdatedJul 31, 2024 -
LLM-Finetuning-Toolkit Public
Forked from georgian-io/LLM-Finetuning-ToolkitToolkit for fine-tuning, ablating and unit-testing open-source LLMs.
Python Apache License 2.0 UpdatedJul 25, 2024 -
llm-awq Public
Forked from mit-han-lab/llm-awq[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Python MIT License UpdatedJul 16, 2024 -
langroid Public
Forked from langroid/langroidHarness LLMs with Multi-Agent Programming
Python MIT License UpdatedJul 13, 2024 -
brain-tokyo-workshop Public
Forked from google/brain-tokyo-workshop🧠🗼
Jupyter Notebook Apache License 2.0 UpdatedJul 9, 2024 -
lang-seg Public
Forked from isl-org/lang-segLanguage-Driven Semantic Segmentation
Jupyter Notebook MIT License UpdatedJul 5, 2024 -
MotionPatches Public
Forked from line/MotionPatchesOfficial implementation for "Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches" (CVPR 2024)
Python Other UpdatedJul 4, 2024 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedJul 3, 2024 -
Ask-Anything Public
Forked from OpenGVLab/Ask-Anything[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Python MIT License UpdatedJul 2, 2024