-
Nexa AI Inc
Lists (1)
Sort Name ascending (A-Z)
Stars
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
A course on aligning smol models.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
This repository contains the Hugging Face Agents Course.
A fork to add multimodal model training to open-r1
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Fast and memory-efficient exact attention
Model Compression Toolbox for Large Language Models and Diffusion Models
Everything about the SmolLM2 and SmolVLM family of models
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
shenzheyu / llama.cpp
Forked from ggml-org/llama.cppLLM inference in C/C++
Composable building blocks to build Llama Apps
An interactive AI character with voice input, voice output, and profile image generation—all running locally with Nexa SDK and powered by Llama3 Uncensored Model. Enjoy a private and immersive expe…
AI for all: Build the large graph of the language models
Android ChatBot with Octopus v2 - Function Calling Demo
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)
Awesome LLMs on Device: A Comprehensive Survey
Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.
[TMLR 2024] Efficient Large Language Models: A Survey
A modular graph-based Retrieval-Augmented Generation (RAG) system
Survey Paper List - Efficient LLM and Foundation Models
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.