- Chicago, IL
-
18:35
(UTC -05:00) - sumuk.org
- https://orcid.org/0000-0002-8265-9946
- @sumukx
- in/sumuks
Highlights
Stars
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
huggingface / yourbench
Forked from sumukshashidhar/yourbench🤗 Benchmark Large Language Models Reliably On Your Data
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Benchmark Large Language Models Reliably On Your Data
A terrible web ui and RPC server for yt-dlp. Designed to be self-hosted.
Python tool for converting files and office documents to Markdown.
Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!
Aidan Bench attempts to measure <big_model_smell> in LLMs.
AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
dyrector.io is a self-hosted continuous delivery & deployment platform with version management.
Ultra Lobster offers a visually pleasing and comfortable working experience, with an emphasis on bringing rounded UI elements, modern design trends, and soft design choices to Obsidian.
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
Large Language Model Text Generation Inference
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
The OpenTF Manifesto expresses concern over HashiCorp's switch of the Terraform license from open-source to the Business Source License (BSL) and calls for the tool's return to a truly open-source …
Angrave's Crowd-Sourced System Programming Book used at UIUC
An open-source, lightweight note-taking solution. The pain-less way to create your meaningful notes. Your Notes, Your Way.
Think fearlessly with end-to-end encrypted notes and files. For issues, visit https://standardnotes.com/forum or https://standardnotes.com/help.
Self-hosted YouTube downloader built on Material Design
A social networking service scraper in Python
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.