- San Francisco
- http://jherrman.com
- @jherrm
Stars
- All languages
- AGS Script
- ActionScript
- Arduino
- Assembly
- Blade
- C
- C#
- C++
- CSS
- CoffeeScript
- Cython
- Emacs Lisp
- G-code
- GDScript
- Go
- HTML
- Java
- JavaScript
- Jinja
- Jupyter Notebook
- Lua
- MATLAB
- MDX
- Markdown
- OCaml
- Objective-C
- Objective-C++
- OpenSCAD
- PHP
- PLpgSQL
- Perl
- Prolog
- Python
- QML
- R
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Svelte
- Swift
- TeX
- TypeScript
- Visual Basic
- Vue
- ZAP
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
Sim Studio is an open-source AI agent workflow builder. Sim Studio's interface is a lightweight, intuitive way to quickly build and deploy LLMs that connect with your favorite tools.
A TTS model capable of generating ultra-realistic dialogue in one pass.
Common assets for all render pipelines.
Graft is an open-source transactional storage engine optimized for lazy, partial, and strongly consistent replication—perfect for edge, offline-first, and distributed applications.
[CVPR 2025 Best Paper Award Candidate] VGGT: Visual Geometry Grounded Transformer
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
The ultimate training toolkit for finetuning diffusion models
One stable, self-healing SDK to build and manage all your data pipelines. Comes with automated schema-drift detection, retries and remappings so your data keeps moving no matter what - no connector…
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at sc…
Self-Hosted, Personal Music Server, designed for collectors and music maniacs
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…
A small Python module that gives you nice human readable Macintosh model names, e.g. "iMac (27-inch, Late 2009)", when given a serial number or model code.
🍏 + 🎯 + 🐍 = Everything you need to query Apple's FindMy network!
Automagically reverse-engineer REST APIs via capturing traffic
Config files for booting Mac OS 7-9, OS X and macOS on UTM emulator
Infinite Photorealistic Worlds using Procedural Generation
FastVideo is a unified framework for accelerated video generation.
Python tool for converting files and office documents to Markdown.