Highlights
- Pro
Stars
Recipes to scale inference-time compute of open models
Democratizing Reinforcement Learning for LLMs
[ICLR 2025 Spotlight] An open-sourced LLM judge for evaluating LLM-generated answers.
Search-o1: Agentic Search-Enhanced Large Reasoning Models
解决Cursor在免费订阅期间出现以下提示的问题: You've reached your trial request limit. / Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to prevent abuse. Please l…
A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.
[NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs
Train transformer language models with reinforcement learning.
Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
Aligning pretrained language models with instruction data generated by themselves.
SGLang is a fast serving framework for large language models and vision language models.
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
(CVPR 2023) Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning
Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution…
(ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"
A collection of AWESOME things about Graph-Related LLMs.
Infinite Photorealistic Worlds using Procedural Generation
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
An open source implementation of CLIP.
An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, a…
Implementation of Supervised Contrastive Learning with AMP, EMA, SWA, and many other tricks