-
Mila - Quebec AI Institute / Université de Montréal
- Montréal, Québec, Canada
-
11:16
(UTC -04:00) - https://suyuchen.wang/
- in/suyuchenwang
- https://huggingface.co/sheryc
Highlights
- Pro
Stars
DeerFlow is a community-driven framework for deep research, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
💥 Blazing fast terminal file manager written in Rust, based on async I/O.
Minimalistic 4D-parallelism distributed training framework for education purpose
Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.
Build your own second brain with supermemory. It's a ChatGPT for your bookmarks. Import tweets or save websites and content using the chrome extension.
Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
a repository of example machine learning experiments for SLURM clusters
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
Official repository for "NoLiMa: Long-Context Evaluation Beyond Literal Matching"
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
[CVPR 2025 Highlight] Official implementation of "MangaNinja: Line Art Colorization with Precise Reference Following"
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
This package contains the original 2012 AlexNet code.
ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory
E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use.
Model Context Protocol Servers
A powerful tool for creating fine-tuning datasets for LLM
Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
Awesome Reasoning LLM Tutorial/Survey/Guide
Synthetic data curation for post-training and structured data extraction
verl: Volcano Engine Reinforcement Learning for LLMs
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL