8000 phuongdnm (Phuong Dinh) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View phuongdnm's full-sized avatar
🌏
Have a good day!
🌏
Have a good day!

Highlights

  • Pro

Block or report phuongdnm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 145,977 29,441 Updated Jun 23, 2025

Fine-tune LLMs for free with 100+ Notebooks on Google Colab, Kaggle, and more.

Jupyter Notebook 2,398 331 Updated Jun 23, 2025

Efficient vision foundation models for high-resolution generation and perception.

Python 2,932 225 Updated Apr 24, 2025

Error correction back-end for speaker diarization

Python 17 Updated Sep 26, 2023

Repository for "LLM-based speaker diarization correction: A generalizable approach" paper

Jupyter Notebook 15 Updated Jul 31, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 16,379 1,753 Updated Jun 8, 2025

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Python 2,729 400 Updated May 16, 2025

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Python 222 26 Updated Mar 13, 2025

[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"

Python 211 15 Updated Nov 24, 2024

Integrate the DeepSeek API into popular softwares

32,945 3,641 Updated May 13, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 46,093 5,815 Updated Jun 23, 2025

Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase

Python 10,077 754 Updated Jun 23, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 23,172 3,321 Updated Mar 5, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 34,530 4,939 Updated Jun 23, 2025

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,075 80 Updated Mar 27, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 40,959 3,261 Updated Jun 23, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 16,762 1,737 Updated Jun 7, 2025

A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.ai

TypeScript 1,464 113 Updated Jul 21, 2024

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

Rust 5,205 420 Updated May 21, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 11,166 1,136 Updated Jun 21, 2025

[CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement"

Jupyter Notebook 196 15 Updated Apr 11, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,271 326 Updated May 18, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 4,562 656 Updated Mar 27, 2025

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 32,215 7,707 Updated Jun 23, 2025
Python 3,308 360 Updated Jun 10, 2023

A Python wrapper for Kaldi

Python 1,018 245 Updated Jan 23, 2025

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,765 231 Updated Oct 16, 2024
Next
0