-
LG AI Research @ EXAONE Lab
- Seoul, Korea
- lkm2835@gmail.com
- https://huggingface.co/lkm2835
Stars
Efficient Triton Kernels for LLM Training
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
GPU programming related news and material links
📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.
Transformer related optimization, including BERT, GPT
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
🔪 Elimination based Lightweight Neural Net with Pretrained Weights
Repository for Open Source Reinforcement Learning Framework JORLDY
object-detection-level2-cv-05 created by GitHub Classroom
image-classification-level1-21 created by GitHub Classroom
PyTorch deep learning projects made easy.