-
Korea University DSBA Lab.
- Seoul, Republic of Korea
- https://velog.io/@stapers
- https://jaehee-kim.notion.site/Unknown-NLP-Study-ff54da176c164c5aa01165a255370e8a?pvs=4
Highlights
- Pro
Stars
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Repository to extract key information from semi-/un-structured documents using large language models.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
verl: Volcano Engine Reinforcement Learning for LLMs
Concrete ML: Privacy Preserving ML framework using Fully Homomorphic Encryption (FHE), built on top of Concrete, with bindings to traditional ML frameworks.
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
AirLLM 70B inference with single 4GB GPU
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Must-read Papers on Knowledge Editing for Large Language Models.
The Universe of Data. All about data, data science, and data engineering
Friends don't let friends make certain types of data visualization - What are they and why are they bad.
Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification (EMNLP 2023 Findings)
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Machine Learning Engineering Open Book
code and data for Hayati et al's paper on "How Far Can We Extract Diverse Perspectives from Large Language Models? Criteria-Based Diversity Prompting!"
Robust recipes to align language models with human and AI preferences
Simple replication of DPR (Dense Passage Retrieval)