8000 KimJaehee0725 (Kim Jaehee) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View KimJaehee0725's full-sized avatar
🥔
I'm a talking potato
🥔
I'm a talking potato

Highlights

  • Pro

Block or report KimJaehee0725

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 5,755 287 Updated Feb 21, 2025

Repository to extract key information from semi-/un-structured documents using large language models.

Jupyter Notebook 8 4 Updated Aug 14, 2024

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,021 2,211 Updated Feb 25, 2025

Improved file parsing for LLM’s

Python 2,823 111 Updated Nov 13, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,800 340 Updated Feb 25, 2025

Concrete ML: Privacy Preserving ML framework using Fully Homomorphic Encryption (FHE), built on top of Concrete, with bindings to traditional ML frameworks.

Python 1,096 164 Updated Feb 24, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 1,209 170 Updated Feb 25, 2025

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 5,709 456 Updated Nov 24, 2024

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 1,040 64 Updated Jan 7, 2025

LLM101n: Let's build a Storyteller

31,964 1,735 Updated Aug 1, 2024

Benchmarking library for RAG

Jupyter Notebook 169 15 Updated Feb 25, 2025

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,427 115 Updated Jan 24, 2025

PyTorch native post-training library

Python 4,923 544 Updated Feb 25, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 40,735 5,464 Updated Feb 23, 2025

Must-read Papers on Knowledge Editing for Large Language Models.

1,013 69 Updated Feb 18, 2025

The Universe of Data. All about data, data science, and data engineering

Python 541 52 Updated Jul 18, 2024

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

R 6,583 260 Updated Dec 10, 2024

Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification (EMNLP 2023 Findings)

Python 1 Updated Feb 1, 2024

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,476 269 Updated Jun 24, 2024

SILO Language Models code repository

Python 81 12 Updated Feb 23, 2024
Jupyter Notebook 24 22 Updated Nov 24, 2023
Jupyter Notebook 23 19 Updated Mar 19, 2024

Machine Learning Engineering Open Book

Python 12,913 789 Updated Feb 23, 2025

code and data for Hayati et al's paper on "How Far Can We Extract Diverse Perspectives from Large Language Models? Criteria-Based Diversity Prompting!"

JavaScript 7 Updated Dec 20, 2024

Robust recipes to align language models with human and AI preferences

Python 5,024 433 Updated Nov 21, 2024

Simple replication of DPR (Dense Passage Retrieval)

Python 44 4 Updated Nov 10, 2023
Next
0