yj4165

Yejun Yoon yj4165

Stars

MichSchli / AVeriTeC

Python 59 4 Updated Nov 27, 2024

Raldir / FEVER-8-Shared-Task

This repository contains the baseline and example submission for the FEVER 8 Shared Task. The baseline is a computationally optimized version of the HerO system (https://github.com/ssu-humane/HerO)…

Python 4 5 Updated Apr 26, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

56,085 6,017 Updated Jun 4, 2025

deepseek-ai / DeepSeek-V3

Python 97,815 15,911 Updated Jun 16, 2025

deepseek-ai / DeepSeek-R1

90,250 11,654 Updated Apr 9, 2025

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,386 126 Updated Jun 3, 2025

HandsOnLLM / Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 10,516 2,495 Updated Jun 6, 2025

ict-bigdatalab / awesome-pretrained-models-for-information-retrieval

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

668 48 Updated Jan 7, 2024

PKU-YuanGroup / LLaVA-CoT

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 2,017 77 Updated May 13, 2025

ssu-humane / HerO

The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)

Python 10 2 Updated Mar 18, 2025

plageon / HtmlRAG

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)

Python 421 33 Updated Jun 11, 2025

harishsg993010 / LLM-Research-Scripts

Python 435 42 Updated Oct 4, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,872 6,472 Updated Jun 24, 2025

RahulSChand / gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,315 76 Updated Dec 3, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 20,281 1,519 Updated Jun 3, 2025

jxzhangjhu / Awesome-LLM-RAG

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,233 73 Updated Feb 24, 2025

coree / awesome-rag

A curated list of retrieval-augmented generation (RAG) in large language models

280 20 Updated Feb 14, 2025

Cartus / Automated-Fact-Checking-Resources

Links to conference/journal publications in automated fact-checking (resources for the TACL22/EMNLP23 paper).

508 52 Updated Feb 23, 2025

eugeneyan / open-llms

📋 A list of open LLMs available for commercial use.

12,124 876 Updated Feb 13, 2025

mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

817 19 Updated Jul 31, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,018 113 Updated Jul 29, 2024

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 9,736 1,054 Updated Jun 24, 2025

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,441 430 Updated Apr 7, 2025

microsoft / MS-MARCO-Web-Search

A large-scale information-rich web dataset, featuring millions of real clicked query-document labels

331 18 Updated Dec 16, 2024

zhilizju / Awesome-instruction-tuning

A curated list of awesome instruction tuning datasets, models, papers and repositories.

Python 335 14 Updated Jun 12, 2023

hunkim / SolarLLMChatDemo

Full Stack SolarLLM Zero to All

Python 169 34 Updated Mar 1, 2025

hyunwoongko / kss

KSS: Korean String processing Suite

Python 446 64 Updated Mar 30, 2025

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 29,516 3,645 Updated Jul 23, 2024

teddylee777 / langchain-kr

LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.

Jupyter Notebook 1,720 574 Updated Jun 12, 2025

PLUM-Lab / Mocheg

Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)

Python 55 8 Updated Nov 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly