8000 yueyu1030 (Yue Yu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yueyu1030's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report yueyu1030

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official repository for paper "MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale"

Python 4 Updated Jun 6, 2025

Official Code Repository for WorkForceAgent-R1

Python 2 Updated Jun 1, 2025
Python 15 2 Updated Apr 8, 2025

s1: Simple test-time scaling

Python 6,431 749 Updated May 19, 2025

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,587 166 Updated Jun 5, 2025

[EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning

Python 33 Updated Dec 26, 2024

Official Code Repository for paper "HYDRA: Model Factorization Framework for Black-Box LLM Personalization"

Python 11 Updated Oct 7, 2024

Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"

Python 42 2 Updated Oct 1, 2024

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

Python 204 18 Updated Jun 5, 2025

[ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.

Python 37 3 Updated Sep 19, 2024

Source code of MOLLEO

Python 44 3 Updated Apr 11, 2025

[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.

Python 71 4 Updated Nov 27, 2024

[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".

Python 21 2 Updated Sep 19, 2024

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 698 516 Updated Jul 4, 2024

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 176 13 Updated Jun 20, 2024

Train Models Contrastively in Pytorch

Python 719 58 Updated Mar 26, 2025
Python 99 2 Updated Dec 22, 2023

[EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electro 91F4 nic Health Records

Python 100 13 Updated Dec 26, 2024

MoraBench (Model Ranking Benchmark)

Python 5 Updated Mar 2, 2024
Python 13 Updated Jan 26, 2024

[ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models".

Python 39 3 Updated Jun 23, 2024

EcoAssistant: using LLM assistant more affordably and accurately

Python 132 6 Updated Jun 30, 2024

面试高频算法题总结,个人博客

C++ 1,120 274 Updated Dec 16, 2023

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 45,697 6,931 Updated Jun 9, 2025

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

527 32 Updated Oct 28, 2024
Jupyter Notebook 337 32 Updated Jan 3, 2024

MAD: The first work to explore Multi-Agent Debate with Large Language Models :D

Python 389 41 Updated Jan 14, 2025

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut …

Python 946 83 Updated Oct 22, 2024

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

Jupyter Notebook 266 10 Updated Aug 19, 2023

MUBen: Benchmarking the Uncertainty of Molecular Representation Models

Python 7 Updated Apr 17, 2024
Next
0