jiaxin-ai

jiaxin-ai

Popular repositories Loading

ProJudge ProJudge Public

Implementation of "ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges".

5
ResNet9 ResNet9 Public

Implement ResNet-9 training with Numpy/CuPy.

Python
ConvBench ConvBench Public

Forked from shirlyliu64/ConvBench

ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models

Python
JudgeLM JudgeLM Public

Forked from baaivision/JudgeLM

An open-sourced LLM judge for evaluating LLM-generated answers.

Python
VLMEvalKit VLMEvalKit Public

Forked from open-compass/VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

Python
openr openr Public

Forked from openreasoner/openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python