-
Shanghai Jiao Tong University
- Shanghai, China
- http://bcmi.sjtu.edu.cn/~zhangzs/
- @zhangzhuosheng
-
-
DocBench Public
Forked from Anni-Zou/DocBenchDocBench: A Benchmark for Evaluating LLM-based Document Reading Systems
Python UpdatedJul 19, 2024 -
Auto-GUI Public
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
-
-
MultiHopShortcuts Public
Forked from Jometeorie/MultiHopShortcutsReproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"
Python GNU General Public License v3.0 UpdatedJun 1, 2024 -
OOD-Math-Reasoning Public
Forked from Alsace08/OOD-Math-ReasoningCode and Data Repo for Paper "Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning"
-
dive-into-llms Public
Forked from Lordog/dive-into-llmsDive-into-LLMs Tutorial for Beginners
-
-
-
X-SIR Public
Forked from zwhe99/X-SIRCan Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models
Python UpdatedApr 11, 2024 -
StructChem Public
Forked from ozyyshr/StructChemStructured Chemistry Reasoning with Large Language Models
Python UpdatedMar 4, 2024 -
AmazonPriceHistory Public
This is the official repository of our paper "Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method".
Python Apache License 2.0 UpdatedFeb 27, 2024 -
FeedbackMT Public
Forked from zwhe99/FeedbackMTCode of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"
Python UpdatedFeb 22, 2024 -
Meta-Reasoning Public
Forked from Alsace08/Meta-ReasoningCode and Data Repo for Paper "Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models"
-
GLaPE Public
Forked from thunderous77/GLaPEOfficial implementation for "GLaPE: Gold Label-agnostic Prompt Evaluation and Optimization for Large Language Models" (stay tuned & more will be updated)
Python UpdatedFeb 6, 2024 -
R-Judge Public
Forked from Lordog/R-JudgeR-Judge: Benchmarking Safety Risk Awareness for LLM Agents
UpdatedJan 18, 2024 -
-
-
CSrankings Public
Forked from emeryberger/CSrankingsA web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
Python Other UpdatedSep 24, 2023 -
AwesomeMRC Public
IJCAI 2021 Tutorial & code for Retrospective Reader for Machine Reading Comprehension (AAAI 2021)
-
mm-cot Public
Forked from amazon-science/mm-cotOfficial implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
-
Chain-of-ThoughtsPapers Public
Forked from Timothyxxx/Chain-of-ThoughtsPapersA trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
1 UpdatedJun 16, 2023 -
FocalReasoner Public
Forked from ozyyshr/FocalReasonerFact-driven Logical Reasoning for Machine Reading Comprehension (AAAI 2024)
Python MIT License UpdatedMay 26, 2023 -
pytorch-image-models Public
Forked from huggingface/pytorch-image-modelsPyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Python Apache License 2.0 UpdatedMar 24, 2023 -
detr Public
Forked from facebookresearch/detrEnd-to-End Object Detection with Transformers
Python Apache License 2.0 UpdatedMar 18, 2023 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
Auto-CoT Public
Forked from amazon-science/auto-cotOfficial implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
-
SemBERT Public
Semantics-aware BERT for Language Understanding (AAAI 2020)
-
SG-Net Public
SG-Net: Syntax-guided machine reading comprehension (AAAI 2020)
-
CompassMTL Public
Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)