PKU-XD

pku_xd PKU-XD

I am Xiao Dong, a student in Peking university. I want to study in Github and try to make some contributions to the community.

Peking University
Peking University

Stars

jvhs0706 / zkllm-ccs2024

Cuda 80 19 Updated Jan 5, 2025

PKU-XD / EventAD

[2025 ICML spotlight] When Every Millisecond Counts: Real-Time Anomaly Detection via the Multimodal Asynchronous Hybrid Network

Python 3 Updated May 30, 2025

kaiyuyue / nxtp

PyTorch Implementation of Object Recognition as Next Token Prediction [CVPR 2024 Highlight]

Python 179 8 Updated May 1, 2025

MoonBlvd / Detection-of-Traffic-Anomaly

This is the repo for our Detection of Traffic Anomaly (DoTA) dataset.

Python 232 39 Updated Dec 28, 2023

yaotingwangofficial / Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

679 19 Updated Jun 29, 2025

taco-group / OpenEMMA

OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.

Python 729 88 Updated May 13, 2025

XduSyL / EventGPT

🔥[CVPR2025] EventGPT: Event Stream Understanding with Multimodal Large Language Models

Python 53 5 Updated Jun 24, 2025

jeffreychou777 / LOTVS-MM-AU

[CVPR2024 Highlight] The official repo for paper "Abductive Ego-View Accident Video Understanding for Safe Driving Perception"

Jupyter Notebook 55 2 Updated Mar 24, 2025

Yuchen413 / AnomalyRuler

Implementation for paper "Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Model"

Python 81 12 Updated Dec 16, 2024

uzh-rpg / bflow

Official implementation of "Dense Continuous-Time Optical Flow from Event Cameras"

Python 69 4 Updated Feb 4, 2024

BICLab / SpikeYOLO

Offical implementation of "Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection" (ECCV2024 Best Paper Candidate)

Python 181 10 Updated Jun 30, 2025

ZZY-Zhou / RENet

[ICRA'23] Dataset of Moving Object Detection; Official Implementation of "RGB-Event Fusion for Moving Object Detection in Autonomous Driving"

Python 63 7 Updated Aug 5, 2023

SensorsINI / v2e

V2E: From video frames to DVS events

Python 360 64 Updated Nov 9, 2023

monjurulkarim / risky_object

This is the implementation code for the paper, "An Attention-guided Multistream Feature Fusion Network for Early Localization of Risky Traffic Agents in Driving Videoss", IEEE Transaction on Intell…

Python 22 1 Updated Nov 1, 2023

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,604 430 Updated May 29, 2024

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,377 155 Updated Mar 3, 2025

quangminhdinh / TrafficVLM

[CVPRW 2024] TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning. Official code for the 3rd place solution of the AI City Challenge 2024 Track 2.

Python 39 3 Updated Feb 11, 2025