8000 chuyangchencd (Chuyang Chen) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View chuyangchencd's full-sized avatar
  • New York, NY

Block or report chuyangchencd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NYU Course Notes & Resources

Jupyter Notebook 209 46 Updated May 16, 2024

MUSIC-AVQA, CVPR2022 (ORAL)

Python 85 8 Updated Dec 30, 2022

Temporal Reasoning via Audio Question Answering

Python 25 10 Updated Dec 21, 2019

Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.

Python 16 1 Updated Oct 25, 2024
Python 10 Updated May 21, 2024

🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)

Python 53 8 Updated Feb 13, 2025

Speech, Language, Audio, Music Processing with Large Language Model

Python 831 82 Updated Jun 16, 2025

Spatial Audio Generation

Python 109 20 Updated Mar 24, 2023

A curated list of resources in audio visual question answering and related area. :-)

7 Updated Aug 1, 2024

ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos

Python 7 Updated Aug 17, 2023

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,303 230 Updated May 21, 2023

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,068 304 Updated Feb 27, 2025

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

Python 618 211 Updated Aug 30, 2021

Measuring compositionality in representation learning

Jupyter Notebook 73 7 Updated May 5, 2019
Python 3 Updated Aug 24, 2022

Deep Learning Model for Signal Data

Python 87 12 Updated Nov 1, 2019

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,992 393 Updated May 8, 2024

Learning audio concepts from natural language supervision

Python 564 42 Updated Sep 18, 2024

A tool to visualize DCASE format SELD labels and predictions

Python 9 1 Updated Apr 8, 2024
Jupyter Notebook 53 5 Updated Apr 28, 2025

Baseline method for sound event localization task of DCASE 2023 challenge

Python 52 14 Updated Mar 13, 2023

Example code to help people follow along with the tutorials

ChucK 19 2 Updated Aug 21, 2024

go binary for setting up singularity containers with a miniconda

Go 16 1 Updated Sep 18, 2024

Visualisation of VISOR Segmentations with Annotations and Relations

Python 21 2 Updated Aug 15, 2022

Inference code for Llama models

Python 58,380 9,779 Updated Jan 26, 2025

Instructional notebooks on music information retrieval.

Jupyter Notebook 1,241 413 Updated Nov 15, 2023
0