Lists (2)
Sort Name ascending (A-Z)
Stars
Official project page of the paper "Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges" (Accepted by CVPR 2024)
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
[IEEE BigData'24] Leveraging Large Language Models for Suicide Detection on Social Media with Limited Labels
Predicting suicidal ideation on Reddit using machine learning and DistillBERT
Advanced self-intelligence sound-based women's safety system, where speech converted to text using speech recognition algorithm by Pysound and recognition of harmful words from the dictionary using…
A collection of production-ready Generative AI Agent templates built for Google Cloud. It accelerates development by providing a holistic, production-ready solution, addressing common challenges (D…
Video Enhancement For Surveillance
Using two stream architecture to implement a classic action recognition method on UCF101 dataset
3D ResNets for Action Recognition (CVPR 2018)
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
The Shazam-similar app, that identify the song using audio fingerprints & spectrum analysis and Fast Fourier transform
All-in-one Toolbox for Computer Vision Research.
Code for the CVPRW GAZE 2021 paper -- GOO : A Dataset for Gaze Object Prediction in Retail Environments
Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale person image databases by combining both a reference image and a textual descrip…
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Person Detection using HOG Feature and SVM Classifier
Hue rotation for sprites in cocos2d game engine
[ICLR 2025] From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"