8000 GitHub - Jingkang50/jingkang50
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Jingkang50/jingkang50

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

10 Commits
Β 
Β 

Repository files navigation

Hi there! πŸ‘‹ I'm Jingkang Yang.

πŸŽ“ Currently pursuing a PhD in Visual Perception and Reasoning.

πŸ” My research interests revolve around Vision-Language Models 🧠, Embodied Agents πŸ€–, and Scene Graph Generation πŸ•Έ. I am passionate about creating generalist AI models capable of understanding and interacting with complex visual data.

πŸš€ My Ongoing Research Projects:

  • Visual Generalist Models: Developing models that process diverse visual data (e.g., images, videos, 3D, audio, IMU) to tackle various tasks in perception, reasoning, generation, robotics, and gaming. Notable projects include EgoLife, Octopus, FunQA, and Otter.

  • AI Safety for Foundation Models: Investigating how to mitigate hallucinations in large language models (LLMs) and multimodal models (LMMs). A key contribution is the introduction of UPD to withhold answers when faced with unsolvable questions.

πŸ† Previous Contributions:

  • PSG Series (2022-2023): Led the development of the PSG, PVSG, and PSG4D models, focusing on relation modeling for scene understanding. I also collaborated on works like Relate-Anything and PairNet.

  • OOD Detection (2021-2022): Led a comprehensive survey and developed OpenOOD, a popular codebase for Out-of-Distribution detection in AI safety.

  • Prompt Tuning (2022): Contributed to foundational works like CoOp and CoCoOp for prompt tuning in vision-language models.

πŸ“ˆ GitHub Stats

Jingkang50's GitHub stats


πŸ“¬ Get in Touch

Feel free to reach out for collaboration or just to chat about AI and technology!

Thanks for visiting my profile!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0