default search action
Deyao Zhu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j1]Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny:
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions. Trans. Mach. Learn. Res. 2024 (2024) - [c9]Kirolos Ataallah, Xiaoqian Shen, Eslam Abdelrahman, Essam Sleiman, Mingchen Zhuge, Jian Ding, Deyao Zhu, Jürgen Schmidhuber, Mohamed Elhoseiny:
Goldfish: Vision-Language Understanding of Arbitrarily Long Videos. ECCV (29) 2024: 251-267 - [c8]Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny:
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models. ICLR 2024 - [i13]Kirolos Ataallah, Xiaoqian Shen, Eslam Abdelrahman, Essam Sleiman, Deyao Zhu, Jian Ding, Mohamed Elhoseiny:
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens. CoRR abs/2404.03413 (2024) - [i12]Asma Alkhaldi, Raneem Alnajim, Layan Alabdullatef, Rawan Alyahya, Jun Chen, Deyao Zhu, Ahmed Alsinan, Mohamed Elhoseiny:
MiniGPT-Med: Large Language Model as a General Interface for Radiology Diagnosis. CoRR abs/2407.04106 (2024) - [i11]Kirolos Ataallah, Xiaoqian Shen, Eslam Abdelrahman, Essam Sleiman, Mingchen Zhuge, Jian Ding, Deyao Zhu, Jürgen Schmidhuber, Mohamed Elhoseiny:
Goldfish: Vision-Language Understanding of Arbitrarily Long Videos. CoRR abs/2407.12679 (2024) - [i10]Chenhui Gou, Abdulwahab Felemban, Faizan Farooq Khan, Deyao Zhu, Jianfei Cai, Hamid Rezatofighi, Mohamed Elhoseiny:
How Well Can Vision Language Models See Image Details? CoRR abs/2408.03940 (2024) - 2023
- [c7]Jun Chen, Deyao Zhu, Guocheng Qian, Bernard Ghanem, Zhicheng Yan, Chenchen Zhu, Fanyi Xiao, Sean Chang Culatana, Mohamed Elhoseiny:
Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only. ICCV 2023: 699-710 - [c6]Deyao Zhu, Li Erran Li, Mohamed Elhoseiny:
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning. ICLR 2023 - [i9]Deyao Zhu, Yuhui Wang, Jürgen Schmidhuber, Mohamed Elhoseiny:
Guiding Online Reinforcement Learning with Action-Free Offline Pretraining. CoRR abs/2301.12876 (2023) - [i8]Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny:
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions. CoRR abs/2303.06594 (2023) - [i7]Jun Chen, Deyao Zhu, Kilichbek Haydarov, Xiang Li, Mohamed Elhoseiny:
Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions. CoRR abs/2304.04227 (2023) - [i6]Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny:
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models. CoRR abs/2304.10592 (2023) - [i5]Jun Chen, Deyao Zhu, Guocheng Qian, Bernard Ghanem, Zhicheng Yan, Chenchen Zhu, Fanyi Xiao, Mohamed Elhoseiny, Sean Chang Culatana:
Exploring Open-Vocabulary Semantic Segmentation without Human Labels. CoRR abs/2306.00450 (2023) - [i4]Jun Chen, Deyao Zhu, Xiaoqian Shen, Xiang Li, Zechun Liu, Pengchuan Zhang, Raghuraman Krishnamoorthi, Vikas Chandra, Yunyang Xiong, Mohamed Elhoseiny:
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning. CoRR abs/2310.09478 (2023) - 2022
- [c5]Jun Chen, Aniket Agarwal, Sherif Abdelkarim, Deyao Zhu, Mohamed Elhoseiny:
RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition. CVPR 2022: 19485-19495 - [c4]Abduallah A. Mohamed, Deyao Zhu, Warren Vu, Mohamed Elhoseiny, Christian G. Claudel:
Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation. ECCV (22) 2022: 463-479 - [i3]Abduallah A. Mohamed, Deyao Zhu, Warren Vu, Mohamed Elhoseiny, Christian G. Claudel:
Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation. CoRR abs/2203.03057 (2022) - [i2]Deyao Zhu, Li Erran Li, Mohamed Elhoseiny:
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning. CoRR abs/2206.04384 (2022) - 2021
- [c3]Deyao Zhu, Mohamed Zahran, Li Erran Li, Mohamed Elhoseiny:
Motion Forecasting with Unlikelihood Training in Continuous Space. CoRL 2021: 1003-1012 - [c2]Deyao Zhu, Mohamed Zahran, Li Erran Li, Mohamed Elhoseiny:
HalentNet: Multimodal Trajectory Forecasting with Hallucinative Intents. ICLR 2021 - [i1]Jun Chen, Aniket Agarwal, Sherif Abdelkarim, Deyao Zhu, Mohamed Elhoseiny:
RelTransformer: Balancing the Visual Relationship Detection from Local Context, Scene and Memory. CoRR abs/2104.11934 (2021)
2010 – 2019
- 2019
- [c1]Deyao Zhu, Marco Munderloh, Bodo Rosenhahn, Jörg Stückler:
Learning to Disentangle Latent Physical Factors for Video Prediction. GCPR 2019: 595-608
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-02 21:24 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint