default search action
Jiayi Ji
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j9]Yinan Li, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Yunpeng Luo, Rongrong Ji:
M3ixup: A multi-modal data augmentation approach for image captioning. Pattern Recognit. 158: 110941 (2025) - 2024
- [c23]Tianyu Guo, Haowei Wang, Yiwei Ma, Jiayi Ji, Xiaoshuai Sun:
Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation. AAAI 2024: 1985-1993 - [c22]Zhipeng Qian, Yiwei Ma, Jiayi Ji, Xiaoshuai Sun:
X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks. AAAI 2024: 4551-4559 - [c21]Changli Wu, Yiwei Ma, Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun:
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation. AAAI 2024: 5940-5948 - [c20]Mingrui Wu, Yuqi Liu, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Toward Open-Set Human Object Interaction Detection. AAAI 2024: 6066-6073 - [c19]Tao Chen, Ze Lin, Hui Li, Jiayi Ji, Yiyi Zhou, Guanbin Li, Rongrong Ji:
MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization. LREC/COLING 2024: 11429-11439 - [c18]Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation. CVPR 2024: 26648-26658 - [c17]Danni Yang, Ruohan Dong, Jiayi Ji, Yiwei Ma, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model. ECCV (53) 2024: 161-180 - [c16]Yaxin Luo, Jiayi Ji, Xiaofu Chen, Yuxin Zhang, Tianhe Ren, Gen Luo:
APL: Anchor-Based Prompt Learning for One-Stage Weakly Supervised Referring Expression Comprehension. ECCV (13) 2024: 198-215 - [c15]Zhipeng Qian, Yiwei Ma, Zhekai Lin, Jiayi Ji, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Multi-branch Collaborative Learning Network for 3D Visual Grounding. ECCV (46) 2024: 381-398 - [c14]Yiwei Ma, Zhekai Lin, Jiayi Ji, Yijun Fan, Xiaoshuai Sun, Rongrong Ji:
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation. ICML 2024 - [c13]Mingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu, Xiaoshuai Sun, Rongrong Ji:
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models. ICML 2024 - [c12]Danni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation. ICML 2024 - [c11]Changli Wu, Yihang Liu, Jiayi Ji, Yiwei Ma, Haowei Wang, Gen Luo, Henghui Ding, Xiaoshuai Sun, Rongrong Ji:
3D-GRES: Generalized 3D Referring Expression Segmentation. ACM Multimedia 2024: 7852-7861 - [i30]Yiwei Ma, Zhekai Lin, Jiayi Ji, Yijun Fan, Xiaoshuai Sun, Rongrong Ji:
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation. CoRR abs/2405.00954 (2024) - [i29]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Xiaopeng Hong, Yongjian Wu, Rongrong Ji:
Image Captioning via Dynamic Path Customization. CoRR abs/2406.00334 (2024) - [i28]Danni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation. CoRR abs/2406.01451 (2024) - [i27]Shengqiong Wu, Hao Fei, Xiangtai Li, Jiayi Ji, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan:
Towards Semantic Equivalence of Tokenization in Multimodal LLM. CoRR abs/2406.05127 (2024) - [i26]Yiwei Ma, Xiaoshuai Sun, Jiayi Ji, Guannan Jiang, Weilin Zhuang, Rongrong Ji:
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval. CoRR abs/2406.05620 (2024) - [i25]Mingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu, Xiaoshuai Sun, Rongrong Ji:
Evaluating and Analyzing Relationship Hallucinations in LVLMs. CoRR abs/2406.16449 (2024) - [i24]You Huang, Wenbin Lai, Jiayi Ji, Liujuan Cao, Shengchuan Zhang, Rongrong Ji:
HRSAM: Efficiently Segment Anything in High-Resolution Images. CoRR abs/2407.02109 (2024) - [i23]Danni Yang, Ruohan Dong, Jiayi Ji, Yiwei Ma, Haowei Wang, Xiaoshuai Sun, Rongrong Ji:
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model. CoRR abs/2407.05352 (2024) - [i22]Zhipeng Qian, Yiwei Ma, Zhekai Lin, Jiayi Ji, Xiawu Zheng, Xiaoshuai Sun, Rongrong Ji:
Multi-branch Collaborative Learning Network for 3D Visual Grounding. CoRR abs/2407.05363 (2024) - [i21]Yiwei Ma, Zhibin Wang, Xiaoshuai Sun, Weihuang Lin, Qiang Zhou, Jiayi Ji, Rongrong Ji:
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model. CoRR abs/2407.16198 (2024) - [i20]Changli Wu, Yihang Liu, Jiayi Ji, Yiwei Ma, Haowei Wang, Gen Luo, Henghui Ding, Xiaoshuai Sun, Rongrong Ji:
3D-GRES: Generalized 3D Referring Expression Segmentation. CoRR abs/2407.20664 (2024) - [i19]Mingrui Wu, Xinyue Cai, Jiayi Ji, Jiale Li, Oucheng Huang, Gen Luo, Hao Fei, Xiaoshuai Sun, Rongrong Ji:
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models. CoRR abs/2407.21534 (2024) - [i18]Mingrui Wu, Oucheng Huang, Jiayi Ji, Jiale Li, Xinyue Cai, Huafeng Kuang, Jianzhuang Liu, Xiaoshuai Sun, Rongrong Ji:
TraDiffusion: Trajectory-Based Training-Free Image Generation. CoRR abs/2408.09739 (2024) - [i17]Yiwei Ma, Jiayi Ji, Ke Ye, Weihuang Lin, Zhibin Wang, Yonghan Zheng, Qiang Zhou, Xiaoshuai Sun, Rongrong Ji:
I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing. CoRR abs/2408.14180 (2024) - [i16]Yaxin Luo, Gen Luo, Jiayi Ji, Yiyi Zhou, Xiaoshuai Sun, Zhiqiang Shen, Rongrong Ji:
γ-MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models. CoRR abs/2410.13859 (2024) - [i15]Yu Zhao, Hao Fei, Xiangtai Li, Libo Qin, Jiayi Ji, Hongyuan Zhu, Meishan Zhang, Min Zhang, Jianguo Wei:
Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image. CoRR abs/2410.15312 (2024) - 2023
- [j8]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji:
Towards local visual modeling for image captioning. Pattern Recognit. 138: 109420 (2023) - [j7]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Knowing What it is: Semantic-Enhanced Dual Attention Transformer. IEEE Trans. Multim. 25: 3723-3736 (2023) - [j6]Jiayi Ji, Xiaoyang Huang, Xiaoshuai Sun, Yiyi Zhou, Gen Luo, Liujuan Cao, Jianzhuang Liu, Ling Shao, Rongrong Ji:
Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning. IEEE Trans. Multim. 25: 3962-3974 (2023) - [c10]Haowei Wang, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Xiaoshuai Sun:
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network. AAAI 2023: 2528-2536 - [c9]Yiwei Ma, Haowei Wang, Xiaoqing Zhang, Guannan Jiang, Xiaoshuai Sun, Weilin Zhuang, Jiayi Ji, Rongrong Ji:
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance. ICCV 2023: 2737-2748 - [c8]Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, Zeng Zhao, Tangjie Lv, Rongrong Ji:
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation. ACM Multimedia 2023: 3403-3414 - [c7]Yiwei Ma, Xiaoshuai Sun, Jiayi Ji, Guannan Jiang, Weilin Zhuang, Rongrong Ji:
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval. ACM Multimedia 2023: 4157-4168 - [c6]Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji:
Semi-Supervised Panoptic Narrative Grounding. ACM Multimedia 2023: 7164-7174 - [i14]Haowei Wang, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Xiaoshuai Sun:
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network. CoRR abs/2301.03160 (2023) - [i13]Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji:
Towards Local Visual Modeling for Image Captioning. CoRR abs/2302.06098 (2023) - [i12]Yiwei Ma, Xiaoqing Zhang, Xiaoshuai Sun, Jiayi Ji, Haowei Wang, Guannan Jiang, Weilin Zhuang, Rongrong Ji:
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance. CoRR abs/2303.15764 (2023) - [i11]Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, Zeng Zhao, Tangjie Lv, Rongrong Ji:
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation. CoRR abs/2308.02982 (2023) - [i10]Tao Chen, Ze Lin, Hui Li, Jiayi Ji, Yiyi Zhou, Guanbin Li, Rongrong Ji:
M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce. CoRR abs/2308.11351 (2023) - [i9]Changli Wu, Yiwei Ma, Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun:
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation. CoRR abs/2308.16632 (2023) - [i8]Jiayi Ji, Haowei Wang, Changli Wu, Yiwei Ma, Xiaoshuai Sun, Rongrong Ji:
JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues. CoRR abs/2310.09503 (2023) - [i7]Haowei Wang, Jiayi Ji, Tianyu Guo, Yilong Yang, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji:
NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning. CoRR abs/2310.10975 (2023) - [i6]Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji:
Semi-Supervised Panoptic Narrative Grounding. CoRR abs/2310.18142 (2023) - [i5]Yiwei Ma, Yijun Fan, Jiayi Ji, Haowei Wang, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji:
X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation. CoRR abs/2312.00085 (2023) - [i4]Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji:
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation. CoRR abs/2312.12470 (2023) - 2022
- [j5]Fangfang Kang, Xuejian Li, Huaqiang Du, Fangjie Mao, Guomo Zhou, Yanxin Xu, Zihao Huang, Jiayi Ji, Jingyi Wang:
Spatiotemporal Evolution of the Carbon Fluxes from Bamboo Forests and their Response to Climate Change Based on a BEPS Model in China. Remote. Sens. 14(2): 366 (2022) - [j4]Liangyuan Hu, Jiayi Ji:
CIMTx: An R Package for Causal Inference with Multiple Treatments using Observational Data. R J. 14(3): 213-230 (2022) - [j3]Jiayi Ji, Yiwei Ma, Xiaoshuai Sun, Yiyi Zhou, Yongjian Wu, Rongrong Ji:
Knowing What to Learn: A Metric-Oriented Focal Mechanism for Image Captioning. IEEE Trans. Image Process. 31: 4321-4335 (2022) - 2021
- [j2]Jiayi Ji, Xuejian Li, Huaqiang Du, Fangjie Mao, Weiliang Fan, Yanxin Xu, Zihao Huang, Jingyi Wang, Fangfang Kang:
Multiscale leaf area index assimilation for Moso bamboo forest based on Sentinel-2 and MODIS data. Int. J. Appl. Earth Obs. Geoinformation 104: 102519 (2021) - [j1]Jingyi Wang, Huaqiang Du, Xuejian Li, Fangjie Mao, Meng Zhang, Enbin Liu, Jiayi Ji, Fangfang Kang:
Remote Sensing Estimation of Bamboo Forest Aboveground Biomass Based on Geographically Weighted Regression. Remote. Sens. 13(15): 2962 (2021) - [c5]Jiayi Ji, Yunpeng Luo, Xiaoshuai Sun, Fuhai Chen, Gen Luo, Yongjian Wu, Yue Gao, Rongrong Ji:
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network. AAAI 2021: 1655-1663 - [c4]Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji:
Dual-level Collaborative Transformer for Image Captioning. AAAI 2021: 2286-2293 - [c3]Xuying Zhang, Xiaoshuai Sun, Yunpeng Luo, Jiayi Ji, Yiyi Zhou, Yongjian Wu, Feiyue Huang, Rongrong Ji:
RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual Words. CVPR 2021: 15465-15474 - [i3]Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji:
Dual-Level Collaborative Transformer for Image Captioning. CoRR abs/2101.06462 (2021) - 2020
- [c2]Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji, Fuhai Chen, Jianzhuang Liu, Qi Tian:
Attacking Image Captioning Towards Accuracy-Preserving Target Words Removal. ACM Multimedia 2020: 4226-4234 - [i2]Jiayi Ji, Yunpeng Luo, Xiaoshuai Sun, Fuhai Chen, Gen Luo, Yongjian Wu, Yue Gao, Rongrong Ji:
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network. CoRR abs/2012.07061 (2020)
2010 – 2019
- 2019
- [c1]Fuhai Chen, Rongrong Ji, Jiayi Ji, Xiaoshuai Sun, Baochang Zhang, Xuri Ge, Yongjian Wu, Feiyue Huang, Yan Wang:
Variational Structured Semantic Inference for Diverse Image Captioning. NeurIPS 2019: 1929-1939 - [i1]Fuhai Chen, Rongrong Ji, Chengpeng Dai, Xiaoshuai Sun, Chia-Wen Lin, Jiayi Ji, Baochang Zhang, Feiyue Huang, Liujuan Cao:
Semantic-aware Image Deblurring. CoRR abs/1910.03853 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-11 20:46 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint