default search action
Bin Xiao 0004
Person information
- affiliation: Microsoft Cloud+AI, Microsoft Research Asia, China
- affiliation: South China University of Technology, School of Electronic and Information Engineering, China
Other persons with the same name
- Bin Xiao — disambiguation page
- Bin Xiao 0001 — Hong Kong Polytechnic University, Department of Computing, Hung Hom, Kowloon, Hong Kong (and 1 more)
- Bin Xiao 0002 — Chongqing University of Posts and Telecommunications, Key Laboratory of Computational Intelligence, China (and 1 more)
- Bin Xiao 0003 — China Electric Power Research Institute, Wuhan, China
- Bin Xiao 0005 — Lanzhou University, College of Earth and Environment Sciences, MOE, China
- Bin Xiao 0006 — University of Chinese Academy of Sciences, Beijing, China (and 2 more)
- Bin Xiao 0007 — Stockholm University, Kista, Department of Computer and Systems Sciences, Sweden
- Bin Xiao 0008 — University of Ottawa, School of Electrical Engineering and Computer Science, Canada (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c21]Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai, Houdong Hu, Yumao Lu, Michael Zeng, Ce Liu, Lu Yuan:
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks. CVPR 2024: 4818-4829 - [c20]Xu Ma, Xiyang Dai, Jianwei Yang, Bin Xiao, Yinpeng Chen, Yun Fu, Lu Yuan:
Efficient Modulation for Vision Networks. ICLR 2024 - [c19]Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Xuemei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data. NAACL-HLT (Findings) 2024: 1615-1627 - [i27]Hongyuan Yu, Cheng Wan, Mengchen Liu, Dongdong Chen, Bin Xiao, Xiyang Dai:
Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search. CoRR abs/2403.10413 (2024) - [i26]Xu Ma, Xiyang Dai, Jianwei Yang, Bin Xiao, Yinpeng Chen, Yun Fu, Lu Yuan:
Efficient Modulation for Vision Networks. CoRR abs/2403.19963 (2024) - 2023
- [c18]Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code: An Integrative and Composable Multimodal Learning Framework. AAAI 2023: 10880-10890 - [c17]Kan Wu, Houwen Peng, Zhenghong Zhou, Bin Xiao, Mengchen Liu, Lu Yuan, Hong Xuan, Michael Valenzuela, Xi Stephen Chen, Xinggang Wang, Hongyang Chao, Han Hu:
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance. ICCV 2023: 21913-21923 - [i25]Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Mei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data. CoRR abs/2305.12311 (2023) - [i24]Kan Wu, Houwen Peng, Zhenghong Zhou, Bin Xiao, Mengchen Liu, Lu Yuan, Hong Xuan, Michael Valenzuela, Xi Chen, Xinggang Wang, Hongyang Chao, Han Hu:
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance. CoRR abs/2309.12314 (2023) - [i23]Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai, Houdong Hu, Yumao Lu, Michael Zeng, Ce Liu, Lu Yuan:
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks. CoRR abs/2311.06242 (2023) - 2022
- [c16]Jinnian Zhang, Houwen Peng, Kan Wu, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan:
MiniViT: Compressing Vision Transformers with Weight Multiplexing. CVPR 2022: 12135-12144 - [c15]Jianwei Yang, Chunyuan Li, Pengchuan Zhang, Bin Xiao, Ce Liu, Lu Yuan, Jianfeng Gao:
Unified Contrastive Learning in Image-Text-Label Space. CVPR 2022: 19141-19151 - [c14]Kan Wu, Jinnian Zhang, Houwen Peng, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan:
TinyViT: Fast Pretraining Distillation for Small Vision Transformers. ECCV (21) 2022: 68-85 - [c13]Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, Shih-Fu Chang, Lu Yuan:
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training. ECCV (27) 2022: 69-87 - [c12]Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan:
DaViT: Dual Attention Vision Transformers. ECCV (24) 2022: 74-92 - [c11]Chunyuan Li, Jianwei Yang, Pengchuan Zhang, Mei Gao, Bin Xiao, Xiyang Dai, Lu Yuan, Jianfeng Gao:
Efficient Self-supervised Vision Transformers for Representation Learning. ICLR 2022 - [i22]Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Jianwei Yang, Xiyang Dai, Bin Xiao, Haoxuan You, Shih-Fu Chang, Lu Yuan:
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks. CoRR abs/2201.05729 (2022) - [i21]Jianwei Yang, Chunyuan Li, Pengchuan Zhang, Bin Xiao, Ce Liu, Lu Yuan, Jianfeng Gao:
Unified Contrastive Learning in Image-Text-Label Space. CoRR abs/2204.03610 (2022) - [i20]Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan:
DaViT: Dual Attention Vision Transformers. CoRR abs/2204.03645 (2022) - [i19]Jinnian Zhang, Houwen Peng, Kan Wu, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan:
MiniViT: Compressing Vision Transformers with Weight Multiplexing. CoRR abs/2204.07154 (2022) - [i18]Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Xiyang Dai, Bin Xiao, Jianwei Yang, Haoxuan You, Kai-Wei Chang, Shih-Fu Chang, Lu Yuan:
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks. CoRR abs/2204.10496 (2022) - [i17]Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code: An Integrative and Composable Multimodal Learning Framework. CoRR abs/2205.01818 (2022) - [i16]Kan Wu, Jinnian Zhang, Houwen Peng, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan:
TinyViT: Fast Pretraining Distillation for Small Vision Transformers. CoRR abs/2207.10666 (2022) - [i15]Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, Shih-Fu Chang, Lu Yuan:
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training. CoRR abs/2207.12661 (2022) - 2021
- [j2]Jingdong Wang, Ke Sun, Tianheng Cheng, Borui Jiang, Chaorui Deng, Yang Zhao, Dong Liu, Yadong Mu, Mingkui Tan, Xinggang Wang, Wenyu Liu, Bin Xiao:
Deep High-Resolution Representation Learning for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 43(10): 3349-3364 (2021) - [c10]Xiyang Dai, Yinpeng Chen, Bin Xiao, Dongdong Chen, Mengchen Liu, Lu Yuan, Lei Zhang:
Dynamic Head: Unifying Object Detection Heads With Attentions. CVPR 2021: 7373-7382 - [c9]Changqian Yu, Bin Xiao, Changxin Gao, Lu Yuan, Lei Zhang, Nong Sang, Jingdong Wang:
Lite-HRNet: A Lightweight High-Resolution Network. CVPR 2021: 10440-10450 - [c8]Zigang Geng, Ke Sun, Bin Xiao, Zhaoxiang Zhang, Jingdong Wang:
Bottom-Up Human Pose Estimation via Disentangled Keypoint Regression. CVPR 2021: 14676-14686 - [c7]Haiping Wu, Bin Xiao, Noel Codella, Mengchen Liu, Xiyang Dai, Lu Yuan, Lei Zhang:
CvT: Introducing Convolutions to Vision Transformers. ICCV 2021: 22-31 - [c6]Pengchuan Zhang, Xiyang Dai, Jianwei Yang, Bin Xiao, Lu Yuan, Lei Zhang, Jianfeng Gao:
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding. ICCV 2021: 2978-2988 - [c5]Jianwei Yang, Chunyuan Li, Pengchuan Zhang, Xiyang Dai, Bin Xiao, Lu Yuan, Jianfeng Gao:
Focal Attention for Long-Range Interactions in Vision Transformers. NeurIPS 2021: 30008-30022 - [i14]Pengchuan Zhang, Xiyang Dai, Jianwei Yang, Bin Xiao, Lu Yuan, Lei Zhang, Jianfeng Gao:
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding. CoRR abs/2103.15358 (2021) - [i13]Haiping Wu, Bin Xiao, Noel Codella, Mengchen Liu, Xiyang Dai, Lu Yuan, Lei Zhang:
CvT: Introducing Convolutions to Vision Transformers. CoRR abs/2103.15808 (2021) - [i12]Zigang Geng, Ke Sun, Bin Xiao, Zhaoxiang Zhang, Jingdong Wang:
Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression. CoRR abs/2104.02300 (2021) - [i11]Changqian Yu, Bin Xiao, Changxin Gao, Lu Yuan, Lei Zhang, Nong Sang, Jingdong Wang:
Lite-HRNet: A Lightweight High-Resolution Network. CoRR abs/2104.06403 (2021) - [i10]Xiyang Dai, Yinpeng Chen, Bin Xiao, Dongdong Chen, Mengchen Liu, Lu Yuan, Lei Zhang:
Dynamic Head: Unifying Object Detection Heads with Attentions. CoRR abs/2106.08322 (2021) - [i9]Chunyuan Li, Jianwei Yang, Pengchuan Zhang, Mei Gao, Bin Xiao, Xiyang Dai, Lu Yuan, Jianfeng Gao:
Efficient Self-supervised Vision Transformers for Representation Learning. CoRR abs/2106.09785 (2021) - [i8]Jianwei Yang, Chunyuan Li, Pengchuan Zhang, Xiyang Dai, Bin Xiao, Lu Yuan, Jianfeng Gao:
Focal Self-attention for Local-Global Interactions in Vision Transformers. CoRR abs/2107.00641 (2021) - [i7]Lu Yuan, Dongdong Chen, Yi-Ling Chen, Noel Codella, Xiyang Dai, Jianfeng Gao, Houdong Hu, Xuedong Huang, Boxin Li, Chunyuan Li, Ce Liu, Mengchen Liu, Zicheng Liu, Yumao Lu, Yu Shi, Lijuan Wang, Jianfeng Wang, Bin Xiao, Zhen Xiao, Jianwei Yang, Michael Zeng, Luowei Zhou, Pengchuan Zhang:
Florence: A New Foundation Model for Computer Vision. CoRR abs/2111.11432 (2021) - 2020
- [c4]Haiping Wu, Bin Xiao:
3D Human Pose Estimation via Explicit Compositional Depth Maps. AAAI 2020: 12378-12385 - [c3]Bowen Cheng, Bin Xiao, Jingdong Wang, Honghui Shi, Thomas S. Huang, Lei Zhang:
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation. CVPR 2020: 5385-5394 - [i6]Ke Sun, Zigang Geng, Depu Meng, Bin Xiao, Dong Liu, Zhaoxiang Zhang, Jingdong Wang:
Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive Keypoint Estimates. CoRR abs/2006.15480 (2020)
2010 – 2019
- 2019
- [c2]Ke Sun, Bin Xiao, Dong Liu, Jingdong Wang:
Deep High-Resolution Representation Learning for Human Pose Estimation. CVPR 2019: 5693-5703 - [i5]Ke Sun, Bin Xiao, Dong Liu, Jingdong Wang:
Deep High-Resolution Representation Learning for Human Pose Estimation. CoRR abs/1902.09212 (2019) - [i4]Ke Sun, Yang Zhao, Borui Jiang, Tianheng Cheng, Bin Xiao, Dong Liu, Yadong Mu, Xinggang Wang, Wenyu Liu, Jingdong Wang:
High-Resolution Representations for Labeling Pixels and Regions. CoRR abs/1904.04514 (2019) - [i3]Jingdong Wang, Ke Sun, Tianheng Cheng, Borui Jiang, Chaorui Deng, Yang Zhao, Dong Liu, Yadong Mu, Mingkui Tan, Xinggang Wang, Wenyu Liu, Bin Xiao:
Deep High-Resolution Representation Learning for Visual Recognition. CoRR abs/1908.07919 (2019) - [i2]Bowen Cheng, Bin Xiao, Jingdong Wang, Honghui Shi, Thomas S. Huang, Lei Zhang:
Bottom-up Higher-Resolution Networks for Multi-Person Pose Estimation. CoRR abs/1908.10357 (2019) - 2018
- [c1]Bin Xiao, Haiping Wu, Yichen Wei:
Simple Baselines for Human Pose Estimation and Tracking. ECCV (6) 2018: 472-487 - [i1]Bin Xiao, Haiping Wu, Yichen Wei:
Simple Baselines for Human Pose Estimation and Tracking. CoRR abs/1804.06208 (2018) - 2014
- [j1]Yongqiang Zou, Xing Jin, Yi Li, Zhimao Guo, Eryu Wang, Bin Xiao:
Mariana: Tencent Deep Learning Platform and its Applications. Proc. VLDB Endow. 7(13): 1772-1777 (2014)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-11 20:47 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint