default search action
Vasudev Lal
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c21]Avinash Madasu, Vasudev Lal:
ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval Models. CVPR Workshops 2024: 1733-1743 - [c20]Zhipeng Cai, Matthias Mueller, Reiner Birkl, Diana Wofk, Shao-Yen Tseng, Junda Cheng, Gabriela Ben Melech Stan, Vasudev Lal, Michael Paulitsch:
L-MAGIC: Language Model Assisted Generation of Images with Coherence. CVPR 2024: 7049-7058 - [c19]Phillip Howard, Avinash Madasu, Tiep Le, Gustavo A. Lujan-Moreno, Anahita Bhiwandiwalla, Vasudev Lal:
SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples. CVPR 2024: 11975-11985 - [c18]Shachar Rosenman, Vasudev Lal, Phillip Howard:
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation. EACL (Demonstrations) 2024: 159-167 - [c17]Agneet Chatterjee, Gabriela Ben Melech Stan, Estelle Aflalo, Sayak Paul, Dhruba Ghosh, Tejas Gokhale, Ludwig Schmidt, Hannaneh Hajishirzi, Vasudev Lal, Chitta Baral, Yezhou Yang:
Getting it Right: Improving Spatial Consistency in Text-to-Image Models. ECCV (22) 2024: 204-222 - [c16]Musashi Hinck, Carolin Holtermann, Matthew L. Olson, Florian Schneider, Sungduk Yu, Anahita Bhiwandiwalla, Anne Lauscher, Shao-Yen Tseng, Vasudev Lal:
Why do LLaVA Vision-Language Models Reply to Images in English? EMNLP (Findings) 2024: 13402-13421 - [c15]Phillip Howard, Junlin Wang, Vasudev Lal, Gadi Singer, Yejin Choi, Swabha Swayamdipta:
NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge. NAACL-HLT (Findings) 2024: 4502-4520 - [i30]Agneet Chatterjee, Gabriela Ben Melech Stan, Estelle Aflalo, Sayak Paul, Dhruba Ghosh, Tejas Gokhale, Ludwig Schmidt, Hannaneh Hajishirzi, Vasudev Lal, Chitta Baral, Yezhou Yang:
Getting it Right: Improving Spatial Consistency in Text-to-Image Models. CoRR abs/2404.01197 (2024) - [i29]Musashi Hinck, Matthew L. Olson, David Cobbley, Shao-Yen Tseng, Vasudev Lal:
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model. CoRR abs/2404.01331 (2024) - [i28]Gabriela Ben Melech Stan, Raanan Y. Yehezkel Rohekar, Yaniv Gurwicz, Matthew Lyle Olson, Anahita Bhiwandiwalla, Estelle Aflalo, Chenfei Wu, Nan Duan, Shao-Yen Tseng, Vasudev Lal:
LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models. CoRR abs/2404.03118 (2024) - [i27]Zhipeng Cai, Matthias Mueller, Reiner Birkl, Diana Wofk, Shao-Yen Tseng, Junda Cheng, Gabriela Ben Melech Stan, Vasudev Lal, Michael Paulitsch:
L-MAGIC: Language Model Assisted Generation of Images with Coherence. CoRR abs/2406.01843 (2024) - [i26]Xin Su, Man Luo, Kris W. Pan, Tien Pei Chou, Vasudev Lal, Phillip Howard:
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs. CoRR abs/2406.19593 (2024) - [i25]Musashi Hinck, Carolin Holtermann, Matthew Lyle Olson, Florian Schneider, Sungduk Yu, Anahita Bhiwandiwalla, Anne Lauscher, Shao-Yen Tseng, Vasudev Lal:
Why do LLaVA Vision-Language Models Reply to Images in English? CoRR abs/2407.02333 (2024) - [i24]Sungduk Yu, Brian L. White, Anahita Bhiwandiwalla, Musashi Hinck, Matthew Lyle Olson, Tung Nguyen, Vasudev Lal:
ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution. CoRR abs/2408.15993 (2024) - [i23]Avinash Madasu, Yossi Gandelsman, Vasudev Lal, Phillip Howard:
Quantifying and Enabling the Interpretability of CLIP-like Models. CoRR abs/2409.06579 (2024) - [i22]Sungduk Yu, Man Luo, Avinash Madasu, Vasudev Lal, Phillip Howard:
Is Your Paper Being Reviewed by an LLM? Investigating AI Text Detectability in Peer Review. CoRR abs/2410.03019 (2024) - [i21]Neale Ratzlaff, Matthew Lyle Olson, Musashi Hinck, Shao-Yen Tseng, Vasudev Lal, Phillip Howard:
Debiasing Large Vision-Language Models by Ablating Protected Attribute Representations. CoRR abs/2410.13976 (2024) - [i20]Prafulla Kumar Choubey, Xin Su, Man Luo, Xiangyu Peng, Caiming Xiong, Tiep Le, Shachar Rosenman, Vasudev Lal, Phil Mui, Ricky Ho Yin Chan, Phillip Howard, Chien-Sheng Wu:
Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency. CoRR abs/2410.16597 (2024) - 2023
- [j1]Avinash Madasu, Estelle Aflalo, Gabriela Ben Melech Stan, Shachar Rosenman, Shao-Yen Tseng, Gedas Bertasius, Vasudev Lal:
MuMUR: Multilingual Multimodal Universal Retrieval. Inf. Retr. J. 26(1): 5 (2023) - [c14]Xiao Xu, Chenfei Wu, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan:
BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning. AAAI 2023: 10637-10647 - [c13]Xiao Xu, Bei Li, Chenfei Wu, Shao-Yen Tseng, Anahita Bhiwandiwalla, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan:
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning. ACL (1) 2023: 14507-14525 - [c12]Avinash Madasu, Vasudev Lal:
Is Multimodal Vision Supervision Beneficial to Language? CVPR Workshops 2023: 2637-2642 - [c11]Avinash Madasu, Estelle Aflalo, Gabriela Ben Melech Stan, Shao-Yen Tseng, Gedas Bertasius, Vasudev Lal:
Improving Video Retrieval Using Multilingual Knowledge Transfer. ECIR (1) 2023: 669-684 - [c10]Tiep Le, Vasudev Lal, Phillip Howard:
COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs. NeurIPS 2023 - [c9]Jerry Tang, Meng Du, Vy A. Vo, Vasudev Lal, Alexander Huth:
Brain encoding models based on multimodal transformers can transfer across language and vision. NeurIPS 2023 - [i19]Avinash Madasu, Vasudev Lal:
Is multi-modal vision supervision beneficial to language? CoRR abs/2302.05016 (2023) - [i18]Gadi Singer, Joscha Bach, Tetiana Grinberg, Nagib Hakim, Phillip Howard, Vasudev Lal, Zev Rivlin:
Thrill-K Architecture: Towards a Solution to the Problem of Knowledge Based Understanding. CoRR abs/2303.12084 (2023) - [i17]Phillip Howard, Junlin Wang, Vasudev Lal, Gadi Singer, Yejin Choi, Swabha Swayamdipta:
NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge. CoRR abs/2305.04978 (2023) - [i16]Gabriela Ben Melech Stan, Diana Wofk, Scottie Fox, Alex Redden, Will Saxton, Jean Yu, Estelle Aflalo, Shao-Yen Tseng, Fabio Nonato, Matthias Müller, Vasudev Lal:
LDM3D: Latent Diffusion Model for 3D. CoRR abs/2305.10853 (2023) - [i15]Jerry Tang, Meng Du, Vy A. Vo, Vasudev Lal, Alexander G. Huth:
Brain encoding models based on multimodal transformers can transfer across language and vision. CoRR abs/2305.12248 (2023) - [i14]Xiao Xu, Bei Li, Chenfei Wu, Shao-Yen Tseng, Anahita Bhiwandiwalla, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan:
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning. CoRR abs/2306.00103 (2023) - [i13]Avinash Madasu, Vasudev Lal:
ICSVR: Investigating Compositional and Semantic Understanding in Video Retrieval Models. CoRR abs/2306.16533 (2023) - [i12]Tiep Le, Vasudev Lal, Phillip Howard:
COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs. CoRR abs/2309.14356 (2023) - [i11]Phillip Howard, Avinash Madasu, Tiep Le, Gustavo A. Lujan-Moreno, Vasudev Lal:
Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples. CoRR abs/2310.02988 (2023) - [i10]Avinash Madasu, Anahita Bhiwandiwalla, Vasudev Lal:
Analyzing Zero-Shot Abilities of Vision-Language Models on Video Understanding Tasks. CoRR abs/2310.04914 (2023) - [i9]Gabriela Ben Melech Stan, Diana Wofk, Estelle Aflalo, Shao-Yen Tseng, Zhipeng Cai, Michael Paulitsch, Vasudev Lal:
LDM3D-VR: Latent Diffusion Model for 3D VR. CoRR abs/2311.03226 (2023) - [i8]Shachar Rosenman, Vasudev Lal, Phillip Howard:
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation. CoRR abs/2311.12229 (2023) - [i7]Phillip Howard, Avinash Madasu, Tiep Le, Gustavo A. Lujan-Moreno, Anahita Bhiwandiwalla, Vasudev Lal:
Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples. CoRR abs/2312.00825 (2023) - 2022
- [c8]Gadi Singer, Joscha Bach, Tetiana Grinberg, Nagib Hakim, Phillip Ryan Howard, Vasudev Lal, Zev Rivlin:
Thrill-K Architecture: Towards a Solution to the Problem of Knowledge Based Understanding. AGI 2022: 404-412 - [c7]Phillip Howard, Arden Ma, Vasudev Lal, Ana Paula Simões, Daniel Korat, Oren Pereg, Moshe Wasserblat, Gadi Singer:
Cross-Domain Aspect Extraction using Transformers Augmented with Knowledge Graphs. CIKM 2022: 780-790 - [c6]Estelle Aflalo, Meng Du, Shao-Yen Tseng, Yongfei Liu, Chenfei Wu, Nan Duan, Vasudev Lal:
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers. CVPR 2022: 21374-21383 - [c5]Phillip Howard, Gadi Singer, Vasudev Lal, Yejin Choi, Swabha Swayamdipta:
NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation. EMNLP (Findings) 2022: 5056-5072 - [c4]Yongfei Liu, Chenfei Wu, Shao-Yen Tseng, Vasudev Lal, Xuming He, Nan Duan:
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation. NAACL-HLT (Findings) 2022: 1589-1600 - [c3]Ayal Klein, Oren Pereg, Daniel Korat, Vasudev Lal, Moshe Wasserblat, Ido Dagan:
Opinion-based Relational Pivoting for Cross-domain Aspect Term Extraction. WASSA@ACL 2022: 104-112 - [i6]Estelle Aflalo, Meng Du, Shao-Yen Tseng, Yongfei Liu, Chenfei Wu, Nan Duan, Vasudev Lal:
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers. CoRR abs/2203.17247 (2022) - [i5]Xiao Xu, Chenfei Wu, Shachar Rosenman, Vasudev Lal, Nan Duan:
Bridge-Tower: Building Bridges Between Encoders in Vision-Language Representation Learning. CoRR abs/2206.08657 (2022) - [i4]Avinash Madasu, Estelle Aflalo, Gabriela Ben Melech Stan, Shao-Yen Tseng, Gedas Bertasius, Vasudev Lal:
Improving video retrieval using multilingual knowledge transfer. CoRR abs/2208.11553 (2022) - [i3]Phillip Howard, Arden Ma, Vasudev Lal, Ana Paula Simões, Daniel Korat, Oren Pereg, Moshe Wasserblat, Gadi Singer:
Cross-Domain Aspect Extraction using Transformers Augmented with Knowledge Graphs. CoRR abs/2210.10144 (2022) - [i2]Phillip Howard, Gadi Singer, Vasudev Lal, Yejin Choi, Swabha Swayamdipta:
NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation. CoRR abs/2210.12365 (2022) - 2021
- [c2]Vasudev Lal, Somak Aditya, Yezhou Yang, Pasquale Minervini, Sandya Mannarswamy:
First Workshop on Knowledge Injection in Neural Networks (KINN). CIKM 2021: 4882-4883 - [c1]Vasudev Lal, Arden Ma, Estelle Aflalo, Phillip Howard, Ana Paula Simões, Daniel Korat, Oren Pereg, Gadi Singer, Moshe Wasserblat:
InterpreT: An Interactive Visualization Tool for Interpreting Transformers. EACL (System Demonstrations) 2021: 135-142 - [i1]Yongfei Liu, Chenfei Wu, Shao-Yen Tseng, Vasudev Lal, Xuming He, Nan Duan:
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation. CoRR abs/2109.10504 (2021)
Coauthor Index
aka: Phillip Ryan Howard
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-28 20:30 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint