EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
-
Updated
Nov 27, 2024 - Python
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
The back-end of cross-modal retrieval system,wihch will contain services such as semantic location .etc
[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.
PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k
PIMA - A Novel Approach for Pill-Prescription Matching with GNN Assistance and Contrastive Learning
The LLM-Powered Video Search System is an advanced multimodal video search solution that leverages Large Language Models (LLMs) to enhance video retrieval through text, image, and metadata queries.
A search engine, operating on the foundation of the OpenAI Clip Model to retrieve images corresponding to textual queries.
Digimon Dataset for MultiModal Machine Learning
Add a description, image, and links to the text-image-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the text-image-retrieval topic, visit your repo's landing page and select "manage topics."