Stars
《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
A curated list of Hypergraph Learning, Hypergraph Theory, Hypergraph Dataset and Hypergraph Tool.
Keras documentation, hosted live at keras.io
A Survey on multimodal learning research.
Diverse Image Captioning with Grounded Style (GCPR 2021)
Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems
Awesome Papers About Performing Prompting On Graphs
A pytorch library for graph and hypergraph computation.
The repository contains lists of papers on causality and how relevant techniques are being used to further enhance deep learning era computer vision solutions.
Scenic: A Jax Library for Computer Vision Research and Beyond
结合XrecyclerView 和BaseRecyclerViewAdapterHelper更加的方便的调用RecyclerView的下拉刷新跟上拉加载,本库基于“https://github.com/jianghejie/XRecyclerView“ 以及”https://github.com/CymChad/BaseRecyclerViewAdapterHelper“ 特此感谢
[Image 2 Text Para] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
Multi Task Vision and Language
cfh3c / InVQG
Forked from bcxbg/ICVQGDataset & Code for Inferential Visual Question Generation
Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools,…
🤘 awesome-semantic-segmentation
Optimized primitives for collective multi-GPU communication
Faster R-CNN (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB version
Google Drive Public File Downloader when Curl/Wget Fails
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
A curated list of research papers in exploring causality in vision. Link to the code if available is also present.
This repository contains the main baselines introduced in WSSTG (ACL 2019).
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)