MM-LLMs: Recent Advances in MultiModal Large Language Models
Mar 12, 2024 multi modal model arXiv (2024)
A Theory of Multimodal Learning
Jan 29, 2024 multi modal model NIPS (2023)
Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection
Dec 4, 2023 multi modal model arXiv (2023)
Grounded Language-Image Pre-training
Nov 14, 2023 multi modal model CVPR (2022)
Open-Vocabulary Object Detection Using Captions
Oct 31, 2023 multi modal model CVPR (2021)
Link-Context Learning for Multimodal LLMs
Oct 24, 2023 multi modal model arXiv (2023)
Llama 2: Open Foundation and Fine-Tuned Chat Models
Sep 27, 2023 large language model arXiv (2023)
LLaMA: Open and Efficient Foundation Language Models
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Jul 11, 2023 object detection arXiv 2022