Projects with this topic
Sort by:
-
SpatialLM: Training Large Language Models for Structured Indoor ModelingUpdated
-
https://github.com/InternLM/InternLM-XComposer InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Updated -
🔧 🔗 https://github.com/FoundationVision/GenerateU[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
Updated -
🔧 🔗 https://github.com/FoundationVision/Groma[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
Updated