multimodal
Projects with this topic
-
https://github.com/Stability-AI/stability-sdk SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Updated -
https://github.com/InternLM/HuixiangDou HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
Updated -
https://github.com/princeton-nlp/CharXiv CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Updated -
🔧 🔗 https://github.com/FoundationVision/Groma[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
Updated -
🔧 🔗 https://github.com/forhaoliu/language-quantized-autoencodersLanguage Quantized AutoEncoders This is a Jax implementation of our work Language Quantized AutoEncoders.
Updated -
https://github.com/InternLM/InternLM-XComposer InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Updated