Projects with this topic
-
🔧 🔗 https://github.com/Skyvern-AI/skyvernAutomate browser-based workflows with LLMs and Computer Vision
🕸 ️🔗 www.skyvern.com/Updated -
mlx-vlm
https://github.com/Blaizzy/mlx-vlm MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
Updated -
🔧 🔗 https://github.com/om-ai-lab/RS5M RS5M: a large-scale vision language dataset for remote sensing [TGRS]Updated -
🔧 🔗 https://github.com/om-ai-lab/OmDet Real-time and accurate open-vocabulary end-to-end object detectionUpdated -
🔧 🔗 https://github.com/om-ai-lab/VL-CheckList Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]Updated -
🔧 🔗 https://github.com/om-ai-lab/OVDEvalA Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
Updated