Projects with this topic
-
🔧 🔗 https://github.com/Skyvern-AI/skyvernAutomate browser-based workflows with LLMs and Computer Vision
🕸 ️🔗 www.skyvern.com/Updated -
mlx-vlm
https://github.com/Blaizzy/mlx-vlm MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
Updated -
yzma
https://github.com/hybridgroup/yzma Write Go applications that directly integrate llama.cpp for local inference using hardware acceleration.
Updated -
SymCode
https://github.com/visioncortex/SymCode
The Symbolic Barcode for Humans and Machines
Updated -
🔧 🔗 https://github.com/om-ai-lab/RS5M RS5M: a large-scale vision language dataset for remote sensing [TGRS]Updated -
🔧 🔗 https://github.com/om-ai-lab/OmDet Real-time and accurate open-vocabulary end-to-end object detectionUpdated -
🔧 🔗 https://github.com/om-ai-lab/VL-CheckList Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]Updated -
🔧 🔗 https://github.com/om-ai-lab/OVDEvalA Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
Updated