G
grpo

Projects with this topic

View VLM R1 project

mirrored_repos / MachineLearning / OM AI Lab / VLM R1

🔧🔗https://github.com/om-ai-lab/VLM-R1 Solve Visual Understanding with Reinforced VLMs

vlm multimodal Large Langua... qwen deepseek grpo vlm-r1

0

Updated Mar 24, 2026

0 0 0 0

Updated Mar 24, 2026
View MLX GRPO project

mirrored_repos / MachineLearning / doriandarko / MLX GRPO

🔧🔗https://github.com/Doriandarko/MLX-GRPO MLX-GRPO is a training framework for large language models (LLMs) that leverages Apple’s MLX framework exclusively. Designed to run natively on Apple Silicon using the Metal backend, this project implements Group-based Relative Policy Optimization (GRPO) with a chain-of-thought prompting structure. The pipeline includes dataset preparation, reward function definitions, and GRPO training—all running in a pure MLX environment (no CUDA).

grpo mlx

0

Updated Oct 28, 2025

0 0 0 0

Updated Oct 28, 2025

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾