Projects with this topic
Sort by:
-
🔧 🔗 https://github.com/modelscope/ms-swiftSWIFT (Scalable lightWeight Infrastructure for Fine-Tuning) Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs.
Updated -
🔧 🔗 https://github.com/om-ai-lab/VLM-R1 Solve Visual Understanding with Reinforced VLMsUpdated -
🔧 🔗 https://github.com/QwenLM/online_merging_optimizers Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in AlignmentUpdated