D
dataset
Projects with this topic
-
🔧 🔗 https://github.com/HumanSignal/label-studioLabel Studio is a multi-type data labeling and annotation tool with standardized output format
Updated -
https://github.com/awesome-selfhosted/awesome-selfhosted-data machine-readable data for https://awesome-selfhosted.net
Updated -
-
https://github.com/lm-sys/llm-decontaminator Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
Updated -
https://github.com/Farama-Foundation/Minari A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
Updated