oscarcorpus/OSCAR2109 · Datasets at Hugging Face
OSCAR follows the OSCAR Schema, which adds metadata to each entry while staying backwardscompatible with OSCAR. The order of operations is similar as in the goclassy pipeline, with optimisations regarding IO and a finer granlularity regarding multithreading.