FineWeb-C is a community-driven project that expands upon FineWeb2, providing educational content annotations across hundreds of languages.The project enables community members to rate web content's educational value and improve Language Model development.The dataset, FineWeb-Edu, demonstrates superior performance compared to existing datasets and focuses on educational content labeling.The project prioritizes human-generated annotations, particularly for low-resource languages, and operates under open licenses.