Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Big Science - Modeling Metadata

non-profit
https://github.com/bigscience-workshop/metadata
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Muennighoff  submitted a paper 1 day ago
Composer 2 Technical Report
vumichien  authored a paper 6 months ago
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
vumichien  authored a paper 6 months ago
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
View all activity

Timo Schick's profile pictureNora Kassner's profile pictureLucile Saulnier's profile pictureShanya Sharma's profile pictureChristopher's profile pictureMike Tian-Jian Jiang's profile picturegerard dupont's profile pictureStella Biderman's profile pictureMachine User Bs Metadata WG's profile pictureHugo Laurençon's profile pictureVictor Sanh's profile pictureLeo Tronchon's profile picturePaul Pommer's profile pictureMasoud Jalili Sabet's profile pictureNiklas Muennighoff's profile pictureJordan Clive's profile pictureManan Dey's profile pictureM Saiful Bari's profile pictureJonathan Chang's profile picturevumichien's profile picture

bs-modeling-metadata 's datasets 6

bs-modeling-metadata/c4-en-html-with-training_metadata_all

Viewer • Updated Apr 1, 2023 • 33.3k • 194

bs-modeling-metadata/c4-en-html-with-metadata

Viewer • Updated Aug 18, 2022 • 44.6M • 39k • 12

bs-modeling-metadata/website_metadata_c4

Viewer • Updated Nov 24, 2021 • 52.6k • 59 • 3

bs-modeling-metadata/wiki_dump

Updated Nov 23, 2021 • 49

bs-modeling-metadata/c4_newslike_url_only

Viewer • Updated Sep 20, 2021 • 13.8M • 27

bs-modeling-metadata/OSCAR_Entity_13_000

Viewer • Updated Sep 15, 2021 • 10.7k • 35 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs