Model checkpoints accompanying the paper "Scaling Behavior of Discrete Diffusion Language Models" (https://arxiv.org/abs/2512.10858).
Dimitri von Rütte
dvruette
AI & ML interests
None yet
Recent Activity
new activity 23 days ago
dvruette/fabric:Update requirements.txt new activity 23 days ago
dvruette/fabric:Space has crashed updated a Space 23 days ago
dvruette/fabricOrganizations
models 39
dvruette/openwebtext-bpe-1k
Updated
dvruette/openwebtext-bpe-16k
Updated
dvruette/openwebtext-bpe-131k
Updated
dvruette/openwebtext-bpe-8k
Updated
dvruette/openwebtext-bpe-4k
Updated
dvruette/openwebtext-bpe-33k
Updated
dvruette/openwebtext-bpe-66k
Updated
dvruette/openwebtext-bpe-2k
Updated
dvruette/gidd-unif-3b-orbax
Updated
dvruette/gidd-unif-3b
Text Generation • 3B • Updated • 1.41k
datasets 14
dvruette/openwebtext-tokenized-131k
Viewer • Updated • 8.01M • 67
dvruette/openwebtext-tokenized-66k
Viewer • Updated • 8.01M • 8
dvruette/openwebtext-tokenized-33k
Viewer • Updated • 8.01M • 58
dvruette/openwebtext-tokenized-16k
Viewer • Updated • 8.01M • 17
dvruette/openwebtext-tokenized-8k
Viewer • Updated • 8.01M • 5
dvruette/openwebtext-tokenized-4k
Viewer • Updated • 8.01M • 152
dvruette/openwebtext-tokenized-2k
Viewer • Updated • 8.01M • 70
dvruette/openwebtext-tokenized-1k
Viewer • Updated • 8.01M • 40
dvruette/openwebtext
Viewer • Updated • 8.01M • 45
dvruette/gidd-nemotron-cc-pretok
Preview • Updated • 2.03k