Naphula's picture

In a Training Loop 🔄

Naphula

Naphula

·

AI & ML interests

Building new model tools, merges (7B-24B), ablations, finetunes, and datasets.

Recent Activity

new activity about 3 hours ago

24B-Suite/Mergedonia-Suite-24B-v1:Custom Methods (Index)

reacted to Parveshiiii's post with 🔥 about 6 hours ago

Just did something I’ve been meaning to try for ages. In only 3 hours, on 10 billion+ tokens, I trained a custom BPE + tiktoken-style tokenizer using my new library microtok — and it hits the same token efficiency as Qwen3. Tokenizers have always felt like black magic to me. We drop them into every LLM project, but actually training one from scratch? That always seemed way too complicated. Turns out it doesn’t have to be. microtok makes the whole process stupidly simple — literally just 3 lines of code. No heavy setup, no GPU required. I built it on top of the Hugging Face tokenizers library so it stays clean, fast, and actually understandable. If you’ve ever wanted to look under the hood and build your own optimized vocabulary instead of just copying someone else’s, this is the entry point you’ve been waiting for. I wrote up the full story, threw in a ready-to-run Colab template, and dropped the trained tokenizer on Hugging Face. Blog → https://parveshiiii.github.io/blogs/microtok/ Trained tokenizer → https://huggingface.co/Parveshiiii/microtok GitHub repo → https://github.com/Parveshiiii/microtok

liked a model 1 day ago

Retreatcost/Chrysologus-12B

View all activity

Organizations

Collections 5

View 5 collections

spaces 4

Model Tools

Merge and audit large language models on low‑VRAM GPUs

TAS 1.5

Analyze text files with advanced metrics

GGUF Repo Suite

Create and quantize Hugging Face models

Portable Offline Markdown Viewer

Portable Offline Markdown Viewer

models 149

Naphula/Riemannian-Redshift-12B-v1-GGUF

12B • Updated 1 day ago • 324 • 1

Naphula/Ancient-Awakening-12B-MPOA-GGUF

12B • Updated 3 days ago • 392 • 3

Naphula/Ancient-Awakening-12B-GGUF

12B • Updated 3 days ago • 190

Naphula/WBCR-SLERP-24B-v1

24B • Updated 3 days ago • 100

Naphula/Asmodeus-24B-v2-GGUF

24B • Updated 5 days ago • 15.2k • 2

Naphula/Slimaki-24B-v1.1-ramplus_tl

Text Generation • 24B • Updated 8 days ago • 131 • 2

Naphula/Ancient-Awakening-12B-MPOA

Text Generation • 12B • Updated 8 days ago • 102 • 4

Naphula/Ancient-Awakening-12B

Text Generation • 12B • Updated 9 days ago • 91 • 2

Naphula/Riemannian-Redshift-12B-v1

Text Generation • 12B • Updated 9 days ago • 55 • 4

Naphula/Kraken-Karcher-12B-v1-GGUF

12B • Updated 10 days ago • 163 • 1

View 149 models

datasets 4

Naphula/Updated_Settings

Updated 8 days ago • 9

Naphula/System_Prompts_Jailbreaks_Creative_Writing

Updated Dec 25, 2025 • 12 • 4

Naphula/Grim_Fandango

Viewer • Updated Nov 1, 2025 • 5 • 21

Naphula/Starship_Titanic

Updated Oct 26, 2025 • 160