Interactive learning for modern LLMs

Understand LLMs by feeling them

Explore latency, tokenization, retrieval (RAG), network structure, sampling, and agents—hands-on, in your browser. No accounts. No backend. Just insight.

Explore Agents Start with speed Pack context (RAG) Try a local reply

Runs 100% in-browserNo sign-in requiredOptimised for WebGPU

Agents (L1→L5)

Glass-box pipeline with contracts

Open the demo

Tokens per second

Feel generation speed

Open the demo

Mini RAG Lab

Retrieve snippets & fit the context window

Open the demo

Tokenizer

See how text splits into tokens

Open the demo

Neural Net

Visualise dense network layers

Open the demo

LLM Simulation

Generate + tokenize a local reply

Open the demo

Latency shapes UX

Vary tokens/sec to feel cadence
Perceived speed vs. actual throughput
Design for anticipation

Tokens are the atoms

Whitespace vs. model tokens
Why prompts ≠ characters
Costs scale with tokens

RAG = open-book answers

Retrieve relevant snippets
Fit within a context window
Answer with citations

Ready to peek under the hood?

Jump into any demo—each one is fast, focused, and teaches by doing.

Explore Agents See tokens Pack context (RAG)Visualise layers Generate + tokenize