Experiment Dashboard

A bird's-eye view of every stage. Numbers update as you progress through the pipeline.

The dashboard is empty until you load a corpus and run preprocessing. Start here →

Corpus characters

—

Total tokens

—

Vocabulary

—

Sentences

—

Train tokens

—

Validation tokens

—

Test tokens

—

—

Test perplexity (lower is better)

—

Test perplexity (lower is better)

Not yet tuned. Run sweep →

Most frequent unigrams in the training split.

No data yet.

Tokens per split.

Not split yet.

Test-set perplexity. Lower is better. LM1 with no smoothing can go to infinity when an unseen 4-gram appears.

No evaluation yet.