|
|
|
@@ -8,7 +8,7 @@ The idea: give an AI agent a small but real LLM training setup and let it experi
|
|
|
|
|
|
|
|
|
|
## How it works
|
|
|
|
|
|
|
|
|
|
The repo is deliberately kept small and only really has a three files that matter:
|
|
|
|
|
The repo is deliberately kept small and only really has three files that matter:
|
|
|
|
|
|
|
|
|
|
- **`prepare.py`** — fixed constants, one-time data prep (downloads training data, trains a BPE tokenizer), and runtime utilities (dataloader, evaluation). Not modified.
|
|
|
|
|
- **`train.py`** — the single file the agent edits. Contains the full GPT model, optimizer (Muon + AdamW), and training loop. Everything is fair game: architecture, hyperparameters, optimizer, batch size, etc. **This file is edited and iterated on by the agent**.
|
|
|
|
@@ -16,6 +16,8 @@ The repo is deliberately kept small and only really has a three files that matte
|
|
|
|
|
|
|
|
|
|
By design, training runs for a **fixed 5-minute time budget** (wall clock, excluding startup/compilation), regardless of the details of your compute. The metric is **val_bpb** (validation bits per byte) — lower is better, and vocab-size-independent so architectural changes are fairly compared.
|
|
|
|
|
|
|
|
|
|
If you are new to neural networks, this ["Dummy's Guide"](https://x.com/hooeem/status/2030720614752039185) looks pretty good for a lot more context.
|
|
|
|
|
|
|
|
|
|
## Quick start
|
|
|
|
|
|
|
|
|
|
**Requirements:** A single NVIDIA GPU (tested on H100), Python 3.10+, [uv](https://docs.astral.sh/uv/).
|
|
|
|
@@ -83,6 +85,7 @@ I think these would be the reasonable hyperparameters to play with. Ask your fav
|
|
|
|
|
- [miolini/autoresearch-macos](https://github.com/miolini/autoresearch-macos) (MacOS)
|
|
|
|
|
- [trevin-creator/autoresearch-mlx](https://github.com/trevin-creator/autoresearch-mlx) (MacOS)
|
|
|
|
|
- [jsegov/autoresearch-win-rtx](https://github.com/jsegov/autoresearch-win-rtx) (Windows)
|
|
|
|
|
- [andyluo7/autoresearch](https://github.com/andyluo7/autoresearch) (AMD)
|
|
|
|
|
|
|
|
|
|
## License
|
|
|
|
|
|
|
|
|
|