Average Joe

A superhuman generals.io bot, trained from scratch with self-play reinforcement learning.

“Its ability to flow army in complex situations is phenomenal.”

Average Joe is a bot for generals.io — a real-time, fog-of-war strategy game — that taught itself to play at a superhuman level, from zero, through millions of games against itself.

🏆 Superhuman, from scratch — trained purely by self-play; it never sees a human game.
🔥 Blazing-fast simulator — runs on generals-bots, a fully-vectorized JAX environment.
🔁 Fully reproducible — one config and one command reproduce the released agent end to end.
🛠️ Powered by JAX + Equinox — a small, pure-functional, JIT-compiled training loop.

📊 Results

In its first 1,000 ranked games on the generals.io 1v1 ladder, Average Joe won 81.5% and finished as the #1-rated player — ahead of the strongest human and well clear of the prior AI state of the art.

🎮 Watch it play

Average Joe competes on the generals.io 1v1 ladder — watch its live games and replays:

🧠 Architecture

The board — plus a short history of each player's army and land — is encoded as tokens and run through a small transformer with two heads: one picks the move, the other estimates who is winning.

Policy–value transformer — pre-norm self-attention over board + temporal tokens; emits per-cell move logits and a distributional (HL-Gauss) value. · networks/transformer.py
Self-play PPO — one network plays both sides; GAE, top-k advantage filtering, EMA weights for evaluation. · train/ppo.py

📦 Install

Requires Python ≥ 3.11 and a JAX build for your accelerator (CPU/GPU/TPU).

pip install -e .

Average Joe runs on the generals-bots environment (the generals.core.* package — the vectorized game, observations, and reward functions), a separate, non-PyPI package. Install it from source and make it importable before running.

🚀 Train

python main.py --config configs/custom/L_7d_gae90.yaml

L_7d_gae90 is the config behind the released agent. Checkpoints (a regular and an EMA copy) are written to checkpoints/<run_name>/, alongside the exact config that produced them. Any Config field can be overridden on the CLI, e.g. --num_envs 256. configs/ also holds map-size presets (S / M / L / default).

🕹️ Evaluate / play

python evals/eval.py                                       # vs a random opponent (pygame)
python evals/eval_selfplay.py                              # the agent vs itself

📈 Logging (optional)

Training logs to Weights & Biases when a token is present at .secrets/wandb_token.txt; otherwise it runs console-only.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
configs		configs
evals		evals
networks		networks
train		train
.gitignore		.gitignore
README.md		README.md
config.py		config.py
logger.py		logger.py
main.py		main.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Average Joe

📊 Results

🎮 Watch it play

🧠 Architecture

📦 Install

🚀 Train

🕹️ Evaluate / play

📈 Logging (optional)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Average Joe

📊 Results

🎮 Watch it play

🧠 Architecture

📦 Install

🚀 Train

🕹️ Evaluate / play

📈 Logging (optional)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages