Autoresearch

Rating:

Rate it!

Category:Research AI Agents

Overview

An open-source project that lets AI agents autonomously run LLM training experiments and keep the best model changes.

Visit website

Best For Professions:

AI researchers machine learning engineers software developers research engineers

Autoresearch is an open-source project by Andrej Karpathy that lets AI agents run autonomous machine learning research loops on a small but real LLM training setup. The repository is designed so an agent edits the main training file, launches a fixed 5-minute experiment, evaluates whether the result improved, and then keeps or discards the change before repeating the cycle. Its README describes the setup as a lightweight autonomous research organization driven by instructions in a program.md file rather than traditional manual code iteration. The project is built around a simplified single-GPU nanochat training workflow and is aimed at developers and researchers exploring automated model improvement, agent-driven experimentation, and compact research loops on their own hardware. :contentReference[oaicite:0]{index=0}

Autonomy level

74%

Reasoning: Autoresearch demonstrates high operational autonomy within defined constraints. The AI agent operates independently in a continuous feedback loop, autonomously modifying train.py code, executing 5-minute training runs, and making keep/discard decisions based on validation bits-per-byte (val_bpb) metrics without human intervention. Each experiment i...

Comparisons

Custom Comparisons

Some of the use cases of Autoresearch:

Running autonomous overnight experiments to improve small language model training setups.
Testing agent-driven code changes in a controlled single-file training workflow.
Exploring automated research loops for architecture, optimizer, and hyperparameter changes.
Studying how AI systems can manage iterative machine learning experimentation with minimal human intervention.

Loading Community Opinions...

Pricing model:

free

Code access:

open-source

Popularity level: 72%

Autoresearch

Overview

Best For Professions:

Autonomy level

Comparisons

Custom Comparisons

Some of the use cases of Autoresearch:

Pricing model:

Code access:

Popularity level: 72%

Industries:

Tags:

Autoresearch Video:

Describe the job. Get an AI worker you can actually message.

Did you find this page useful?