Experiments Index¶
Status: Experiment index Audience: Team, AI agents Purpose: Entry point into dated experiment logs, evaluations, retrains, and benchmark changes
Use this folder for dated, narrative experiment notes:
benchmark changes
evaluation rounds
retraining results
acquisition-wave measurements
autoresearch observations and decisions
Current logs¶
Recommended log shape¶
Each experiment note should stay compact and follow the same structure:
Context
Problem or goal
Commands run
Results
Interpretation
Decision or next step