Framework for building AI agents that work effectively across multiple context windows on complex, long-running tasks.
Framework for enabling AI agents to work effectively across many context windows on complex tasks.
Long-running agents must work in discrete sessions where each new session begins with no memory of previous work. Without proper scaffolding, agents tend to:
Sets up the environment with all context future agents need:
init.sh script for environment setupfeature_list.json with all requirementsclaude-progress.txt for session loggingSee references/initializer-prompt.md for the full prompt template.
Makes incremental progress while maintaining clean state:
See references/coding-prompt.md for the full prompt template.
Every coding agent session should begin:
1. pwd # Understand working directory
2. cat claude-progress.txt # Read recent progress
3. cat feature_list.json # Check feature status
4. git log --oneline -20 # Review recent commits
5. ./init.sh # Start dev environment
6. <run basic test> # Verify app works
7. <select next feature> # Choose one failing feature
Comprehensive list of all features with pass/fail status. Use JSON format to prevent inappropriate edits.
{
"features": [
{
"category": "functional",
"description": "User can create new chat",
"steps": ["Navigate to main", "Click New Chat", "Verify creation"],
"passes": false
}
]
}
Template: assets/feature_list_template.json
Session-by-session log of work completed. Each entry includes:
Template: assets/progress_template.md
Environment setup script that:
passes field status| Problem | Solution |
|---|---|
| Agent one-shots entire project | Create detailed feature list, work one at a time |
| Declares victory too early | Check feature_list.json for failing tests |
| Leaves broken state | Run basic test at session start, fix first |
| Marks features done prematurely | Require end-to-end browser testing |
| Wastes time figuring out setup | Read init.sh, use established patterns |
This framework generalizes beyond web development. Key principles: