Experimental NPX installer for TDD-focused AI agents
This installs a collection of AI agents designed for Test-Driven Development and rapid prototyping. It's experimental, opinionated, and I built it to speed up my own MVP development work.
npx claude-code-collective init
You get 30+ specialized agents that enforce TDD methodology and try to be smarter about using real documentation instead of guessing.
I got tired of:
- AI giving me code without tests
- Having to manually look up library documentation
- Inconsistent development approaches across projects
- Breaking down complex features manually
So I built agents that:
- Write tests first, always (RED β GREEN β REFACTOR)
- Use Context7 to pull real documentation
- Route work to specialists based on what needs doing
- Break down complex requests intelligently
- @component-implementation-agent - UI components with tests and modern patterns
- @feature-implementation-agent - Business logic with comprehensive testing
- @infrastructure-implementation-agent - Build systems with testing setup
- @testing-implementation-agent - Test suites that actually test things
- @polish-implementation-agent - Performance optimization with preserved tests
- @quality-agent - Code review and standards checking
- @devops-agent - Deployment and CI/CD setup
- @functional-testing-agent - Browser testing with Playwright
- @enhanced-quality-gate - Comprehensive validation gates
- @completion-gate - Task validation and completion checks
- @research-agent - Context7-powered documentation lookup
- @prd-research-agent - Intelligent requirement breakdown
- @task-orchestrator - Smart task parallelization
/van
command - Entry point that routes to @task-orchestrator- @task-orchestrator - Central routing hub that picks the right specialist
- @behavioral-transformation-agent - System behavioral setup
- @hook-integration-agent - TDD enforcement automation
- @van-maintenance-agent - System maintenance and updates
Plus 20+ other specialized agents for specific development tasks.
npx claude-code-collective init
# Just core agents for lightweight projects
npx claude-code-collective init --minimal
# Focus on testing framework only
npx claude-code-collective init --testing-only
# Just the behavioral system and hooks
npx claude-code-collective init --hooks-only
# Interactive setup with choices
npx claude-code-collective init --interactive
your-project/
βββ CLAUDE.md # Behavioral rules for agents
βββ .claude/
β βββ settings.json # Hook configuration
β βββ agents/ # Agent definitions (30+ files)
β β βββ prd-research-agent.md
β β βββ task-orchestrator.md
β β βββ lib/
β β β βββ research-analyzer.js # Complexity analysis engine
β β βββ ... (lots more agents)
β βββ hooks/ # TDD enforcement scripts
β βββ test-driven-handoff.sh
β βββ collective-metrics.sh
βββ .claude-collective/
βββ tests/ # Test framework templates
βββ metrics/ # Usage tracking (for development)
βββ package.json # Testing setup (Vitest)
/van
command routes to @task-orchestrator (the routing hub) which analyzes requests and delegates to specialists- Research phase - agents use Context7 for real documentation
- Tests written first - before any implementation
- Implementation - minimal code to make tests pass
- Refactoring - clean up while keeping tests green
- Delivery - you see what tests were added and results
## DELIVERY COMPLETE
β
Tests written first (RED phase)
β
Implementation passes tests (GREEN phase)
β
Code refactored for quality (REFACTOR phase)
π Test Results: X/X passing
# Check what's installed and working
npx claude-code-collective status
# Validate installation integrity
npx claude-code-collective validate
# Fix broken installations
npx claude-code-collective repair
# Remove everything
npx claude-code-collective clean
# Get help
npx claude-code-collective --help
- TDD enforcement prevents a lot of bugs
- Context7 integration is much better than agents guessing
- Routing usually picks the right agent for the job
- Breaking down complex tasks is genuinely helpful
- Some agents are still being refined
- Research phase can be slow sometimes
- Hook system requires restart (Claude Code limitation)
- Documentation is scattered across files
- Requires Node.js >= 16
- Need to restart Claude Code after installation
- Opinionated about TDD (if you don't like tests, skip this)
- Some agents might be too thorough/slow for simple tasks
After installing:
# 1. Validate everything installed correctly
npx claude-code-collective validate
# 2. Check status
npx claude-code-collective status
# 3. Restart Claude Code (required for hooks)
# 4. Try it out
# In Claude Code: "Build a simple todo app with React"
# Expected: routes to research β breaks down task β writes tests β implements
- Check Node.js version:
node --version
(need >= 16) - Clear npm cache:
npm cache clean --force
- Try force install:
npx claude-code-collective init --force
- Restart Claude Code (hooks need to load)
- Check
.claude/settings.json
exists - Run
npx claude-code-collective validate
- Make sure your project has a test runner (Jest, Vitest, etc.)
- Check if tests are actually being written to files
- Look at the TDD completion reports from agents
- Context7 might be having connectivity issues
- Agent might be being thorough (this varies)
- Check
.claude-collective/metrics/
for timing data
- Node.js: >= 16.0.0
- NPM: >= 8.0.0
- Claude Code: With MCP support and hook system
- Restart: Required after installation (hooks limitation)
- Experimental development aid for rapid prototyping
- Collection of TDD-focused AI agents
- Personal project that I use for my own MVPs
- Opinionated about test-first development
- Production-ready enterprise software
- Guaranteed to work perfectly
- Following any official standards
- A replacement for thinking or understanding code
Because in my experience:
- Writing tests first forces better design thinking
- Tests catch bugs when they're cheap to fix
- Refactoring is safe with good test coverage
- Code with tests is easier to change later
The agents enforce this because I believe it leads to better outcomes. If you disagree with TDD philosophy, this tool probably isn't for you.
To make agents smarter about modern development:
- Context7 integration - real, current library documentation
- ResearchDrivenAnalyzer - intelligent complexity assessment
- Smart task breakdown - only creates subtasks when actually needed
- Best practice application - research-informed patterns
This stuff is experimental and sometimes overthinks things, but generally helpful.
AI agents can be unreliable. Here's what I built to deal with that:
Agents ignoring TDD rules: Hook system enforces test-first development before any code gets written.
Agents bypassing directives: CLAUDE.md behavioral operating system with prime directives that override default behavior.
Agents stopping mid-task: Test-driven handoff validation ensures work gets completed or explicitly handed off.
Agents making up APIs: Context7 integration forces agents to use real, current documentation.
Agents taking wrong approach: Central routing through @task-orchestrator hub prevents agents from self-selecting incorrectly.
Agents breaking coordination: Hub-and-spoke architecture eliminates peer-to-peer communication chaos.
Agents skipping quality steps: Quality gates that block completion until standards are met.
Agents losing context: Handoff contracts preserve required information across agent transitions.
Agents providing inconsistent output: Standardized TDD completion reporting from every implementation agent.
Agents working on wrong priorities: ResearchDrivenAnalyzer scores complexity to focus effort appropriately.
Most of these are enforced automatically through hooks and behavioral constraints, not just hoping agents follow instructions.
This is a personal project, but:
- Issues welcome if you find bugs or have suggestions
- PRs welcome for small fixes or better agent prompts
- Don't expect rapid responses - this is a side project
Get help: Run npx claude-code-collective validate
for diagnostics
MIT License - Use it, break it, fix it, whatever works for you.
Experimental | TDD-Focused | Personal Project | Use At Your Own Risk
See CHANGELOG.md for detailed version history and release notes.