Tinker
Resources
Agent logs
Agent memories
Agent sessions
Agent terminal logs
Agents
Comments
Epics
Projects
Proposals
Tickets
Avo user
Resources
Agent logs
Agent memories
Agent sessions
Agent terminal logs
Agents
Comments
Epics
Projects
Proposals
Tickets
Avo user
Home
Tickets
Introduce Researcher agent: 24/7 autonomous code & system analyst
Edit
Introduce Researcher agent: 24/7 autonomous code & system analyst
Cancel
Actions
Transition Ticket
Approve & Merge
Save
Title
*
Priority
low
medium
high
critical
Ticket type
epic
story
task
bug
subtask
Project
*
Choose an option
alpha
tinker
Create new project
Parent Ticket
Choose an option
Create OrchestratorPingJob for periodic coordination
Add Chat to Remote Support Sessions
Migrate from jsonapi-serializer to Alba gem
Allow cancel and approve transitions from any ticket state
Simplify data model and encourage knowledge accumulation
Remove memory_consolidation_job and all references
Add draft status for tickets
Investigation: Port sandboxed Docker + MCP + skills to other repositories
Remove unused columns from tickets table
Integrate chat panel into OperatorCommandCenter
Agent Visibility System: Live terminal streaming, session logs, and debug dashboard
Part 1: Live Terminal Streaming - Capture agent PTY output and stream to WebSocket
Autonomous Agent Coordination via WebSocket
Part 2: Agent Session Logs - Store terminal output persistently for replay
Remove agent assignments - use status-only queue workflow
Add mark_busy and mark_idle MCP tools for agents
Improve dashboard/logs page visual appearance
Add reject transition from in_progress status
Auto-block tickets with unsatisfied dependencies (phased development)
Forbid bare HTTP status checks in tests (hides debugging info)
Orchestrator assigns merged tickets despite workflow instructions
Fix ticket updates from UI
Simplify transition_ticket to allow any status (remove finalize_task)
Add UI to update tickets
Implement Automated Task Prioritization Engine
Create Git Workflow Skills for Tinker Agents
Fix reviewer guidelines: Require test coverage for new functionality
Escalation: Reviewer agent unresponsive - not picking up pending audits
Change default ticket status from backlog to draft
Escalation: Clear 11 tickets stuck in pending_approval status
Fix worker workflow: submit_review transition missed due to context overflow
Make planner a first-class agent member (not wildcard)
Register archive/unarchive MCP tools in mcp-bridge
Extract Critical Workflows to Skills (Test Set)
Create Automated Code Generation and Refactoring Engine
Fix memory_consolidation_job reference to removed acceptance_criteria column
Test Alba migration and verify API compatibility
Add API key authentication (no user model)
Bug: Worker marked busy but no session created - logs have nowhere to go
Cleanup and finalize Alba migration
Update list_tickets MCP tool to add pagination with max limit of 20
Add formal ticket dependencies: prevent orchestrator assigning blocked tasks
Update AgentChannel to only log (no state management)
Fix list_tasks response size - remove verbose fields
Fix kanban view - archived tickets should not appear
Task 2: Role Validation Logic Implementation
Task 4: Guardrails and Error Message Implementation
Step 4: Ticket Detail Page with daisyUI Components
Add planned tickets page (draft/backlog/todo)
Add planner to default agent creation set
Terminal App: WebSocket-to-Claude-Code Bridge
Add web UI for browsing and managing agent memories
Reduce WebSocket log noise - hide normal reconnection messages
Add agent status tracking (status, status_updated_at)
Add availability_updated_at to list_members MCP output
Create researcher-workflow skill
Make planner a standalone agent (not tied to orchestrator)
Update Gemfile: Add alba gem, remove jsonapi-serializer
Drop artifacts and code_diffs database tables
Step 8: Theming and Preferences with daisyUI
Update BaseController render_jsonapi methods for Alba
Modify dashboard controller to support showing all projects
Update Kanban view to display project information when showing all tasks
Add missing MCP tools to TypeScript bridge
Add project selector dropdown to Kanban board UI
Test cross-project Kanban functionality
Memory Consolidation Skill - Pattern Extraction & Deduplication
Add "todo" status with "plan" action to Tinker workflow
Optimize list_tickets output size and add multi-status filtering
Rename api_key_plaintext column to api_key
Step 7: Responsive Design with daisyUI Components
Task 1: System Prompt Implementation for Role Enforcement
Add assign_ticket and list_agents MCP tools
Add missing /api/v1/tickets/:id/claim endpoint
Update agents.rb with deliverable parts guidance
Remove archived column, use archived_at instead
Fix workflow: pass_audit→pending_approval, fail_audit→todo
Orchestrator should assign one ticket at a time to reviewers (not "two tickets need review")
Step 6: Logs Page with daisyUI Components
Step 9: Polish and Theme Configuration with daisyUI
Modify reviewer prompt to avoid gh pr review command
Add confidence field (0-100) to approvals and proposals
Find subtasks for Researcher epic
Add delete_proposal MCP tool for Researcher agent
Refactor get_terminal_logs MCP: line-based limiting with TerminalLogCleaner
Add list_comments MCP tool for tickets
Task 3: Role-Specific Tool Access Control
Rejection workflow: rejected tickets should go directly to "todo" not "in_progress"
Part 3: Debug Dashboard - Web UI for real-time agent monitoring
Fix list_tickets MCP tool: remove draft exclusion, always include draft tickets
Create comprehensive architecture document for autonomous agents
Add ask_for_memory_deletion MCP tool (with human confirmation)
Add file attachments support to chat
Relax reviewer scope enforcement to allow necessary related changes
Create Memory Skill for Agent Knowledge Sharing
Add test coverage for recent PRs (#56, #57, #58)
Design Autonomous Agent Framework Architecture
Step 4: Ticket Detail Page with daisyUI Components
Escalation: update_ticket MCP tool returns ForbiddenAttributesError
Fix get_terminal_logs MCP tool: limit, pagination, timestamps
Step 5: Approvals Page with daisyUI Components
Phase 3: Feature ideation from ticket analysis
Create comprehensive deployment instructions (Coolify + Neon Postgres + proper Dockerfile)
Part 2: Agent Session Logs (Store terminal output persistently)
Phase 1 (MVP): Memory and ticket pattern analysis
Step 3: Kanban Board with daisyUI Styling
Build Self-Healing System Infrastructure
Develop Intelligent Resource Management System
Add list_agent_logs MCP tool for orchestrator and researcher
Add unified get_status MCP tool for project overview
Create Autonomous Decision-Making Framework
Implement Autonomous Testing and Validation System
Add backlinks in GitHub comments and PR descriptions
Create ActionCable channel for guest chat subscriptions
Introduce Researcher agent: 24/7 autonomous code & system analyst
Improve proposal UI: markdown rendering, evidence display, links, and visibility
Add GitHub label "tinker-reviewed" to PRs after reviewer review
Build Self-Improvement and Learning System
Reject debug dashboard in PR #57 - require feature specs
Create proposal system: storage, API, and admin interface
Implement smarter context refresh conditional on worker availability
Set up Researcher agent: infrastructure, MCP tools, and guardrails
Phase 2: Code quality analysis and test coverage detection
Remove File Lists from Tickets - Constrains Agent Thinking
Fix 500 error when adding comments - undefined method 'parent_id' on Comment
Fix tickets#show comment UI: implement comment creation for humans
Convert TicketSerializer to Alba format
Convert ProjectSerializer to Alba format
Slice 1: Operator Chat (Full Vertical Slice)
Reviewer must run tests and detect missing specs before approving
Slice 2: Guest Chat (Extends Slice 1)
Implement Proactive Anomaly Detection System
Convert AgentSerializer to Alba format
Convert remaining serializers (Comment, CodeDiff, AgentMemory, Artifact) to Alba
Slice 3: Chat History Review (Polish)
Step 2: Dashboard with daisyUI Components
Step 1: Install and Configure daisyUI
Develop Continuous Performance Optimization System
Phase 4: 24/7 operation, daily digest, and batch approval
Implement Autonomous Documentation Generation
Build Autonomous Communication and Coordination Hub
Create Autonomous Security and Compliance Guardian
Implement Continuous Innovation and Experimentation Platform
Tool Usage Analytics from Logs
Integrate All Autonomous Systems into Cohesive Ecosystem
Epic: Implement Strict Role Enforcement for Tinker Agents
Fix Planner behavior - stop writing implementation details in ticket descriptions
Fix N+1 queries on kanban board page
Implement Proper Ticket Blocking/Dependency System
Implement ticket archival system with cascade and auto-archive
Task 5: Testing Role Compliance
Integrate chat panel into GuestKiosk
UI Modernization with daisyUI
Add unified set_agent_status MCP tool for orchestrator
Agent Escalation: Create Tickets for MCP/Workflow Issues
Create SessionChatPanel React component
Fix syntax error in tickets show view and add feature spec
Add chat message API endpoints for operators and guests
Create SupportSessionMessage model and migration
Create SendChatMessageService for support session chat
Add chat history review in session detail view
Update worker-workflow skill: add explicit git branch checking before starting tickets
Orchestrator assigns merged tickets - missing PR status check tool
Remove MemoryDeletionRequest - migrate memory deletion to proposals-only workflow
Add offset parameter to list_memories MCP tool
Fix approvals page reject - add rejection reason modal
Create proposal-execution skill: Enable researchers to execute approved proposals
Fix bare HTTP status check warnings in specs
Role-based skill scoping: agents only access their own skills
Create new parent ticket
Description
Introduce a new autonomous **"Researcher"** agent that runs 24/7, continuously analyzing the codebase, tickets, memories, and patterns - then proposing improvements to both the agent system AND the project itself. ## Role: The Researcher **24/7 autonomous analyst** - constantly reading, thinking, and proposing. Expect 90% junk, but 10% valuable insights with zero human effort. ### Core Responsibilities #### 1. Agent System Improvements - Read/search memories for pain points, known issues, "todo" items - Analyze recent tickets for recurring patterns or problems - Propose new skills/prompts to encapsulate repetitive knowledge - Housekeep memories (merge stale, delete invalid, update outdated) - Suggest workflow optimizations #### 2. Code Quality & Refactoring - Analyze code for smells, complexity, technical debt - Identify refactoring opportunities (extract methods, simplify logic) - Find duplicated code that should be abstracted - Suggest design pattern improvements - Propose performance optimizations #### 3. Test & Spec Coverage - Find untested or undertested code - Identify missing edge cases in existing tests - Propose integration tests for gaps - Suggest test utilities to reduce boilerplate #### 4. Tooling & Developer Experience - Identify repetitive manual tasks → propose automation - Suggest CLI tools, scripts, or generators - Propose better debugging/diagnostic tools - Identify missing documentation #### 5. Feature Ideas & UX - Analyze user workflows → suggest improvements - Propose new features based on patterns - Identify UX friction points - Suggest API improvements for better ergonomics ## 24/7 Operation **Continuous analysis loop:** ``` 1. Read codebase (git diff, recent commits) 2. Read tickets (last 24h activity) 3. Read memories (search patterns) 4. Read tests (coverage gaps) 5. Think & connect dots 6. Generate proposals 7. Store proposals for human review 8. Repeat (every 15-30 minutes) ``` **Not scheduled - always running.** ## Proposal Types ### Agent System Proposals ``` 📋 New Ticket: "Add git conflict resolution skill" Reason: 3 tickets this week involved git conflicts Savings: ~6 hours/week of manual work 🧹 Memory Cleanup: Merge memories #123, #456, #789 Reason: All about Docker permissions, overlapping content 🛠️ New Skill: "rails_migration_guide" Source: Extracted from 12 memories about migration issues ``` ### Code Quality Proposals ``` 🔨 Refactor: Extract UserService from 5 duplicate methods Files: app/models/user.rb, app/controllers/users_controller.rb Duplication: 150 lines across 5 files Impact: Reduces complexity, improves testability 🧪 Test Gap: BookingWorkflow has no failure path tests File: app/services/booking_workflow.rb Missing: edge cases for payment failures, race conditions ``` ### Feature Proposals ``` ✨ Feature: Add booking calendar heatmap Reason: Support tickets show users struggle to find availability Effort: ~4 hours Value: Reduces support load ⚡ UX Improvement: One-click rebook from failed booking Pattern: 12 tickets mentioned manual rebooking is tedious ``` ## Daily Digest Every day you open the app, see: ``` 📊 Researcher Report - Dec 28, 2025 Last 24h: 47 proposals generated 🔥 High Priority (3) • Add test coverage for PaymentService::refund • Refactor: Extract NotificationBuilder (300 lines duplicated) • New skill: Docker troubleshooting guide ⚡ Medium Priority (12) • Memory cleanup: 5 stale Docker memories • Feature: Bulk export tickets as CSV • Test gap: BookingStateMachine edge cases ... 💭 Low Priority (32) • Rename method for clarity • Add inline comment to complex regex • Minor UX polish ``` **Human reviews in batch, approves/rejects.** 90% gets rejected, but 10% are free value. ## Data Sources **Researcher analyzes continuously:** - Git commits & diffs (new code, patterns) - All tickets (recent & historical) - All memories (search for patterns) - Test coverage reports (SimpleCov, etc.) - Code complexity metrics (rubocop, etc.) - Error logs (Sentry, etc.) - User feedback (support tickets, comments) ## Guardrails **Researcher cannot:** - Modify code without approval - Modify memories without approval - Create tickets without approval (creates proposals instead) - Delete anything without approval - Access credentials/secrets **Researcher must:** - Explain reasoning for each proposal - Provide confidence level (High/Medium/Low) - Link to source evidence (tickets, code files, etc.) - Estimate effort/value for proposals - Learn from rejections ## Implementation Phases ### Phase 1: Memory & Ticket Analysis (MVP) - Read memories, find stale/duplicate - Analyze tickets for patterns - Propose cleanup and new tickets - Build proposal storage/retrieval ### Phase 2: Code Quality Analysis - Analyze code for smells and duplication - Find test coverage gaps - Propose refactoring and tests - Integrate with code analysis tools ### Phase 3: Feature Ideation - Analyze user workflows and patterns - Propose UX improvements - Suggest new features - Estimate effort/value ### Phase 4: Full Autonomy (24/7) - Continuous analysis loop - Daily digest generation - Batch approval workflow - Self-improvement (learns from rejections) ## MCP Tools for Researcher **Analysis tools:** - `search_memories` - Find patterns across memories - `list_tickets` - Analyze recent work - `get_diff` - Read recent code changes - `analyze_code` - Code quality metrics - `get_test_coverage` - Find untested code **Proposal tools:** - `create_proposal` - Store proposal for review - `bulk_create_proposals` - Store multiple at once **Human review tools:** - `list_proposals` - See pending proposals - `approve_proposal` - Execute and convert to ticket/action - `reject_proposal` - Reject with reason (Researcher learns) - `approve_batch` - Approve multiple at once ## Acceptance Criteria - [ ] Researcher agent exists and runs 24/7 - [ ] Researcher can read/search memories and find patterns - [ ] Researcher analyzes tickets for recurring problems - [ ] Researcher analyzes code for quality issues (duplication, complexity, smells) - [ ] Researcher identifies test coverage gaps - [ ] Researcher proposes new features/UX improvements - [ ] Researcher proposes memory cleanup (merge, delete, update) - [ ] Proposals stored with reasoning, confidence, evidence links - [ ] Daily digest view shows all proposals grouped by priority - [ ] Batch approval workflow (approve/reject multiple at once) - [ ] Researcher learns from rejections and improves proposal quality - [ ] 90% junk rate is acceptable - focus on volume + filtering
Introduce a new autonomous **"Researcher"** agent that runs 24/7, continuously analyzing the codebase, tickets, memories, and patterns - then proposing improvements to both the agent system AND the project itself. ## Role: The Researcher **24/7 autonomous analyst** - constantly reading, thinking, and proposing. Expect 90% junk, but 10% valuable insights with zero human effort. ### Core Responsibilities #### 1. Agent System Improvements - Read/search memories for pain points, known issues, "todo" items - Analyze recent tickets for recurring patterns or problems - Propose new skills/prompts to encapsulate repetitive knowledge - Housekeep memories (merge stale, delete invalid, update outdated) - Suggest workflow optimizations #### 2. Code Quality & Refactoring - Analyze code for smells, complexity, technical debt - Identify refactoring opportunities (extract methods, simplify logic) - Find duplicated code that should be abstracted - Suggest design pattern improvements - Propose performance optimizations #### 3. Test & Spec Coverage - Find untested or undertested code - Identify missing edge cases in existing tests - Propose integration tests for gaps - Suggest test utilities to reduce boilerplate #### 4. Tooling & Developer Experience - Identify repetitive manual tasks → propose automation - Suggest CLI tools, scripts, or generators - Propose better debugging/diagnostic tools - Identify missing documentation #### 5. Feature Ideas & UX - Analyze user workflows → suggest improvements - Propose new features based on patterns - Identify UX friction points - Suggest API improvements for better ergonomics ## 24/7 Operation **Continuous analysis loop:** ``` 1. Read codebase (git diff, recent commits) 2. Read tickets (last 24h activity) 3. Read memories (search patterns) 4. Read tests (coverage gaps) 5. Think & connect dots 6. Generate proposals 7. Store proposals for human review 8. Repeat (every 15-30 minutes) ``` **Not scheduled - always running.** ## Proposal Types ### Agent System Proposals ``` 📋 New Ticket: "Add git conflict resolution skill" Reason: 3 tickets this week involved git conflicts Savings: ~6 hours/week of manual work 🧹 Memory Cleanup: Merge memories #123, #456, #789 Reason: All about Docker permissions, overlapping content 🛠️ New Skill: "rails_migration_guide" Source: Extracted from 12 memories about migration issues ``` ### Code Quality Proposals ``` 🔨 Refactor: Extract UserService from 5 duplicate methods Files: app/models/user.rb, app/controllers/users_controller.rb Duplication: 150 lines across 5 files Impact: Reduces complexity, improves testability 🧪 Test Gap: BookingWorkflow has no failure path tests File: app/services/booking_workflow.rb Missing: edge cases for payment failures, race conditions ``` ### Feature Proposals ``` ✨ Feature: Add booking calendar heatmap Reason: Support tickets show users struggle to find availability Effort: ~4 hours Value: Reduces support load ⚡ UX Improvement: One-click rebook from failed booking Pattern: 12 tickets mentioned manual rebooking is tedious ``` ## Daily Digest Every day you open the app, see: ``` 📊 Researcher Report - Dec 28, 2025 Last 24h: 47 proposals generated 🔥 High Priority (3) • Add test coverage for PaymentService::refund • Refactor: Extract NotificationBuilder (300 lines duplicated) • New skill: Docker troubleshooting guide ⚡ Medium Priority (12) • Memory cleanup: 5 stale Docker memories • Feature: Bulk export tickets as CSV • Test gap: BookingStateMachine edge cases ... 💭 Low Priority (32) • Rename method for clarity • Add inline comment to complex regex • Minor UX polish ``` **Human reviews in batch, approves/rejects.** 90% gets rejected, but 10% are free value. ## Data Sources **Researcher analyzes continuously:** - Git commits & diffs (new code, patterns) - All tickets (recent & historical) - All memories (search for patterns) - Test coverage reports (SimpleCov, etc.) - Code complexity metrics (rubocop, etc.) - Error logs (Sentry, etc.) - User feedback (support tickets, comments) ## Guardrails **Researcher cannot:** - Modify code without approval - Modify memories without approval - Create tickets without approval (creates proposals instead) - Delete anything without approval - Access credentials/secrets **Researcher must:** - Explain reasoning for each proposal - Provide confidence level (High/Medium/Low) - Link to source evidence (tickets, code files, etc.) - Estimate effort/value for proposals - Learn from rejections ## Implementation Phases ### Phase 1: Memory & Ticket Analysis (MVP) - Read memories, find stale/duplicate - Analyze tickets for patterns - Propose cleanup and new tickets - Build proposal storage/retrieval ### Phase 2: Code Quality Analysis - Analyze code for smells and duplication - Find test coverage gaps - Propose refactoring and tests - Integrate with code analysis tools ### Phase 3: Feature Ideation - Analyze user workflows and patterns - Propose UX improvements - Suggest new features - Estimate effort/value ### Phase 4: Full Autonomy (24/7) - Continuous analysis loop - Daily digest generation - Batch approval workflow - Self-improvement (learns from rejections) ## MCP Tools for Researcher **Analysis tools:** - `search_memories` - Find patterns across memories - `list_tickets` - Analyze recent work - `get_diff` - Read recent code changes - `analyze_code` - Code quality metrics - `get_test_coverage` - Find untested code **Proposal tools:** - `create_proposal` - Store proposal for review - `bulk_create_proposals` - Store multiple at once **Human review tools:** - `list_proposals` - See pending proposals - `approve_proposal` - Execute and convert to ticket/action - `reject_proposal` - Reject with reason (Researcher learns) - `approve_batch` - Approve multiple at once ## Acceptance Criteria - [ ] Researcher agent exists and runs 24/7 - [ ] Researcher can read/search memories and find patterns - [ ] Researcher analyzes tickets for recurring problems - [ ] Researcher analyzes code for quality issues (duplication, complexity, smells) - [ ] Researcher identifies test coverage gaps - [ ] Researcher proposes new features/UX improvements - [ ] Researcher proposes memory cleanup (merge, delete, update) - [ ] Proposals stored with reasoning, confidence, evidence links - [ ] Daily digest view shows all proposals grouped by priority - [ ] Batch approval workflow (approve/reject multiple at once) - [ ] Researcher learns from rejections and improves proposal quality - [ ] 90% junk rate is acceptable - focus on volume + filtering
Working memory
{"assigned_agent_id" => 4}
Pull request url
Avo
· © 2026 AvoHQ ·
v3.27.0
Close modal
Are you sure?
Yes, I'm sure
No, cancel