Graham Neubig
A software development agent w/ web browsing
Two-Tool Planning System:
1. Task Tracker Tool - Structured task management
{
"command": "plan",
"task_list": [
{"id": "1", "title": "Analyze requirements", "status": "done"},
{"id": "2", "title": "Implement feature", "status": "in_progress"},
{"id": "3", "title": "Write tests", "status": "todo"}
]
}
2. Think Tool - Complex reasoning and brainstorming
{
"thought": "I need to consider three approaches: direct implementation,
refactoring existing code, or creating a new module. The refactoring
approach seems most maintainable..."
}
e.g. from Tavily MCP extract function
# Language Technologies Institute
## School of Computer Science - Carnegie Mellon University
### Mission Statement
**Empowering human communication through trustworthy language technologies**
The Language Technologies Institute at Carnegie Mellon educates the leaders of tomorrow and performs groundbreaking research in the areas of:
### Research Areas
- **Natural Language Processing**
- **Computational Linguistics**
- **Information Extraction**
e.g. BrowserGym axtree
RootWebArea 'Language Technologies Institute - Language Technologies Institute - School of Computer Science - Carnegie Mellon University', focused
[40] banner '', center="(632,127)", visible
[43] link 'Carnegie Mellon University', center="(177,21)", clickable, visible
[44] button 'toggle menu', center="(1221,18)", clickable, visible, pressed='false'
StaticText '—'
StaticText '—'
StaticText '—'
[48] Section '', center="(1047,19)", visible
[52] LabelText '', center="(1047,19)", clickable, visible
[54] textbox 'Search', center="(1047,19)", clickable, visible
[55] button 'Search', center="(1183,19)", clickable, visible
[62] link 'Language Technologies Institute', center="(369,107)", clickable, visible
[63] heading 'School of Computer Science', center="(430,162)", visible
[64] link 'School of Computer Science', center="(174,162)", clickable, visible
[65] link 'LTI Logo', center="(1049,126)", clickable, visible
[66] image 'LTI Logo', center="(1161,130)", visible
e.g. VisualWebArena
OpenHands approach:
Why use multiple agents?
Why not use multiple agents?