(feat): Add a ToDo tool to track ongoing task lists by anj-s · Pull Request #8761 · google-gemini/gemini-cli

anj-s · 2025-09-18T22:05:25Z

TLDR

This PR introduces a new write_todos_list tool that allows the agent to create and manage a checklist of tasks for complex user requests. This helps the agent track its progress, organize its work, and provides the user with visibility into the agent's plan.

Dive Deeper

The write_todos_list tool is a declarative tool that enables the agent to manage a list of tasks with the following statuses: pending, in_progress, completed, and cancelled. The agent is guided by an updated system prompt on when and how to use this tool, with a focus on using it for complex, multi-step tasks and avoiding it for simple requests.

The tool is enabled by a useWriteTodos flag in the configuration. The implementation includes the tool itself, along with comprehensive unit tests to ensure its functionality and validation logic are working correctly.

Reviewer Test Plan

To test this feature, you can enable the useWriteTodos flag in your settings and give the agent a complex task. Here are a few examples:

Create a new feature:
- Prompt: add a new feature to the CLI that allows users to configure the output format of the response.
- Expected behavior: The agent should create a todo list with steps like add a new configuration option, implement the logic to format the output, add tests for the new feature, etc.
Build a simple application:
- Prompt: create a simple web app that uses the Gemini API to answer questions.
- Expected behavior: The agent should break down the task into smaller sub-tasks and create a todo list to track its progress.
Debug an issue:
- Prompt: The application is crashing when I try to upload a file. Can you help me debug and fix the issue?
- Expected behavior: The agent should create a todo list to investigate the issue, such as reproduce the crash, examine the logs, identify the root cause, implement a fix, and verify the fix.

Fixes #4580

Testing Matrix

	🍏	🪟	🐧
npm run	❓	❓	❓
npx	❓	❓	❓
Docker	❓	❓	❓
Podman	❓	-	-
Seatbelt	❓	-	-

Linked issues / bugs

anj-s · 2025-09-19T14:17:10Z

I'm confused. All the tool does is respond back with the todos it was given. It doesn't persist it anywhere and doesn't allow the model to view the list. Is this going to be part of a suite of tools?

This is a tool for the model to track what it needs to do. The todos are provided by the model and this tool is for it to communicate the updated status of that list. It doesn't need to persist it anywhere since it is in the history. Yes, this will be part of the core tools.

scidomino

It's weird that a noop tool would improve performance but I assume you have run evals and shown that this improves things.

packages/core/src/tools/write-todos.ts

owenofbrien · 2025-09-19T19:12:22Z

It's weird that a noop tool would improve performance but I assume you have run evals and shown that this improves things.

I think it's primarily a way to:
a) encourage the model to actually make a plan for complex tasks, and
b) encourage the model to explicitly update the plan as tasks are completed or become obsolete

I wonder if just adding instructions for a) and b) to the system prompt could yield a similar performance impact. @anj-s wdyt?

anj-s · 2025-09-19T20:16:49Z

It's weird that a noop tool would improve performance but I assume you have run evals and shown that this improves things.

Its not a noop tool as explained above. This helps the model create a list of items and track it. yes, this improves evals and is a known method for doing so.

anj-s · 2025-09-19T20:18:04Z

It's weird that a noop tool would improve performance but I assume you have run evals and shown that this improves things.

I think it's primarily a way to: a) encourage the model to actually make a plan for complex tasks, and b) encourage the model to explicitly update the plan as tasks are completed or become obsolete

I wonder if just adding instructions for a) and b) to the system prompt could yield a similar performance impact. @anj-s wdyt?

We have this in the system prompt but its not something the model does consistently and does not involve the model updating the plan list at every turn. We ideally want the todo list to be the only plan list that the model is tracking

Co-authored-by: gemini-cli-robot <gemini-cli-robot@google.com>

packages/cli/src/config/config.ts

Co-authored-by: joshualitt <joshualitt@google.com> Co-authored-by: Tommaso Sciortino <sciortino@gmail.com> Co-authored-by: matt korwel <matt.korwel@gmail.com> Co-authored-by: gemini-cli-robot <gemini-cli-robot@google.com> Co-authored-by: Jacob MacDonald <jakemac@google.com> Co-authored-by: Shreya Keshive <skeshive@gmail.com>

anj-s added 30 commits July 29, 2025 01:31

wip

7671b34

Merge branch 'main' into u/anj/fix-divergence

c13df02

wip

d26577a

wip

17bb9f1

wip

3e89e1b

wip, tool v1

d2271a0

wip

8cd0bee

wip

a6afa06

Merge branch 'main' into u/anj/write-todos

377487e

wip

825624c

wip

fb5aa8d

Merge branch 'main' into u/anj/yolo-add-logging

81e3c3e

wip

b8d227e

wip

2af0220

wip

241c866

wip

7b46943

wip

64868ac

wip

e54f04e

wip

7f6337d

wip

90e5a76

wip

8ae51c6

wip

4941b10

wip

8fcf52c

wip

9073656

wip

75fec27

wip

e170f5a

wip

48c64a3

wip

94dcecf

wip

9977987

wip

86f35a4

scidomino approved these changes Sep 19, 2025

View reviewed changes

Merge branch 'main' into u/anj/write-todos

6531dd2

owenofbrien reviewed Sep 19, 2025

View reviewed changes

anj-s and others added 7 commits September 19, 2025 13:21

wip

46bda4c

Revert "feat(third_party) Port get-ripgrep." (#8923)

a66eb62

Rollback shrinkwrap (#8926)

4de9aca

Release: Ensure Tag Modification works (#8931)

502da6f

Co-authored-by: gemini-cli-robot <gemini-cli-robot@google.com>

Update extension-releasing.md to have more info (#8927)

5238f63

Add skip_github_release option to Manual Release. (#8932)

3363439

Add few more license file names to generate-notices script (#8939)

35e2f88

anj-s requested a review from a team as a code owner September 19, 2025 20:21

scidomino approved these changes Sep 19, 2025

View reviewed changes

agmsb approved these changes Sep 19, 2025

View reviewed changes

silviojr approved these changes Sep 19, 2025

View reviewed changes

NTaylorMullen reviewed Sep 20, 2025

View reviewed changes

packages/cli/src/config/config.ts Outdated Show resolved Hide resolved

anj-s added 2 commits September 20, 2025 05:45

wip

a2e8e3f

Merge branch 'main' into u/anj/write-todos

b472252

anj-s added this pull request to the merge queue Sep 20, 2025

Merged via the queue into main with commit 44691a4 Sep 20, 2025
19 checks passed

anj-s deleted the u/anj/write-todos branch September 20, 2025 13:05

heltonduarte mentioned this pull request Oct 1, 2025

Investigate if we could use the ToDo tool from CLI instead of our todo and draft files gemini-cli-extensions/security#48

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(feat): Add a ToDo tool to track ongoing task lists#8761

(feat): Add a ToDo tool to track ongoing task lists#8761
anj-s merged 47 commits intomainfrom
u/anj/write-todos

anj-s commented Sep 18, 2025 •

edited

Loading

Uh oh!

anj-s commented Sep 19, 2025

Uh oh!

scidomino left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

owenofbrien commented Sep 19, 2025

Uh oh!

anj-s commented Sep 19, 2025

Uh oh!

anj-s commented Sep 19, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Conversation

anj-s commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TLDR

Dive Deeper

Reviewer Test Plan

Testing Matrix

Linked issues / bugs

Uh oh!

anj-s commented Sep 19, 2025

Uh oh!

scidomino left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

owenofbrien commented Sep 19, 2025

Uh oh!

anj-s commented Sep 19, 2025

Uh oh!

anj-s commented Sep 19, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

anj-s commented Sep 18, 2025 •

edited

Loading