feat(voice): implement speech-friendly response formatter by ayush31010 · Pull Request #20989 · google-gemini/gemini-cli

ayush31010 · 2026-03-03T16:57:42Z

Summary

Closes #20985
Related: #20779 (voice mode skeleton), #20456 (RFC)

Adds packages/core/src/voice/responseFormatter.ts with formatForSpeech() — a pure-TypeScript utility that converts markdown/ANSI-formatted output into speech-clean plain text for TTS playback in voice mode
Adds 33 unit tests covering every transformation and edge case
Zero new runtime dependencies

Transformations (applied in order)

Input	Speech output
`\x1b[31mError\x1b[0m`	`Error`
```json\n{...}\n```	`(JSON object with N keys)`
Multi-frame Node.js stack trace	First line + `(and N more frames)`
`bold`, `italic`, `code`	Text only, delimiters stripped
`> blockquote`, `# Heading`, `[link](url)`	Text only
`/home/user/project/src/tools/file.ts:142`	`…/src/tools/file.ts line 142`
Output > 500 chars	Truncated + `… (N chars total)`

Test plan

npx vitest run packages/core/src/voice/responseFormatter.test.ts
# ✓ 33 tests passed

Adds packages/core/src/voice/responseFormatter.ts with a formatForSpeech() function that converts markdown/ANSI-formatted tool output and LLM responses into speech-clean plain text suitable for TTS playback in voice mode. Transformations applied (in order): 1. Strip ANSI escape codes (color, bold, cursor movement) 2. Unwrap fenced code blocks; summarise large JSON content as "(JSON object with N keys)" / "(JSON array with N items)" 3. Collapse Node.js stack traces to first frame + "(and N more frames)" 4. Strip markdown syntax: bold, italic, inline code, blockquotes, headings, links, unordered/ordered list markers 5. Abbreviate deep absolute Unix and Windows paths to last pathDepth segments prefixed with "…"; convert ":142" suffixes to "line 142" 6. Normalise whitespace (collapse excess blank lines, trim) 7. Truncate to maxLength with "… (N chars total)" suffix Public API: formatForSpeech(text, options?) → string options: { maxLength?, pathDepth?, jsonThreshold? } All defaults chosen to produce natural-sounding output from typical tool results without requiring any new runtime dependencies. Closes google-gemini#20985 Related: google-gemini#20779 (voice mode skeleton), google-gemini#20456 (RFC)

gemini-code-assist · 2026-03-03T16:58:03Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a new utility function, formatForSpeech, designed to prepare complex, formatted text for clear and concise text-to-speech (TTS) playback within the voice mode. It systematically cleans up input by removing visual formatting like ANSI escape codes and markdown, summarizing verbose content such as large JSON objects and stack traces, and abbreviating file paths, ensuring that spoken output is streamlined and easily understandable for an optimal voice experience.

Highlights

New Speech-Friendly Response Formatter: Implemented the formatForSpeech() utility in packages/core/src/voice/responseFormatter.ts to convert markdown and ANSI-formatted output into plain text suitable for text-to-speech (TTS) playback.
Comprehensive Unit Testing: Added 33 unit tests covering every transformation and edge case for the formatForSpeech utility, ensuring robustness and correctness.
Zero New Dependencies: The new utility introduces no additional runtime dependencies, maintaining a lightweight footprint.
Key Transformations: The formatter applies several transformations including stripping ANSI escape codes, summarizing large JSON objects, collapsing multi-frame stack traces, stripping various markdown elements (bold, italic, code, blockquotes, headings, links, lists), abbreviating deep file paths, normalizing whitespace, and truncating long outputs.

Changelog

packages/core/src/voice/responseFormatter.test.ts
- Added a new test file to validate the formatForSpeech utility.
- Included tests for edge cases like empty input and plain text.
- Verified correct stripping of ANSI color, bold, and cursor codes.
- Tested markdown stripping for bold, italic, inline code, blockquotes, headings, links, and list markers.
- Ensured proper unwrapping and summarization of fenced code blocks, including JSON objects and arrays.
- Validated path abbreviation for Unix and Windows paths, including line number conversion.
- Confirmed stack trace collapsing functionality.
- Tested truncation behavior for long outputs.
- Included tests for whitespace normalization.
- Provided real-world examples to demonstrate end-to-end cleaning.
packages/core/src/voice/responseFormatter.ts
- Added a new utility file containing the formatForSpeech function.
- Defined FormatForSpeechOptions interface for configurable maxLength, pathDepth, and jsonThreshold.
- Implemented regular expressions for identifying and processing ANSI codes, markdown elements, code blocks, stack traces, and file paths.
- Developed abbreviatePath helper to shorten file paths and format line numbers.
- Created summariseJson helper to provide concise descriptions for large JSON structures.
- Implemented the main formatForSpeech logic, applying transformations in a specific order: ANSI stripping, code block processing, stack trace collapsing, markdown stripping, path abbreviation, whitespace normalization, and truncation.

Activity

This pull request implements a core feature for the voice mode, building upon related work (feat(cli): introduce experimental voice mode architecture skeleton #20779 for voice mode skeleton, [RFC] Architecture Proposal: Hands-Free Multimodal Voice Mode (GSoC 2026) #20456 for RFC).
It addresses and closes issue feat(voice): implement speech-friendly response formatter for voice mode output #20985.
The author has provided a detailed test plan and confirmed 33 passing unit tests, indicating thorough validation of the new functionality.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a valuable utility for converting formatted text into a speech-friendly format. While no security vulnerabilities were found, critical bugs were identified in the implementation related to Windows path handling and stack trace collapsing, which can lead to incorrect or malformed output. A less severe issue with markdown parsing for bold/italic text was also noted. Detailed suggestions and code snippets are provided to address these, along with recommendations for additional test cases to prevent future regressions.

packages/core/src/voice/responseFormatter.ts

packages/core/src/voice/responseFormatter.test.ts

The WIN_PATH_RE replacement callback previously hardcoded 'C:\' as the path prefix, so paths on any other drive (D:\, E:\, etc.) were silently reconstructed as C: paths and abbreviated incorrectly. Fix: capture the full path (drive letter included) as a single regex group, matching the pattern already used by UNIX_PATH_RE, and pass it directly to abbreviatePath() without any manual prefix concatenation. Also adds two missing test cases identified in review: - Windows path abbreviation on a non-C drive (D:\...) - Stack trace collapsing preserves surrounding text before and after the trace frames Addresses review feedback on google-gemini#20989

packages/core/src/voice/responseFormatter.ts

Scoped package paths like @google/gemini-cli-core were only matched up to the @ character, producing broken TTS output. Adding @ to the character class fixes the match and a test case is added to cover it.

mrpmohiburrahman

LGTM

spencer426

Automated Code Review

packages/core/src/voice/responseFormatter.ts

spencer426

Here are a few specific issues with the implementation that need to be addressed.

packages/core/src/voice/responseFormatter.test.ts

packages/core/src/voice/responseFormatter.ts

spencer426

Missing Export Rule

packages/core/src/voice/responseFormatter.ts

- Update copyright year to 2026 in responseFormatter.ts and test file - Fix BOLD_ITALIC_RE to exclude newlines, preventing cross-line matches that could consume list markers before they are stripped - Fix stack trace collapsing to replace frames in-place (STACK_BLOCK_RE) instead of stripping all frames and appending summary at end, which was mangling text that followed the trace - Export formatForSpeech and FormatForSpeechOptions from packages/core index

spencer426

LGTM

packages/core/src/voice/responseFormatter.ts

spencer426

Please address the Polynomial regular expression used on uncontrolled data issue

spencer426

LGTM

…ini#20989) Co-authored-by: Spencer <spencertang@google.com>

gemini-cli bot added the area/core Issues related to User Interface, OS Support, Core Functionality label Mar 3, 2026

gemini-code-assist bot reviewed Mar 3, 2026

View reviewed changes

Ayush Debnath added 2 commits March 3, 2026 22:48

Merge branch 'main' into fix/voice-response-formatter

273afbe

gemini-cli bot added priority/p3 Backlog - a good idea but not currently a priority. help wanted We will accept PRs from all issues marked as "help wanted". Thanks for your support! labels Mar 4, 2026

Merge branch 'main' into fix/voice-response-formatter

c8168e1

spencer426 self-requested a review March 7, 2026 04:57

Merge branch 'main' into fix/voice-response-formatter

68a0be5

This comment was marked as spam.

Sign in to view

This was referenced Mar 9, 2026

📊 AI CLI 工具社区动态日报 2026-03-09 duanyytop/agents-radar#105

Open

📊 AI CLI Tools Digest 2026-03-09 duanyytop/agents-radar#106

Open

mrpmohiburrahman reviewed Mar 9, 2026

View reviewed changes

packages/core/src/voice/responseFormatter.ts Outdated Show resolved Hide resolved

This was referenced Mar 9, 2026

📊 Bản tin hàng ngày công cụ AI CLI 2026-03-09 compasify/agents-radar#14

Open

📊 AI CLI 工具社区动态日报 2026-03-09 rollysys/agents-radar#57

Open

Ayush Debnath added 2 commits March 9, 2026 12:43

Merge branch 'main' into fix/voice-response-formatter

d00324f

fix(voice): add @ to UNIX_PATH_RE to support scoped npm package paths

322d3b2

Scoped package paths like @google/gemini-cli-core were only matched up to the @ character, producing broken TTS output. Adding @ to the character class fixes the match and a test case is added to cover it.

This comment was marked as spam.

Sign in to view

mrpmohiburrahman reviewed Mar 9, 2026

View reviewed changes

This comment was marked as spam.

Sign in to view

Ayush Debnath added 2 commits March 9, 2026 20:56

Merge branch 'main' into fix/voice-response-formatter

0d4bcf9

Merge branch 'main' into fix/voice-response-formatter

bf25f57

spencer426 reviewed Mar 9, 2026

View reviewed changes

packages/core/src/voice/responseFormatter.ts Outdated Show resolved Hide resolved

spencer426 reviewed Mar 9, 2026

View reviewed changes

packages/core/src/voice/responseFormatter.test.ts Outdated Show resolved Hide resolved

packages/core/src/voice/responseFormatter.ts Outdated Show resolved Hide resolved

packages/core/src/voice/responseFormatter.ts Outdated Show resolved Hide resolved

spencer426 reviewed Mar 9, 2026

View reviewed changes

packages/core/src/voice/responseFormatter.ts Show resolved Hide resolved

This comment was marked as spam.

Sign in to view

Merge branch 'main' into fix/voice-response-formatter

225cc1f

spencer426 approved these changes Mar 10, 2026

View reviewed changes

Ayush Debnath added 2 commits March 10, 2026 20:31

Merge branch 'main' into fix/voice-response-formatter

ecd623d

Merge branch 'main' into fix/voice-response-formatter

1933bad

This comment was marked as duplicate.

Sign in to view

This comment was marked as spam.

Sign in to view

spencer426 enabled auto-merge March 10, 2026 15:15

github-advanced-security bot found potential problems Mar 10, 2026

View reviewed changes

packages/core/src/voice/responseFormatter.ts Fixed Show fixed Hide fixed

spencer426 added this pull request to the merge queue Mar 10, 2026

spencer426 removed this pull request from the merge queue due to a manual request Mar 10, 2026

spencer426 self-requested a review March 10, 2026 15:32

spencer426 requested changes Mar 10, 2026

View reviewed changes

Ayush Debnath added 2 commits March 10, 2026 22:02

fix(voice): prevent polynomial regex ReDoS on ANSI and path matching

2d8c2f5

Merge branch 'main' into fix/voice-response-formatter

81c9457

This comment was marked as spam.

Sign in to view

Ayush Debnath added 2 commits March 11, 2026 00:22

Merge branch 'main' into fix/voice-response-formatter

a75fa35

Merge branch 'main' into fix/voice-response-formatter

58cafac

This comment was marked as spam.

Sign in to view

Merge branch 'main' into fix/voice-response-formatter

b78f5c1

spencer426 enabled auto-merge March 10, 2026 19:42

spencer426 approved these changes Mar 10, 2026

View reviewed changes

spencer426 added this pull request to the merge queue Mar 10, 2026

Merged via the queue into google-gemini:main with commit 9eae91a Mar 10, 2026
27 checks passed

gemini-code-assist bot mentioned this pull request Mar 11, 2026

Changelog for v0.34.0-preview.0 #21965

Merged

JaisalJain pushed a commit to JaisalJain/gemini-cli that referenced this pull request Mar 11, 2026

feat(voice): implement speech-friendly response formatter (google-gem…

7a37d70

…ini#20989) Co-authored-by: Spencer <spencertang@google.com>

liamhelmer pushed a commit to badal-io/gemini-cli that referenced this pull request Mar 12, 2026

feat(voice): implement speech-friendly response formatter (google-gem…

5a04d37

…ini#20989) Co-authored-by: Spencer <spencertang@google.com>

This comment was marked as spam.

Sign in to view

yashodipmore pushed a commit to yashodipmore/geemi-cli that referenced this pull request Mar 21, 2026

feat(voice): implement speech-friendly response formatter (google-gem…

1a50c05

…ini#20989) Co-authored-by: Spencer <spencertang@google.com>

Conversation

ayush31010 commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Transformations (applied in order)

Test plan

Uh oh!

gemini-code-assist bot commented Mar 3, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as spam.

Uh oh!

This comment was marked as spam.

mrpmohiburrahman left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as spam.

This comment was marked as spam.

spencer426 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

spencer426 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

spencer426 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

This comment was marked as spam.

spencer426 left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as duplicate.

Uh oh!

This comment was marked as spam.

Uh oh!

Uh oh!

spencer426 left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as spam.

This comment was marked as spam.

spencer426 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

This comment was marked as spam.

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ayush31010 commented Mar 3, 2026 •

edited

Loading