feat: Redpanda Connect connector docs automation with multi-release attribution by JakeSCahill · Pull Request #183 · redpanda-data/docs-extensions-and-macros

JakeSCahill · 2026-04-01T13:50:24Z

Summary

This PR adds multi-release attribution to the Redpanda Connect connector documentation automation. When releases are missed, the automation now processes each intermediate release separately instead of lumping all changes into the latest version.

Key improvements:

Sequential processing of intermediate releases for accurate version attribution
Cloud binary version matching for each release date
CGO-only component false positive elimination
Enhanced configuration YAML generation
Comprehensive documentation for the automation system

Multi-Release Attribution

Problem

When the automation missed weekly releases (e.g., 4.81.0 through 4.85.0), all changes were attributed to version 4.85.0 instead of their actual release version. Writers couldn't tell which features appeared in which version.

Solution

Automatically discovers intermediate releases via GitHub API
Processes each version pair sequentially (4.81→4.82, 4.82→4.83, etc.)
Generates separate diff files for each release
Creates master diff aggregating all changes with per-version breakdown
Matches correct cloud binary version to each release date

New CLI Flags

--skip-intermediate: Legacy mode (single comparison only)
--from-version <version>: Override starting version instead of using antora.yml

Bug Fixes

CGO-Only Component False Positives

Problem: Components like tigerbeetle_cdc, zmq4, ffi existed in OSS binaries all along but showed as "new" in 4.85.0 because augmented data was used for diff generation.

Fix: Added stripAugmentationFields() function that removes cloud/CGO augmentation before version comparisons. This ensures diffs compare clean OSS-to-OSS data.

Result: 4.85.0 now shows 2 new components (correct) instead of 7 (eliminated 5 false positives).

Cloud Binary Version Mismatch

Problem: When processing intermediate releases, automation used latest cloud version (4.85.0) for ALL comparisons, causing incorrect platform attribution.

Fix: Added findCloudVersionForDate() function that finds the appropriate cloud version for each OSS release date.

Result: Each intermediate release now uses its contemporary cloud version (4.82.0 uses cloud 4.82.0, 4.83.0 uses cloud 4.83.0, etc.).

Cloud-Only Connector Labeling

Problem: aws_cloudwatch_logs was placed in cloud-only directory but PR summary labeled it "self-hosted only".

Fix: Changed from negative checks (!inCloud) to explicit positive checks for isSelfHostedOnly and isCloudOnly.

Configuration YAML Improvements

Label field restriction: The label field is now only added for components that support it (inputs, outputs, processors). Previously it was added for all types including caches and metrics where it's invalid.

Common vs Advanced deduplication: When common and advanced configurations are identical, only the common config is shown with a leading sentence. No tabs are generated for duplicate content.

Enhanced PR Summary

The multi-version PR summary now includes:

Release notes links for each version
New connector descriptions (2-sentence summaries from connector metadata)
New fields table with component, field path, and description
Removed connectors/fields with version attribution
Deprecated fields with migration guidance
Changed defaults table (old → new values)
Prioritized action items (cloud connectors first, self-hosted second)
Per-version breakdown showing which connectors appeared in which release

New Files

File	Purpose
`tools/redpanda-connect/github-release-utils.js`	GitHub API integration for release discovery and cloud version matching
`tools/redpanda-connect/multi-version-summary.js`	Master diff aggregation across multiple releases
`tools/redpanda-connect/AUTOMATION.md`	Comprehensive documentation explaining what the automation does and what outputs it creates
`CLAUDE.md`	AI-optimized repository overview following Redpanda documentation standards
`__tests__/tools/github-release-utils.test.js`	Tests for release discovery and version matching
`__tests__/tools/buildConfigYaml.test.js`	Tests for YAML generation (label field, deprecated fields, object/array rendering)

Modified Files

File	Changes
`rpcn-connector-docs-handler.js`	Added `stripAugmentationFields()`, intermediate release processing loop, cloud version detection
`pr-summary-formatter.js`	Fixed cloud-only detection logic, added multi-version summary support
`buildConfigYaml.js`	Conditional label field (only for inputs/outputs/processors)
`bin/doc-tools.js`	Added `--skip-intermediate` and `--from-version` flags

Test Coverage

66 tests across 3 test files (all passing)

buildConfigYaml.test.js: Label inclusion, deprecated fields, object/array rendering
github-release-utils.test.js: Version parsing, release discovery, findCloudVersionForDate()
pr-summary-formatter.test.js: Platform detection, multi-version summary, action items

Test plan

Run npm test - all 66 tests pass
Test single-version mode with --skip-intermediate
Test multi-version mode (4.81.0 → 4.85.0)
Verify PR summary includes all version-specific details for writers
Verify field partials and draft pages generate correctly
Verify CGO false positives eliminated
Verify cloud version matching for intermediate releases
Cross-reference output against GitHub release notes

🤖 Generated with Claude Code

- Fix semver validation for version extraction (--connect-version, --from-version flags, and filename parsing) - Add null-safety to multi-version PR summary formatter to prevent crashes on malformed data - Remove duplicate cleanup logic that contradicted versionsToKeep - Add 14 tests for generateMultiVersionPRSummary() function - Export generateMultiVersionPRSummary for testing Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Implements sequential processing of intermediate releases to accurately attribute new connectors to their actual release version instead of lumping all changes into the latest version. Key improvements: - Process each intermediate release between last documented and latest version - Match cloud binary version to each release date for accurate platform attribution - Fix CGO-only component false positives via augmentation stripping - Add master diff aggregation across multiple releases - New CLI flags: --skip-intermediate, --from-version Bug fixes: - Strip augmentation fields (cloudSupported, requiresCgo, cloudOnly) before version comparisons - Prevent cloud-only/CGO components from appearing as "new" in wrong versions - Fix buildConfigYaml to only add label field for inputs/outputs/processors New files: - tools/redpanda-connect/github-release-utils.js - GitHub release discovery and cloud version matching - tools/redpanda-connect/multi-version-summary.js - Master diff aggregation - tools/redpanda-connect/AUTOMATION.md - Comprehensive automation documentation - CLAUDE.md - AI-optimized repository overview - Tests for new functionality Accuracy improved from ~70% to ~95% for connector attribution. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

The multi-version PR summary now includes comprehensive information for technical writers: - Release notes links for each version - New connector descriptions (2-sentence summaries) - New fields table with component, field, and description columns - Removed connectors and fields with version attribution - Deprecated fields with migration guidance - Changed defaults table showing old → new values - Prioritized action items (cloud connectors first) - Platform grouping (cloud vs self-hosted sections) This makes the PR summary actionable for writers without needing to dig through JSON files for details. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

netlify · 2026-04-01T13:50:30Z

✅ Deploy Preview for docs-extensions-and-macros ready!

Name	Link
🔨 Latest commit	`fbf4ad1`
🔍 Latest deploy log	https://app.netlify.com/projects/docs-extensions-and-macros/deploys/69cd44b89451ad0008868124
😎 Deploy Preview	https://deploy-preview-183--docs-extensions-and-macros.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

coderabbitai · 2026-04-01T13:50:40Z

Important

Review skipped

Auto incremental reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: f3a3187a-c4ca-496c-86d2-b3b9eddade89

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

This PR introduces multi-version connector documentation automation for Redpanda Connect. It adds a GitHub release discovery utility to find intermediate releases between versions, implements multi-release diff aggregation, extends PR summary formatting to handle multiple releases with per-release breakdowns and aggregated statistics, and updates the connector docs handler to orchestrate intermediate release processing. Supporting changes include new CLI options for version control (--from-version, --skip-intermediate), helper functions for loading connector data across versions, configuration YAML refinements for conditional label inclusion, comprehensive test coverage for new modules, and documentation of the automation workflow.

Sequence Diagram

sequenceDiagram
    participant CLI as CLI Handler
    participant Discovery as Release Discovery
    participant DataLoader as Data Loader
    participant Analyzer as Binary Analyzer
    participant DiffGen as Diff Generator
    participant Aggregator as Master Diff Aggregator
    participant Formatter as PR Summary Formatter

    CLI->>Discovery: discoverIntermediateReleases(from, to)
    Discovery->>Discovery: Fetch releases from GitHub
    Discovery-->>CLI: [Release v1, v2, v3...]
    
    loop For each consecutive version pair
        CLI->>DataLoader: loadConnectorDataForVersion(version)
        DataLoader-->>CLI: Connector data (stripped)
        
        CLI->>Analyzer: analyzeAllBinaries(newVersion)
        Analyzer-->>CLI: Binary analysis (Cloud/OSS/CGO)
        
        CLI->>DiffGen: Generate & write diff JSON
        DiffGen-->>CLI: connect-diff-<from>_to_<to>.json
    end
    
    CLI->>Aggregator: createMasterDiff(intermediateResults, finalDiff)
    Aggregator->>Aggregator: Read & parse all diff JSON files
    Aggregator->>Aggregator: Aggregate counts across releases
    Aggregator-->>CLI: masterDiff (aggregated metadata)
    
    CLI->>Formatter: generateMultiVersionPRSummary(masterDiff)
    Formatter->>Formatter: Format per-release breakdown
    Formatter->>Formatter: Compute aggregated totals
    Formatter-->>CLI: PR summary with multi-version output

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

Fix Redpanda Connect cloud-only connector automation #164 — Overlapping modifications to pr-summary-formatter.js and rpcn-connector-docs-handler.js for cloud/binary-analysis data handling
Fix Redpanda Connect cloud-only connector automation #163 — Related changes to rpcn-connector-docs-handler.js and PR summary logic for cloud-only connector filtering
Fix/array of objects and cloud docs check #166 — Touches buildConfigYaml.js and rpcn-connector-docs-handler.js for connector docs and cloud-docs generation

Suggested reviewers

paulohtb6
kbatuigas
Feediver1

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The PR title clearly and specifically summarizes the main feature: multi-release attribution for Redpanda Connect connector documentation automation, which is the core purpose of the changeset.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Description check	✅ Passed	The PR description comprehensively describes the changeset, explaining multi-release attribution, bug fixes, new features, files added/modified, and test coverage.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/connector-docs-automation

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 9

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tools/redpanda-connect/pr-summary-formatter.js (1)

397-408: ⚠️ Potential issue | 🟠 Major

Guard binaryAnalysis.comparison everywhere and count cloudOnly connectors as cloud-supported.

tools/redpanda-connect/connector-binary-analyzer.js Lines 431-525 can return { comparison: null } when cloud analysis is skipped, so these dereferences can throw and abort summary generation. The quick summary and writer-action blocks also only look at inCloud/notInCloud, which drops cloudOnly connectors from the cloud-doc count/checklist.

💡 Suggested pattern

+  const comparison = binaryAnalysis?.comparison;
+  const inCloud = comparison?.inCloud ?? [];
+  const cloudOnly = comparison?.cloudOnly ?? [];
+  const notInCloud = comparison?.notInCloud ?? [];
+
   if (stats.newComponents > 0) {
     lines.push(`- **${stats.newComponents}** new connector${stats.newComponents !== 1 ? 's' : ''}`);
 
-    if (binaryAnalysis) {
+    if (comparison) {
       const newConnectorKeys = diffData.details.newComponents.map(c => `${c.type}:${c.name}`);
       const cloudSupported = newConnectorKeys.filter(key => {
-        const inCloud = binaryAnalysis.comparison.inCloud.some(c => `${c.type}:${c.name}` === key);
-        return inCloud;
+        return inCloud.some(c => `${c.type}:${c.name}` === key) ||
+          cloudOnly.some(c => `${c.type}:${c.name}` === key);
       }).length;

Also applies to: 551-559, 648-669, 739-743, 951-955

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tools/redpanda-connect/pr-summary-formatter.js` around lines 397 - 408, Guard
all uses of binaryAnalysis.comparison before dereferencing it and include
cloudOnly connectors when counting cloud-supported connectors: check
binaryAnalysis && binaryAnalysis.comparison before accessing comparison.inCloud
or comparison.notInCloud, and when computing newConnectorKeys' cloud support use
comparison.inCloud.concat(comparison.cloudOnly) (or check both arrays) to count
cloudSupported; ensure needsCloudDocs is treated as a numeric count (not a
boolean) and then use if (needsCloudDocs > 0) to push the summary line. Apply
the same guards and cloudOnly-inclusive counting to the other
summary/writer-action blocks that reference comparison.inCloud/notInCloud.

🧹 Nitpick comments (3)

__tests__/tools/buildConfigYaml.test.js (1)

149-224: Solid coverage for complex field rendering.

Tests appropriately verify:

Nested object fields render with children
Array-of-objects render as empty arrays (not expanded)
Simple arrays render correctly

Consider adding an edge case test for an empty children array to ensure no rendering issues occur.

🧪 Optional: Add edge case test for empty children

+    it('should handle empty children array', () => {
+      const result = buildConfigYaml('inputs', 'kafka', [], false);
+      expect(result).toContain('inputs:');
+      expect(result).toContain('  label: ""');
+      expect(result).toContain('  kafka:');
+      // Should only have header lines, no field lines
+      const lines = result.split('\n');
+      expect(lines.length).toBe(3);
+    });

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@__tests__/tools/buildConfigYaml.test.js` around lines 149 - 224, Add a unit
test for the empty-children edge case by calling buildConfigYaml with a field
whose children is an empty array (e.g., { name: 'foo', type: 'object', kind:
'map', children: [] } and also test kind: 'array' if desired) and assert it does
not render any child keys (expect(result).not.toContain('someChild:')) and that
the parent renders appropriately (e.g., contains 'foo:' for map and 'foo: []'
for array-of-objects). Place the test inside the existing "complex field types"
describe block next to the other cases and reference buildConfigYaml to locate
the behavior to validate.

tools/redpanda-connect/AUTOMATION.md (1)

41-41: Add languages to the unlabeled fenced blocks.

These fences trigger markdownlint MD040. Tag the ASCII diagrams/tree blocks as text to keep the new doc lint-clean.

Also applies to: 85-85, 92-92, 628-628

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tools/redpanda-connect/AUTOMATION.md` at line 41, Update each unlabeled
fenced code block that contains ASCII diagrams or tree-style ASCII art (the
blocks that currently start with just ``` and contain ASCII diagrams/tree) by
adding the language tag text after the opening fence (i.e., change ``` to
```text) so markdownlint MD040 is satisfied; search for the plain ``` fences
surrounding ASCII diagram content (the diagram/tree blocks referenced in the
review) and update them to use ```text consistently.

__tests__/tools/pr-summary-formatter.test.js (1)

507-519: Scope the cloud-indicator assertion to the connector entry.

expect(summary).not.toContain('☁️') is broader than the behavior this test cares about, so an unrelated cloud legend/header will break it even when test_connector is still unmarked. Assert on the test_connector line instead.

♻️ Suggested assertion

       // Should not crash and should not show cloud indicator
       expect(summary).toContain('`test_connector`');
-      expect(summary).not.toContain('☁️');
+      const connectorLine = summary.match(/`test_connector`[^\n]*/);
+      expect(connectorLine).toBeTruthy();
+      expect(connectorLine[0]).not.toContain('☁️');

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@__tests__/tools/pr-summary-formatter.test.js` around lines 507 - 519, The
test currently checks globally that the summary does not contain the cloud emoji
which is too broad; update the assertion to verify the connector-specific line
for `test_connector` produced by generateMultiVersionPRSummary(masterDiff) does
not include '☁️' instead of using expect(summary).not.toContain('☁️'); locate
the test using createMasterDiff and createRelease and change the negative cloud
assertion to target the specific connector entry (e.g., find the line that
contains '`test_connector`' in the summary and assert that line does not include
the cloud indicator).

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@CLAUDE.md`:
- Around line 161-171: The example for the `fetch` command is out of sync with
the implementation in bin/doc-tools.js: the CLI exposes --owner, --repo,
--remote-path, --save-dir, and --filename (not --path, --tag, or --output) and
it does not perform version-specific fetches; either update the CLAUDE.md
example to use the actual flags (--owner, --repo, --remote-path, --save-dir,
--filename) and remove the claim about version-specific downloads, or implement
the missing flags/behavior in the fetch command in bin/doc-tools.js (add support
for --path/--tag/--output and versioned fetch logic) so the docs match the CLI.
- Around line 139-142: The example command is missing the required -s/--surface
flag enforced by the CLI (see bin/doc-tools.js where -s, --surface is
mandatory); update the example to include the surface flag and a concrete value
(e.g., npx doc-tools generate bundle-openapi --surface openapi --tag v25.3.1) so
the command runs successfully.

In `@tools/redpanda-connect/github-release-utils.js`:
- Around line 144-154: Ensure fromVersion and toVersion are validated as strings
before using startsWith: check typeof fromVersion === 'string' and typeof
toVersion === 'string' and if either is not a string throw the existing Invalid
starting/ending version Error for fromVersion/toVersion; only after those guards
compute normalizedFrom and normalizedTo (using startsWith('v') ? slice(1) :
value) and then run semver.valid on the normalized values. Reference variables:
fromVersion, toVersion, normalizedFrom, normalizedTo, and semver.valid.

In `@tools/redpanda-connect/pr-summary-formatter.js`:
- Around line 264-289: The current logic only emits otherConnectors when there
are no cloudConnectors or selfHostedConnectors, dropping unknown-platform
connectors; update the block that handles otherConnectors (the otherConnectors
variable and its if check) to always run when otherConnectors.length > 0 (remove
the && cloudConnectors.length === 0 && selfHostedConnectors.length === 0
condition) and add an appropriate header (e.g., lines.push('_Other
connectors:_')) before iterating so those connectors are included in the
checklist regardless of cloud/self-hosted presence.

In `@tools/redpanda-connect/rpcn-connector-docs-handler.js`:
- Around line 869-879: The catch currently logs errors and records failed pairs
in intermediateProcessingResults but the later logic still updates the Antora
`latest-connect-version` and exits 0; change the flow so any recorded failure
prevents advancing the latest version: set a failure flag (e.g.,
hadProcessingError) or rely on inspecting intermediateProcessingResults entries
(check for any item with success: false) after processing all pairs, and if any
failures exist, skip the Antora/latest-version update and exit with a non-zero
code (or throw) so CI/automation will not treat the run as fully successful;
update the code paths that write/update `latest-connect-version` to first
confirm all intermediateProcessingResults are success === true (or that
hadProcessingError is false) before proceeding.
- Around line 943-945: The oldIndex is only stripped in the Antora fallback
branch, causing shape mismatches when oldIndex is loaded via the --old-data path
or from existingDataFiles; update every code path that assigns oldIndex
(including the --old-data/oldPath load and the existingDataFiles branch) to wrap
the parsed object with stripAugmentationFields(...) (i.e., replace direct
assignments like oldIndex = JSON.parse(fs.readFileSync(oldPath,'utf8')) or
assignments from existingDataFiles with oldIndex =
stripAugmentationFields(JSON.parse(...)) ) so that generateConnectorDiffJson()
receives a consistently stripped oldIndex.
- Around line 950-958: The code currently swaps the snapshot files (creating
._connect-${newVersion}-augmented.json.tmp and renaming the original) before
calling analyzeAllBinaries(), but if analyzeAllBinaries() throws the original
connect-${newVersion}.json isn't restored; wrap the analyzeAllBinaries() call
and the rename/copy operations in a try/finally (or add a finally block) so that
regardless of errors you rename/move the temp augmented file back to its
expected filename and remove the .tmp, restoring the original snapshot;
specifically update the logic around where cleanOssDataPath, newIndex, and
analyzeAllBinaries() are used to perform the restore in finally to guarantee
cleanup and restore of connect-${newVersion}.json.
- Around line 495-507: stripAugmentationFields currently only filters out
cloudOnly connectors but misses those marked requiresCgo, so update the filter
inside stripAugmentationFields to also exclude connectors with requiresCgo
unless they have OSS config (config or fields). Concretely, change the predicate
in the cleanData[type] = cleanData[type].filter(...) for connectors to return
true only when the connector is not cloudOnly and not requiresCgo, or when it
has c.config or c.fields (e.g., return (!(c.cloudOnly || c.requiresCgo) ||
c.config || c.fields)); this ensures cgo-only injected connectors are removed
from the cleaned data.
- Around line 603-607: The current filename regex
/^connect-\d+\.\d+\.\d+\.json$/ must be relaxed to allow prerelease segments and
you must replace lexicographic .sort() calls on version strings with
semver-aware sorting; update the regex to capture prerelease (for example
/^connect-(\d+\.\d+\.\d+(?:-[0-9A-Za-z-.]+)?)\.json$/) and extract the captured
version, validate with semver.valid, then use semver.rsort(candidates) or
semver.sort(candidates) and pick the first element to get the highest semantic
version (adjust places that currently call candidates.sort() and pick last to
instead call semver.rsort/semver.sort and pick index 0); apply this change
wherever candidates are filtered and sorted (the blocks referencing the regex
and candidates.sort()).

---

Outside diff comments:
In `@tools/redpanda-connect/pr-summary-formatter.js`:
- Around line 397-408: Guard all uses of binaryAnalysis.comparison before
dereferencing it and include cloudOnly connectors when counting cloud-supported
connectors: check binaryAnalysis && binaryAnalysis.comparison before accessing
comparison.inCloud or comparison.notInCloud, and when computing
newConnectorKeys' cloud support use
comparison.inCloud.concat(comparison.cloudOnly) (or check both arrays) to count
cloudSupported; ensure needsCloudDocs is treated as a numeric count (not a
boolean) and then use if (needsCloudDocs > 0) to push the summary line. Apply
the same guards and cloudOnly-inclusive counting to the other
summary/writer-action blocks that reference comparison.inCloud/notInCloud.

---

Nitpick comments:
In `@__tests__/tools/buildConfigYaml.test.js`:
- Around line 149-224: Add a unit test for the empty-children edge case by
calling buildConfigYaml with a field whose children is an empty array (e.g., {
name: 'foo', type: 'object', kind: 'map', children: [] } and also test kind:
'array' if desired) and assert it does not render any child keys
(expect(result).not.toContain('someChild:')) and that the parent renders
appropriately (e.g., contains 'foo:' for map and 'foo: []' for
array-of-objects). Place the test inside the existing "complex field types"
describe block next to the other cases and reference buildConfigYaml to locate
the behavior to validate.

In `@__tests__/tools/pr-summary-formatter.test.js`:
- Around line 507-519: The test currently checks globally that the summary does
not contain the cloud emoji which is too broad; update the assertion to verify
the connector-specific line for `test_connector` produced by
generateMultiVersionPRSummary(masterDiff) does not include '☁️' instead of using
expect(summary).not.toContain('☁️'); locate the test using createMasterDiff and
createRelease and change the negative cloud assertion to target the specific
connector entry (e.g., find the line that contains '`test_connector`' in the
summary and assert that line does not include the cloud indicator).

In `@tools/redpanda-connect/AUTOMATION.md`:
- Line 41: Update each unlabeled fenced code block that contains ASCII diagrams
or tree-style ASCII art (the blocks that currently start with just ``` and
contain ASCII diagrams/tree) by adding the language tag text after the opening
fence (i.e., change ``` to ```text) so markdownlint MD040 is satisfied; search
for the plain ``` fences surrounding ASCII diagram content (the diagram/tree
blocks referenced in the review) and update them to use ```text consistently.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: daa2ab84-f179-4d5f-ac7d-70dfaa2c4f4d

📥 Commits

Reviewing files that changed from the base of the PR and between 00f1589 and 865e1f5.

📒 Files selected for processing (12)

CLAUDE.md
CLI_REFERENCE.adoc
__tests__/tools/buildConfigYaml.test.js
__tests__/tools/github-release-utils.test.js
__tests__/tools/pr-summary-formatter.test.js
bin/doc-tools.js
tools/redpanda-connect/AUTOMATION.md
tools/redpanda-connect/github-release-utils.js
tools/redpanda-connect/helpers/buildConfigYaml.js
tools/redpanda-connect/multi-version-summary.js
tools/redpanda-connect/pr-summary-formatter.js
tools/redpanda-connect/rpcn-connector-docs-handler.js

CLAUDE.md

coderabbitai · 2026-04-01T14:06:57Z

tools/redpanda-connect/github-release-utils.js

+  // Normalize versions (remove 'v' prefix if present)
+  const normalizedFrom = fromVersion.startsWith('v') ? fromVersion.slice(1) : fromVersion;
+  const normalizedTo = toVersion.startsWith('v') ? toVersion.slice(1) : toVersion;
+
+  // Validate versions
+  if (!semver.valid(normalizedFrom)) {
+    throw new Error(`Invalid starting version: ${fromVersion}`);
+  }
+  if (!semver.valid(normalizedTo)) {
+    throw new Error(`Invalid ending version: ${toVersion}`);
+  }


⚠️ Potential issue | 🟡 Minor

Validate the inputs before calling startsWith().

If fromVersion or toVersion is undefined or non-string, this throws a TypeError before the semver checks run, so callers get a crash instead of the intended Invalid ... version error.

💡 Suggested guard

- const normalizedFrom = fromVersion.startsWith('v') ? fromVersion.slice(1) : fromVersion; - const normalizedTo = toVersion.startsWith('v') ? toVersion.slice(1) : toVersion; + const normalizeVersion = (value, label) => { + if (typeof value !== 'string') { + throw new Error(`Invalid ${label} version: ${value}`); + } + return value.startsWith('v') ? value.slice(1) : value; + }; + + const normalizedFrom = normalizeVersion(fromVersion, 'starting'); + const normalizedTo = normalizeVersion(toVersion, 'ending');

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@tools/redpanda-connect/github-release-utils.js` around lines 144 - 154, Ensure fromVersion and toVersion are validated as strings before using startsWith: check typeof fromVersion === 'string' and typeof toVersion === 'string' and if either is not a string throw the existing Invalid starting/ending version Error for fromVersion/toVersion; only after those guards compute normalizedFrom and normalizedTo (using startsWith('v') ? slice(1) : value) and then run semver.valid on the normalized values. Reference variables: fromVersion, toVersion, normalizedFrom, normalizedTo, and semver.valid.

coderabbitai · 2026-04-01T14:06:57Z

tools/redpanda-connect/pr-summary-formatter.js

+    const cloudConnectors = allNewConnectors.filter(c => c.isCloud);
+    const selfHostedConnectors = allNewConnectors.filter(c => c.isSelfHostedOnly);
+    const otherConnectors = allNewConnectors.filter(c => !c.isCloud && !c.isSelfHostedOnly);
+
+    if (cloudConnectors.length > 0) {
+      lines.push('_Cloud-supported (higher priority):_');
+      cloudConnectors.forEach(conn => {
+        lines.push(`- [ ] \`${conn.name}\` ${conn.type} ☁️ — introduced in **${conn.version}**`);
+      });
+      lines.push('');
+    }
+
+    if (selfHostedConnectors.length > 0) {
+      lines.push('_Self-hosted only:_');
+      selfHostedConnectors.forEach(conn => {
+        lines.push(`- [ ] \`${conn.name}\` ${conn.type} 🖥️ — introduced in **${conn.version}**`);
+      });
+      lines.push('');
+    }
+
+    if (otherConnectors.length > 0 && cloudConnectors.length === 0 && selfHostedConnectors.length === 0) {
+      otherConnectors.forEach(conn => {
+        lines.push(`- [ ] \`${conn.name}\` ${conn.type} — introduced in **${conn.version}**`);
+      });
+      lines.push('');
+    }


⚠️ Potential issue | 🟠 Major

Don't drop unknown-platform connectors from the multi-release checklist.

otherConnectors are only emitted when there are no cloud/self-hosted entries at all. If one release has binary-analysis data and another does not, the second release's new connectors disappear from Writer Action Items even though they still need docs.

💡 Suggested fix

- if (otherConnectors.length > 0 && cloudConnectors.length === 0 && selfHostedConnectors.length === 0) { + if (otherConnectors.length > 0) { + lines.push('_Platform not determined:_'); otherConnectors.forEach(conn => { lines.push(`- [ ] \`${conn.name}\` ${conn.type} — introduced in **${conn.version}**`); }); lines.push(''); }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@tools/redpanda-connect/pr-summary-formatter.js` around lines 264 - 289, The current logic only emits otherConnectors when there are no cloudConnectors or selfHostedConnectors, dropping unknown-platform connectors; update the block that handles otherConnectors (the otherConnectors variable and its if check) to always run when otherConnectors.length > 0 (remove the && cloudConnectors.length === 0 && selfHostedConnectors.length === 0 condition) and add an appropriate header (e.g., lines.push('_Other connectors:_')) before iterating so those connectors are included in the checklist regardless of cloud/self-hosted presence.

coderabbitai · 2026-04-01T14:06:57Z