feat: treat `query_knowledge_sources` results as sensitive by default by abhinav-m22 · Pull Request #4404 · archestra-ai/archestra

abhinav-m22 · 2026-05-06T13:55:38Z

Fixes Prompt Injection via Knowledge Base Ingestion #4348

This PR closes a critical prompt-injection and RAG-poisoning escalation path by ensuring that results from the built-in query_knowledge_sources tool are treated as sensitive by default.

Backend

Special-cased query_knowledge_sources in TrustedDataPolicyModel.evaluateBulk to skip the automatic trust bypass for built-in tools.
Implemented a fail-closed fallback: if the tool record is missing from the database, the system defaults to untrusted rather than trusted.

UI

Modified /api/tools/with-assignments to support an explicit inclusion filter for specific built-in tools.
Updated the Tool Guardrails page to expose query_knowledge_sources in the tools table while keeping other infrastructure tools hidden.

Docs and Tests

Updated codegen-archestra-mcp-server-docs.ts to reflect the trust exception for this tool in the auto-generated platform documentation.
Implemented comprehensive unit tests verifying that both standard and branded KB tool results are treated as untrusted and follow normal policy evaluation.
Added integration tests proving that restricted tools are successfully blocked when invoked after a KB query.

abhinav-m22 · 2026-05-07T13:34:57Z

Hi @joeyorlando!
PR is ready for review. The KB query results (query_knowledge_sources) are marked as sensitive by default and exposed the tool in the UI as suggested.

Konstantinov-Innokentii · 2026-05-11T08:51:14Z

@joeyorlando - assigning you as a reviewer since you own the original issue.

Co-authored-by: Joey Orlando <joseph.t.orlando@gmail.com>

joeyorlando · 2026-05-11T16:01:24Z

I pushed a few minor changes to platform/backend/src/standalone-scripts/codegen-archestra-mcp-server-docs.ts - you'll just need to rerun pnpm codegen to regenerate the contents here

Co-authored-by: Joey Orlando <joseph.t.orlando@gmail.com>

joeyorlando

in terms of "migration", where are the default tool invocation + tool result policies being set/assigned for the query_knowledge_sources tool?

abhinav-m22 · 2026-05-12T01:55:42Z

in terms of "migration", where are the default tool invocation + tool result policies being set/assigned for the query_knowledge_sources tool?

@joeyorlando missed that point. I was leaning on the evaluator's "no policy = untrusted" fallback at evaluation time.

Just fixed it in seedArchestraTools which is the same startup-seed pattern as migratePlaywrightToolsToDynamicCredential.

It now ensures default rows in both tool_invocation_policies (allow_when_context_is_untrusted) and trusted_data_policies (mark_as_untrusted).

abhinav-m22 added 9 commits May 6, 2026 17:30

fix: treat query_knowledge_sources as untrusted by default

1d612f8

test: cover query_knowledge_sources untrusted

67da615

test: block restricted tool after KB query result

7051c8b

feat: allow including specific Archestra tools when excluded

c63ef4e

feat: show query_knowledge_sources on ui

3585399

fix: show kb tool In guardrails

d63cebe

docs: document query_knowledge_sources trust exception

3eac359

test: harden security tests

cd94421

fix tool filter type error

4b97bda

abhinav-m22 marked this pull request as draft May 6, 2026 14:04

abhinav-m22 added 2 commits May 6, 2026 19:39

fix: remove execution trust bypass for query_knowledge_sources

232a9eb

fix tests and lint errors

52294c4

abhinav-m22 marked this pull request as ready for review May 6, 2026 14:29

github-actions Bot requested a review from Konstantinov-Innokentii May 6, 2026 14:40

abhinav-m22 added 2 commits May 7, 2026 08:27

Merge branch 'main' into fix/untrust-knowledge-results

8e7831f

Merge branch 'main' into fix/untrust-knowledge-results

25e75a1

abhinav-m22 added 2 commits May 9, 2026 10:04

Merge branch 'main' into fix/untrust-knowledge-results

1fbf3aa

Merge branch 'main' into fix/untrust-knowledge-results

51e6c58

Konstantinov-Innokentii requested review from joeyorlando and removed request for Konstantinov-Innokentii May 11, 2026 08:50