fix: accumulate additive events in memory staging area by yashhzd · Pull Request #165 · mesa/mesa-llm

yashhzd · 2026-03-06T14:00:33Z

Summary

Fixes #137

The base Memory.add_to_memory() method uses a plain dict (step_content) where each event type maps to a single value. When multiple events of the same type occur within a single step (e.g., an agent receiving messages from several neighbors), subsequent calls silently overwrite previous entries, causing permanent data loss.

Changes

memory.py: Introduce ADDITIVE_EVENT_TYPES (frozenset({"message", "action"})) — events of these types are collected in a list rather than overwritten. Observations remain state-based (overwrite semantics preserved). Other types (plans, reasoning steps, etc.) also keep overwrite semantics.
memory.py (MemoryEntry.__str__): Updated to render list-valued content with numbered sub-entries for clean display.
st_memory.py / st_lt_memory.py: Updated get_communication_history() to iterate over list-valued message entries.
test_llm_agent.py: Updated test_apply_plan_adds_to_memory assertion to match the new list-based action storage.
New: tests/test_memory/test_memory_staging.py — 9 focused tests covering:
- Single and multiple messages preserved in a list
- Multiple actions preserved
- Observations still overwrite (state-based)
- Non-additive types still overwrite
- Mixed types coexist correctly
- MemoryEntry.__str__ formats lists properly
- ShortTermMemory.get_communication_history handles list messages

Design Decision

Per the discussion in #76, communicative events (messages, actions) should be additive — an agent receiving five messages in one step should have all five recorded. Observations may remain state-based to prevent LLM context bloat.

The ADDITIVE_EVENT_TYPES set is a class attribute on Memory, making it easy for subclasses to extend with additional event types if needed.

Test Plan

All 271 existing tests pass
9 new tests for staging area behavior
Pre-commit hooks pass (ruff check, ruff format, codespell)

Summary by CodeRabbit

Release Notes

New Features
- Messages and actions within a step now accumulate as lists, preserving all events instead of overwriting.
- Added support for rendering lists in memory content with indexed enumeration.
- New async API for memory updates.
Bug Fixes
- Improved communication history handling to properly display multiple messages per step.

codecov · 2026-03-06T14:01:47Z

Codecov Report

❌ Patch coverage is 98.38710% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 91.48%. Comparing base (1e91f24) to head (0b8111f).

Files with missing lines	Patch %	Lines
mesa_llm/memory/memory.py	96.29%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #165      +/-   ##
==========================================
+ Coverage   91.20%   91.48%   +0.27%     
==========================================
  Files          19       19              
  Lines        1592     1632      +40     
==========================================
+ Hits         1452     1493      +41     
+ Misses        140      139       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

coderabbitai · 2026-03-07T16:29:56Z

Important

Review skipped

Auto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 6d87aa8e-8c78-41d7-91c0-ef83d6a09735

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

Walkthrough

The PR implements additive semantics for certain event types in the memory staging area to fix data loss when multiple events of the same type occur within a single simulation step. Messages and actions now accumulate in lists rather than overwriting each other. Supporting changes update memory rendering and communication history retrieval to handle list-valued content.

Changes

Cohort / File(s)	Summary
Core Memory Logic `mesa_llm/memory/memory.py`	Added ADDITIVE_EVENT_TYPES set containing "message" and "action" types. Modified add_to_memory to accumulate additive event types into lists across multiple calls within a step, while observation types retain overwrite semantics. Added aadd_to_memory async wrapper. Enhanced list rendering in memory content display.
Communication History Retrieval `mesa_llm/memory/st_memory.py`, `mesa_llm/memory/st_lt_memory.py`	Refactored get_communication_history to handle messages stored as either single strings or lists, preserving multiple messages per step and ensuring robust formatting of both legacy and new list-based message storage.
Test Updates `tests/test_llm_agent.py`	Updated test assertion to expect action events as lists accumulating multiple entries within a step, reflecting new additive semantics.
Comprehensive Test Coverage `tests/test_memory/test_memory_staging.py`	Added extensive unit tests validating additive vs. overwrite semantics across event types, list rendering, communication history handling, legacy single-dict migration, and edge cases with ConcreteMemory test fixture.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

PR #69: Also adds async wrapper aadd_to_memory to Memory class in mesa_llm/memory/memory.py with overlapping implementation details.

Suggested labels

bug

Suggested reviewers

sanika-n

Poem

🐰 A rabbit's ode to additive cheer,
No more lost messages, we hold them dear!
Lists grow where singles once did reign,
Each step's events preserved, none in vain. 🌟

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 46.15% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly summarizes the main change: introducing additive event semantics to prevent data loss in memory staging.
Description check	✅ Passed	The description is comprehensive, covering changes, design decisions, and testing. It directly references the linked issue and includes a detailed test plan.
Linked Issues check	✅ Passed	All requirements from issue `#137` are met: additive event types (messages, actions) are collected in lists, observations retain overwrite semantics, and data loss is prevented.
Out of Scope Changes check	✅ Passed	All changes are directly related to fixing issue `#137`: memory staging improvements, helper method updates, test updates, and new comprehensive tests.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

colinfrisch · 2026-03-10T09:11:43Z

@CodeRabbit review

coderabbitai · 2026-03-10T09:11:53Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

🧹 Nitpick comments (3)

mesa_llm/memory/memory.py (2)
56-57: Minor formatting inconsistency for non-dict list items.

Line 57 appends a trailing colon after non-dict items ({item} :), but since these are values rather than key-value pairs, the colon appears unnecessary and inconsistent with how scalar values are displayed elsewhere (line 61 also has this pattern for non-list scalars).
💡 Suggested fix
                     else:
-                        lines.append(f"      [blue]└──[/blue] [cyan]{item} :[/cyan]")
+                        lines.append(f"      [blue]└──[/blue] [cyan]{item}[/cyan]")
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@mesa_llm/memory/memory.py` around lines 56 - 57, The list branch currently
appends non-dict items with a trailing colon using the lines.append call that
builds f"      [blue]└──[/blue] [cyan]{item} :[/cyan]"; remove the colon so
scalar list items render like other scalars (match the pattern used elsewhere
for non-list scalars, e.g., the similar append at the non-list scalar branch),
updating the string to omit " :" and ensure whitespace/formatting matches other
scalar lines.
143-146: Type annotation mismatch: frozenset assigned to set[str].

The annotation declares set[str] but the value is a frozenset. While this works at runtime, it's technically incorrect for static type checkers and could mislead subclasses about mutability.
💡 Suggested fix
-    ADDITIVE_EVENT_TYPES: set[str] = frozenset({"message", "action"})
+    ADDITIVE_EVENT_TYPES: frozenset[str] = frozenset({"message", "action"})
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@mesa_llm/memory/memory.py` around lines 143 - 146, The annotation for
ADDITIVE_EVENT_TYPES is incorrect: it declares set[str] but is assigned a
frozenset; update the type to match immutability by changing the annotation to
frozenset[str] (or typing.FrozenSet[str]) so ADDITIVE_EVENT_TYPES:
frozenset[str] = frozenset({"message", "action"}) and keep the existing value;
this ensures static type checkers and readers see the correct immutable type for
the constant.
mesa_llm/memory/st_memory.py (1)
97-111: LGTM with a consistency note.

The implementation correctly handles both list-valued and scalar message entries. However, EpisodicMemory.get_communication_history() (lines 150-160) directly accesses entry.content['message'] without checking if it's a list, unlike the updated approach here. If both classes are used with additive message storage, they should handle messages consistently to avoid formatting errors.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@mesa_llm/memory/st_memory.py` around lines 97 - 111,
EpisodicMemory.get_communication_history currently assumes
entry.content['message'] is a scalar; change it to match the logic used in
get_communication_history of the short-term memory: first check that "message"
exists in entry.content, retrieve msgs = entry.content["message"], then if
isinstance(msgs, list) iterate and append each msg with the same formatting
(e.g., f"step {entry.step}: {msg}\n\n") else append the scalar message with the
same formatting; ensure you reference EpisodicMemory.get_communication_history
and use entry.step and entry.content consistently.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@mesa_llm/memory/memory.py`:
- Around line 56-57: The list branch currently appends non-dict items with a
trailing colon using the lines.append call that builds f"      [blue]└──[/blue]
[cyan]{item} :[/cyan]"; remove the colon so scalar list items render like other
scalars (match the pattern used elsewhere for non-list scalars, e.g., the
similar append at the non-list scalar branch), updating the string to omit " :"
and ensure whitespace/formatting matches other scalar lines.
- Around line 143-146: The annotation for ADDITIVE_EVENT_TYPES is incorrect: it
declares set[str] but is assigned a frozenset; update the type to match
immutability by changing the annotation to frozenset[str] (or
typing.FrozenSet[str]) so ADDITIVE_EVENT_TYPES: frozenset[str] =
frozenset({"message", "action"}) and keep the existing value; this ensures
static type checkers and readers see the correct immutable type for the
constant.

In `@mesa_llm/memory/st_memory.py`:
- Around line 97-111: EpisodicMemory.get_communication_history currently assumes
entry.content['message'] is a scalar; change it to match the logic used in
get_communication_history of the short-term memory: first check that "message"
exists in entry.content, retrieve msgs = entry.content["message"], then if
isinstance(msgs, list) iterate and append each msg with the same formatting
(e.g., f"step {entry.step}: {msg}\n\n") else append the scalar message with the
same formatting; ensure you reference EpisodicMemory.get_communication_history
and use entry.step and entry.content consistently.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 3cee3f10-7388-4e91-98bc-3ecaacc8aac4

📥 Commits

Reviewing files that changed from the base of the PR and between 4c0549e and 26e4c03.

📒 Files selected for processing (5)

mesa_llm/memory/memory.py
mesa_llm/memory/st_lt_memory.py
mesa_llm/memory/st_memory.py
tests/test_llm_agent.py
tests/test_memory/test_memory_staging.py

sanika-n

Thank you! This would probably have been the approach that I'd have gone with as well. I haven't done a thorough check of the code, but the idea seems right to me. I will wait for @colinfrisch’s approval as well before merging:)

wang-boyu · 2026-03-24T16:00:11Z

Thanks @sanika-n and @yashhzd. Before merging this I'll need to also look into #243 which tries to fix the same issue, once I have the chance.

The base Memory.add_to_memory() method uses a plain dict where each event type maps to a single value. When multiple events of the same type occur within a single step, subsequent calls silently overwrite previous entries, causing permanent data loss. Introduces ADDITIVE_EVENT_TYPES (message, action) that accumulate in lists rather than overwriting. Updates MemoryEntry.__str__, get_communication_history(), and adds 9 focused tests. Closes mesa#137

Fixes ruff PLC0415 (import should be at top-level) by moving the EpisodicMemory import from inside test functions to the top of the file.

wang-boyu · 2026-04-01T16:29:26Z

Hi @sanika-n and @yashhzd, I have updated this PR by making additive event types a configurable parameter in Memory classes (except EpisodicMemory). The default is still "message" and "action". So it is now possible to do:

memory = ShortTermMemory(
  agent=agent,
  additive_event_types=["message", "observation"],
)

I have also updated docstrings and tests, and merged the latest main branch to resolve merge conflicts. When you have a chance, could you pull the latest changes and verify that it's working as expected? Happy to discuss more about it.

wang-boyu mentioned this pull request Mar 6, 2026

fix: store graded EpisodicMemory entries as MemoryEntry objects and use correct llm instance #109

Merged

colinfrisch assigned colinfrisch and sanika-n Mar 10, 2026

coderabbitai bot reviewed Mar 10, 2026

View reviewed changes

sanika-n reviewed Mar 15, 2026

View reviewed changes

yashhzd force-pushed the fix/memory-staging-overwrite-137 branch from 3ff130d to bcd7d35 Compare March 19, 2026 10:05

yashhzd force-pushed the fix/memory-staging-overwrite-137 branch from bcd7d35 to a5a4b2d Compare March 25, 2026 21:28

Yash Goel added 2 commits March 27, 2026 04:53

fix: address review feedback on memory staging

675d638

yashhzd force-pushed the fix/memory-staging-overwrite-137 branch from a5a4b2d to 675d638 Compare March 26, 2026 23:28

style: move EpisodicMemory import to module level

1ba216c

Fixes ruff PLC0415 (import should be at top-level) by moving the EpisodicMemory import from inside test functions to the top of the file.

wang-boyu added the memory label Mar 29, 2026

wang-boyu added 2 commits April 1, 2026 12:19

memory: make additive event types configurable

cc60ae7

Merge branch 'main' into fix/memory-staging-overwrite-137

0b8111f

wang-boyu added the bug Release notes label label Apr 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: accumulate additive events in memory staging area#165

fix: accumulate additive events in memory staging area#165
yashhzd wants to merge 5 commits intomesa:mainfrom
yashhzd:fix/memory-staging-overwrite-137

yashhzd commented Mar 6, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

codecov bot commented Mar 6, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Mar 7, 2026 •

edited

Loading

Review skipped

❌ Failed checks (1 warning)

Uh oh!

colinfrisch commented Mar 10, 2026

Uh oh!

coderabbitai bot commented Mar 10, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

sanika-n left a comment •

edited

Loading

Uh oh!

wang-boyu commented Mar 24, 2026

Uh oh!

wang-boyu commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

yashhzd commented Mar 6, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Design Decision

Test Plan

Summary by CodeRabbit

Release Notes

Uh oh!

codecov bot commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

coderabbitai bot commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

colinfrisch commented Mar 10, 2026

Uh oh!

coderabbitai bot commented Mar 10, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

sanika-n left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wang-boyu commented Mar 24, 2026

Uh oh!

wang-boyu commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yashhzd commented Mar 6, 2026 •

edited by coderabbitai bot

Loading

codecov bot commented Mar 6, 2026 •

edited

Loading

coderabbitai bot commented Mar 7, 2026 •

edited

Loading

sanika-n left a comment •

edited

Loading