#9011: fix(session): auto-recovery for corrupted tool responses [AI-assisted]

by cheenu1092-oss open 2026-02-04 19:04 View on GitHub →

agents size: L

## Summary Fixes #8946 - Session should auto-recover when corrupted tool response makes history invalid ### Problem When a tool response is corrupted (JSON parse error, truncated output, invalid UTF-8), the session history becomes invalid and the entire session fails. ### Solution 1. **Pre-persist validation** - Validate tool responses before storing: - JSON structure validation - UTF-8 encoding validation 2. **Graceful degradation** - When validation fails: - Store sanitized placeholder: `[Tool output corrupted - original truncated]` - Log original to `~/.openclaw/debug/corrupted-tool-results/` - Continue session with warning 3. **Integration** - Applied in session-tool-result-guard-wrapper: - Before plugin hooks (ensures plugins get valid input) - After plugin transformations (catches plugin-introduced corruption) ### Files Changed - `src/agents/session-tool-result-validation.ts` (new) - `src/agents/session-tool-result-validation.test.ts` (new) - 12 tests - `src/agents/session-tool-result-guard-wrapper.ts` (modified) - `src/agents/session-tool-result-guard.validation-integration.test.ts` (new) - 5 tests ### Impact - Sessions survive tool failures instead of crashing - Corrupted outputs logged for investigation - Conversation context preserved ### AI Disclosure - [x] AI-assisted (Claude Code via Clawdbot) - [x] Tests included - [x] I understand what the code does  <h2>Greptile Overview</h2> <h3>Greptile Summary</h3> This PR adds a pre-persist validation/sanitization layer for `toolResult` messages to keep sessions running when tool outputs are corrupted (non-JSON-serializable structures, invalid UTF-8, etc.). It introduces `validateAndSanitizeToolResult()` plus unit + integration tests, and wires the validator into `guardSessionManager` so tool results are validated before persistence, then again after `tool_result_persist` plugin hooks. It also updates the Telegram monitor to enforce single-instance startup via in-memory instance state (running/starting + debounce) and adds tests for duplicate start prevention/debounce behavior. <h3>Confidence Score: 3/5</h3> - This PR is close to mergeable but has a real runtime handler-leak bug that should be fixed first. - Most changes are additive and well-tested, and the tool-result sanitization approach is straightforward. However, `monitorTelegramProvider` currently installs a process-level unhandled-rejection handler and can return early (debounce/duplicate) without unregistering it, which will accumulate handlers and duplicate logging. There’s also some test env leakage and a potential debug-log overwrite edge case. - src/telegram/monitor.ts; src/agents/session-tool-result-guard.validation-integration.test.ts; src/agents/session-tool-result-validation.ts