cookbook/08_learning/TEST_LOG.md
Last updated: 2026-06-12
Changes in this pass:
gpt-5.5 (was gpt-5.2, plus two OpenAIChat/gpt-4o stragglers in 09_decision_logs/)claude-sonnet-4-610_demo/: AgentOS demo with all six learning stores enabled on Postgres + pgvector, a seed script, and the Learning UI walkthroughVerified in this pass:
Status: PASS
Description: Imported the demo agent against Postgres + pgvector, confirmed all six stores initialize (user_profile, user_memory, session_context, entity_memory, learned_knowledge, decision_log), built the AgentOS app, and exercised GET /learnings, GET /learnings/users, and learning_type filtering with a FastAPI TestClient.
Result: App builds and the /learnings endpoints respond with paginated results.
Status: PENDING (live re-run)
Description: Model id swap is mechanical; imports verified. Live extraction runs with gpt-5.5 still need a full pass (requires OPENAI_API_KEY and the pgvector container for Postgres-based examples).
.venvs/demo/bin/pythonStatus: PASS
Description: Tests AGENTIC mode for LearnedKnowledgeStore with the restructured prompt (Rules 1-4 consolidated in CRITICAL RULES section).
Result: Agent correctly:
Status: PASS
Description: Tests PROPOSE mode where agent proposes learnings for user approval before saving.
Result: Agent correctly:
Status: PASS
Description: Tests the learning=True shorthand which now enables both UserProfile and UserMemory stores by default.
Result:
['user_profile', 'user_memory']Status: PASS
Description: Basic ALWAYS mode learning with automatic extraction.
Result: Agent learned user info (Alice, Anthropic research scientist, prefers concise responses) and recalled it in session 2.
Status: PASS
Description: Basic AGENTIC mode where agent has tools to update memory.
Result: Agent used update_user_memory tool and correctly recalled user info.
Status: PASS
Description: Tests learned knowledge sharing across users.
Result:
Status: PASS
Description: UserProfileStore with ALWAYS mode extraction.
Result: Extracted profile (Alice Chen / Ali) and recalled correctly in session 2.
Status: PASS
Description: UserMemoryStore with ALWAYS mode extraction.
Result: Extracted memories about user's work and preferences, applied them in session 2 response.
Status: PASS
Description: SessionContextStore tracking conversation state.
Result: Maintained session summary across turns, correctly summarized the API design discussion when asked "What did we decide?"
Status: PASS
Description: Basic LearnedKnowledgeStore functionality.
Result: Agent searched learnings, incorporated egress cost goal into cloud provider recommendations.
| Category | Tests | Passed | Failed |
|---|---|---|---|
| Priority 1 (Recent Changes) | 3 | 3 | 0 |
| Priority 2 (Smoke Tests) | 3 | 3 | 0 |
| Priority 3 (User Profile/Memory) | 2 | 2 | 0 |
| Priority 4 (Other Stores) | 2 | 2 | 0 |
| Total | 10 | 10 | 0 |
All tests passing after the following changes:
learning=True now enables both user_profile and user_memory by defaultlearning_saved state reset bug