cookbook/00_quickstart/TEST_LOG.md
Date: 2026-05-19
Environment: .venvs/quickstart/bin/python (Python 3.12.8)
Model: gemini-3.5-flash
Pre-flight: All .py files pass py_compile; GOOGLE_API_KEY loaded via .envrc. 01-03 run serially; 04-12 run in parallel (10 serialized after 08 due to shared learnings Chroma collection).
Status: PASS
Description: Agent uses YFinanceTools to fetch real-time data for NVIDIA. Tool calling, data retrieval, and brief formatting all work correctly.
Result: 5 tool calls (get_current_stock_price, get_stock_fundamentals, get_company_info, get_company_news, get_historical_stock_prices). Delivered a markdown investment brief: NVDA at $220.61, market cap $5.34T, P/E 45.11. Response in 17.4s.
Status: PASS
Description: Agent returns a typed StockAnalysis Pydantic model with all required fields populated.
Result: Valid StockAnalysis for NVIDIA: price $220.61, market cap "5.34T", P/E 45.11, 52-week range $129.16-$236.54, recommendation "Strong Buy". All fields populated and printed programmatically without errors. 9.3s.
Status: PASS
Description: Agent accepts typed AnalysisRequest input (dict and Pydantic model) and returns typed StockAnalysis. Tests deep analysis with risks (NVDA) and quick analysis without risks (AAPL).
Result: Both input modes work. NVDA deep returned populated key_drivers and key_risks. AAPL quick (price $298.97, recommendation "Buy") returned null for both optional fields as expected by analysis_type="quick" and include_risks=False.
Status: PASS
Description: Agent persists conversation across 3 sequential turns using SQLite + a fixed session_id="finance-agent-session".
Result: All 3 turns completed. Agent correctly referenced NVIDIA from turn 1 when comparing to Tesla in turn 2, and synthesized both analyses into a final recommendation in turn 3. Session persistence works.
Note: A first attempt at this test hung at 0% CPU for several minutes (before any output). Killing and re-running cleanly resolved it; root cause not investigated. If it recurs, look at SQLite locking from leftover state in tmp/agents.db.
Status: PASS
Description: Agent uses MemoryManager with enable_agentic_memory=True to capture user preferences. First prompt sets preferences (AI/semiconductor stocks, moderate risk), second asks for personalized recommendations.
Result: Agent stored a single consolidated memory: "User is interested in AI and semiconductor stocks and has a moderate risk tolerance." (topics: investment_interests, risk_tolerance, finance). Second prompt used the stored memory to tailor recommendations. get_user_memories(user_id="[email protected]") returned the memory correctly.
Note: The previous 2026-02-20 run on gemini-3-flash-preview produced 2 separate memory records; gemini-3.5-flash chose to consolidate into 1. Both are valid behavior for the cookbook.
Status: PASS
Description: Agent manages a stock watchlist via session_state. Custom tools (add_to_watchlist, remove_from_watchlist) modify session_state["watchlist"]; state injected into instructions via {watchlist}.
Result: Agent added NVDA, AAPL, GOOGL via parallel tool calls. Second prompt fetched current prices for all 3. Final get_session_state() returned ['NVDA', 'AAPL', 'GOOGL'].
Status: PASS
Description: Loads https://docs.agno.com/ into ChromaDb (hybrid search, RRF), then answers "What is Agno?" by searching the knowledge base.
Result: Knowledge load succeeded against the updated URL. Agent searched the knowledge base and returned a comprehensive answer covering Agno's SDK code example, AgentOS production APIs, control plane UI, and data-ownership story.
Note: Original URL https://docs.agno.com/introduction.md was failing with httpx.HTTPStatusError: 307 Temporary Redirect to a broken target (/.md). Switched to https://docs.agno.com/ in this run.
Status: PASS
Description: Custom save_learning tool persists insights to a ChromaDb knowledge base. Three turns: ask about P/E ratios, approve learning, query saved learnings.
Result: Agent proposed and saved "Tech Stock P/E Benchmarks" (covers mature mega-caps 20-35x, high-growth SaaS 35-60x+, semiconductors 15-25x, PEG cross-reference). On the third prompt the agent successfully retrieved and presented the saved learning from the knowledge base.
Status: PASS
Description: Three guardrails — PIIDetectionGuardrail, PromptInjectionGuardrail, custom SpamDetectionGuardrail. Four test cases.
Result: All 4 cases behaved correctly:
CheckTrigger.PII_DETECTEDCheckTrigger.PROMPT_INJECTIONCheckTrigger.INPUT_NOT_ALLOWEDNote: Same pre-existing quirk as the 2026-02-20 run — guardrail blocks are surfaced as ERROR logs by print_response rather than raising InputCheckError to the caller, so the demo's except InputCheckError branch is cosmetic. The guardrails themselves function correctly.
Status: PASS
Description: @tool(requires_confirmation=True) on save_learning. Flow pauses for confirmation, accepts "y" from stdin, resumes with agent.continue_run().
Result: Agent paused on save_learning call, displayed confirmation prompt with tool name and args, accepted "y", executed the tool, and saved "Tech Stock P/E Ratio Benchmarks" to the knowledge base. Final response included a polished markdown explanation with PEG-ratio formula. continue_run flow works.
Status: PASS
Description: Team of 3 agents — Bull Analyst, Bear Analyst, Lead Analyst (team leader). Two prompts: analyze NVDA, then compare to AMD.
Result: Both prompts completed. For NVDA: bull and bear agents independently fetched data and produced opposing arguments; leader synthesized into a balanced recommendation. For the AMD comparison: leader delegated to both analysts, produced a comprehensive comparison with bull case, bear case, synthesis, recommendation ("UNDERWEIGHT / SELL relative to NVDA, Confidence 8.5/10"), and key metrics table. Total run time ~97s.
Status: PASS
Description: Three-step workflow pipeline — Data Gatherer → Analyst → Report Writer.
Result: All 3 steps completed in sequence. Data Gatherer fetched NVDA market data. Analyst interpreted P/E, P/S, strengths, weaknesses, and benchmark comparisons. Report Writer produced a concise brief with metric table covering price/market cap ($220.61 / $5.34T), Forward P/E and PEG (18.98 / 0.71), margins (71.07% / 55.60%), net cash ($51.15B), ROE (101.49%). Total workflow time: 39.2s.
| # | File | Status |
|---|---|---|
| 01 | agent_with_tools.py | PASS |
| 02 | agent_with_structured_output.py | PASS |
| 03 | agent_with_typed_input_output.py | PASS |
| 04 | agent_with_storage.py | PASS |
| 05 | agent_with_memory.py | PASS |
| 06 | agent_with_state_management.py | PASS |
| 07 | agent_search_over_knowledge.py | PASS |
| 08 | custom_tool_for_self_learning.py | PASS |
| 09 | agent_with_guardrails.py | PASS |
| 10 | human_in_the_loop.py | PASS |
| 11 | multi_agent_team.py | PASS |
| 12 | sequential_workflow.py | PASS |
Result: 12/12 PASS