Back to Agno

TEST_LOG - Gemini Interactions API Cookbooks

cookbook/90_models/google/gemini_interactions/TEST_LOG.md

2.6.73.4 KB
Original Source

TEST_LOG - Gemini Interactions API Cookbooks

Test Environment

  • Python: 3.12
  • SDK: google-genai (latest)
  • Model: gemini-3-flash-preview
  • Date: 2026-05-14

basic.py

Status: PASS

Description: Tests all four invocation modes: sync, sync+streaming, async, async+streaming. All modes returned coherent text responses.

Result: All four modes produced valid 2-sentence horror stories. Async mode showed expected aiohttp fallback warning.


tool_use.py

Status: PASS

Description: Tests function calling with WebSearchTools. Agent correctly invokes the web search tool and returns a synthesized answer.

Result: Sync, streaming, and async streaming all produced detailed current events responses about France.


multi_turn.py

Status: PASS

Description: Tests multi-turn conversation with server-side history via previous_interaction_id. Agent maintains context across turns.

Result: Agent correctly referenced prior conversation context (hiking interest) in follow-up responses.


thinking.py

Status: PASS

Description: Tests thinking/reasoning mode with thinking_budget configuration.

Result: Model produced detailed mathematical reasoning about triangle angles with multiple proof approaches.


search.py

Status: PASS

Description: Tests built-in Google Search tool (search=True). Agent uses Google Search grounding for real-time information.

Result: Returned current news with citations including geopolitics, tech, sports, and entertainment.


image_understanding.py

Status: PASS

Description: Tests image understanding from URL. Image is downloaded via httpx and sent as base64 data.

Result: Model correctly identified and described a black Labrador puppy on wooden floorboards with detailed analysis of composition, lighting, and mood.


audio_understanding.py

Status: PASS

Description: Tests audio understanding from URL. Audio file is downloaded and sent as base64 data.

Result: Model identified and described the audio clip contents correctly.


video_understanding.py

Status: PASS

Description: Tests video understanding from URL. Video file is downloaded and sent as base64 data.

Result: Model described the video scene in detail including setting, background elements, people, and atmosphere.


document_processing.py

Status: PASS

Description: Tests PDF document processing from URL (arxiv "Attention Is All You Need" paper).

Result: Model produced a comprehensive summary of the Transformer paper including key innovations, performance results, and significance.


structured_output.py

Status: PASS

Description: Tests structured output with Pydantic schema (MovieReview). Uses output_schema parameter with response_format using TextResponseFormatParam.

Result: Returned a properly typed MovieReview object with title="The Matrix", year=1999, genre="Sci-Fi", rating=8.7, and a coherent summary.


image_generation.py

Status: FAIL (API limitation)

Description: Tests image generation using response_modalities=["text", "image"]. The Interactions API does not currently support image generation output with gemini-3-flash-preview.

Result: Model returned text instead of generating an image. Image generation is not yet supported through the Interactions API. The cookbook is kept as a reference for when support is added.