Back to Agno

multimodal

cookbook/03_teams/19_multimodal/README.md

2.6.4875 B
Original Source

multimodal

Examples for team workflows in multimodal.

Prerequisites

  • Load environment variables (for example, OPENAI_API_KEY) via direnv allow.
  • Use .venvs/demo/bin/python to run cookbook examples.
  • Some examples require additional services (for example PostgreSQL, LanceDB, or Infinity server) as noted in file docstrings.

Files

  • audio_sentiment_analysis.py - Demonstrates audio sentiment analysis.
  • audio_to_text.py - Demonstrates audio to text.
  • generate_image_with_team.py - Demonstrates generate image with team.
  • image_to_image_transformation.py - Demonstrates image to image transformation.
  • image_to_structured_output.py - Demonstrates image to structured output.
  • image_to_text.py - Demonstrates image to text.
  • media_input_for_tool.py - Demonstrates media input for tool.
  • video_caption_generation.py - Demonstrates video caption generation.