Log in Get started

multimodal

cookbook/03_teams/19_multimodal/README.md

2.6.4875 B

Original Source

multimodal

Examples for team workflows in multimodal.

Prerequisites

Load environment variables (for example, OPENAI_API_KEY) via direnv allow.
Use .venvs/demo/bin/python to run cookbook examples.
Some examples require additional services (for example PostgreSQL, LanceDB, or Infinity server) as noted in file docstrings.

Files

audio_sentiment_analysis.py - Demonstrates audio sentiment analysis.
audio_to_text.py - Demonstrates audio to text.
generate_image_with_team.py - Demonstrates generate image with team.
image_to_image_transformation.py - Demonstrates image to image transformation.
image_to_structured_output.py - Demonstrates image to structured output.
image_to_text.py - Demonstrates image to text.
media_input_for_tool.py - Demonstrates media input for tool.
video_caption_generation.py - Demonstrates video caption generation.