.agents/skills/xsai/references/media-and-embeddings.md
Use this reference for embeddings, image generation, speech, and transcription.
Use @xsai/embed for:
embedembedManyTypical inputs:
inputdimensionsmodelapiKeybaseURLUse @xsai/generate-image for OpenAI-compatible image generation.
Useful options include:
promptnsizeresponseFormatReturned images are normalized to data URLs plus MIME types.
Use @xsai/generate-speech for text-to-speech.
Useful options include:
inputvoiceresponseFormatspeedThe result is binary audio data.
@xsai/generate-transcription: unary speech-to-text@xsai/stream-transcription: streaming speech-to-textUse the unary API for batch transcription and the streaming API for live or incremental output.
These APIs still follow the same xsAI rule: