optional-skills/creative/kanban-video-orchestrator/references/intake.md
The discovery process is adaptive. Always start with three baseline questions to identify the broad style category, then drill into a per-style question bank. Ask 2-4 questions at a time, listen, then proceed. Make reasonable assumptions whenever the user implies an answer.
From these answers, classify the style category and pick the relevant Tier 1 follow-ups. Do not continue asking until you have at least these three.
Map the brief to one of these archetypes (or a hybrid):
| Archetype | Tells |
|---|---|
| Narrative film | Plot, characters, scenes-with-events, dialogue, location |
| Product / marketing | A specific product or feature being shown / sold; CTA at end |
| Music video | A specific track exists; visuals sync to music |
| Explainer / educational | A concept being taught; voiceover-driven |
| Tutorial / changelog | Software demo, terminal-heavy, technical |
| ASCII / terminal art | Retro terminal aesthetic explicit, character-grid |
| Abstract / loop | Generative, no plot, often perfect-loop |
| Documentary / interview cut | Real footage, transcription-driven |
| Real-time / installation | Audio-reactive, gallery installation, VJ output |
If ambiguous, ask which category fits — don't guess. Hybrids are common (e.g., a product video with a narrative arc); decompose into the dominant mode + secondary modifiers.
Recursive / meta ("a video that shows its own production") is a rendering technique, not a separate style — compose it from any of the above by adding a two-pass render step where pass 2 uses pass 1's output as texture inside the final scene.
songwriting-and-ai-music Suno
prompts, or local heartmula), licensed track provided, silentmanim-video (math), p5js (generative),
Remotion (UI motion graphics), comfyui (AI-generated visuals),
ascii-video (technical/retro), hybridexcalidraw,
architecture-diagram, concept-diagrams)touchdesigner-mcp for full TD operator graphs;
p5js for web-canvas; comfyui for generative-AI fed by audio featuresmax_runtime_seconds and acceptable scopeWhen the user under-specifies, fill in these defaults rather than asking:
| Question | Default |
|---|---|
| Frame rate | 30 fps for X / IG; 60 fps for tutorials/explainers; 24 fps for narrative film |
| Resolution | 1080×1080 for square, 1920×1080 for 16:9, 1080×1920 for 9:16 |
| Codec | H.264 / yuv420p, CRF 18 |
| Audio codec | AAC 192 kbps |
| Voice | Provider's mid-range neutral voice unless brand calls for distinctive timbre |
| Music | Silent (require user to specify if music is wanted) |
| Captions | On for explainer/tutorial; off for narrative/abstract unless requested |
| Quality bar | Polished final unless user says draft |
State the assumption explicitly: "Assuming 30fps and AAC audio unless you say otherwise — proceed?"