CausalCine
Real-time Autoregressive Generation for Multi-Shot Video Narratives
CausalCine enables real-time interactive autoregressive multi-shot video generation. It streams video causally, follows new shot-level prompts, and reuses content-aware KV memory to preserve long-range story context and cross-shot consistency.
Shot t-3
Shot t-2
Shot t-1
Shot t
prompt: neon alley, rain, close-up
Real-time directing
16 FPS streaming generation on 8 NVIDIA H200 GPUs.
Causal multi-shot
Stable rollouts across shot boundaries with new content.
Content-aware memory
Retrieve relevant earlier shots by semantic content.
Prompt anytime
Append new directions without recomputing past shots.