CausalCine

Real-time Autoregressive Generation for Multi-Shot Video Narratives

CausalCine enables real-time interactive autoregressive multi-shot video generation. It streams video causally, follows new shot-level prompts, and reuses content-aware KV memory to preserve long-range story context and cross-shot consistency.

Interactive Demo
Shot t-3
Shot t-2
Shot t-1
Shot t
generating...
prompt: neon alley, rain, close-up
Real-time directing 16 FPS streaming generation on 8 NVIDIA H200 GPUs.
Causal multi-shot Stable rollouts across shot boundaries with new content.
Content-aware memory Retrieve relevant earlier shots by semantic content.
Prompt anytime Append new directions without recomputing past shots.

Comparison