In February 2024, OpenAI introduced Sora on a page titled “Sora: Creating video from text.” Sora is a model that generates video clips directly from natural-language descriptions, extending the text-to-image capabilities of earlier systems into moving images.
Sora represented a significant step in generative media because video is far harder to produce than still images. It requires the system to keep objects, characters, and scenes consistent across many frames and to render plausible motion over time. OpenAI presented Sora as a demonstration that AI could synthesize coherent video from a prompt, opening a new modality for generative AI.
For business readers, Sora signaled that video, one of the most expensive and labor-intensive forms of media to produce, was entering the generative-AI era. The announcement set expectations for how AI might reshape marketing, entertainment, and communication, even as the technology continued to mature toward production use.