Generative AI Video Generator Sora: Transforming Text to Video and Creating Digital Worlds

VIVE POST-WAVE Team • Feb. 22, 2024

2-minute read

Have you also been bombarded with images created by the generative AI Sora?

In the early hours of February 16, OpenAI CEO Sam Altman took to X (the platform formerly known as Twitter) to solicit image descriptions—the more complex and detailed, the better. Shortly after, he unveiled a series of videos generated by Sora:

"Two golden retrievers recording a podcast on a mountaintop." (Cuteness overload)

"A cycling race at sea captured from a drone's perspective, showcasing various marine animals riding bicycles." (Outrageously quirky)

"A blue-robed wizard casting lightning with one hand while holding a spell book in the other." (Relatively normal?)

This marks the official debut of Sora.

What can Sora do?

1. Based on text prompts, generate 60-second, hyper-realistic videos featuring detailed scenes, expressive characters, complex camera movements, and a resolution of up to 2048×2048.

2. In addition to text prompts (text to video), Sora can generate videos by using existing images and footage.

Sora animates a picture of surfing in the Sistine Chapel. (Source: OpenAI)

3. Extend the length of existing videos and create footage that can be played forward or backward. Even infinite looping is possible.

4. Video editing: Easily change video backgrounds and simulate different "digital worlds." For example, enter "Minecraft" to render a video in the style of the game's graphics.

The original version had fewer trees. (Source: OpenAI)

Transformed into a lush forest. (Source: OpenAI)

Converted into a game scene. (Source: OpenAI)

5. Seamlessly merge two separate videos. For instance, blending drone footage with a butterfly in flight to transform a flying drone into a butterfly.

What are current limitations of Sora?

Sora cannot accurately simulate certain fundamental physical interactions, such as how a glass should shatter on a table (see the video below). It is still in the testing phase and only accessible to select individuals, such as artists, designers, and filmmakers, to gather feedback and refine the model. OpenAI emphasizes that safety and drawbacks from usage ethics are crucial considerations.

What is the ramification of Sora?

It is not difficult to imagine the chaos if the Internet is filled with photos and videos that cannot be authenticated. It's important to note that OpenAI does not position Sora merely as a video generation model but as a "world simulator" with data-driven physics engine capability.

There is already much discussion about Sora's potential seismic impact, such as the future of 3D animators, the profitability of stock footage businesses, and the devaluation of content production services due to a lower entry barrier.

However, foundational creativity and storytelling will become even more precious when everyone has the power of Hollywood-level CGI at their fingertips. We look forward to seeing Sora's true capabilities and how humanity balances critical thinking and creativity while collaborating with AI.

Headsets | Artificial Intelligence

I Took the VIVE Eagle to Seoul for Charli XCX — A First-Person View of Brat Summer’s Finale

Remember last summer when everyone around you was sporting neon green backgrounds with slightly blurry names in Arial font? Yep, that was the Brat Summer trend kicked off by Charli XCX! A year later, it's not just Facebook memories reminding us of...

Generative AI Video Generator Sora: Transforming Text to Video and Creating Digital Worlds

Artificial Intelligence

What can Sora do?

What are current limitations of Sora?

What is the ramification of Sora?

Related Posts

Brain-Computer Interface: Apple, Amazon Partner with Synchron

DeepMind Unveils Genie 3, Google’s World Model Toward AGI

I Took the VIVE Eagle to Seoul for Charli XCX — A First-Person View of Brat Summer’s Finale