Please Select Your Location
Australia
Österreich
België
Canada
Canada - Français
中国
Česká republika
Denmark
Deutschland
France
HongKong
Iceland
Ireland
Italia
日本
Korea
Latvija
Lietuva
Lëtzebuerg
Malta
المملكة العربية السعودية (Arabic)
Nederland
New Zealand
Norge
Polska
Portugal
Russia
Saudi Arabia
Southeast Asia
Suisse
Suomi
Sverige
台灣
Ukraine
United Kingdom
United States
Please Select Your Location
België
Česká republika
Denmark
Iceland
Ireland
Italia
Latvija
Lietuva
Lëtzebuerg
Malta
Nederland
Norge
Polska
Portugal
Suisse
Suomi
Sverige
<< Back to Blog

Generative AI Video Generator Sora: Transforming Text to Video and Creating Digital Worlds

VIVE POST-WAVE Team • Feb. 22, 2024

2-minute read

Have you also been bombarded with images created by the generative AI Sora?

In the early hours of February 16, OpenAI CEO Sam Altman took to X (the platform formerly known as Twitter) to solicit image descriptions—the more complex and detailed, the better. Shortly after, he unveiled a series of videos generated by Sora:

"Two golden retrievers recording a podcast on a mountaintop." (Cuteness overload)

"A cycling race at sea captured from a drone's perspective, showcasing various marine animals riding bicycles." (Outrageously quirky)

"A blue-robed wizard casting lightning with one hand while holding a spell book in the other." (Relatively normal?)

This marks the official debut of Sora.

What can Sora do?

 1. Based on text prompts, generate 60-second, hyper-realistic videos featuring detailed scenes, expressive characters, complex camera movements, and a resolution of up to 2048×2048.

 2. In addition to text prompts (text to video), Sora can generate videos by using existing images and footage. 

Sora animates a picture of surfing in the Sistine ChapelSora animates a picture of surfing in the Sistine Chapel. (Source: OpenAI) 
 
 

 3. Extend the length of existing videos and create footage that can be played forward or backward. Even infinite looping is possible. 

 

 

 

 4. Video editing: Easily change video backgrounds and simulate different "digital worlds." For example, enter "Minecraft" to render a video in the style of the game's graphics. 

The original version had fewer trees
The original version had fewer trees. (Source: OpenAI) 
 
Transformed into a lush forest
Transformed into a lush forest. (Source: OpenAI) 
 
Converted into a game scene
Converted into a game scene. (Source: OpenAI) 
 

5. Seamlessly merge two separate videos. For instance, blending drone footage with a butterfly in flight to transform a flying drone into a butterfly.

 

 

 

What are current limitations of Sora?

 Sora cannot accurately simulate certain fundamental physical interactions, such as how a glass should shatter on a table (see the video below). It is still in the testing phase and only accessible to select individuals, such as artists, designers, and filmmakers, to gather feedback and refine the model. OpenAI emphasizes that safety and drawbacks from usage ethics are crucial considerations.

 

 

 

What is the ramification of Sora?

 It is not difficult to imagine the chaos if the Internet is filled with photos and videos that cannot be authenticated. It's important to note that OpenAI does not position Sora merely as a video generation model but as a "world simulator" with data-driven physics engine capability. 

 There is already much discussion about Sora's potential seismic impact, such as the future of 3D animators, the profitability of stock footage businesses, and the devaluation of content production services due to a lower entry barrier. 

However, foundational creativity and storytelling will become even more precious when everyone has the power of Hollywood-level CGI at their fingertips. We look forward to seeing Sora's true capabilities and how humanity balances critical thinking and creativity while collaborating with AI.