Nothing's in my cart
2-minute read
Have you also been bombarded with images created by the generative AI Sora?
In the early hours of February 16, OpenAI CEO Sam Altman took to X (the platform formerly known as Twitter) to solicit image descriptions—the more complex and detailed, the better. Shortly after, he unveiled a series of videos generated by Sora:
"Two golden retrievers recording a podcast on a mountaintop." (Cuteness overload)
"A cycling race at sea captured from a drone's perspective, showcasing various marine animals riding bicycles." (Outrageously quirky)
"A blue-robed wizard casting lightning with one hand while holding a spell book in the other." (Relatively normal?)
This marks the official debut of Sora.
1. Based on text prompts, generate 60-second, hyper-realistic videos featuring detailed scenes, expressive characters, complex camera movements, and a resolution of up to 2048×2048.
2. In addition to text prompts (text to video), Sora can generate videos by using existing images and footage.
3. Extend the length of existing videos and create footage that can be played forward or backward. Even infinite looping is possible.
4. Video editing: Easily change video backgrounds and simulate different "digital worlds." For example, enter "Minecraft" to render a video in the style of the game's graphics.
5. Seamlessly merge two separate videos. For instance, blending drone footage with a butterfly in flight to transform a flying drone into a butterfly.
Sora cannot accurately simulate certain fundamental physical interactions, such as how a glass should shatter on a table (see the video below). It is still in the testing phase and only accessible to select individuals, such as artists, designers, and filmmakers, to gather feedback and refine the model. OpenAI emphasizes that safety and drawbacks from usage ethics are crucial considerations.
It is not difficult to imagine the chaos if the Internet is filled with photos and videos that cannot be authenticated. It's important to note that OpenAI does not position Sora merely as a video generation model but as a "world simulator" with data-driven physics engine capability.
There is already much discussion about Sora's potential seismic impact, such as the future of 3D animators, the profitability of stock footage businesses, and the devaluation of content production services due to a lower entry barrier.
However, foundational creativity and storytelling will become even more precious when everyone has the power of Hollywood-level CGI at their fingertips. We look forward to seeing Sora's true capabilities and how humanity balances critical thinking and creativity while collaborating with AI.