OpenAI Sora Text-to-Video AI Model: Generate 1-Minute Videos from Text Prompt and results are impressive

By Aadil Raval • Updated On 16 Feb 2024

Like
Comment
Share

After transforming the world with text-based generative AI, OpenAI is back and now, it can generate videos. OpenAI introduced Sora, its AI text-to-video diffusion model, earlier on Thursday that can generate photo-realistic videos based on user prompts. It is currently available for red teams and a group of experts and visual designers who can use it through and through to provide insights and valuable feedback before the AI model is made available to the public.

OpenAI Sora set to disrupt text-to-video AI market

https://t.co/uCuhUPv51N pic.twitter.com/nej4TIwgaP
— Sam Altman (@sama) February 15, 2024

OpenAI disrupted the tech world with its ChatGPT which currently has 100 million users. It started with text-based prompts and generative AI and has introduced voice prompts and image prompts, the latest ChatGPT 4.0 is connected to the internet to furnish updated data and so on. With Sora, OpenAI is set to disrupt AI video generation capabilities.

You might have seen video-generating models on the internet, however, these are limited to a few seconds or may not have promising results. Google is working on a text-to-video model, Meta already has one but for short videos. However, OpenAI Sora can create AI videos based on text up to a minute long and can perfectly emulate realistic graphics and complex settings such as a couple of people walking through on the sidewalk of a busy road that would require immense computing and processing.

Announcing Sora — our model which creates minute-long videos from a text prompt: https://t.co/SZ3OxPnxwz pic.twitter.com/0kzXTqK9bG
— Greg Brockman (@gdb) February 15, 2024

According to OpenAI, Sora can contemplate complex scenes involving specific types of motion, multiple characters, vibrant emotions, and attention to details in the background gracefully. The AI analyses user prompts and understands what the user wants and how it will exist in the physical world to create realistic videos based on text.

The AI still requires fine-tuning given the early stage. For instance, it might not understand the cause and effect of certain scenarios such as no cookie eating marks when someone takes a bite off the cookie or it could be specific to the camera trajectory to follow or others. However, we can confidently say that just like ChatGPT which took a trove of data to train upon and furnish promising results, Sora will get to that level soon.

Introducing Sora, our text-to-video model.

Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions. https://t.co/7j2JN27M3W

Prompt: “Beautiful, snowy… pic.twitter.com/ruTEWn87vf
— OpenAI (@OpenAI) February 15, 2024

OpenAI is fortifying the safety nets on Sora

When it comes to safety, OpenAI will use safety nets to prevent users from generating videos based on hateful sentiments, sexual, extreme violence, likeliness to celebrities, or using IPs reserved by others. Additionally, it will use sophisticated image classifiers to scan through each frame of videos generated to keep the results as per usage policies.

Following the C2PA metadata guidelines, OpenAI will watermark videos generated using Sora (its text-to-video diffusion model) to identify original versus AI-generated videos. This should help prevent the spread of AI videos (often marked as legit or real) reducing the virality of unwanted content.

You can follow Smartprix on Twitter, Facebook, Instagram, and Google News. Visit smartprix.com for the most recent news, reviews, and tech guides.

Aadil Raval

Google Pixel 7 Pro User Shares Frustrating Reality of Google Service Centers in India

The service experience at Google Pixel service centers in India can be mixed, as illustrated by a recent experience shared by a user-facing slow charging issues with his Google Pixel 7 Pro. This article delves into the specifics of his ordeal and the challenges encountered with the service center. The Service Center Saga The user’s journey (MohipGhosh1 …

OpenAI Text Classifier To Flag Content Written By AI

The popular ChatGPT reached a million users in just five days of its launch. You might have seen countless examples of how text-generating AI offers impeccable performance and it continues to be the case. Its maker OpenAI has released an AI text classifier that helps users detect whether the text submitted was written by a …

ChatGPT 5 Expected Launch date, Features, and more

Artificial Intelligence (AI) has become a household name since the arrival of ChatGPT back in 2022. It’s a chatbot that anyone can use for free, and it quickly became very important. It’s used by businesses and regular people for many things like writing poems, helping with schoolwork, and talking to customers. The version called GPT …

ElevenLabs’ AI Sound Generator Adds Background Score To Sora’s Mute Videos

A few days ago, OpenAI released Sora, a breakthrough text-to-video generator tool capable of generating extremely high-quality videos. The tool sent a shockwave down the industry. The samples shared by OpenAI are nothing like any competing company has achieved. However, just days later, ElevenLabs has showcased a new AI-based tool capable of text-to-sound generation. Here’s …

Google Lumiere Joins The AI Video Generation Race, Can Add Animation To Images

Google has revealed its AI-based video generation model called Lumiere. It is a multimodal video generation tool that generates five-second-long videos based on text or image prompts. Currently, there are very few models out there that can create videos based on a given description, including Runway Gen-2. Even though the platform isn’t publicly available, here’s …