OpenAI launches Sora, an AI that can create 60-second videos from text instructions


By - Che Browndon
Written-on - February 17, 2024

Sora AI

OpenAI took the world by storm in November 2022 after it launched ChatGPT (Chat Generative Pre-trained Transformer), a highly sophisticated AI able to write computer programs, essays, answer questions, generate business ideas, compose music, translate and even summarize text. It is based on the Language Learning Model and requires text prompts to answer questions thrown at it. It became one of the fastest growing software applications in history gaining over 100 million users in a span of


What is "Sora"?


OpenAI recently announced a brand new AI model. This AI is known as “Sora” and it is capable of creating 60-second videos with just text prompts. Text prompts are sentences or keywords input provided to AI language models for a reply otherwise known as output to be generated. In order for Sora to function, all you need to do is create a prompt message or input a text message. This message should be specific and it should be detailed or concise. This AI is able to generate complex scenes with multiple characters, specific types of motion and accurate details of the subject and background because Sora understand how those prompt text exist in the real world. It is also revealed the model has a deep understanding of language enabling it to interpret prompts and generate compelling characters. Sora can also create multiple shots within a generated video.

Sora is a diffusion model hence it generates videos by staring off with one that looks like static noise and gradually transforms it by removing the noise over many steps. The model was given foresight of many frames at a time when enables the subject stay the same even after temporarily going out of view. Sora uses a transformer architecture giving it amazing scaling performance. Sora was built on past research on DALL-E and GPT models. Sora uses the recaptioning technique from the DALL-E 3 which generates descriptive captions for the visual training data hence its able to follow the user’s text instructions faithfully. Videos and images are represented as collections of smaller units called patches having relations to the GPT model. By unifying how data is trained, diffusion transformers can be trained on a wider range of visual data spanning different durations, resolution and aspect ratios.

This AI can also generate a video from an existing still image. The images content will be animated with accuracy and with attention to even tiny details. It can also expand an existing video and even fill in missing frames.


ComicCone official image-logo

Some limitations


Sora may struggle with accurately stimulating the physics of a complex science. OpenAI also gave an example to this scenario and here’s how it goes. A person might bite a cookie, but afterwards, the cookie may not have a bite mark. Sora might also confuse spatial details of a prompt for instance mixing up left and right. It may struggle with precise description of events that take place over time such as Camera trajectory


Access and Safety


The access to Sora was given to red teamers, some visual artists, designers, and filmmakers on how to advance the model to assist professionals. They are trying to get the feedback as fast as possible so the public eventually gets a “feel” of this new AI model. OpenAI also emphasized on safety measures emphasizing on how it is working on various ways to ensure the product doesn’t cause “issues”. One of such steps is the creation of policies meant to govern the use of the AI model.


Closure


OpenAI announced this AI showcasing what it can be and it was quite impressive. These videos were extremely detailed. It was visually amazing. In order to see the Videos Sora created and to get more information about this brand new AI model, you can click here.


Related


X formally known as Twitter

Elon Musk reveals real name Doxxing on X will result in Account Suspension

Let me briefly explain what Doxxing is for better understanding. Doxxing is a means of publishing someone’s real life information on the Internet without his/her consent. ...................


Written by - Che Browndon



X Video download

X will soon give users control to determine who can or cannot download videos

Some time ago, I wrote an article about the brand new Download feature on X. With this brand new feature, Users can now download Videos on X but ...................


Written by - Che Browndon



Windows 11 coopilot AI kEy

Microsoft reveals new AI Key for Windows 11 PC’s

Five days ago (January 4, 2024), Microsoft announced a new AI key for Windows 11 PC’s. Microsoft revealed this information on the official window website and various social media platforms. ...................


Written by - Che Browndon



Grok AI

Grok will soon add an option that shares answers directly to your X timeline

Grok is X’s new AI technology developed in partnership with xAI. It was created as a means of competing with OpenAI’s chatGPT which ...................


Written by - Che Browndon



PS5

Sony has reportedly sold 50 million PS5 units since its official release in 2020

Sony made an announcement earlier this week and it was about the number of PlayStation 5 consoles sold. According to this ...................


Written by - Che Browndon



Clyde Chatbot

Discord to deactivate its AI chatbot, Clyde, as of December 1st

On Friday 17, November 2023 Discord announced it’s going to deactivate its AI chatbot Clyde and making it inaccessible as of December 1st 2023. Discord users won’t be able to access Clyde ...................


Written by - Che Browndon



xAI Grok

X is moving towards AI powered personalization and platform control with the introduction of Grok

X is working on more ways to control what you see on the platform and avoid what you don’t like. X formally known as Twitter has been prone to frequent updates after Elon Musk's acquisition ...................


Written by - Che Browndon



X subscription tiers

X introduces the Basic and Premium+ subscription plans and the ability to upgrade and downgrade subscription tiers

The X help center just revealed new information about its subscription tier. X right now is offering two new tiers which are basic and Premium+. This new update was ...................


Written by - Che Browndon



X likes and repost count

X to remove Interaction counts from the main timeline, leaving only view counts

There’s once again another update about X, formally known as Twitter. For months now, X has been prone to frequent update; some good, some bad and others which ...................


Written by - Che Browndon



Threads

Meta is reportedly looking for ways to revive interest in Threads as its traffic continues to decline

Three months ago, Thread was one of the fastest growing social media platforms achieving over 100 million new sign ups in just a few days ...................


Written by - Che Browndon



Umbrella Academy final season

According to Netflix, “The Umbrella Academy” ends at Season 4

Popular Netflix Series Umbrella Academy is once again coming back to screens. The Umbrella Academy Season 3 was released a year ago and since then there hasn’t been any major update ...................


Written by - Che Browndon



Reddit

Reddits new contributor program will allow Monetization of text posts

Reddit recently announced a Contributor program which awards users real money for their fake internet points. Eligible users ...................


Written by - Che Browndon




About-us


Privacy Policy


Advertisement


Article Policy


Terms Of Use