OpenAI launches Sora (Indian Express)
- 17 Feb 2024
Why is it in the News?
OpenAI has unveiled a new generative artificial intelligence (GenAI) model that can convert a text prompt into video, an area of GenAI that was thus far fraught with inconsistencies.
What is OpenAI's Sora?
- Sora is an AI model developed by OpenAI –– built on past research in DALL·E and GPT models –– and is capable of generating videos based on text instructions.
- It can also animate a static image, transforming it into a dynamic video presentation.
- Sora can create full videos in one go or add more to already created videos to make them longer.
- It can produce videos up to one minute in duration, ensuring high visual quality and accuracy.
- Sora can generate complex scenes with various characters, precise actions, and detailed backgrounds.
- Not only does the model understand the user's instructions, but it also interprets how these elements would appear in real-life situations.
- The model has a deep understanding of language, enabling it to accurately interpret prompts and generate compelling characters that express vibrant emotions.
- Sora can also create multiple shots within a single generated video that accurately portrays characters and visual style.
Limitations:
- Despite its impressive capabilities, OpenAI acknowledges certain limitations in the current iteration of Sora.
- The model may encounter challenges in accurately simulating complex physics within scenes, leading to potential discrepancies in cause-and-effect scenarios.
- For instance, while depicting a person taking a bite out of a cookie, Sora may struggle to consistently render a corresponding bite mark on the cookie in subsequent frames.