OpenAI launches Sora (Indian Express)

  • 17 Feb 2024

Why is it in the News?

OpenAI has unveiled a new generative artificial intelligence (GenAI) model that can convert a text prompt into video, an area of GenAI that was thus far fraught with inconsistencies.

What is OpenAI's Sora?

  • Sora is an AI model developed by OpenAI –– built on past research in DALL·E and GPT models –– and is capable of generating videos based on text instructions.
  • It can also animate a static image, transforming it into a dynamic video presentation.
  • Sora can create full videos in one go or add more to already created videos to make them longer.
  • It can produce videos up to one minute in duration, ensuring high visual quality and accuracy.
  • Sora can generate complex scenes with various characters, precise actions, and detailed backgrounds.
    • Not only does the model understand the user's instructions, but it also interprets how these elements would appear in real-life situations.
  • The model has a deep understanding of language, enabling it to accurately interpret prompts and generate compelling characters that express vibrant emotions.
  • Sora can also create multiple shots within a single generated video that accurately portrays characters and visual style.

Limitations:

  • Despite its impressive capabilities, OpenAI acknowledges certain limitations in the current iteration of Sora.
  • The model may encounter challenges in accurately simulating complex physics within scenes, leading to potential discrepancies in cause-and-effect scenarios.
    • For instance, while depicting a person taking a bite out of a cookie, Sora may struggle to consistently render a corresponding bite mark on the cookie in subsequent frames.