OpenAI unveiled Sora and ChatGPT Pro, Google introduced Veo 2 and Imagen 3 — the top 3 AI news stories of the week
Our latest AI Digest covers the biggest breaking AI news for the week. Nikolai Chesalin, Product Architect at EPAM, comments on key stories.
#1 — OpenAI’s Sora: revolutionizing video creation
OpenAI has unveiled Sora, a groundbreaking AI video generator that transforms simple text prompts into high-quality videos. This tool offers unprecedented creative flexibility for content creators across various industries, with capabilities to produce 1080p videos up to 20 seconds long in various aspect ratios. Creators can incorporate their own images and videos into projects, blending AI-generated elements with existing content to produce unique and personalized outputs.
Sora utilizes advanced diffusion models to produce videos with exceptional detail, realistic textures, and seamless motion. It revolutionizes the storyboarding process by allowing creators to design and visualize videos frame by frame. This precise control over narrative flow and visual consistency enhances the efficiency of video production.
Currently, Sora is available to ChatGPT Plus and Pro subscribers, with plans to expand access in the coming months.
#2 — ChatGPT Pro: elevating AI interaction
OpenAI has introduced ChatGPT Pro, a subscription plan priced at $200 per month. This plan offers unlimited access to OpenAI’s most sophisticated models, including o1, o1-mini, GPT-4o, and Advanced Voice.
Exclusive to ChatGPT Pro users, o1 pro mode utilizes increased computational resources to provide more refined and insightful responses, especially beneficial for complex problems in fields like data science, programming, and legal analysis. Pro users can select o1 pro mode within the model picker in the ChatGPT interface. To accommodate the longer response times associated with o1 pro mode, a progress bar will indicate wait times, and users will receive notifications upon task completion.
ChatGPT Pro is tailored for researchers, engineers, and professionals who require research-grade intelligence for complex tasks. For casual users, the Free or Plus plans may offer sufficient functionality.
#3 — Google’s Veo 2 and Imagen 3: pushing the boundaries of AI-generated media
Google has announced significant updates to its AI tools, introducing Veo 2 and Imagen 3. Veo 2, Google’s latest AI model, is designed to produce high-quality videos with remarkable realism. Supporting resolutions up to 4K, it can generate videos lasting several minutes and demonstrates an improved understanding of human movement and interactions, resulting in more natural and coherent outputs. Veo 2 also offers advanced cinematic effects, allowing users to specify camera angles, lenses, and shot types—such as “18mm lens” or “low-angle tracking shot” — to achieve their desired visual styles.
The upgraded Imagen 3 model is focused on delivering photorealistic images that feature richer textures and brighter colors. It exhibits a deeper understanding of user prompts, enabling it to generate images that closely align with detailed descriptions and reduces visual artifacts, resulting in cleaner and more accurate outputs.
Both tools are currently available through Google’s platforms, with Veo 2 accessible via VideoFX and Imagen 3 through ImageFX.