Google's new AI video model sucks less at physics

Joystiq

Joystiq News
Google may have only recently begun rolling out its
Please, Log in or Register to view URLs content!
generative AI to
Please, Log in or Register to view URLs content!
, but the company is not wasting any time getting a new version of the video tool out to early testers. On Monday, Google announced a
Please, Log in or Register to view URLs content!
. According to the company, Veo 2 “understands the language of cinematography.” In practice, that means you can reference a specific genre of film, cinematic effect or lens when prompting the model.

Additionally, Google says the new model has a better understanding of real-world physics and human movement. Correctly modeling humans in motion is something all generative models struggle to do. So the company’s claim that Veo 2 is better when it comes to both of those trouble points is notable. Of course, the samples the company provided aren’t enough to know for sure; the true test of Veo 2’s capabilities will come when someone prompts it to
Please, Log in or Register to view URLs content!
. Oh, and speaking of things video models struggle with, Google says Veo will produce artifacts like extra fingers “less frequently.”

A sample image of a squirrel Google's Imagen 3 generated.

Google

Separately, Google is rolling out improvements to
Please, Log in or Register to view URLs content!
. Of its text-to-image model, the company says the latest version generates brighter and better-composed images. Additionally, it can render more diverse art styles with greater accuracy. At the same time, it’s also better at following prompts more faithfully. Prompt adherence was an issue I highlighted when the company made Imagen 3 available to Google Cloud customers earlier this month, so if nothing else, Google is aware of the areas where its AI models need work.

Veo 2 will gradually roll out to
Please, Log in or Register to view URLs content!
users in the US. For now, Google will limit testers to generating up to eight seconds of footage at 720p. For context,
Please, Log in or Register to view URLs content!
can generate up to 20 seconds of 1080p footage, though doing so requires a $200 per month
Please, Log in or Register to view URLs content!
. As for the latest enhancements to Imagen 3, those are available to Google Labs users in more than 100 countries through
Please, Log in or Register to view URLs content!
.

This article originally appeared on Engadget at
Please, Log in or Register to view URLs content!


Please, Log in or Register to view URLs content!


Console Bang News!
 

Users who are viewing this thread

Top