Google VideoPoet : An AI Tool That Crafts Videos from Text Input

By Ronik | 2 min read

Last updated: November 13, 2025

December 23, 2023: Google’s software engineers, Dan Kondratyuk and David Ross, have recently introduced an innovative tool named VideoPoet, which is set to change the world of AI video generation.

VideoPoet

This new tool, based on a large language model (LLM), can perform a range of video generation tasks, including text-to-video, image-to-video, video stylization, and even video-to-audio conversions.

VideoPoet stands out in its field by integrating various video generation capabilities into a single LLM, unlike other models, which rely on separate components for each task.

This integration allows for more seamless and coherent video creation, especially in tasks involving large motions, which has been a challenge for current models.

One of the key features of VideoPoet is its ability to animate still images and edit videos for tasks like inpainting, outpainting, and stylization.

For example, it can take a static image of a ship at sea and animate it to show the ship navigating through a thunderstorm. This capability is enhanced by the use of text prompts, which guide the motion and style of the generated videos.

videopoet example videos

The model’s training and inference inputs and outputs across different tasks are particularly intriguing.

VideoPoet uses multiple tokenizers (MAGVIT V2 for video and image, and SoundStream for audio) to convert various modalities into tokens and vice versa.

This process enables the model to generate tokens based on context, which are then converted back into a viewable representation.

VideoPoet has also shown promise in generating longer videos maintaining the appearance and consistency of objects over several iterations. Additionally, the model can interactively edit existing video clips, allowing users to change the motion of objects within a video.

The evaluation results of VideoPoet are equally impressive. In terms of text fidelity and motion interestingness, VideoPoet was preferred over competing models, showcasing its ability to follow prompts and produce interesting motions accurately.

For those interested in seeing more examples of VideoPoet’s capabilities, a demo is available on their website.

Recent Blogs

Tools We Recommendd

Weam AI

Not sure where AI fits in your business?

Take the free AI Cost Saving Audit to find where AI can cut costs in your business — and which area to start with first. It only takes a few minutes.

Wonder how AI fits into your workflow? Let’s start by automating your first one.

Book a Call

Get Cost-Saving Audit!

Ready to bring secure AI workflows into your multi-unit business?

Partner with Weam to build an automated AI system for your franchise network, multi-unit operation, or location-based brand.