Veo 3.1 used for both text-led and image-led short video generation
A reliable creator example for seeing Veo 3.1 used in both text-led and image-led short video generation.
Google's newer video model family with native audio, 4K output, and text-to-video or image-to-video workflows, designed for higher-spec short-form generation and mode-based quality comparison.
Ready to create videos
Generate in this workspace and the latest result will appear here with the supporting content below.
Veo 3.1 is Google's latest Veo video generation family on Vertex AI. Official docs support text-to-video, image-to-video, prompt rewriting, and generating video from first and last frames, while the current Epochal page exposes one unified Veo 3.1 entry with Lite, Fast, and Standard modes. In practice, this page focuses Veo 3.1 into short 4, 6, or 8 second clips with native audio, 16:9 or 9:16 output, and 720p or 1080p rendering.
Veo 3.1 preview 1
Google positions Veo 3.1 as one video model family that supports both text-to-video and image-to-video, instead of splitting those paths into separate products.
The Veo 3.1 family supports synchronized audio generation, which makes it more useful when the first draft should already include sound rather than silent motion only.
Official docs highlight prompt rewriting plus video generation from first and last frames, which gives Veo 3.1 a stronger bridge between prompt-led and visually guided work.
On Epochal, Lite, Fast, and Standard modes share one public page and the same core control surface, so teams can change cost and generation strength without changing tools.
Creator walkthroughs that are useful for judging Veo 3.1 prompt structure, native audio behavior, and the difference between faster and higher-quality operating modes.
A reliable creator example for seeing Veo 3.1 used in both text-led and image-led short video generation.
Useful when you want a walkthrough focused on prompt construction, mode comparison, and native audio output in short clips.
Helpful for judging how Veo 3.1 is being compared on realism, speed, and creator usability.
A useful creator-side pass on how Veo 3.1 behaves in practical prompting and iteration loops.
Useful when you want a comparison-focused read on where Veo 3.1 sits against other current short-video models.
Public rollout notes and creator examples that are useful for judging Veo 3.1 upgrades, selfie workflows, and lower-cost Lite usage.
Write the subject, motion, camera, and atmosphere you want, or use the matching image-led workflow when a starting visual should guide the shot.
Set Lite, Fast, or Standard, then choose 16:9 or 9:16, 720p or 1080p, 4, 6, or 8 seconds, and whether audio should be generated.
Review motion, timing, framing, and sound together, then refine the prompt or switch modes if the next pass should trade off speed, cost, or output quality differently.
Veo 3.1 is strongest when you need short, polished video drafts with native audio, flexible text or image guidance, and clear tradeoffs between speed and generation strength.
Use Veo 3.1 when you need to test scene direction, camera motion, and audio mood before committing to a heavier production workflow.
It is a strong fit for short 9:16 clips where mobile framing and native audio matter from the first render.
Use it when a source image or first-and-last-frame setup should guide a short motion result more tightly than prompt-only generation.
It works well when the same concept needs to be compared across Lite, Fast, and Standard to balance cost, speed, and result quality.
Each generation with Veo 3.1 consumes credits inside Epochal.
Processing time varies with mode, duration, resolution, audio, and queue state.
This page uses one Veo 3.1 model entry and changes operating tier through Lite, Fast, and Standard mode selection.
The current integration supports both text-to-video and image-to-video under the same Veo 3.1 family, with one shared model page.
Start with free credits on sign-up. Upgrade only when recurring production, private generation, or higher volume starts to matter.
For lighter recurring creation.
Switch fixed steps to match your monthly output.
3,000 credits/month
Up to 12,000 images
Up to 996 videos
Higher monthly capacity
No watermark
Private generation
Faster speed
Image and video workflows
Try the core flow before you upgrade.
Keep reading the newest posts on model capabilities, workflow tips, and creative practice.

HappyHorse 1.0 supports text-to-video and image-to-video for creative drafts, first-frame animation, ad testing, and short cinematic shots.

A practical guide to the best image to video AI tools in 2026, comparing Kling 3.0, Veo 3.1, Seedance 2.0, Wan 2.7, and Grok Imagine Video for frame preservation, motion quality, speed, and workflow fit.

A practical comparison of the best AI video generators available in 2026, covering output quality, audio generation, prompt control, speed, and which model fits each workflow.