Vidu,Lee Do the generative video platform from Beijing-based ShengShu Technology, has rolled out an upgrade with the launch of Vidu Q1. The browser-based generative video model turns two still images and a text prompt into a five second, 1080p cinematic clip. Its “First-to-Last Frame” system guides motion smoothly between unrelated frames, giving solo creators access to transitions that once required pro VFX teams. Audio is now baked into the workflow, too. Vidu Q1 generates 48 kHz background music and sound effects via text, supports ten second multitrack layering, and responds to timestamped cues, eliminating the need for external sound libraries. Anime-style outputs have also improved, with crisper lines and better frame consistency, the company said. Internal benchmarks put Q1 ahead of OpenAI’s Sora, Runway Gen-2, and Luma Dream Machine in prompt fidelity and frame coherence, while rivals still rely on outside tools for audio or longer render times. Founded in March 2023, ShengShu Technology is a Beijing-based AI startup specializing in multimodal large language models and creative tools for film, advertising, and digital creators. [TechNode report]
Related Articles
2025-06-26 05:23
1776 views
How to cancel your Hulu subscription on desktop and in the app
In a world where new streaming services seem to be announced every day, it can be exhausting to keep
Read More
2025-06-26 05:16
1872 views
Girl drunkenly takes photos of her locked doors to reassure her sober self
Ever wake up after a night out partying and panic that you forgot to perform a crucial task? You're
Read More