Synthetic Media

Image, video, audio, and 3D generation — model reviews, creative workflows, and the tools reshaping content creation.

ByteDance Put a Director's Chair Inside CapCut

video-generationseedancebytedancecapcutmultimodalai-video

A video model that ingests nine reference images, three video clips, and three audio tracks in a single generation pass just hit number one on Artificial Analysis. Seedance 2.0 posted an Elo of 1,269

Wan2.2 Splits Its Brain in Half — And That's the Point

video-generationwan2.2mixture-of-expertsopen-sourcealibabacomfyui

Every video diffusion model released in the last year has followed the same playbook: train bigger, throw more VRAM at inference, charge accordingly. Alibaba's Tongyi Lab just published something

Microsoft Built a Top-Three Image Model and Locked It in a Square

image-generationmai-image-2microsoftenterprise-aiazureapi-pricing

Microsoft's MAI-Image-2 debuted on April 2nd and immediately landed third on the Arena.ai leaderboard — behind only Google's Gemini 3.1 Flash and OpenAI's GPT-Image 1.5. That ranking got t

ACE-Step 1.5 Ships and Suno's Moat Gets Thinner

music-generationace-stepopen-sourcelocal-aicomfyuilora

Two months ago, paying Suno $24/month felt like the only realistic path to AI-generated music that didn't sound like a MIDI ringtone from 2004. That calculus just broke. ACE-Step 1.5 dropped in la

Gaussian Splats Cast Shadows Now

gaussian-splattingnvidia3d-renderingvulkangame-engines

Shadows were the tell. You could always spot a Gaussian Splatting scene in a demo reel because everything floated in ambient light — beautiful geometry, gorgeous color, zero ground contact. NVIDIA&#39

Midjourney V8 Rewrites Your Prompts and Your Budget

midjourneyimage-generationworkflowpromptingv8-alpha

Two weeks into Midjourney V8 Alpha, I can confirm it's the fastest image generator I've used from any commercial platform. I can also confirm it broke half my prompt library. The V8 that shipp

Mistral's Voxtral Fits in 3 GB and Makes ElevenLabs Optional

voice-aitext-to-speechvoxtralmistralopen-weightsvoice-cloning

Four days ago Mistral quietly shipped the most disruptive model in the TTS space this year, and it wasn't even their main announcement that week. Voxtral TTS is a 4-billion-parameter speech synthe

Sora Was Never a Product

ai-videosoraopenairunwayklingmarket-analysis

OpenAI pulled the plug on Sora this week. The app goes dark in April; the API follows in September. Everyone's writing eulogies. I want to write something different: a thank-you note. Because Sora