| Text-to-video with best-in-class photorealism for lighting, skin texture, and material surfaces | Yes | No |
| Camera control modes — Orbit, Dolly, Pan, and Push-in — for cinematic shot composition | Yes | No |
| Character consistency across shots — same face and clothing maintained across multiple generations | Yes | No |
| Video extension to lengthen existing clips by adding more seconds at the end | Yes | No |
| Image-to-video for animating reference photos with motion | Yes | No |
| API access for building video generation into applications | Yes | No |
| Text-to-video generation up to 10 seconds per clip | No | Yes |
| Image-to-video — animate a still photo with motion prompts | No | Yes |
| Lip sync — drop in audio and match mouth movements to generated characters | No | Yes |
| AI sound effects generation from text descriptions, no audio files needed | No | Yes |
| Fast generation speeds — most clips ready in under 90 seconds | No | Yes |
| Aspect ratio control for 16:9, 9:16, and 1:1 outputs | No | Yes |