Using Large Language Models for Zero-Shot Video Generation: A VideoPoet Case Study
VideoPoet is a transformer-based model for generating high-quality videos from diverse inputs, excelling in zero-shot ...
VideoPoet is a transformer-based model for generating high-quality videos from diverse inputs, excelling in zero-shot ...
VideoPoet uses task-specific prefixes with text, visual, and audio tokens, training only on outputs like ...
This research paper looks into VideoPoet and compares it to previous diffusion-based works on text-to-video ...