Tech » Topic » Video Generation Using Large Language Models: Work in Progress

Video Generation Using Large Language Models: Work in Progress

3 weeks, 3 days ago hackernoon.com

Video Generation Using Large Language Models: Work in Progress by @teleplay

This research paper looks into VideoPoet and compares it to previous diffusion-based works on text-to-video generation.

Table of Links

Abstract and 1 Introduction

2. Related Work

3. Model Overview and 3.1. Tokenization

3.2. Language Model Backbone and 3.3. Super-Resolution

4. LLM Pretraining for Generation

4.1. Task Prompt Design

4.2. Training Strategy

5. Experiments

5.1. Experimental Setup

5.2. Pretraining Task Analysis

5.3. Comparison with the State-of-the-Art

5.4. LLM’s Diverse Capabilities in Video Generation and 5.5. Limitations

6. Conclusion, Acknowledgements, and References

2. Related Work

Video diffusion models. Recently, numerous video generation methods use diffusion-based methods for text-to video (Ho et al., 2022a; Blattmann et al., 2023b; Zhang et al., 2023a; Blattmann et al., 2023a; He et al., 2023; Zhou et al., 2022; Wang et al., 2023a; Ge ...

Copyright of this story solely belongs to hackernoon.com . To see the full text click HERE