Tech »  Topic »  Video Generation Using Large Language Models: Work in Progress

Video Generation Using Large Language Models: Work in Progress


Video Generation Using Large Language Models: Work in Progress by @teleplay

This research paper looks into VideoPoet and compares it to previous diffusion-based works on text-to-video generation.

Table of Links

Abstract and 1 Introduction

2. Related Work

3. Model Overview and 3.1. Tokenization

3.2. Language Model Backbone and 3.3. Super-Resolution

4. LLM Pretraining for Generation

4.1. Task Prompt Design

4.2. Training Strategy

5. Experiments

5.1. Experimental Setup

5.2. Pretraining Task Analysis

5.3. Comparison with the State-of-the-Art

5.4. LLM’s Diverse Capabilities in Video Generation and 5.5. Limitations

6. Conclusion, Acknowledgements, and References

A. Appendix

2. Related Work

Video diffusion models. Recently, numerous video generation methods use diffusion-based methods for text-to video (Ho et al., 2022a; Blattmann et al., 2023b; Zhang et al., 2023a; Blattmann et al., 2023a; He et al., 2023; Zhou et al., 2022; Wang et al., 2023a; Ge ...


Copyright of this story solely belongs to hackernoon.com . To see the full text click HERE