Amazing technological breakthrough possible @S-Logix pro@slogix.in

Office Address

  • #5, First Floor, 4th Street Dr. Subbarayan Nagar Kodambakkam, Chennai-600 024 Landmark : Samiyar Madam
  • pro@slogix.in
  • +91- 81240 01111

Social List

Video Generation From Text - 2018

Video Generation From Text

Research paper on Video Generation From Text

Research Area:  Machine Learning

Abstract:

Generating videos from text has proven to be a significant challenge for existing generative models. We tackle this problem by training a conditional generative model to extract both static and dynamic information from text. This is manifested in a hybrid framework, employing a Variational Autoencoder (VAE) and a Generative Adversarial Network (GAN). The static features, called "gist," are used to sketch text-conditioned background color and object layout structure. Dynamic features are considered by transforming input text into an image filter. To obtain a large amount of data for training the deep-learning model, we develop a method to automatically create a matched text-video corpus from publicly available online videos. Experimental results show that the proposed framework generates plausible and diverse short-duration smooth videos, while accurately reflecting the input text information. It significantly outperforms baseline models that directly adapt text-to-image generation procedures to produce videos. Performance is evaluated both visually and by adapting the inception score used to evaluate image generation in GANs.

Keywords:  
video generation
variational autoencoder
generative adversarial network
machine learning

Author(s) Name:  Yitong Li , Martin Min, Dinghan Shen, David Carlson ,Lawrence Carin

Journal name:  

Conferrence name:  Proceedings of the AAAI Conference on Artificial Intelligence

Publisher name:  AAAI

DOI:  10.1609/aaai.v32i1.12233

Volume Information: