Author of the publication

Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation.

, , , , , , and . CoRR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling., , , , , and . CoRR, (2019)DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents., , , and . CoRR, (2021)Remedying BiLSTM-CNN Deficiency in Modeling Cross-Context for NER., , and . CoRR, (2019)LayoutGPT: Compositional Visual Planning and Generation with Large Language Models., , , , , , , , and . CoRR, (2023)Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis., , , , , , , , and . CoRR, (2022)L2C: Describing Visual Differences Needs Semantic Understanding of Individuals., , , and . EACL, page 2315-2320. Association for Computational Linguistics, (2021)Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation., , , , , , , and . EACL, page 1207-1221. Association for Computational Linguistics, (2021)M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers., , , , and . CVPR, page 10503-10512. IEEE, (2022)Dynamic Video Segmentation Network., , , and . CVPR, page 6556-6565. Computer Vision Foundation / IEEE Computer Society, (2018)Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation., , , , , , and . CVPR, page 10681-10692. IEEE, (2023)