Author of the publication

GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation

, , , , , , , , , and . Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Boundary-sensitive Pre-training for Temporal Localization in Videos., , , , , , , and . CoRR, (2020)Negative Frames Matter in Egocentric Visual Query 2D Localization., , , , , and . CoRR, (2022)TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification., , , , , and . BMVC, page 304. BMVA Press, (2021)Space-time Mixing Attention for Video Transformer., , , , and . NeurIPS, page 19594-19607. (2021)Efficient Progressive Neural Architecture Search., , and . BMVC, page 150. BMVA Press, (2018)Boundary-sensitive Pre-training for Temporal Localization in Videos., , , , , , , and . ICCV, page 7200-7210. IEEE, (2021)Background-foreground tracking for video object segmentation., , and . ICIP, page 1613-1617. IEEE, (2015)Boundary Denoising for Video Activity Localization., , , , , and . ICLR, OpenReview.net, (2024)GenTron: Diffusion Transformers for Image and Video Generation., , , , , , , , , and . CVPR, page 6441-6451. IEEE, (2024)FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing, , , , , , , , , and . International Conference on Learning Representations (ICLR), (2024)