Author of the publication

End-to-end Multi-Modal Multi-Task Vehicle Control for Self-Driving Cars with Visual Perceptions.

, , , , and . ICPR, page 2289-2294. IEEE Computer Society, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Attentive Relational Networks for Mapping Images to Scene Graphs., , , , and . CoRR, (2018)MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning., , , , , , , and . CoRR, (2023)Grounding-Tracking-Integration., , , and . CoRR, (2019)SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph Generation., , , , and . AAAI, page 4035-4043. AAAI Press, (2024)Improving One-Stage Visual Grounding by Recursive Sub-query Construction., , , and . ECCV (14), volume 12359 of Lecture Notes in Computer Science, page 387-404. Springer, (2020)Action Recognition with Visual Attention on Skeleton Images., , , and . ICPR, page 3309-3314. IEEE Computer Society, (2018)SAT: 2D Semantics Assisted Training for 3D Visual Grounding., , , and . ICCV, page 1836-1846. IEEE, (2021)ReCo: Region-Controlled Text-to-Image Generation., , , , , , , , , and 1 other author(s). CVPR, page 14246-14255. IEEE, (2023)Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition., , , , , , , , and . CoRR, (2024)MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos., , , , , , , , , and 4 other author(s). CoRR, (2024)