Recent studies have shown that vision transformer (ViT) models can attain better results than most state-of-the-art convolutional neural networks (CNNs) across various image recognition tasks, and can do so while using considerably fewer computational resources. This has led some researchers to propose ViTs could replace CNNs in this field.However, despite their promising performance, ViTs areContinue Reading
Jiqizhixin("The heart of the machine") is China's leading cutting-edge technology media and industry service platform, focusing on artificial intelligence, robotics and neurocognitive science, and insisting on providing high-quality content and various industrial services for practitioners.
机器之心是国内领先的前沿科技媒体和产业服务平台,关注人工智能、机器人和神经认知科学,坚持为从业者提供高质量内容和多项产业服务。
W. Hung, Y. Tsai, Y. Liou, Y. Lin, and M. Yang. (2018)cite arxiv:1802.07934Comment: Accepted in BMVC 2018. Code and models available at https://github.com/hfslyc/AdvSemiSeg.
A. Ulusoy, A. Geiger, and M. Black. Proceedings of the 2015 International Conference on 3D Vision, page 10--18. Washington, DC, USA, IEEE Computer Society, (2015)
P. Wu, R. Wang, K. Kin, C. Twigg, S. Han, M. Yang, and S. Chien. Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology, page 365--374. New York, NY, USA, ACM, (2017)