In the past few years, object detection has attracted a lot of attention in the context of human–robot collaboration and Industry 5.0 due to enormous quality improvements in deep learning technologies. In many applications, object detection models have to be able to quickly adapt to a changing environment, i.e., to learn new objects. A crucial but challenging prerequisite for this is the automatic generation of new training data which currently still limits the broad application of object detection methods in industrial manufacturing. In this work, we discuss how to adapt state-of-the-art object detection methods for the task of automatic bounding box annotation in a use case where the background is homogeneous and the object’s label is provided by a human. We compare an adapted version of Faster R-CNN and the Scaled-YOLOv4-p5 architecture and show that both can be trained to distinguish unknown objects from a complex but homogeneous background using only a small amount of training data. In contrast to most other state-of-the-art methods for bounding box labeling, our proposed method neither requires human verification, a predefined set of classes, nor a very large manually annotated dataset. Our method outperforms the state-of-the-art, transformer-based object discovery method LOST on our simple fruits dataset by large margins.
Gromit-MPX is an on-screen annotation tool that works with any Unix desktop environment under X11 as well as Wayland. - GitHub - bk138/gromit-mpx: Gromit-MPX is an on-screen annotation tool that works with any Unix desktop environment under X11 as well as Wayland.
@SafeVarargs
Is a cure for the warning: [unchecked] Possible heap pollution from parameterized vararg type Foo.
Is part of the method's contract, hence why the annotation has runtime retention.
Is a promise to the caller of the method that the method will not mess up the heap using the generic varargs argument.
W. Wu, B. Zhang, и M. Ostendorf. Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, стр. 689--692. Stroudsburg, PA, USA, Association for Computational Linguistics, (2010)
S. Repp, S. Linckels, и C. Meinel. Proceedings of the international workshop on Educational multimedia and multimedia education, стр. 19--26. New York, NY, USA, ACM, (2007)
A. Stent, и A. Loui. Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, стр. 59--65. New York, NY, USA, ACM, (2001)
P. Chirita, S. Costache, W. Nejdl, и S. Handschuh. WWW '07: Proceedings of the 16th International Conference on World Wide Web, стр. 845--854. New York, NY, USA, ACM, (2007)
S. Pyysalo, F. Ginter, K. Haverinen, J. Heimonen, T. Salakoski, и V. Laippala. Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing, стр. 25--32. Stroudsburg, PA, USA, Association for Computational Linguistics, (2007)
A. Russo, и D. Peacock. Archives & Museum Informatics: Museums and the Web 2009, (2009)Under Creative Commons License: Attribution Non-Commercial No Derivatives.
C. Marshall. Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems, стр. 40--49. New York, NY, USA, ACM, (1998)
X. Wu, L. Zhang, и Y. Yu. WWW '06: Proceedings of the 15th international conference on World Wide Web, стр. 417--426. New York, NY, USA, ACM Press, (2006)
J. Schnasse, V. Heydegger, и E. Weiper. The eXtensible Chacterisation Languages -- XCL, том 3 из Kölner Beiträge zu einer geisteswissenschaftlichen Fachinformatik, глава 3, Verlag Dr. Kovac, Hamburg, (2009)