Article,

Siamese coding network and pair similarity prediction for near-duplicate image detection

M. Fisichella.
International Journal of Multimedia Information Retrieval, 11 (2): 159--170 (Jun 1, 2022)
DOI: 10.1007/s13735-022-00233-w

Abstract

Near-duplicate detection in a dataset involves finding the elements that are closest to a new query element according to a given similarity function and proximity threshold. The brute force approach is very computationally intensive as it evaluates the similarity between the queried item and all items in the dataset. The potential application domain is an image sharing website that checks for plagiarism or piracy every time a new image is uploaded. Among the various approaches, near-duplicate detection was effectively addressed by SimPair LSH (Fisichella et al., in Decker, Lhotská, Link, Spies, Wagner (eds) Database and expert systems applications, Springer, 2014). As the name suggests, SimPair LSH uses locality sensitive hashing (LSH) and computes and stores in advance a small set of near-duplicate pairs present in the dataset and uses them to reduce the candidate set returned for a given query using the Triangle inequality. We develop an algorithm that predicts how the candidate set will be reduced. We also develop a new efficient method for near-duplicate image detection using a deep Siamese coding neural network that is able to extract effective features from images useful for building LSH indices. Extensive experiments on two benchmark datasets confirm the effectiveness of our deep Siamese coding network and prediction algorithm.

BibTeX key: Fisichella2022
entry type: article
year: 2022
month: jun
day: 01
journal: International Journal of Multimedia Information Retrieval
number: 2
pages: 159--170
volume: 11
issn: 2192-662X
DOI: 10.1007/s13735-022-00233-w
url: https://doi.org/10.1007/s13735-022-00233-w

Users

Comments and Reviewsshow / hide

anonymous 2 years ago (last updated 2 years ago)
using siamese and indexing technique, quite useful
References
Bookmarks
deleting review

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{Fisichella2022, abstract = {Near-duplicate detection in a dataset involves finding the elements that are closest to a new query element according to a given similarity function and proximity threshold. The brute force approach is very computationally intensive as it evaluates the similarity between the queried item and all items in the dataset. The potential application domain is an image sharing website that checks for plagiarism or piracy every time a new image is uploaded. Among the various approaches, near-duplicate detection was effectively addressed by SimPair LSH (Fisichella et al., in Decker, Lhotsk{\'a}, Link, Spies, Wagner (eds) Database and expert systems applications, Springer, 2014). As the name suggests, SimPair LSH uses locality sensitive hashing (LSH) and computes and stores in advance a small set of near-duplicate pairs present in the dataset and uses them to reduce the candidate set returned for a given query using the Triangle inequality. We develop an algorithm that predicts how the candidate set will be reduced. We also develop a new efficient method for near-duplicate image detection using a deep Siamese coding neural network that is able to extract effective features from images useful for building LSH indices. Extensive experiments on two benchmark datasets confirm the effectiveness of our deep Siamese coding network and prediction algorithm.}, added-at = {2022-12-08T11:00:22.000+0100}, author = {Fisichella, Marco}, biburl = {https://www.bibsonomy.org/bibtex/2454a19dd47fdab6f76f2034853520748/mfisichella}, day = 01, doi = {10.1007/s13735-022-00233-w}, interhash = {6eab435cc164048e5896380d800a50b6}, intrahash = {454a19dd47fdab6f76f2034853520748}, issn = {2192-662X}, journal = {International Journal of Multimedia Information Retrieval}, keywords = {}, month = jun, number = 2, pages = {159--170}, timestamp = {2022-12-08T11:00:22.000+0100}, title = {Siamese coding network and pair similarity prediction for near-duplicate image detection}, url = {https://doi.org/10.1007/s13735-022-00233-w}, volume = 11, year = 2022 }

BibSonomy

Siamese coding network and pair similarity prediction for near-duplicate image detection

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on