Header menu link for other important links
X
Improving Hashing Algorithms for Similarity Search via MLE and the Control Variates Trick
K. Kang, S. Kushnarev, W.P. Wong, , H. Yeo, Y. Chen
Published in ML Research Press
2021
Volume: 157
   
Pages: 814 - 829
Abstract
Hashing algorithms are continually used for large-scale learning and similarity search, with computationally cheap and better algorithms being proposed every year. In this paper we focus on hashing algorithms which involve estimating a distance measure d(xi,xj) between two vectors xi,xj. Such hashing algorithms require generation of random variables, and we propose two approaches to reduce the variance of our hashed estimates: control variates and maximum likelihood estimates. We explain how these approaches can be immediately applied to a wide subset of hashing algorithms. Further, we evaluate the impact of these methods on various datasets. We finally run empirical simulations to verify our results. © 2021 K. Kang, S. Kushnarev, W.P. Wong, R. Pratap, H. Yeo & Y. Chen.
About the journal
JournalProceedings of Machine Learning Research
PublisherML Research Press
ISSN26403498