Cited 2 time in
BRB-KMeans: Enhancing Binary Data Clustering for Binary Product Quantization
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Suwon | - |
| dc.contributor.author | Choi, Sang-Min | - |
| dc.date.accessioned | 2024-12-03T05:00:42Z | - |
| dc.date.available | 2024-12-03T05:00:42Z | - |
| dc.date.issued | 2024-07 | - |
| dc.identifier.issn | 0000-0000 | - |
| dc.identifier.uri | https://scholarworks.gnu.ac.kr/handle/sw.gnu/74200 | - |
| dc.description.abstract | In Binary Product Quantization (BPQ), where product quantization is applied to binary data, the traditional k-majority method is used for clustering, with centroids determined based on Hamming distance and majority vote for each bit. However, this approach often leads to a degradation in clustering quality, negatively impacting BPQ's performance. To address these challenges, we introduce Binary-to-Real-and-Back K-Means (BRB-KMeans), a novel method that initially transforms binary data into real-valued vectors, performs k-means clustering on these vectors, and then converts the generated centroids back into binary data. This innovative approach significantly enhances clustering quality by leveraging the high clustering quality of k-means in the real-valued vector space, thereby facilitating future quantization for binary data. Through extensive experiments, we demonstrate that BRB-KMeans significantly enhances clustering quality and overall BPQ performance, notably outperforming traditional methods. © 2024 Owner/Author. | - |
| dc.format.extent | 5 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Association for Computing Machinery, Inc | - |
| dc.title | BRB-KMeans: Enhancing Binary Data Clustering for Binary Product Quantization | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1145/3626772.3657898 | - |
| dc.identifier.scopusid | 2-s2.0-85200546356 | - |
| dc.identifier.wosid | 001273410002042 | - |
| dc.identifier.bibliographicCitation | SIGIR 2024 - Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 2306 - 2310 | - |
| dc.citation.title | SIGIR 2024 - Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval | - |
| dc.citation.startPage | 2306 | - |
| dc.citation.endPage | 2310 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
| dc.subject.keywordAuthor | binary clustering | - |
| dc.subject.keywordAuthor | binary data | - |
| dc.subject.keywordAuthor | binary vector | - |
| dc.subject.keywordAuthor | product quantization | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
Gyeongsang National University Central Library, 501, Jinju-daero, Jinju-si, Gyeongsangnam-do, 52828, Republic of Korea+82-55-772-0532
COPYRIGHT 2022 GYEONGSANG NATIONAL UNIVERSITY LIBRARY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
