Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/142480
Title: Optimal algorithms for selecting top-k combinations of attributes : theory and applications
Authors: Lin, Chunbin
Lu, Jiaheng
Wei, Zhewei
Wang, Jianguo
Xiao, Xiaokui
Keywords: Engineering::Computer science and engineering
Issue Date: 2017
Source: Lin, C., Lu, J., Wei, Z., Wang, J., & Xiao, X. (2018). Optimal algorithms for selecting top-k combinations of attributes : theory and applications. The VLDB Journal, 27(1), 27-52. doi:10.1007/s00778-017-0485-2
Journal: The VLDB Journal
Abstract: Traditional top-k algorithms, e.g., TA and NRA, have been successfully applied in many areas such as information retrieval, data mining and databases. They are designed to discover k objects, e.g., top-k restaurants, with highest overall scores aggregated from different attributes, e.g., price and location. However, new emerging applications like query recommendation require providing the best combinations of attributes, instead of objects. The straightforward extension based on the existing top-k algorithms is prohibitively expensive to answer top-k combinations because they need to enumerate all the possible combinations, which is exponential to the number of attributes. In this article, we formalize a novel type of top-k query, called top-k, m, which aims to find top-k combinations of attributes based on the overall scores of the top-m objects within each combination, where m is the number of objects forming a combination. We propose a family of efficient top-k, m algorithms with different data access methods, i.e., sorted accesses and random accesses and different query certainties, i.e., exact query processing and approximate query processing. Theoretically, we prove that our algorithms are instance optimal and analyze the bound of the depth of accesses. We further develop optimizations for efficient query evaluation to reduce the computational and the memory costs and the number of accesses. We provide a case study on the real applications of top-k, m queries for an online biomedical search engine. Finally, we perform comprehensive experiments to demonstrate the scalability and efficiency of top-k, m algorithms on multiple real-life datasets.
URI: https://hdl.handle.net/10356/142480
ISSN: 1066-8888
DOI: 10.1007/s00778-017-0485-2
Schools: School of Computer Science and Engineering 
Rights: © 2017 Springer-Verlag GmbH Germany. All rights reserved.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:SCSE Journal Articles

SCOPUSTM   
Citations 50

8
Updated on Mar 12, 2025

Web of ScienceTM
Citations 20

5
Updated on Oct 30, 2023

Page view(s)

276
Updated on Mar 17, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.