Please use this identifier to cite or link to this item:
Title: Selecting the right search term in query-based systems for deduplication
Authors: Jele, Harald
Keywords: Library and information science
Issue Date: 2013
Source: Jele, H. (2013). Selecting the right search term in query-based systems for deduplication. Library and Information Science Research E-Journal, 23(2), 1-13.
Journal: Library and Information Science Research E-Journal 
Abstract: Essentially three approaches could be identified when choosing a proper search term to detect bibliographic duplicates. Stop words are excluded in all of them, then (1) just the first term of an entry will be selected or (2) that term is selected, which produces the smallest number of hits or finally (3) that term will be used, which has a certain number of hits below a defined threshold. These three procedures are compared with each other here. The results derive from series of measurements done with bibliographic data from the Austrian Central Catalog.
ISSN: 1058-6768
DOI: 10.32655/LIBRES.2013.2.1
Rights: © 2013 Harald Jele. All rights reserved.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:Library and Information Science Research E-journal (LIBRES)

Files in This Item:
File Description SizeFormat 
NTU-Jele_2012_anfragebegriff_main(1).pdf572.16 kBAdobe PDFThumbnail

Page view(s)

Updated on Jan 18, 2022


Updated on Jan 18, 2022

Google ScholarTM




Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.