Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/180718
Title: Open-vocabulary object detection via debiased curriculum self-training
Authors: Zhang, Hanlue
Guan, Dayan
Ke, Xiangrui
El Saddik, Abdulmotaleb
Lu, Shijian
Keywords: Computer and Information Science
Issue Date: 2024
Source: Zhang, H., Guan, D., Ke, X., El Saddik, A. & Lu, S. (2024). Open-vocabulary object detection via debiased curriculum self-training. Expert Systems With Applications, 255, 124762-. https://dx.doi.org/10.1016/j.eswa.2024.124762
Journal: Expert Systems with Applications 
Abstract: Open-vocabulary object detection aims to train a detector capable of recognizing various novel classes. Most existing studies exploit image-level weak supervision to generate pseudo object boxes for novel class training. However, the generated pseudo boxes are often noisy and biased towards base classes, leading to sub-optimal open-vocabulary detectors. We propose DCS, a novel Debiased Curriculum Self-Training technique that generates pseudo object boxes progressively and adaptively for training accurate open-vocabulary detectors. DCS consists of two complementary designs, namely, progressive pseudo-label filtering (PPF) and adaptive pseudo-label selection (APS). Specifically, PPF discards confident but mismatched detection progressively at the early training stage when the trained detector is biased towards the base classes, APS instead fuses class-aware and class-agnostic pseudo labels by prioritizing class-aware pseudo labels at the late training stage when the detector can better recognize novel classes. Without bells and whistles, DCS achieves superior detection performance over two open-vocabulary detection benchmarks.
URI: https://hdl.handle.net/10356/180718
ISSN: 0957-4174
DOI: 10.1016/j.eswa.2024.124762
Schools: College of Computing and Data Science 
Rights: © 2024 Elsevier Ltd. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:CCDS Journal Articles

Page view(s)

84
Updated on May 7, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.