Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/175416
Title: Multi-armed linear bandits with latent biases
Authors: Kang, Qiyu
Tay, Wee Peng
She, Rui
Wang, Sijie
Liu, Xiaoqian
Yang, Yuan-Rui
Keywords: Computer and Information Science
Issue Date: 2024
Source: Kang, Q., Tay, W. P., She, R., Wang, S., Liu, X. & Yang, Y. (2024). Multi-armed linear bandits with latent biases. Information Sciences, 660, 120103-. https://dx.doi.org/10.1016/j.ins.2024.120103
Project: MOE-T2EP20220-0002 
Journal: Information Sciences 
Abstract: In a linear stochastic bandit model, each arm corresponds to a vector in Euclidean space, and the expected return observed at each time step is determined by an unknown linear function of the selected arm. This paper addresses the challenge of identifying the optimal arm in a linear stochastic bandit model, where latent biases corrupt each arm's expected reward. Unlike traditional linear bandit problems, where the observed return directly represents the reward, this paper considers a scenario where the unbiased reward at each time step remains unobservable. This model is particularly relevant in situations where the observed return is influenced by latent biases that need to be carefully excluded from the learning model. For example, in recommendation systems designed to prevent racially discriminatory suggestions, it is crucial to ensure that the users' race does not influence the system. However, the observed return, such as click-through rates, may have already been influenced by racial attributes. In the case where there are finitely many arms, we develop a strategy to achieve O(|D|log⁡n) regret, where |D| is the number of arms and n is the number of time steps. In the case where each arm is chosen from an infinite compact set, our strategy achieves O(n2/3(log⁡n)1/2) regret. Experiments verify the efficiency of our strategy.
URI: https://hdl.handle.net/10356/175416
ISSN: 0020-0255
DOI: 10.1016/j.ins.2024.120103
Schools: School of Electrical and Electronic Engineering 
Rights: © 2024 Elsevier Inc. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. The Version of Record is available online at http://doi.org/10.1016/j.ins.2024.120103.
Fulltext Permission: embargo_20260407
Fulltext Availability: With Fulltext
Appears in Collections:EEE Journal Articles

Files in This Item:
File Description SizeFormat 
V18.pdf
  Until 2026-04-07
904.94 kBAdobe PDFUnder embargo until Apr 07, 2026

Page view(s)

88
Updated on May 7, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.