Stochastic Low-Rank Bandits

Kveton, Branislav; Szepesvari, Csaba; Rao, Anup; Wen, Zheng; Abbasi-Yadkori, Yasin; Muthukrishnan, S.

Computer Science > Machine Learning

arXiv:1712.04644 (cs)

[Submitted on 13 Dec 2017]

Title:Stochastic Low-Rank Bandits

Authors:Branislav Kveton, Csaba Szepesvari, Anup Rao, Zheng Wen, Yasin Abbasi-Yadkori, S. Muthukrishnan

View PDF

Abstract:Many problems in computer vision and recommender systems involve low-rank matrices. In this work, we study the problem of finding the maximum entry of a stochastic low-rank matrix from sequential observations. At each step, a learning agent chooses pairs of row and column arms, and receives the noisy product of their latent values as a reward. The main challenge is that the latent values are unobserved. We identify a class of non-negative matrices whose maximum entry can be found statistically efficiently and propose an algorithm for finding them, which we call LowRankElim. We derive a $\DeclareMathOperator{\poly}{poly} O((K + L) \poly(d) \Delta^{-1} \log n)$ upper bound on its $n$-step regret, where $K$ is the number of rows, $L$ is the number of columns, $d$ is the rank of the matrix, and $\Delta$ is the minimum gap. The bound depends on other problem-specific constants that clearly do not depend $K L$. To the best of our knowledge, this is the first such result in the literature.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1712.04644 [cs.LG]
	(or arXiv:1712.04644v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1712.04644

Submission history

From: Branislav Kveton [view email]
[v1] Wed, 13 Dec 2017 07:59:48 UTC (34 KB)

Computer Science > Machine Learning

Title:Stochastic Low-Rank Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Stochastic Low-Rank Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators