Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
Mar 5, 2020 · We prove regret guarantees for the problems of top arm identification, top m-arms identification, contextual modal bandits, and infinite ...
Robustness Guarantees for Mode Estimation with an Application to Bandits ... for mode estimation and (ii) a new application of mode es- timation the bandit ...
Robustness Guarantees · Reward Distributions · Privacy Guarantees · Perturbations · Multi-Armed Bandits · Regret Guarantees · Adversarial Corruptions · Bandit ...
Mar 5, 2020 · ... robustness and privacy guarantees for mode estimation and (ii) a new application of mode estimation the bandit problem, which we call modal ...
Robustness Guarantees for Mode Estimation with an Application to Bandits. Aldo Pacchiano, Heinrich Jiang, Michael Jordan. March 2020. PDF. Type. Conference ...
Sep 4, 2023 · Bibliographic details on Robustness Guarantees for Mode Estimation with an Application to Bandits.
We show in simulations that our algorithms are robust to perturbation of the arms by adversarial noise sequences, thus rendering modal bandits an attractive ...
Connected Papers is a visual tool to help researchers and applied scientists find academic papers relevant to their field of work.
Estimating Optimal Policy Value in General Linear Contextual Bandits. TMLR Jorunal ... Robustness Guarantees for Mode Estimation with an Application to Bandits.
Robustness Guarantees for Mode Estimation with an Application to Bandits. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 35, n ...