The statistical complexity of interactive decision making DJ Foster, SM Kakade, J Qian, A Rakhlin arXiv preprint arXiv:2112.13487, 2021 | 156 | 2021 |
Exploration bonus for regret minimization in discrete and continuous average reward mdps J Qian, R Fruit, M Pirotta, A Lazaric Advances in Neural Information Processing Systems 32, 2019 | 41* | 2019 |
Importance resampling for off-policy prediction M Schlegel, W Chung, D Graves, J Qian, M White Advances in Neural Information Processing Systems 32, 2019 | 41 | 2019 |
Towards minimax optimal reinforcement learning in factored markov decision processes Y Tian, J Qian, S Sra Advances in Neural Information Processing Systems 33, 19896-19907, 2020 | 25 | 2020 |
Concentration inequalities for multinoulli random variables J Qian, R Fruit, M Pirotta, A Lazaric arXiv preprint arXiv:2001.11595, 2020 | 19 | 2020 |
Model-free reinforcement learning with the decision-estimation coefficient DJ Foster, N Golowich, J Qian, A Rakhlin, A Sekhari Thirty-seventh Conference on Neural Information Processing Systems, 2023 | 16* | 2023 |
Byzantine-robust federated linear bandits A Jadbabaie, H Li, J Qian, Y Tian 2022 IEEE 61st Conference on Decision and Control (CDC), 5206-5213, 2022 | 13 | 2022 |
Convex and Non-Convex Optimization under Generalized Smoothness H Li, J Qian, Y Tian, A Rakhlin, A Jadbabaie arXiv preprint arXiv:2306.01264, 2023 | 8 | 2023 |
Robust learning under clean-label attack A Blum, S Hanneke, J Qian, H Shao Conference on Learning Theory, 591-634, 2021 | 7 | 2021 |
Online Estimation via Offline Estimation: An Information-Theoretic Framework DJ Foster, Y Han, J Qian, A Rakhlin arXiv preprint arXiv:2404.10122, 2024 | 1 | 2024 |