Maximum entropy gain exploration for long horizon multi-goal reinforcement learning S Pitis, H Chan, S Zhao, B Stadie, J Ba International Conference on Machine Learning, 7750-7761, 2020 | 128 | 2020 |
Proximal learning with opponent-learning awareness S Zhao, C Lu, RB Grosse, J Foerster Advances in Neural Information Processing Systems 35, 26324-26336, 2022 | 21 | 2022 |
Joint energy-based models for semi-supervised classification S Zhao, JH Jacobsen, W Grathwohl ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning 1, 2020 | 18 | 2020 |
Probabilistic inference in language models via twisted sequential monte carlo S Zhao, R Brekelmans, A Makhzani, R Grosse arXiv preprint arXiv:2404.17546, 2024 | 7 | 2024 |
Reproducing" Are Sixteen Heads Really Better than One?" S Zhao, S Yuan | | |
Layer-Wise Contrastive Unsupervised Representation Learning S Zhao | | |