Follow
Sneha Kudugunta
Sneha Kudugunta
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Palm 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
7652023
Quality at a glance: An audit of web-crawled multilingual datasets
J Kreutzer, I Caswell, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ...
Transactions of the Association for Computational Linguistics 10, 50-72, 2022
120*2022
Investigating Multilingual NMT Representations at Scale
SR Kudugunta, A Bapna, I Caswell, N Arivazhagan, O Firat
arXiv preprint arXiv:1909.02197, 2019
1082019
Beyond distillation: Task-level mixture-of-experts for efficient inference
S Kudugunta, Y Huang, A Bapna, M Krikun, D Lepikhin, MT Luong, O Firat
arXiv preprint arXiv:2110.03742, 2021
732021
Leveraging monolingual data with self-supervision for multilingual neural machine translation
A Siddhant, A Bapna, Y Cao, O Firat, M Chen, S Kudugunta, ...
arXiv preprint arXiv:2005.04816, 2020
732020
Mural: multimodal, multitask retrieval across languages
A Jain, M Guo, K Srinivasan, T Chen, S Kudugunta, C Jia, Y Yang, ...
arXiv preprint arXiv:2109.05125, 2021
682021
A loss curvature perspective on training instabilities of deep learning models
J Gilmer, B Ghorbani, A Garg, S Kudugunta, B Neyshabur, D Cardoze, ...
International Conference on Learning Representations, 2021
52*2021
Madlad-400: A multilingual and document-level large audited dataset
S Kudugunta, I Caswell, B Zhang, X Garcia, D Xin, A Kusupati, R Stella, ...
Advances in Neural Information Processing Systems 36, 2024
242024
Buffet: Benchmarking large language models for few-shot cross-lingual transfer
A Asai, S Kudugunta, XV Yu, T Blevins, H Gonen, M Reid, Y Tsvetkov, ...
arXiv preprint arXiv:2305.14857, 2023
15*2023
MatFormer: Nested Transformer for Elastic Inference
Devvrit*, S Kudugunta*, A Kusupati*, T Dettmers, K Chen, I Dhillon, ...
arXiv preprint arXiv:2310.07707, 2023
22023
Systems and methods for routing within multitask mixture-of-experts models
Y Huang, D Lepikhin, M Krikun, O Firat, A Bapna, T Luong, S Kudugunta
US Patent App. 17/159,437, 2022
12022
MiTTenS: A Dataset for Evaluating Misgendering in Translation
K Robinson, S Kudugunta, R Stella, S Dev, J Bastings
arXiv preprint arXiv:2401.06935, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–12