Non-autoregressive TTS with explicit duration modelling for low-resource highly expressive speech R Shah, K Pokora, A Ezzerg, V Klimkov, G Huybrechts, B Putrycz, ... arXiv preprint arXiv:2106.12896, 2021 | 26 | 2021 |
Creating new voices using normalizing flows P Bilinski, T Merritt, A Ezzerg, K Pokora, S Cygert, K Yanagisawa, ... arXiv preprint arXiv:2312.14569, 2023 | 19 | 2023 |
Text-free non-parallel many-to-many voice conversion using normalising flow T Merritt, A Ezzerg, P Biliński, M Proszewska, K Pokora, R Barra-Chicote, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 15 | 2022 |
Varying speaking styles with neural textto-speech T Wood, T Merritt Amazon Science, 2018 | 11 | 2018 |
Enhancing audio quality for expressive Neural Text-to-Speech A Ezzerg, A Gabrys, B Putrycz, D Korzekwa, D Saez-Trigueros, ... arXiv preprint arXiv:2108.06270, 2021 | 9 | 2021 |
Remap, warp and attend: Non-parallel many-to-many accent conversion with normalizing flows A Ezzerg, T Merritt, K Yanagisawa, P Bilinski, M Proszewska, K Pokora, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 984-990, 2023 | 4 | 2023 |
On granularity of prosodic representations in expressive text-to-speech M Babiański, K Pokora, R Shah, R Sienkiewicz, D Korzekwa, V Klimkov 2022 IEEE Spoken Language Technology Workshop (SLT), 892-899, 2023 | 2 | 2023 |
Cross-lingual knowledge distillation via flow-based voice conversion for robust polyglot text-to-speech D Piotrowski, R Korzeniowski, A Falai, S Cygert, K Pokora, G Tinchev, ... International Conference on Neural Information Processing, 252-264, 2023 | | 2023 |
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech G Zhang, T Merritt, MS Ribeiro, B Tura-Vecino, K Yanagisawa, K Pokora, ... arXiv preprint arXiv:2307.16679, 2023 | | 2023 |