‪Kamil Pokora‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	86	86
h-index	5	5
i10-index	4	4

0

38

19

2019202020212022202320243 1 3 31 37 11

Kamil Pokora

Kamil Pokora

Applied Scientist at Amazon

Verified email at amazon.com

speech processing speech synthesis text-to-speech Deep learning Machine learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Non-autoregressive TTS with explicit duration modelling for low-resource highly expressive speech R Shah, K Pokora, A Ezzerg, V Klimkov, G Huybrechts, B Putrycz, ... arXiv preprint arXiv:2106.12896, 2021	26	2021
Creating new voices using normalizing flows P Bilinski, T Merritt, A Ezzerg, K Pokora, S Cygert, K Yanagisawa, ... arXiv preprint arXiv:2312.14569, 2023	19	2023
Text-free non-parallel many-to-many voice conversion using normalising flow T Merritt, A Ezzerg, P Biliński, M Proszewska, K Pokora, R Barra-Chicote, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	15	2022
Varying speaking styles with neural textto-speech T Wood, T Merritt Amazon Science, 2018	11	2018
Enhancing audio quality for expressive Neural Text-to-Speech A Ezzerg, A Gabrys, B Putrycz, D Korzekwa, D Saez-Trigueros, ... arXiv preprint arXiv:2108.06270, 2021	9	2021
Remap, warp and attend: Non-parallel many-to-many accent conversion with normalizing flows A Ezzerg, T Merritt, K Yanagisawa, P Bilinski, M Proszewska, K Pokora, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 984-990, 2023	4	2023
On granularity of prosodic representations in expressive text-to-speech M Babiański, K Pokora, R Shah, R Sienkiewicz, D Korzekwa, V Klimkov 2022 IEEE Spoken Language Technology Workshop (SLT), 892-899, 2023	2	2023
Cross-lingual knowledge distillation via flow-based voice conversion for robust polyglot text-to-speech D Piotrowski, R Korzeniowski, A Falai, S Cygert, K Pokora, G Tinchev, ... International Conference on Neural Information Processing, 252-264, 2023		2023
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech G Zhang, T Merritt, MS Ribeiro, B Tura-Vecino, K Yanagisawa, K Pokora, ... arXiv preprint arXiv:2307.16679, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–9