Artículos
- Gimeno, P.; Ribas, D.; Ortega, A.; Miguel, A.; Lleida, E. Unsupervised adaptation of deep speech activity detection models to unseen domains. APPLIED SCIENCES (SWITZERLAND). 2022
- Mingote, Victoria; Viñals, Ignacio; Gimeno, Pablo; Miguel, Antonio; Ortega, Alfonso; Lleida, Eduardo. Multimodal Diarization Systems by Training Enrollment Models as Identity Representations. APPLIED SCIENCES (SWITZERLAND). 2022
- Gimeno, P; Mingote, V; Ortega, A; Miguel, A; Lleida, E. Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data. IEEE SIGNAL PROCESSING LETTERS. 2021
- Gimeno, P.; Mingote, V.; Ortega, A.; Miguel, A.; Lleida, E. Partial AUC optimisation using recurrent neural networks for music detection with limited training data. INTERSPEECH (USB). 2020
- Gimeno, Pablo; Viñals, Ignacio; Ortega, Alfonso; Miguel, Antonio; Lleida, Eduardo. Multiclass audio segmentation based on recurrent neural networks for broadcast domain data. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING. 2020
- Viñals, I.; Gimeno, P.; Ortega, A.; Miguel, A.; Lleida, E. Vivolab speaker diarization system for the Dihard 2019 challenge. INTERSPEECH (USB). 2019
- Viñals, I.; Ribas, D.; Mingote, V.; Llombart, J.; Gimeno, P.; Miguel, A.; Ortega, A.; Lleida, E. Phonetically-aware embeddings, wide residual networks with time-delay neural networks and self attention models for the 2018 NIST speaker recognition evaluation. INTERSPEECH (USB). 2019
Proyectos
- ESPERANTO / Exchanges for SPEech ReseArch aNd TechnOlogies (G.A. No. 101007666). 01/01/21 - 31/12/24
- T36_20R: Vivolab. 01/01/20 - 31/12/22
Contratos
- LABORATORIO DE TECNOLOGÍAS DEL HABLA. 01/11/15 - 31/10/25
Participaciones en congresos
- Iberspeech 2016. Participativo - Ponencia oral (comunicación oral). Automatic Text-to-Audio Alignment of Multimedia Broadcast Content. Lisboa. 20/11/16
Estancias
- ELSA Corp. Lisboa. Portugal. 06/01/19 - 14/04/19
Docencia UNIZAR de los últimos seis cursos
|