default search action
Santiago Pascual
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c32]Heng Wang, Jianbo Ma, Santiago Pascual, Richard Cartwright, Weidong Cai:
V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models. AAAI 2024: 15492-15501 - [c31]Jordi Pons, Xiaoyu Liu, Santiago Pascual, Joan Serrà:
GASS: Generalizing Audio Source Separation with Large-Scale Data. ICASSP 2024: 546-550 - [i29]Ioannis Tsiamas, Santiago Pascual, Chunghsin Yeh, Joan Serrà:
Sequential Contrastive Audio-Visual Learning. CoRR abs/2407.05782 (2024) - [i28]Santiago Pascual, Chunghsin Yeh, Ioannis Tsiamas, Joan Serrà:
Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity. CoRR abs/2407.10387 (2024) - [i27]Xiaoyu Liu, Xu Li, Joan Serrà, Santiago Pascual:
Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility. CoRR abs/2409.09357 (2024) - 2023
- [c30]Jordi Pons, Joan Serrà, Santiago Pascual, Giulio Cengarle, Daniel Arteaga, Davide Scaini:
Upsampling Layers for Music Source Separation. EUSIPCO 2023: 311-315 - [c29]Santiago Pascual, Gautam Bhattacharya, Chunghsin Yeh, Jordi Pons, Joan Serrà:
Full-Band General Audio Synthesis with Score-Based Diffusion. ICASSP 2023: 1-5 - [c28]Emilian Postolache, Jordi Pons, Santiago Pascual, Joan Serrà:
Adversarial Permutation Invariant Training for Universal Sound Separation. ICASSP 2023: 1-5 - [c27]Joan Serrà, Davide Scaini, Santiago Pascual, Daniel Arteaga, Jordi Pons, Jeroen Breebaart, Giulio Cengarle:
Mono-to-Stereo Through Parametric Stereo Generation. ISMIR 2023: 304-310 - [c26]Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian J. McAuley:
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models. WASPAA 2023: 1-5 - [i26]Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian J. McAuley:
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models. CoRR abs/2306.09635 (2023) - [i25]Joan Serrà, Davide Scaini, Santiago Pascual, Daniel Arteaga, Jordi Pons, Jeroen Breebaart, Giulio Cengarle:
Mono-to-stereo through parametric stereo generation. CoRR abs/2306.14647 (2023) - [i24]Heng Wang, Jianbo Ma, Santiago Pascual, Richard Cartwright, Weidong Cai:
V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models. CoRR abs/2308.09300 (2023) - [i23]Jordi Pons, Xiaoyu Liu, Santiago Pascual, Joan Serrà:
GASS: Generalizing Audio Source Separation with Large-scale Data. CoRR abs/2310.00140 (2023) - 2022
- [c25]Enric Gusó, Jordi Pons, Santiago Pascual, Joan Serrà:
On Loss Functions and Evaluation Metrics for Music Source Separation. ICASSP 2022: 306-310 - [i22]Enric Gusó, Jordi Pons, Santiago Pascual, Joan Serrà:
On loss functions and evaluation metrics for music source separation. CoRR abs/2202.07968 (2022) - [i21]Joan Serrà, Santiago Pascual, Jordi Pons, R. Oguz Araz, Davide Scaini:
Universal Speech Enhancement with Score-based Diffusion. CoRR abs/2206.03065 (2022) - [i20]Emilian Postolache, Jordi Pons, Santiago Pascual, Joan Serrà:
Adversarial Permutation Invariant Training for Universal Sound Separation. CoRR abs/2210.12108 (2022) - [i19]Santiago Pascual, Gautam Bhattacharya, Chunghsin Yeh, Jordi Pons, Joan Serrà:
Full-band General Audio Synthesis with Score-based Diffusion. CoRR abs/2210.14661 (2022) - 2021
- [c24]Christian J. Steinmetz, Jordi Pons, Santiago Pascual, Joan Serrà:
Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects. ICASSP 2021: 71-75 - [c23]Joan Serrà, Jordi Pons, Santiago Pascual:
SESQA: Semi-Supervised Learning for Speech Quality Assessment. ICASSP 2021: 381-385 - [c22]Jordi Pons, Santiago Pascual, Giulio Cengarle, Joan Serrà:
Upsampling Artifacts in Neural Audio Synthesis. ICASSP 2021: 3005-3009 - [c21]Santiago Pascual, Joan Serrà, Jordi Pons:
Adversarial Auto-Encoding for Packet Loss Concealment. WASPAA 2021: 71-75 - [i18]Joan Serrà, Santiago Pascual, Jordi Pons:
On tuning consistent annealed sampling for denoising score matching. CoRR abs/2104.03725 (2021) - [i17]Santiago Pascual, Joan Serrà, Jordi Pons:
Adversarial Auto-Encoding for Packet Loss Concealment. CoRR abs/2107.03100 (2021) - [i16]Jordi Pons, Joan Serrà, Santiago Pascual, Giulio Cengarle, Daniel Arteaga, Davide Scaini:
Upsampling layers for music source separation. CoRR abs/2111.11773 (2021) - 2020
- [c20]Tina Raissi, Santiago Pascual, Maurizio Omologo:
Sample drop detection for asynchronous devices distributed in space. EUSIPCO 2020: 815-819 - [c19]Mirco Ravanelli, Jianyuan Zhong, Santiago Pascual, Pawel Swietojanski, João Monteiro, Jan Trmal, Yoshua Bengio:
Multi-Task Self-Supervised Learning for Robust Speech Recognition. ICASSP 2020: 6989-6993 - [c18]Baybars Külebi, Alp Öktem, Alex Peiró Lilja, Santiago Pascual, Mireia Farrús:
CATOTRON - A Neural Text-to-Speech System in Catalan. INTERSPEECH 2020: 490-491 - [i15]Mirco Ravanelli, Jianyuan Zhong, Santiago Pascual, Pawel Swietojanski, João Monteiro, Jan Trmal, Yoshua Bengio:
Multi-task self-supervised learning for Robust Speech Recognition. CoRR abs/2001.09239 (2020) - [i14]Joan Serrà, Jordi Pons, Santiago Pascual:
SESQA: semi-supervised learning for speech quality assessment. CoRR abs/2010.00368 (2020) - [i13]Christian J. Steinmetz, Jordi Pons, Santiago Pascual, Joan Serrà:
Automatic multitrack mixing with a differentiable mixing console of neural audio effects. CoRR abs/2010.10291 (2020) - [i12]Jordi Pons, Santiago Pascual, Giulio Cengarle, Joan Serrà:
Upsampling artifacts in neural audio synthesis. CoRR abs/2010.14356 (2020)
2010 – 2019
- 2019
- [j1]Santiago Pascual, Joan Serrà, Antonio Bonafonte:
Time-domain speech enhancement using generative adversarial networks. Speech Commun. 114: 10-21 (2019) - [c17]Amanda Cardoso Duarte, Francisco Roldan, Miquel Tubau, Janna Escur, Santiago Pascual, Amaia Salvador, Eva Mohedano, Kevin McGuinness, Jordi Torres, Xavier Giró-i-Nieto:
WAV2PIX: Speech-conditioned Face Generation using Generative Adversarial Networks. CVPR Workshops 2019: 21-24 - [c16]Amanda Cardoso Duarte, Francisco Roldan, Miquel Tubau, Janna Escur, Santiago Pascual, Amaia Salvador, Eva Mohedano, Kevin McGuinness, Jordi Torres, Xavier Giró-i-Nieto:
Wav2Pix: Speech-conditioned Face Generation Using Generative Adversarial Networks. ICASSP 2019: 8633-8637 - [c15]Santiago Pascual, Mirco Ravanelli, Joan Serrà, Antonio Bonafonte, Yoshua Bengio:
Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks. INTERSPEECH 2019: 161-165 - [c14]Santiago Pascual, Joan Serrà, Antonio Bonafonte:
Towards Generalized Speech Enhancement with Generative Adversarial Networks. INTERSPEECH 2019: 1791-1795 - [c13]Joan Serrà, Santiago Pascual, Carlos Segura:
Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion. NeurIPS 2019: 6790-6800 - [c12]David Álvarez, Santiago Pascual, Antonio Bonafonte:
Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNN. SSW 2019: 35-39 - [i11]Amanda Cardoso Duarte, Francisco Roldan, Miquel Tubau, Janna Escur, Santiago Pascual, Amaia Salvador, Eva Mohedano, Kevin McGuinness, Jordi Torres, Xavier Giró-i-Nieto:
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks. CoRR abs/1903.10195 (2019) - [i10]Santiago Pascual, Mirco Ravanelli, Joan Serrà, Antonio Bonafonte, Yoshua Bengio:
Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks. CoRR abs/1904.03416 (2019) - [i9]Santiago Pascual, Joan Serrà, Antonio Bonafonte:
Towards Generalized Speech Enhancement with Generative Adversarial Networks. CoRR abs/1904.03418 (2019) - [i8]David Álvarez, Santiago Pascual, Antonio Bonafonte:
Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNN. CoRR abs/1906.00733 (2019) - [i7]Joan Serrà, Santiago Pascual, Carlos Segura:
Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion. CoRR abs/1906.00794 (2019) - [i6]Tina Raissi, Santiago Pascual, Maurizio Omologo:
Sample Drop Detection for Distant-speech Recognition with Asynchronous Devices Distributed in Space. CoRR abs/1911.06713 (2019) - 2018
- [c11]Joan Serrà, Santiago Pascual, Alexandros Karatzoglou:
Towards a Universal Neural Network Encoder for Time Series. CCIA 2018: 120-129 - [c10]Oriol Barbany, Antonio Bonafonte, Santiago Pascual:
Multi-Speaker Neural Vocoder. IberSPEECH 2018: 30-34 - [c9]Santiago Pascual, Antonio Bonafonte, Joan Serrà, José Andrés González López:
Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks. IberSPEECH 2018: 117-121 - [c8]Santiago Pascual, Antonio Bonafonte, Joan Serrà:
Self-Attention Linguistic-Acoustic Decoder. IberSPEECH 2018: 152-156 - [c7]Santiago Pascual, Maruchan Park, Joan Serrà, Antonio Bonafonte, Kang-Hun Ahn:
Language and Noise Transfer in Speech Enhancement Generative Adversarial Network. ICASSP 2018: 5019-5023 - [c6]Antonio Bonafonte, Santiago Pascual, Georgina Dorca:
Spanish Statistical Parametric Speech Synthesis Using a Neural Vocoder. INTERSPEECH 2018: 1998-2001 - [i5]Joan Serrà, Santiago Pascual, Alexandros Karatzoglou:
Towards a universal neural network encoder for time series. CoRR abs/1805.03908 (2018) - [i4]Santiago Pascual, Antonio Bonafonte, Joan Serrà:
Self-Attention Linguistic-Acoustic Decoder. CoRR abs/1808.10678 (2018) - [i3]Santiago Pascual, Antonio Bonafonte, Joan Serrà, José A. González:
Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks. CoRR abs/1808.10687 (2018) - 2017
- [c5]Santiago Pascual, Antonio Bonafonte, Joan Serrà:
SEGAN: Speech Enhancement Generative Adversarial Network. INTERSPEECH 2017: 3642-3646 - [i2]Santiago Pascual, Antonio Bonafonte, Joan Serrà:
SEGAN: Speech Enhancement Generative Adversarial Network. CoRR abs/1703.09452 (2017) - [i1]Santiago Pascual, Maruchan Park, Joan Serrà, Antonio Bonafonte, Kang-Hun Ahn:
Language and Noise Transfer in Speech Enhancement Generative Adversarial Network. CoRR abs/1712.06340 (2017) - 2016
- [c4]Igor Jauk, Antonio Bonafonte, Santiago Pascual:
Acoustic feature prediction from semantic features for expressive speech using deep neural networks. EUSIPCO 2016: 2320-2324 - [c3]Santiago Pascual, Antonio Bonafonte:
Multi-output RNN-LSTM for multiple speaker speech synthesis and adaptation. EUSIPCO 2016: 2325-2329 - [c2]Santiago Pascual, Antonio Bonafonte:
Prosodic Break Prediction with RNNs. IberSPEECH 2016: 64-72 - [c1]Santiago Pascual, Antonio Bonafonte:
Multi-output RNN-LSTM for multiple speaker speech synthesis with α-interpolation model. SSW 2016: 112-117
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-15 00:26 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint