selected

A lightweight dual-stage framework for personalized speech enhancement based on DeepFilterNet2
GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis
RIR-in-a-Box: Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis
Speech dereverberation constrained on room impulse response characteristics
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments
Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation