A Hybrid Model for Weakly-Supervised Speech Dereverberation

Louis Bahrman; Mathieu Fontaine; Gael Richard

Communication Dans Un Congrès Année : 2025

A Hybrid Model for Weakly-Supervised Speech Dereverberation

(1, 2) , (1, 2) , (1, 2)

1
2

Louis Bahrman

Fonction : Auteur
PersonId : 1179676
IdHAL : louis-bahrman
ORCID : 0000-0002-4207-2067

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Mathieu Fontaine

Fonction : Auteur
PersonId : 13405
IdHAL : mathieu-fontaine
ORCID : 0000-0002-7657-6271
IdRef : 236886681

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Gael Richard

Fonction : Auteur
PersonId : 14146
IdHAL : gael-richard
IdRef : 094977208

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Résumé

This paper introduces a new training strategy to improve speech dereverberation systems using minimal acoustic information and reverberant (wet) speech. Most existing algorithms rely on paired dry/wet data, which is difficult to obtain, or on target metrics that may not adequately capture reverberation characteristics and can lead to poor results on non-target metrics. Our approach uses limited acoustic information, like the reverberation time (RT60), to train a dereverberation system. The system's output is resynthesized using a generated room impulse response and compared with the original reverberant speech, providing a novel reverberation matching loss replacing the standard target metrics. During inference, only the trained dereverberation model is used. Experimental results demonstrate that our method achieves more consistent performance across various objective metrics used in speech dereverberation than the state-of-the-art.

Mots clés

Speech dereverberation Hybrid deep learning Reverberation modeling Speech processing

Domaines

Traitement du signal et de l'image [eess.SP] Intelligence artificielle [cs.AI]

Fichier principal

camera_ready.pdf (654)

Origine	Fichiers produits par l'(les) auteur(s)

Louis Bahrman : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04931672

Soumis le : jeudi 6 février 2025-00:50:38

Dernière modification le : vendredi 7 février 2025-03:31:26

Dates et versions

hal-04931672 , version 1 (06-02-2025)

Licence

Identifiants

HAL Id : hal-04931672 , version 1

Citer

Louis Bahrman, Mathieu Fontaine, Gael Richard. A Hybrid Model for Weakly-Supervised Speech Dereverberation. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Apr 2025, Hyderabad, India. ⟨hal-04931672⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

GENCI LTCI IDS S2A IP_PARIS INSTITUT-MINES-TELECOM

0 Consultations

0 Téléchargements

A Hybrid Model for Weakly-Supervised Speech Dereverberation

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Partager