Bayesian models for unit discovery on a very low resource language - Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Bayesian models for unit discovery on a very low resource language

Résumé

Developing speech technologies for low-resource languages has become a very active research field over the last decade. Among others, Bayesian models have shown some promising results on artificial examples but still lack of in situ experiments. Our work applies state-of-the-art Bayesian models to unsupervised Acoustic Unit Discovery (AUD) in a real low-resource language scenario. We also show that Bayesian models can naturally integrate information from other resourceful languages by means of informative prior leading to more consistent discovered units. Finally, discovered acoustic units are used, either as the 1-best sequence or as a lattice, to perform word segmentation. Word segmentation results show that this Bayesian approach clearly outperforms a Segmental-DTW baseline on the same corpus.
Fichier principal
Vignette du fichier
main.pdf (216.98 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01709589 , version 1 (15-02-2018)

Identifiants

Citer

Lucas Ondel, Pierre Godard, Laurent Besacier, Elin Larsen, Mark Hasegawa-Johnson, et al.. Bayesian models for unit discovery on a very low resource language. 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Apr 2018, Calgary, Alberta, Canada. ⟨10.1109/ICASSP.2018.8461545⟩. ⟨hal-01709589⟩
549 Consultations
1056 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More