Europarl-ST is a Multilingual Speech Translation Corpus which contains paired audio-text samples for Speech Translation, constructed using the debates carried out in the European Parliament in the period between 2008 and 2012. https://mllp.upv.es/europarl-st/

Gonçal V. Garcés Díaz-Munío 2e70c2d629 First version 2 år sedan
README.md 2e70c2d629 First version 2 år sedan

README.md

Europarl-ST

Europarl-ST is a multilingual Spoken Language Translation corpus containing paired audio-text samples for SLT from and into 9 European languages, for a total of 72 different translation directions. This corpus has been compiled using the debates held in the European Parliament in the period between 2008 and 2012.

Get the corpus

You can download the corpus from: https://mllp.upv.es/europarl-st/