MLLP research group (VRAIN, Universitat Politècnica de València)
Europarl-ST is a Multilingual Speech Translation Corpus which contains paired audio-text samples for Speech Translation, constructed using the debates carried out in the European Parliament in the period between 2008 and 2012.
Updated 6 months ago
Early software by MLLP researchers (2010-2015): AK, GIDOC, jaf_Tools, Bilingual Text Classification.
Updated 1 year ago
This repository contains the code for the paper "Stream-level Latency Evaluation for Simultaneous Machine Translation".
Updated 1 year ago
This repository contains the code for the segmentation system proposed in "Direct Segmentation Models for Streaming Speech Translation".
Updated 1 year ago