Anything I do that may help others, I'll post it here.
Tuesday, April 15, 2014
Europarl corpus v.7 en-fr word-aligned with GIZA++
Finally, I finished aligning the Europarl corpus with GIZA++. Since this took me several days, I thought some people would be happy the find directly the word-aligned version online (saving processor power consumption at the same time!). So here it is, along with the config file that produced it. The source language is English, the target language is French. I basically followed instructions given here (many thanks to the author!).