Matxin is Transfer-based MT system from Spanish into Basque. It is an open, reusable and interoperable framework which can be improved in the next future combining it with the statistical model. The MT architecture reuses several open tools and it is based on an unique XML format for the flow between the different modules, which makes easer the interaction among different developers of tools and resources. Being Basque a resource-poor language this is a key feature in our aim for future improvements and extensions of the engine. The result is an open source software which can be downloaded from matxin.sourceforge.net, and we think it could be adapted to translating between other languages with few resources.
MT form Spanish to Basque. English to Basque in progress.
C++. It uses Freeling for analyzing Spanish or English and Lttoolbox-Apertium for access to the dictionaries. It reuses Basque morphology for morphological generation.
Lexical transfer, structural transfer, syntactical generation and morphological generation.
It is the first MT system for Basque.
OpenTrad and EurOpenTrad projects (Profit).
Alegria I., Díaz de Ilarraza A., Labaka G., Lersundi M., Mayor A., Sarasola K. Transfer-based MT from Spanish into Basque: reusability, standardization and open source. LNCS 4394. 374-384. Cicling 2007. ISBN-10: 3-540-70938-X ISSN 0302-9734. 2007.