Analysis and evaluation of Comparable Corpora for Under Resourced Areas of machine Translation

D2.6 Toolkit for multi-level alignment and information extraction from comparable corpora

Deliverable can be dowloaded from here.

 


D3.5 Tools for building comparable corpus from the Web

Deliverable can be dowloaded from here.

 

Free access is granted to the ACCURAT Toolkit (tools from both D2.6 and D3.5) after filling out the registration form below.



You are welcome to use the ACCURAT Toolkit (including the code) under the terms of the Apache 2.0 licence, however please acknowledge its use with a citation:

Pinnis, M., Ion, R., Ştefănescu, D., Su, F., Skadiņa, I., Vasiļjevs, A., & Babych, B. (2012). ACCURAT Toolkit for Multi-Level Alignment and Information Extraction from Comparable Corpora. Proceedings of the ACL 2012 System Demonstrations (pp. 91–96). Association for Computational Linguistics. Jeju, South Korea

Skadiņa, I., Aker, A., Mastropavlos, N., Su, F., Tufiș, D., Verlic, M., Vasiļjevs, A., Babych, B., Clough, P., Gaizauskas, R., Glaros, N., Paramita, M. & Pinnis, M. (2012). Collecting and Using Comparable Corpora for Statistical Machine Translation. In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12) (pp. 438–445). European Language Resources Association (ELRA). Istanbul, Turkey.

If you would like to cite a particular tool included in the ACCURAT Toolkit, please refer to the documentation of the ACCURAT Toolkit (or the paper above) for specific tool references.


0