The details of the shared task on cleaning of translation memories organised at the 2nd Workshop on Natural Language Processing for Translation Memories have been published. We hope it will be a successful task given the amount of traffic the page has received in the first 24h since it was announced.
The second call for papers for the 2nd Workshop on Natural Language Processing for Translation Memories (NLP4TM 2016) to be organised in conjunction with LREC 2016 has been distributed. The deadline for paper submission is in 2 week. For more details please visit the workshop’s web page.
- Constantin Orasan (University of Wolverhampton, UK)
- Marcello Federico (FBK, Italy)
Submission deadline: May 15, 2016
1. Call For Papers
Translation Memories (TM) are amongst the most widely used tools by professional translators. The underlying idea of TMs is that a translator should benefit as much as possible from previous translations by being able to retrieve the way in which a similar sentence was translated before. Moreover, the usage of TMs aims to guarantee that new translations follow the client’s specified style and terminology. Despite the fact that the core idea of these systems relies on comparing segments (typically of sentence length) from the document to be translated with segments from previous translations, most of the existing TM systems hardly use any language processing for this. Instead of addressing this issue, most of the work on translation memories focused on improving the user experience by allowing processing of a variety of document formats, intuitive user interfaces, etc. Continue reading
The Research Group in Computational Linguistics is looking to develop a new Masters course in Corpus Lexicography. To ensure we get the content just right we need you! Please help us by giving your input via the following short questionnaire. Thank you!
The advanced translation memory tool developed by Rohit Gupta is now available on Github at https://github.com/rohitguptacs/TMAdvanced
Current Translation Memory (TM) systems work at the surface level and lack semantic knowledge while matching. This tool implements an approach to incorporating semantic knowledge in the form of paraphrasing in matching and retrieval. Most of the TMs use Levenshtein edit- distance or some variation of it. This tool implements an efficient approach to incorporating paraphrasing with edit-distance. The approach is based on greedy approximation and dynamic programming. We have obtained significant improvement in both retrieval and translation of retrieved segments. More details about the approach and evaluations given in the following publications:
Approach: Rohit Gupta and Constantin Orasan. 2014. Incorporating Paraphrasing in Translation Memory Matching and Retrieval. In Proceedings of the European Association of Machine Translation (EAMT-2014).
Human Evaluations: Rohit Gupta, Constantin Orasan, Marcos Zampieri, Mihaela Vela and Josef van Genabith. 2015. Can Transfer Memories afford not to use paraphrasing? In Proceeding of EAMT-2015, Antalya Turkey.
The tool was developed part of the EXPERT project.
We are delighted to announce that Prof. Michael Zock of Aix-Marseille Universite was recently awarded an honorary professorship at the University of Wolverhampton, following successful nomination by Prof. Ruslan Mitkov, Director of RIILP.
Prof. Zock’s research interests lie in communication, cognitive science and language production or language generation. Starting from user needs and empirical findings (psycholinguistics, neurosciences), his work involves building tools to help people acquire the skill of speaking or writing in a foreign language and in their mother tongue. Continue reading
If you are considering applying for a Masters or PhD programme with the Research Group in Computational Linguistics, you might like to come along to the University’s next Postgraduate Open Evening on Wednesday 16th September from 5pm – 7.30pm. A member of staff will be on hand to discuss our courses, as well as the areas that the Group specialises in. You can also find out more about the different levels of postgraduate study, how your research interests might fit in with the Group’s work, as well as the potential career opportunities that a degree in this field might open up. There will also be other stands where you can enquire about the application process and how to finance your study. Continue reading
The European Society of Phraseology (EUROPHRAS) organised its 2015 conference in Malaga, Spain, from the 29th June to the 1st July 2015. The topic of the conference was ‘Computerised an Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives’. Continue reading
The 2nd Call for Papers of the Workshop on Natural Language Processing for Translation Memories (NLP4TM) organised at RANLP 2015 by Constantin Orasan and Rohit Gupta has been published. Information about the topics addressed by the workshop and important dates can be found on the workshop’s webpage.
Research carried out in the EXPERT project between researchers from University of Wolverhampton and Saarland University, Germany is being presented at the European Association for Machine Translation 2015 conference. The work shows how paraphrasing can help the task of translators who use translation memories. Continue reading