Bimodal Corpora Terminology Extraction: Another Brick in the Wall Paper

Mihaila, C. and Mekhaldi, D. (2009) Bimodal Corpora Terminology Extraction: Another Brick in the Wall Paper. In Proceedings of RANLP'2009, Borovetz, Bulgaria, Sept 14-16. pp. 236 - 240

Abstract

This paper presents a new study on automatic terminology extraction in the context of bimodal corpora that were generated from lectures and meetings. More speci fically, the study aims to observe to which extent written text (discussed documents) and spoken text (dialogue transcript) share keywords. Using a hybrid terminology extraction approach, experiments have been performed on a collection of bimodal English corpora, including one scienti c conference presentations corpus and two decision-making meetings corpora respectively. The evaluation results highlight a diff erence between keywords extracted from written text and from spoken text. Moreover, the obtained results emphasise the importance of considering text obtained from di fferent modalities in order to generate rich and consistent keyword lists for bimodal corpora.

Electronic version

http://clg.wlv.ac.uk/papers/mihaila-RANLP-09.pdf