The general objective of ARE is to develop discourse anaphora resolution methods and to evaluate them in a common and consistent manner. In this first edition, the evaluation is organised only for English and tackles pronominal anaphora and NP coreference. If successful, in future editions it can be extended to include other types of referential expressions and anaphoric relations (e.g. bridging references) as well as other languages. The first Anaphora Resolution Exercise (ARE) is organised in conjunction with the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC2007).
Four tasks will be organised:
- Pronominal resolution on pre-annotated texts: in this task, participants will be provided with documents in which the NPs are annotated. Among these NPs, some of them will have an attribute which indicates that they are referential and have to be resolved. In this task only referential pronouns will have this attribute (i.e. pleonastic pronouns will not be annotated). The participants will have to identify for each pronominal referential expression any correct antecedent from the list of annotated NPs.
- Coreferential chains resolution on pre-annotated texts: in this task participants will be provided with documents in which the NPs are annotated. The participants will have to identify full coreferential chains which contain the annotated NPs.
- Pronominal resolution on raw texts: in this task, participants will have to resolve referential pronouns in unannotated texts (i.e. the referential expressions and antecedent candidates are not marked as in task 1)
- Coreferential chains resolution on raw texts: in this task, participants will have to identify full coreferential chains in unannotated texts (i.e. the NPs are not pre-marked)
- 2nd Nov 2006: the first announcement of ARE
- 2nd Nov 2006: the registration opens
- 31st Jan 2007: the training data available
- 15th March 2007: the test data available
- 23th March 2007: the competition finishes
- 28th - 29th March 2007: DAARC2007
The format of the input and output, as well as training data is available for each task. They can be viewed here:these guidelines. In order to keep the task simple, the only relations kept in the data are those of IDENTITY, SYNONYMY, GENERALIZATION and SPECIALIZATION. In this first edition of the competition, participants are not required to make a distinction between the relations. They are only required to identify that there is a relation between two entities.
- Antonio Branco, University of Lisbon, Portugal
- Dan Cristea, "Alexandru Ioan Cuza" University, Iasi, Romania
- Ruslan Mitkov, University of Wolverhampton, UK
- Constantin Orasan, University of Wolverhampton, UK
To indicate your intention to participate or to find out more please contact Constantin Orasan.