Pablo Calleja, from the Ontology Engineering Group at the Universidad Politécnica de Madrid, Spain, is currently completing an Internship with RGCL as part of his PhD. Yesterday, Pablo gave a talk to the group about his research.
Title: Role-based Named Entity Recognition over unstructured texts
Abstract:
Named Entity Recognition (NER) poses new challenges in real-world documents in which there are entities with different roles according to their purpose or meaning. Retrieving all the possible entities in scenarios in which only a subset of them based on their role is needed, produces noise on the overall precision.
The talk will present a Role-based NER task that relies on role classification hierarchy models that support recognizing entities with a specific role. The proposed task has been implemented in two use cases: one in the biomedical domain using Spanish drug Summary of Product Characteristics and the other in the legal domain using multilingual and heterogeneous mails of the Panama Papers investigation.