The Treatment of Word Sense Inventories in the ‘LACELL WSD Project’
AbstractThe WSD community has long debated whether the criteria for representing polysemy in general purpose dictionaries meet the specific demands of sense disambiguation tasks. Concern is growing that pre-defined sense inventories might not adjust well to the needs of WSD, because word occurrences can rarely be paired with rigid sense classes in a one-toone fashion. A second cause for concern is the level of sense granularity adopted in conventional dictionary entries.Fine-grained distinctions can be useful for a dictionary user but complicate the design and evaluation of WSD systems in a way that is often unnecessary. As a result of these objections, many experts have voiced the opinion that dictionaries are not adequate sources of sense inventories for WSD. However, the problem of word sense overlaps can also be resolved by modifying the way in which dictionary entries are processed by WSD programs. This is the solution applied in the LACELL WSD system. The algorithm selects simultaneously two or more dictionary senses if the context does not allow sufficient discrimination between/among them. This article explains the underpinnings of such proposal, as well as discussing some advantages and disadvantages.
The works published in this journal are subject to the following terms:
1. The Publications Services at the University of Murcia (the publisher) retains the property rights (copyright) of published works, and encourages and enables the reuse of the same under the license specified in item 2.
2. The works are published in the electronic edition of the magazine under a Creative Commons Attribution Non-commercial Share Alike 4.0.
3.Conditions of self-archiving. Authors are encouraged to disseminate pre-print (draft papers prior to being assessed) and/or post-print versions (those reviewed and accepted for publication) of their papers before publication, because it encourages distribution earlier and thus leads to a possible increase in citations and circulation among the academic community.
RoMEO color: green