English-Chinese news Cross-Lingual WSD dataset

For Cross-Lingual Word Sense Disambiguation from English to Chinese, using news articles from the real-world

Download .zip Download .tar.gz View on GitHub

WordNews English-Chinese Cross-Lingual Word Sense Disambiguation dataset

This dataset allows evaluation of WSD systems on a dataset consisting of sentences from news articles written recently in 2015. The format is in a similar format as Senseval-2 English Lexical Sample task. The .dtd file of Senseval-2 Lexical Sample task is provided. The dataset was originally built as an evaluation dataset for the translation component of WordNews, an education software for language learning.