×
Loading...

RuCor Russian coreference corpus by John Doe

Book Information

TitleRuCor Russian coreference corpus
Year2015-10-29
Languageeng, Russian
Mediatypedata
SubjectRussian, language, Russian language, linguistics, русский, язык, русский язык, лингвистика
Collectionfolkscanomy_science, folkscanomy, additional_collections
Uploaderadydunch7642
Identifierrucoref_29.10.2015-data
Telegram icon Share on Telegram
Download Now

Description

RuCor is the first open corpus of Russian language where anaphorical and coreferential relations between noun groups are annotated. The current version of RuCor contains 156636 tokens. Apart from the annotation of coreferential and anaphorical relations morphological annotation is also provided. The elaboration of RuCor started in 2013 as a part of the project RU­EVAL­2014, campaign evaluating the quality of Russian NLP tools to resolve anaphora and extract coreference chains. RuCor includes prosaic texts of different length and genres: news, science, fiction, blogs. This resource is aimed at theoretical linguists working in the field of anaphora and coreference as well as at NLP systems’ developers and at all those who are fascinated by Russian syntax and discourse. All materials are open and available for download. If you quote examples retrieved from RuCor, please, cite RuCor as the source as well as the author of the text in question and the name of the text. The Web ­interface was designed by Dmitrij Gorshkov. The tool uses MySQL database engine for corpus management.http://rucoref.maimbava.net/