Document Type

Conference Proceeding

Publication Date

1-1-2019

Journal / Book Title

Proceedings of the 11th International Conference on Language Resources and Evaluation

Abstract

This paper describes the development of an idiom-annotated corpus of Russian. The corpus is compiled from freely available resources online and contains texts of different genres. The idiom extraction, annotation procedure, and a pilot experiment using the new corpus are outlined in the paper. Considering the scarcity of publicly available Russian annotated corpora, the corpus is a much-needed resource that can be utilized for literary and linguistic studies, pedagogy as well as for various Natural Language Processing tasks.

Journal ISSN / Book ISBN

ISBN: 979-10-95546-00-9

Rights

The LREC 2018 Proceedings are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Published Citation

Aharodnik, Katsiaryna, Anna Feldman, and Jing Peng. "Designing a Russian idiom-annotated corpus." Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). 2018.

Included in

Linguistics Commons

Share

COinS