Document Type
Preprint
Publication Date
2018
Journal / Book Title
arXIV.org
Abstract
This paper investigates censorship from a linguistic perspective. We collect a corpus of censored and uncensored posts on a number of topics, build a classifier that predicts censorship decisions independent of discussion topics. Our investigation reveals that the strongest linguistic indicator of censored content of our corpus is its readability.
Montclair State University Digital Commons Citation
Ng, Kei Yin; Feldman, Anna; Peng, Jing; and Leberknight, Christopher, "Linguistic Characteristics of Censorable Language on SinaWeibo" (2018). Department of Computer Science Faculty Scholarship and Creative Works. 3.
https://digitalcommons.montclair.edu/compusci-facpubs/3