"Detecting Censorable Content on Sina Weibo: A Pilot Study" by Kei Yin Ng, Anna State Feldman 6557500 et al.

Department of Linguistics Faculty Scholarship and Creative Works

Title

Detecting Censorable Content on Sina Weibo: A Pilot Study

Authors

Kei Yin Ng, Montclair State University
Anna State Feldman 6557500Follow
Chris Leberknight, Montclair State UniversityFollow

Document Type

Conference Proceeding

Publication Date

7-9-2018

Abstract

This study provides preliminary insights into the linguistic features that contribute to Internet censorship in mainland China. We collected a corpus of 344 censored and uncensored microblog posts that were published on Sina Weibo and built a Naive Bayes classifier based on the linguistic, topic-independent, features. The classifier achieves a 79.34% accuracy in predicting whether a blog post would be censored on Sina Weibo.

DOI

10.1145/3200947.3201037

MSU Digital Commons Citation

Ng, Kei Yin; Feldman, Anna State 6557500; and Leberknight, Chris, "Detecting Censorable Content on Sina Weibo: A Pilot Study" (2018). Department of Linguistics Faculty Scholarship and Creative Works. 26.
https://digitalcommons.montclair.edu/linguistics-facpubs/26

Published Citation

Ng, K. Y., Feldman, A., & Leberknight, C. (2018, July). Detecting censorable content on sina weibo: A pilot study. In Proceedings of the 10th Hellenic Conference on Artificial Intelligence (pp. 1-5).

Download

Included in

Linguistics Commons

COinS

Department of Linguistics Faculty Scholarship and Creative Works

Title

Authors

Document Type

Publication Date

Abstract

DOI

MSU Digital Commons Citation

Published Citation

Included in

Search

Browse

Author Corner

Links

Department of Linguistics Faculty Scholarship and Creative Works

Title

Authors

Document Type

Publication Date

Abstract

DOI

MSU Digital Commons Citation

Published Citation

Included in

Search

Browse

Author Corner

Links

//<![CDATA[ document.write("<a href='mailto:" + "digitalcommons" + "@" + "mail.montclair.edu" + "'>" + "Contact Us" + "<\/a>") //]]>