Publications and creative works by faculty in the Department of Linguistics, College of Humanities and Social Sciences, are collected here.

Follow


Submissions from 2025

PDF

De acordo com a…: Lexical Bundles and Disciplinary Variation in Academic Brazilian Portuguese, Larissa Goulart (2025)

Submissions from 2024

PDF

Examining the source of island effects in native speakers and second language learners of English, Saad Aldosari, Lauren Covey, and Alison Gabriele (2024)

PDF

Exploring Language Patterns: Integrating Corpus Linguistics in the Pedagogical Grammar Class, Larissa Goulart (2024)

PDF

AI or student writing? Analyzing the situational and linguistic characteristics of undergraduate student writing and AI-generated assignments, Larissa Goulart, Marine Laísa Matte, Alanna Mendoza, Lee Alvarado, and Ingrid Veloso (2024)

PDF

Young Children’s Directed Question Asking in Preschool Classrooms, Michelle Wong, Koeun Choi, Libby Barak, Elizabeth Lapidow, Jennifer Austin, Patrick Shafto, and Elizabeth Bonawitz (2024)

Submissions from 2023

Link

When Children's Production Deviates From Observed Input: Modeling the Variable Production of the English Past Tense, Libby Barak, Zara Harmon, Naomi H. Feldman, Jan Edwards, and Patrick Shafto (2023)

Submissions from 2022

PDF

CATs are Fuzzy PETs: A Corpus and Analysis of Potentially Euphemistic Terms, Martha Gavidia, Patrick Lee, Anna Feldman, and Jing Peng (2022)

PDF

Register variation in understudied academic contexts, Larissa Goulart (2022)

PDF

Towards better structured and less noisy Web data: Oscar with Register annotations, Veronika Laippala, Anna Salmela, Samuel Rönnqvist, Alham Fikri Aji, Li Hsin Chang, Asma Dhifallah, Larissa Goulart, Henna Kortelainen, Marc Pàmies, Deise Prina Dutra, Valtteri Skantsi, Lintang Sutawika, and Sampo Pyysalo (2022)

PDF

A Report on the Euphemisms Detection Shared Task, Patrick Lee, Anna Feldman, and Jing Peng (2022)

PDF

Searching for PETs: Using Distributional and Sentiment-Based Methods to Find Potentially Euphemistic Terms, Patrick Lee, Martha Gavidia, Anna Feldman, and Jing Peng (2022)

Submissions from 2021

PDF

Examining Variability in the Processing of Agreement in Novice Learners: Evidence From Event-Related Potentials, Alison Gabriele, José Alemán Bañón, Lesa Hoffman, Lauren Covey, Amy Rossomondo, and Robert Fiorentino (2021)

PDF

Findings of the NLP4IF-2021 Shared Tasks on Fighting the COVID-19 Infodemic and Censorship Detection, Shaden Shaar, Firoj Alam, Giovanni Da San Martino, Alex Nikolov, Wajdi Zaghouani, Preslav Nakov, and Anna Feldman (2021)

Submissions from 2020

PDF

You don’t say... Linguistic features in sarcasm detection, Martina Ducret, Lauren Kruse, Carlos Martinez, Anna Feldman, and Jing Peng (2020)

Link

Linguistic fingerprints of internet censorship: The case of sina weibo, Kei Yin Ng, Anna Feldman, and Jing Peng (2020)

PDF

Investigating the relationship between individual differences and island sensitivity, Catherine Pham, Lauren Covey, Alison Gabriele, Saad Aldosari, and Robert Fiorentino (2020)

Submissions from 2019

PDF

Designing a Russian Idiom-Annotated Corpus, Katsiaryna Aharodnik, Anna Feldman, and Jing Peng (2019)

WordPrep: Word-based Preposition Prediction Tool, Pooja Bhagat, Aparna S. Varde, and Anna Feldman (2019)

PDF

Leveraging NLP and social network analytic techniques to detect censored Keywords: System design and experiments, Christopher S. Leberknight and Anna Feldman (2019)

Submissions from 2018

PDF

An eye-tracking study examining the role of question-answer congruency in children’s comprehension of only: A preliminary report, Lauren Covey, Caitlin E. Coughlin, and Utako Minai (2018)

Link

Controversy and sentiment: An exploratory study, Kateryna Kaplun, Christopher Leberknight, and Anna Feldman (2018)

Detecting Censorable Content on Sina Weibo: A Pilot Study, Kei Yin Ng, Anna Feldman, and Chris Leberknight (2018)

PDF

Linguistic Characteristics of Censorable Language on SinaWeibo, Kei Yin Ng, Anna Feldman, Jing Peng, and Christopher Leberknight (2018)

Submissions from 2017

PDF

When Technology Is Too Hot, Too Cold Or Just Right, Jonathan Howell (2017)

PDF

Acoustic classification of focus: On the web and in the lab, Jonathan Howell, Mats Rooth, and Michael Wagner (2017)

PDF

Automatic idiom recognition with word embeddings, Jing Peng and Anna Feldman (2017)

Submissions from 2016

PDF

Experiments in idiom recognition, Jing Peng and Anna Feldman (2016)

PDF

In God we trust. All others must bring data. - W. Edwards Deming Using word embeddings to recognize idioms, Jing Peng and Anna Feldman (2016)

PDF

T. S. Eliot’s ‘obscurity’ in the love song of J. Alfred Prufrock, Longxing Wei (2016)

Submissions from 2015

PDF

Focus Placement on Adjacent Words in Yes/No Questions, Jonathan Howell (2015)

PDF

Classifying idiomatic and literal expressions using vector space representations, Jing Peng, Anna Feldman, and Hamza Jazmati (2015)

Submissions from 2014

PDF

Classifying idiomatic and literal expressions using topic models and intensity of emotions, Jing Peng, Anna Feldman, and Ekaterina Vylomova (2014)

PDF

Evaluating and Automating the Annotation of a Learner Corpus, Alexandr Rosen, Jirka Hana, Barbora Stindlova, and Anna Feldman (2014)

Submissions from 2013

PDF

Automatic Identification of Learners’ Language Background based on their Writing in Czech, Katsiaryna Aharodnik, Marco Chang, Anna Feldman, and Jirka Hana (2013)

PDF

Automatic Detection of Idiomatic Clauses, Anna Feldman and Jing Peng (2013)

Submissions from 2012

Resource-Light Approaches to Computational Morphology. Part 1: Monolingual Approaches, Jirka Hana and Anna Feldman (2012)

Submissions from 2011

PDF

Prosodylab-aligner: A tool for forced alignment of laboratory speech, Kyle Gorman, Jonathan Howell, and Michael Wagner (2011)

PDF

A Low-budget Tagger for Old Czech, Jirka Hana, Anna Feldman, and Katsiaryna Aharodnik (2011)

PDF

Theme: Mobile Language Learning, Glenn Stockwell and Susana Sotillo (2011)

Submissions from 2010

PDF

A Positional Tagset for Russian, Jirka Hana and Anna Feldman (2010)

PDF

Challenges of Cheap Resource Creation for Morphological Tagging, Jirka Hana and Anna Feldman (2010)

PDF

Intelligent Time-Aware Query Translation for Text Sources, Amal Kaluarachchi, Aparna S. Varde, Jing Peng, and Anna Feldman (2010)

Link

Incorporating terminology evolution for query translation in text retrieval with association rules, Amal Kalurachchi, Aparna S. Varde, Srikanta Bedathur, Gerhard Weikum, Jing Peng, and Anna Feldman (2010)

PDF

Like Finding A Needle In a Haystack: Annotating The American National Corpus for Idiomatic Expressions, Laura Street, Nathan Michalov, Rachel Silverstein, Michael Reynolds, Lurdes Ruela, Felicia Flowers, Angela Talucci, Priscilla Pereira, Gabriella Morgon, Samantha Siegel, Marci Barousse, Antequa Anderson, Tashom Carroll, and Anna Feldman (2010)

PDF

Semantic enrichment of text representation with wikipedia for text classification, Hiroki Yamakawa, Jing Peng, and Anna Feldman (2010)

Submissions from 2009

PDF

Second Occurrence Focus and the Acoustics of Prominence, Jonathan Howell (2009)

PDF

The Composite Nature of Interlanguage as a Developing System, Longxing Wei (2009)

Submissions from 2008

PDF

Annotating an Arabic Learner Corpus for Error, Ghazi Abuhakema, Reem Faraj, Anna Feldman, and Eileen Fitzpatrick (2008)

PDF

Verification and Implementation of Language-Based Deception Indicators in Civil and Criminal Narratives, Joan Bachenko, Eileen Fitzpatrick, and Michael Schonwetter (2008)

PDF

Tagset Design, Inflected Languages, and N-gram Tagging, Anna Feldman (2008)

PDF

ARIDA: An Arabic Interlanguage Database and Its Applications: A Pilot Study, Anna Feldman, Ghazi Abuhakema, and Eileen Fitzpatrick (2008)

PDF

Designing and Evaluating a Russian Tagset, Serge Sharoff, Mikhail Kopotev, Tomaž Erjavec, Anna Feldman, and Dagmar Divjak (2008)

Submissions from 2006

PDF

A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources, Anna Feldman, Jirka Hana, and Chris Brew (2006)

Experiments in Cross-Language Morphological Annotation Transfer, Anna Feldman, Jirka Hana, and Chris Brew (2006)

The Multilingual Mental Lexicon and Lemma Transfer in Third Language Learning, Longxing Wei (2006)

Submissions from 2002

The Bilingual Mental Lexicon and Speech Production Process, Longxing Wei (2002)

Submissions from 2001

Lemma Congruence Checking Between Languages as an Organizing Principle in Intrasentential Codeswitching, Longxing Wei (2001)

Submissions from 2000

PDF

Discourse Functions and Syntactic Complexity in Synchronous and Asynchronous Communication, Susana Sotillo (2000)

Types of morphemes and their implications for second language morpheme acquisition, Longxing Wei (2000)

Unequal election of morphemes in adult Second Language Acquisition, Longxing Wei (2000)

Submissions from 1999

Political Discourse of a Working-Class Town, Susana Sotillo and Dana Starace-Nastasi (1999)

Submissions from 1995

A Rule-Based Phrase Parser for Real-Time Text-To-Speech Synthesis, Joan Bachenko, Eileen Fitzpatrick, and Jeffrey Daugherty (1995)

Submissions from 1989

Parsing for prosody: What a text-to-speech system needs from syntax, Eileen Fitzpatrick and Joan Bachenko (1989)