Department of Computer Science Faculty Scholarship and Creative Works

Classifying Idiomatic and Literal Expressions using Vector Space Representations

Jing Peng, Montclair State UniversityFollow
Anna Feldman, Montclair State UniversityFollow
Hamza Jazmati, Montclair State University

Document Type

Conference Proceeding

Publication Date

1-1-2015

Abstract

We describe an algorithm for automatic classification of idiomatic and literal expressions. Our starting point is that idioms and literal expressions occur in different contexts. Idioms tend to violate cohesive ties in local contexts, while literals are expected to fit in. Our goal is to capture this intuition using a vector representation of words. We propose two approaches: (1) Compute inner product of context word vectors with the vector representing a target expression. Since literal vectors predict well local contexts, their inner product with contexts should be larger than idiomatic ones, thereby telling apart literals from idioms; and (2) Compute literal and idiomatic scatter (covariance) matrices from local contexts in word vector space. Since the scatter matrices represent context distributions, we can then measure the difference between the distributions using the Frobenius norm. We provide experimental results validating the proposed techniques.

Montclair State University Digital Commons Citation

Peng, Jing; Feldman, Anna; and Jazmati, Hamza, "Classifying Idiomatic and Literal Expressions using Vector Space Representations" (2015). Department of Computer Science Faculty Scholarship and Creative Works. 162.
https://digitalcommons.montclair.edu/compusci-facpubs/162

This document is currently not available here.

COinS

Department of Computer Science Faculty Scholarship and Creative Works

Classifying Idiomatic and Literal Expressions using Vector Space Representations

Document Type

Publication Date

Abstract

Montclair State University Digital Commons Citation

Search

Browse

Author Corner

Links

Department of Computer Science Faculty Scholarship and Creative Works

Classifying Idiomatic and Literal Expressions using Vector Space Representations

Authors

Document Type

Publication Date

Abstract

Montclair State University Digital Commons Citation

Share

Search

Browse

Author Corner

Links

//<![CDATA[ document.write("<a href='mailto:" + "digitalcommons" + "@" + "mail.montclair.edu" + "'>" + "Contact Us" + "<\/a>") //]]>