Multiword Expressions, idioms and fixed-expressions
已有 4240 次阅读2012-12-14 09:40|个人分类:个人收藏 My favorites|系统分类:科研笔记|Multiword, Idioms, Expressions, 短语, 习语
Here enclosed a reading group's material about multiword expressions, idioms and fixed/semi-fixed expressions which may shed some new light for our research.
Working Definitions, Postulations, Points of Contention
Multiword expression (MWE): any phrase that is not entirely predictable on the basis of standard grammar rules and lexical entries
No immediate counterexamples to the claim that any expression that can be realised hyphenated/as a single lexeme or alternatively with spaces (e.g. mailman/postman vs. mail/post man), is a MWE. This could be used in the evaluation of extraction techniques, possibly using external resources to determine whether extracted expressions can be expressed hyphenated/without spaces (e.g. determine "optimal extraction volume" as the point where the ratio of such expressions is maximised)
Qualities of Multiword Expressions
Institutionalisation/conventionalisation: process of an expression becoming recognised and accepted as a lexical item, through consistent use over time (necessary but not sufficient condition on MWE-hood)
Lexicogrammatical fixedness: formal rigidity, preferred lexical realisation, restrictions on aspect, mood, voice, etc. (neither necessary nor sufficient condition on MWE-hood)
lexicogrammatically fixed MWE: kick the bucket, #the bucket was kicked, #slowly kick the bucket
Semantic/pragmatic non-compositionality: there is a mismatch between the semantics/pragmatics of the parts and the whole; includes the case of the component lexical items having specialised meanings within the context of the MWE, not accessible in simplex contexts (not necessary but sufficient)
idiomatic expression (non-compositional): the expression is semantically opaque and functions as a gestalt (e.g. kick the bucket)
idiomatically combining expression (idiosyncratically compositional): the lexical parts can be seen to (post hoc) assume components of the semantics of the whole, whereby the sum of the parts equals the whole (e.g. let the cat out of the bag)
Syntactic irregularity: the expression cannot be parsed based on the simplex morphology (parts of speech) of the components (not necessary but sufficient)
syntactically-irregular MWEs: all of a sudden, the be all and end all of NP
syntactically regular MWEs: kick the bucket, fly off the handle
Non-identifiability: when first exposed to the expression, the meaning cannot be predicted from its surface form (not necessary but sufficient)
idiom of decoding (non-identifiable): "misleading lexical clusters" (e.g. kick the bucket, fly off the handle)
idiom of encoding (identifiable): idiosyncratic lexical combination; note that all idioms of decoding are also idioms of encoding (example strict idioms of encoding -- wide awake, plain truth)
Situatedness: the expression is associated with a fixed pragmatic point (neither necessary nor sufficient)
situated MWEs: good morning, all aboard
non-situated MWEs: first off, to and fro
Figuration: the expression encodes some metaphor, metonymy, hyperbole, etc, even if the nature thereof is underspecified (neither necessary nor sufficient)
figurative expressions:bull market, beat around the bush
non-figurative expressions:first off, to and fro
Proverbiality: the expression is used "to describe--and implicitly, to explain--a recurrent situation of particular social interest ... in virtue of its resemblance or relation to a scenario involving homely, concrete things and relations" (neither necessary nor sufficient)
Informality: the expression is associated with more informal or colloquial registers (neither necessary nor sufficient)
Affect: the expression encodes a certain evaluation of affective stance toward the thing it denotes (neither necessary nor sufficient)