词汇统计发现,汉语既适合吹嘘拍马亦长于恶意构陷
日期: 05/01/2012 00:49:08
日常汉语词汇不但比类似规模的英语日常词汇多出了两倍到三倍的褒义词和贬义词,更有意思的是,这些褒贬词汇中特别针对人的 就多达约1/3(3809/11968)。由于1/3以外的褒贬词汇大多既可以用于物,也可以用于人,汉语所提供的吹拍谀词和构陷咒骂语,远远多于英语。
不怪中文网络社区,硝烟四起,一地鸡毛。吹拍之风不绝,个人崇拜盛行。
# of 描述人的动作和评价的词语总数: 9108
# of 其中带有主观褒贬色彩的词语数: 3809
# of 带有主观褒贬色彩的词语总数: 11968
国家不幸诗家幸,我们做社会媒体舆情分析的人有“福”了。
Is Chinese more sentiment intensive than English?
Interesting finding: that Chinese more than doubles the negative words and more than triples the positive words in comparison with the English vocabulary. This is based on the Chinese lexicon (50+ k entries) and the English lexicon (44k entries). They are comparable because if we exclude bound morpheme (some character entries) from the Chinese lexicon, we end up with roughly the same size of vocabulary as the comparison basis.
English: Negative: 2950 Positive: 1511
Chinese: Negative: 6568 Positive: 5400
Does Chinese provide more vocabulary vehicles for attacking people or blowing others' or their own trumpets?
Seems to be yes from the perspective of the vocabulary size statistics. Here are the facts of the Chinese lexicon for everyday Mandarin:
# of sentiment entries only related to human: 3809
total # of sentiments entries: 11968
total # of all entries (minus bound morphemes not used in Contemporary Mandarin): ~ 48000
See, about 1/4 of the vocabulary is sentiment words, in which 1/3 of the sentiment words are created only for human. Given that the majority of the remaining 2/3 of sentiment vocabulary can also be used for human, we conclude --
CONCLUSION
Indeed in the contemporary Mandarin, there are more words that can be used to express the sentiments for or against a person. Not only that the Chinese language has 2-3 times more sentiment words in the basic vocabulary than those of English, but also that a bigger portion of the sentiment words are created only for praising or attacking people.
NO WONDER the Chinese social media such as weibo.com is full of political struggle and personal attacks.
Quote
Chinese is a more sentiment-intensive language than English?? FW: Counts of sentiment words in Chinese and English
Interesting finding:
that Chinese more than doubles the negative words and more than triples the positive words in comparison with the English vocabulary.
This is based on the Chinese feature lexicon (50+ k entries) we have just completed manual tagging of for the first pass and the seasoned English feature lexicon (44k entries). They are very comparable in terms of the vocabulary size because the Chinese lexicon contains some single characters which are never used alone in Contemporary Mandarin (bound morphemes). If we exclude these characters, we end up with roughly the same size of vocabulary as the comparison basis.
English:
Negative: 2950
Positive: 1511
Chinese:
Negative: 6568
Positive: 5400
【置顶:立委科学网博客NLP博文一览(定期更新版)】