《镜子大全》《朝华午拾》分享 http://blog.sciencenet.cn/u/liwei999 曾任红小兵,插队修地球,1991年去国离乡,不知行止。

博文

立委统计发现,汉语既适合吹嘘拍马亦长于恶意构陷

已有 4911 次阅读 2012-5-1 13:59 |个人分类:立委科普|系统分类:科研笔记|关键词:汉语,统计,褒贬| 汉语, 统计, 褒贬

词汇统计发现,汉语既适合吹嘘拍马亦长于恶意构陷
作者: 立委 (*)
日期: 05/01/2012 00:49:08

日常汉语词汇不但比类似规模的英语日常词汇多出了两倍到三倍的褒义词和贬义词,更有意思的是,这些褒贬词汇中特别针对人的 就多达约1/3(3809/11968)。由于1/3以外的褒贬词汇大多既可以用于,也可以用于,汉语所提供的吹拍谀词和构陷咒骂语,远远多于英语。

不怪中文网络社区,硝烟四起,一地鸡毛。吹拍之风不绝,个人崇拜盛行。

# of 描述人的动作和评价的词语总数: 9108
# of 其中带有主观褒贬色彩的词语数: 3809
# of 带有主观褒贬色彩的词语总数: 11968

国家不幸诗家幸,我们做社会媒体舆情分析的人有“福”了。

Is Chinese more sentiment intensive than English?

Interesting finding: that Chinese more than doubles the negative words and more than triples the positive words in comparison with the English vocabulary. This is based on the Chinese lexicon (50+ k entries) and the English lexicon (44k entries). They are comparable because if we exclude bound morpheme (some character entries) from the Chinese lexicon, we end up with roughly the same size of vocabulary as the comparison basis.

English:   Negative: 2950 Positive: 1511
Chinese: Negative: 6568 Positive: 5400 

Does Chinese provide more vocabulary vehicles for attacking people or blowing others' or their own trumpets?

Seems to be yes from the perspective of the vocabulary size statistics. Here are the facts of the Chinese lexicon for everyday Mandarin:

# of sentiment entries only related to human: 3809
total # of sentiments entries: 11968
total # of all entries (minus bound morphemes not used in Contemporary Mandarin): ~ 48000

See, about 1/4 of the vocabulary is sentiment words, in which 1/3 of the sentiment words are created only for human. Given that the majority of the remaining 2/3 of sentiment vocabulary can also be used for human, we conclude --

CONCLUSION
 
Indeed in the contemporary Mandarin, there are more words that can be used to express the sentiments for or against a person. Not only that the Chinese language has 2-3 times more sentiment words in the basic vocabulary than those of English, but also that a bigger portion of the sentiment words are created only for praising or attacking people.

NO WONDER the Chinese social media such as weibo.com is full of political struggle and personal attacks.


Quote


Chinese is a more sentiment-intensive language than English?? FW: Counts of sentiment words in Chinese and English

Interesting finding:

that Chinese more than doubles the negative words and more than triples the positive words in comparison with the English vocabulary.

This is based on the Chinese feature lexicon (50+ k entries) we have just completed manual tagging of for the first pass and the seasoned English feature lexicon (44k entries). They are very comparable in terms of the vocabulary size because the Chinese lexicon contains some single characters which are never used alone in Contemporary Mandarin (bound morphemes). If we exclude these characters, we end up with roughly the same size of vocabulary as the comparison basis.

English:
Negative: 2950
Positive: 1511


Chinese:
Negative: 6568
Positive: 5400


【置顶:立委科学网博客NLP博文一览(定期更新版)】



http://blog.sciencenet.cn/blog-362400-565538.html

上一篇:不在于“毒胶囊”究竟有多毒,而在于做事情的手续
下一篇:从美国职业排行榜背后的IT劳工市场谈一谈留学求职的问题

6 陈安 曹聪 徐迎晓 章成志 lixuke2005 nm

该博文允许注册用户评论 请点击登录 评论 (4 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备14006957 )

GMT+8, 2019-1-21 20:17

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部