《镜子大全》《朝华午拾》分享 http://blog.sciencenet.cn/u/liwei999 曾任红小兵,插队修地球,1991年去国离乡,不知行止。

博文

【立委科普:所谓大数据(BIG DATA)】

已有 6458 次阅读 2013-3-21 04:58 |个人分类:立委科普|系统分类:科普集锦| 大数据, Big, Data

Big data is not just data that are big. In the sense of data load, "big data" has been there for quite a while in Internet, on which the entire search industry was based and developed. The current buzz word big data is different, it is innately associated with posters' background and social network, it represents data from social media perspective. That makes the big data a gold mine, waiting to be mined for intelligence as well as opportunities.  The area of data mining is fairly mature.  Data mining from structured data of consumers' behaviour, combined with their background and demographic information, has been put to practical use for some time now and is proven to be very powerful for target ads and marketing.  Sentiment mining on natural language text from social media big data can be regarded as a natural extension of data mining.  Due to its open-endedness, the potential is even greater.  When used properly, the sentiment intelligence mined from big data is beneficial to both businesses and consumers.  The related innovation can be revolutionary in changing the ways businesses interact with consumers and make better products and services to satisfy the consumers.  


所谓大数据,实际上是社会媒体火热以后的专指,所以已经与用户背景相关联,而不是搜索引擎从开放互联网搜罗来的混杂集合。没有社会媒体及其用户社会网络作为背景,纯粹从量上看,“大数据”早就存在了,它催生了搜索产业。但那不是如今的 buzz word,如今的大数据与社会媒体密不可分。当然,数据挖掘领域把用户信息和消费习惯的数据结合起来,已经有很多成果和应用。自然语言的大数据可以看作是那个应用的继续,从术语上说就是,text mining (from social media big data)是 data mining 的自然延伸。


【置顶:立委科学网博客NLP博文一览(定期更新版)】



https://blog.sciencenet.cn/blog-362400-672385.html

上一篇:广而告之:科学网“双百”博主立委四月一日在北京演讲大数据挖掘
下一篇:“芦柑皮干后什么形状”的话题
收藏 IP: 192.168.0.*| 热度|

4 吕喆 章成志 刘钢 bridgeneer

该博文允许注册用户评论 请点击登录 评论 (3 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-11-21 18:26

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部