《镜子大全》《朝华午拾》分享 http://blog.sciencenet.cn/u/liwei999 曾任红小兵,插队修地球,1991年去国离乡,不知行止。

博文

立委愚人节演讲大数据挖掘,时间地点已确定

已有 3914 次阅读 2013-3-25 11:50 |个人分类:其他杂碎|系统分类:博客资讯| 大数据, 演讲, 中文, 立委, 挖掘

(1)立委愚人节北京讲演时间地点已经确认,感谢中文信息学会孙教授的邀请和安排:


时间:四月一日上午 10点-12点

地点:中国科学院软件所,5楼 334 厅

北京市中关村南四街4号


The loacation is :

Room 334, 3rd floor,  building 5
Institute of Software, Chinese Academy of Sciences,
No. 4 Zhongguancun South 4th Street

10:00~12:00am April 1, 2013

It's better you take the subway. And the nearest subway station of line 13 is 知春路


Sentiment Mining from Chinese Social Media in Big Data Age


by Wei Li, Ph.D. Computational Linguistics


In this information age of big data, social media such as WeiBo (Micro-Blog, or Chinese twitter) is more and more influential.  The popularity of mobile devices such as smart phones makes it possible for anyone to share his/her observation, experiences, opinions and sentiments any time anywhere in the social network such as WeiXin (or WeChat). The social media big data from WeiBo, WeiXin, Customer Review sites, Blogs and Forums are like a gold mine of intelligence, yet to be mined.  They are in the form of natural language (Chinese in this case) and contain intelligence of public opinions and consumer sentiments on any topics, brands and products. Automated sentiment mining via Natural Language Processing (NLP) is a must-do if we (or businesses) do not want to be overwhelmed by the information overload.


Dr. Li's talk will present the design philosophy behind such a sentiment mining system which he has designed and led the team to develop.  He will first discuss the value and scope of NLP in sentiment extraction and mining, pros and cons between the rule based system and learning based classification, and different levels of sentiment mining in response to the various information needs.  He will then demonstrate a list of real life Chinese social media hot topics as mined by the system to show the value and future of big data and NLP, in areas like automatic survey and social media listening and monitoring for consumer insights.



大数据时代中文社会媒体的舆情挖掘

李维 博士


随着大数据时代的到来,社会媒体(譬如微博)的影响力日益增强。智能手机等移动设备的普及,使得普罗百姓的见闻、意见和情绪可以随时随地传达(譬如利用微信)。微博、微信、博客、论坛这些社会媒体大数据好像一座座富含情报的金山,等待我们去挖掘。在大数据面前,如果不想被信息爆炸淹没,就必然需要使用自动手段,尤其是可以用来自动抽取挖掘舆情的自然语言技术。


李博士的报告基于他主持开发的客户舆情自动抽取挖掘系统。报告分两大部分。第一部分阐述自然语言技术在舆情抽取中的应用范围,比较统计分类方法与规则系统方法的利弊,以及舆情分析的层级体系。第二部分通过一系列社会媒体热点话题的实例,展示大数据挖掘的价值和前景。


About Dr, Li


A hands-on computational linguist with nearly 30 years of professional experience in Natural Language Processing (NLP), Dr. Li has a track record of making NLP work robust. He has built three large-scale NLP systems, all transformed into real-life, globally distributed products.  


He is now Chief Scientist for a fast-growing Silicon Valley company which serves global Fortune 500 companies for consumer insights and social media monitoring.





(2) 台北演讲中文自动分析



時間主題 -- 主講人地點邀請人
2013-03-29 (Fri) 10:00 – 12:00Towards robust large-scale Chinese parsing
Wei Li 博士
資訊所新館106演講廳陳克健


Institute of Information Science Academia Sinica

講 題:Towards robust large-scale Chinese parsing
講 者:Wei Li 博士
時 間:2013-03-29 (Fri) 10:00 – 12:00
地 點:資訊所新館106演講廳
邀請人:陳克健
摘要:

As a seasoned NLP practitioner with nearly 30 years of professional experience, Dr. Li has built a real-life robust Chinese parser to support sentiment mining from Chinese social media.

In this talk, he will present the infrastructures and platform that are required to build a Chinese parser.  He will discuss the architecture of the system, including the interface between word segmentation, shallow parsing and deep parsing.


【致谢】感谢董振东前辈教授的讲演建议和推举。


广而告之:科学网“双百”博主立委四月一日在北京演讲大数据挖掘

小广告:My talk is 2013-03-29 10:00 資訊所新館106演講廳



https://blog.sciencenet.cn/blog-362400-673742.html

上一篇:拉大旗做虎皮是 marketing 的惯用伎俩,不可轻信,但可以理解
下一篇:吴-程有关5次方程根式解的论争
收藏 IP: 99.90.69.*| 热度|

10 刘洋 张伟 陈安 蔣勁松 陆俊茜 张婷婷 曹聪 苏晓路 bridgeneer fumingxu

该博文允许注册用户评论 请点击登录 评论 (11 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-4-18 19:19

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部