jiangdm的个人博客分享 http://blog.sciencenet.cn/u/jiangdm

博文

review: From Databases to Big Data Sam Madden

已有 3163 次阅读 2011-8-11 21:16 |个人分类:CHI|系统分类:科研笔记| SOA, dependability

From Databases to Big Data

Sam Madden

© 2012 IEEE INTERNET COMPUTING

data management

1 What Is Big Data?

data: too big,too fast, or too hard
2 On Databases and MapReduce
-- First, databases must slowly import data into a native representation before they can be queried
-- Second, the lack of support for in-database statistics and modeling
MapReduce or Hadoop
3 The State of the Art
Tools: SAS, R, and Matlab
approaches:
-- extend the relational model
-- extend the MapReduce/Hadoop model
-- build something entirely different
4 What’s Left?
What’s missing is twofold:
-- First, must improve statistics and machine learning algorithms to be more robust and easier for unsophisticated users to
apply, while simultaneously training students in their intricacies.
-- Second, need to develop a data management ecosystem around these algorithms so that users can manage and
evolve their data, enforce consistency properties over it, and browse, visualize, and understand their algorithms’ results.
 


https://blog.sciencenet.cn/blog-468147-474253.html

上一篇:review: A survey on wireless multimedia sensor networks
下一篇:独立学院出路何在
收藏 IP: 171.34.69.*| 热度|

0

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

数据加载中...

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-5-29 17:18

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部