-- First, databases must slowly import data into a native representation
before they can be queried
Tools: SAS, R, and Matlab
approaches:
-- extend the relational model
-- extend the MapReduce/Hadoop model
-- build something entirely different
4 What’s Left?
What’s missing is twofold:
-- First, must improve statistics and machine learning algorithms to be
more robust and easier for unsophisticated users to
apply, while simultaneously training students in their intricacies.
-- Second, need to develop a data management ecosystem around these
algorithms so that users can manage and
evolve their data, enforce consistency properties over it, and browse,
visualize, and understand their algorithms’ results.