jingweimo的个人博客分享 http://blog.sciencenet.cn/u/jingweimo

博文

[转载]Machine Learning vs Statistical Learning

已有 1315 次阅读 2019-3-3 10:26 |系统分类:科研笔记|文章来源:转载

 

There is a subtle difference between statistical learning models and machine learning models.
Statistical learning involves forming a hypothesis before we proceed with building a model. The hypothesis could involve making certain assumptions which we validate after building the models.


For example, let us consider Linear Regression(LR) which is an example of a statistical model. While building a LR model, a set of 3 assumptions are made.

  • All the residuals follow a normal distribution around the mean.

  • The attributes in the dataset are all independent.

  • There is homoscedasticity in the data.

The model is assumed to take form, Y= b1 + b2X. So, we finally end up with an equation of precisely this form, b1 and b2 being the unknown coefficients.


With the assumptions regarding the model and the type of equation being made, a cost function is calculated and minimized using methods like gradient descent and thus we finally arrive at a LR model and diagnose our model if the assumptions we made are followed by the data. If the assumptions are not fulfilled, we reject the initial hypothesis and start over again.


So, our initial hypothesis certainly plays an important role in the case of statistical learning models.

But, in the case of machine learning(ML) models, we directly run the ML algorithms on the model, thus allowing the data to speak out instead of directing it in a certain direction with our initial hypothesis/assumptions.


For example, while building a decision tree/random forest, we assume no hypotheses and directly run the algorithms. The ML algorithm returns the crucial features and their importance. Here, we are not setting up any hypotheses which might affect our final model. The model totally learns the data without any user imposed conditions.

Thus, the machine learning models are said to be flexible in nature, because the user doesn't intervene in telling a model how to build an equation/classifier and thus learning the data better!



https://blog.sciencenet.cn/blog-578676-1165331.html

上一篇:[转载]Hydraulics vs Pneumatics 
下一篇:[转载]Hall Effect Sensor with Arduino
收藏 IP: 68.83.204.*| 热度|

0

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-4-17 07:39

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部