网络机器人吧社区

【推荐】机器学习学习曲线分析与偏差/方差诊断

机器学习研究会 2018-11-08 13:57:56

                                                                                                                                                                                      
点击上方“机器学习研究会”可以订阅
摘要
 

转自:爱可可-爱生活

When building machine learning models, we want to keep error as low as possible. Two major sources of error are bias and variance. If we managed to reduce these two, then we could build more accurate models.

But how do we diagnose bias and variance in the first place? And what actions should we take once we've detected something?

In this post, we'll learn how to answer both these questions using learning curves. We'll work with a real world data set and try to predict the electrical energy output of a power plant.

We'll generate learning curves while trying to predict the electrical energy output of a power plant. Image source: Pexels.


Some familiarity with scikit-learn and machine learning theory is assumed. If you don't frown when I say cross-validation or supervised learning, then you're good to go. If you're new to machine learning and have never tried scikit, a good place to start is this blog post.


We begin with a brief introduction to bias and variance.


链接:

https://www.dataquest.io/blog/learning-curves-machine-learning/


原文链接:

https://m.weibo.cn/1402400261/4192706915802973

“完整内容”请点击【阅读原文】
↓↓↓

Copyright © 网络机器人吧社区@2017