Practical Machine Learning实用机器学习章1

Practical Machine Learning实用机器学习

1.1 Prediction motivation预测的动机

课程概览About this course

This course covers the basic ideas behind machine learning/prediction,What this course depends onWhat would be useful·Study design trainingvs. test setsConceptual issues outof sample error, ROC curvesPractical implementation thecaret package·The Data Scientist’s ToolboxR Programming·Exploratory analysisReporting Data and Reproducible ResearchRegression models

机器学习的用处

Local governments >pension(退休金) paymentsGoogle >whether you will click on an adAmazon >what movies you will watchInsurance companies >what your risk of death isJohns Hopkins >who will succeed in their programs

推荐书目及资源

The elements of statistical learning

Machine learning (more advanced material)

List of machine learning resources on QuoraList of machine learning resources from ScienceAdvanced notes from MIT open coursewareAdvanced notes from CMUKaggle machinelearning competitions

1.2 什么是预测What is prediction

预测问题的中心教条dogma

predict for these dots whether they’re red or blue:

choosing the right dataset and that knowing what the specific question is are again paramount(最重要的)

可能存在的问题

一个例子：Google Flu trends algorithm didn’t realize the search terms that people would use would change over time.They might use different terms when they were searching, and so that would affect the algorithm’s performance.And also, the way that those terms were actually being used in the algorithm wasn’t very well understood.And so when the function of a particular search term changed in their algorithm, it can cause problems.

预测器的流程components of a predictor