Introduction
介绍
Welcome to Kaggle's Intermediate Machine Learning course!
欢迎来到 Kaggle 的 中级机器学习 课程!
If you have some background in machine learning and you'd like to learn how to quickly improve the quality of your models, you're in the right place! In this course, you will accelerate your machine learning expertise by learning how to:
如果您有一些机器学习背景并且想了解如何快速提高模型质量,那么您来对地方了! 在本课程中,您将通过学习以下内容来加速您的机器学习专业知识:
- tackle data types often found in real-world datasets (missing values, categorical variables),
- 处理现实世界数据集中常见的数据类型(缺失值、分类变量),
- design pipelines to improve the quality of your machine learning code,
- 设计管道以提高机器学习代码的质量,
- use advanced techniques for model validation (cross-validation),
- 使用先进的技术进行模型验证(交叉验证),
- build state-of-the-art models that are widely used to win Kaggle competitions (XGBoost), and
- 构建最先进广泛用于赢得 Kaggle 竞赛的模型 (XGBoost),以及
- avoid common and important data science mistakes (leakage).
- 避免常见和重要的数据科学错误(泄漏)。
Along the way, you'll apply your knowledge by completing a hands-on exercise with real-world data for each new topic. The hands-on exercises use data from the Housing Prices Competition for Kaggle Learn Users, where you'll use 79 different explanatory variables (such as the type of roof, number of bedrooms, and number of bathrooms) to predict home prices. You'll measure your progress by submitting predictions to this competition and watching your position rise on the leaderboard!
在此过程中,您将通过针对每个新主题使用真实数据完成实践练习来应用您的知识。 实践练习使用来自 Kaggle Learn 用户住房价格竞赛 的数据,其中您将使用 79 种不同的解释 变量(例如屋顶类型、卧室数量和浴室数量)来预测房价。 您将通过向本次比赛提交预测并观察您在排行榜上的排名上升来衡量您的进步!
Prerequisites
先决条件
You're ready for this course if you've built a machine learning model before, and you're familiar with topics such as model validation, underfitting and overfitting, and random forests.
如果您之前构建过机器学习模型,并且熟悉诸如模型验证、 欠拟合和过拟合,以及随机森林。
If you're completely new to machine learning, please check out our Intro to Machine Learning course, which covers everything you need to prepare for this course.
如果您对机器学习完全陌生,请查看我们的机器学习简介课程,其中涵盖了您需要准备的所有内容 对于这门课程。
Your Turn
到你了
Continue to the first exercise to learn how to submit predictions to a Kaggle competition and determine what you might need to review before getting started.
继续进行第一个练习,了解如何向 Kaggle 竞赛提交预测,并确定在开始之前可能需要查看的内容。