Flashield's Blog

Just For My Daily Diary

Flashield's Blog

Just For My Daily Diary

Month: March 2024

05.course-principal-component-analysis【主成分分析】

Introduction 介绍 In the previous lesson we looked at our first model-based method for feature engineering: clustering. In this lesson we look at our next: principal component analysis (PCA). Just like clustering is a partitioning of the dataset based on proximity, you could think of PCA as a partitioning of the variation in the data. […]

04.exercise-clustering-with-k-means【练习:K均值聚类】

This notebook is an exercise in the Feature Engineering course. You can reference the tutorial at this link. Introduction 介绍 In this exercise you’ll explore our first unsupervised learning technique for creating features, k-means clustering. 在本练习中,您将探索我们第一个用于创建特征的无监督学习技术:k 均值聚类。 Run this cell to set everything up! 运行这个单元格来设置一切! # Setup feedback system from learntools.core import binder binder.bind(globals()) from […]

04.course-clustering-with-k-means【K均值聚类】

Introduction 介绍 This lesson and the next make use of what are known as unsupervised learning algorithms. Unsupervised algorithms don’t make use of a target; instead, their purpose is to learn some property of the data, to represent the structure of the features in a certain way. In the context of feature engineering for prediction, […]

03.exercise-creating-features【练习:创建特征】

This notebook is an exercise in the Feature Engineering course. You can reference the tutorial at this link. Introduction 介绍 In this exercise you’ll start developing the features you identified in Exercise 2 as having the most potential. As you work through this exercise, you might take a moment to look at the data documentation […]

03.course-creating-features【创建特征】

Introduction 介绍 Once you’ve identified a set of features with some potential, it’s time to start developing them. In this lesson, you’ll learn a number of common transformations you can do entirely in Pandas. If you’re feeling rusty, we’ve got a great course on Pandas. 一旦您确定了一组具有一定潜力的特征,就可以开始开发它们了。 在本课程中,您将学习一些完全可以在 Pandas 中完成的常见转换。 如果您感到生疏,我们有一个很棒的Pandas 课程。 We’ll use four datasets […]

02.exercise-mutual-information【练习:互信息】

This notebook is an exercise in the Feature Engineering course. You can reference the tutorial at this link. Introduction 介绍 In this exercise you’ll identify an initial set of features in the Ames dataset to develop using mutual information scores and interaction plots. 在本练习中,您将确定 Ames 数据集中的一组初始特征,以使用互信息分数和交互图进行开发。 Run this cell to set everything up! 运行这个单元格来设置一切! # […]

02.course-mutual-information【互信息】

Introduction 介绍 First encountering a new dataset can sometimes feel overwhelming. You might be presented with hundreds or thousands of features without even a description to go by. Where do you even begin? 第一次遇到新的数据集有时会让人感到不知所措。 您可能会看到成百上千个特征,甚至没有任何说明。 你从哪里开始呢? A great first step is to construct a ranking with a feature utility metric, a function measuring associations between […]

01.course-what-is-feature-engineering【什么是特征工程】

Welcome to Feature Engineering! 欢迎来到特征工程! In this course you’ll learn about one of the most important steps on the way to building a great machine learning model: feature engineering. You’ll learn how to: 在本课程中,您将了解构建出色的机器学习模型的最重要步骤之一:特征工程。 您将学习如何: determine which features are the most important with mutual information 通过互信息确定哪些特征最重要 invent new features in several real-world problem domains 在几个现实问题领域发明新功能 […]

08.course-creating-your-own-notebooks【创建你自己的笔记本】

Congratulations for making it to the end of the course! 恭喜您来到最后一课! In this final tutorial, you’ll learn an efficient workflow that you can use to continue creating your own stunning data visualizations on the Kaggle website. 在这最后一个教程中,您将学习一个高效的工作流程,您可以使用该工作流程继续在 Kaggle 网站上创建您自己的令人惊叹的数据可视化。 Workflow 工作流 Begin by navigating to the site for Kaggle Notebooks: 首先导航到 Kaggle Notebooks 站点: https://www.kaggle.com/code […]

07.exercise-final-project【练习:最终项目】

This notebook is an exercise in the Data Visualization course. You can reference the tutorial at this link. Now it’s time for you to demonstrate your new skills with a project of your own! 现在是您通过自己的项目展示您的新技能的时候了! In this exercise, you will work with a dataset of your choosing. Once you’ve selected a dataset, you’ll design and […]

Scroll to top