Data preprocessing is a process using which we can process raw data and make the raw data suitable for machine learning models. In this data preprocessing course, we have discussed how to handle missing data, how to encode categorical data, how to discretize continuous values, how to handle outliers in a dataset, how to perform feature scaling, etc. We have also discussed how to use a column transformer or pipeline for data preprocessing in machine learning.
1.1 How to find the total number of missing values in a DataFrame using the pandas Python library?
1.2 How to drop missing values from a DataFrame using the pandas Python library?
1.3 How to remove duplicate rows from a 1.4 DataFrame using the pandas Python library?
1.5 How to change the data type of one or more columns of a DataFrame using pandas?
1.6 How to remove duplicate columns of a DataFrame using the pandas Python library?
1.7 How to find the percentage of missing values in a DataFrame using pandas?
1.8 How to handle missing numerical data in a dataset using the pandas Python library?
1.9 How to handle missing categorical data in a dataset using the pandas Python library?
1.10 How to perform mean or median imputation using the sklearn Python library?
1.11 How to perform end of distribution imputation in machine learning?
1.12 How to perform the frequent category imputation in machine learning?
1.13 Arbitrary Value Imputation in Machine Learning
1.14 How to perform missing category imputation in machine learning?
All the courses are fully accessible to premium members.