For data analysis to be useful and accurate, the analyzed data needs to be cleaned and curated. There are copious methods to achieve this in pandas. In this course, you'll learn how to identify and eliminate duplicates in pandas. You'll start by using the pandas cut method to discretize data into bins, using bins to plot histograms and identify outliers using box-and-whisker plots. You'll parse and work with datetime objects read in from strings and convert string columns to datetime using the dateutils python library. Moving on, you'll master different pandas methods for aggregating data - including the groupby, pivot, and pivot_table methods. Lastly, you'll perform various joins - inner, left outer, right outer, and full outer - using both the merge and join methods.
Perks of Course
Certificate: Yes
CPD Points: 113
Compliance Standards: AICC