Handling data with pandas
Overview
We can consulte the slides about pandas in the following link
In this folder you will find some laboratories about database managment with pandas (a python library) the purpose not is present notebooks to deploy rather to expose some methods or functions.
Dataframe structure
The main object with interact with data.
Laboratories
The objective is that you can find how solve some problems according to keywords
Load data
Change name and drop columns
Counting Missing values
Filter values
Groupby
Dates
Duplicates values
Summation or mean over columns or rows
Sometimes you need calculate the mean or sum a set of columns or the values of a variable
Replace
Merge and append dataframes
Imputation values
Practical laboratories
The following laboratory is to review some concepts with practical results, to realize this lab dowloand the database pibs-dptos.xlsx avaliable (Click here)
Regular expressions (Regex)
This topic deserve “apartado” given its importance. “Al principio” regex could be messy, but with practice each time its implementation is more inutitive.