EDA1: Exploring Data

Model Workflow, Feature Selection

This Week

Warm-up: Describe this Data

Imagine you’ve just been this random sample of a dataset:

species island bill_length_mm bill_depth_mm flipper_length_mm body_mass_g sex
205 Chinstrap Dream 50.7 19.7 203.0 4050.0 Male
42 Adelie Dream 36.0 18.5 186.0 3100.0 Female
127 Adelie Torgersen 41.5 18.3 195.0 4300.0 Male
33 Adelie Dream 40.9 18.9 184.0 3900.0 Male
339 Gentoo Biscoe NaN NaN NaN NaN NaN
253 Gentoo Biscoe 59.6 17.0 230.0 6050.0 Male

We would like to unpack the stories that this data might tell. How many different ways can we describe or summarize this data—both its overall shape and its finer details?