EDA1: Exploring Data
Model Workflow, Feature Selection
Warm-up: Describe this Data
Imagine you’ve just been this random sample of a dataset:
| 205 |
Chinstrap |
Dream |
50.7 |
19.7 |
203.0 |
4050.0 |
Male |
| 42 |
Adelie |
Dream |
36.0 |
18.5 |
186.0 |
3100.0 |
Female |
| 127 |
Adelie |
Torgersen |
41.5 |
18.3 |
195.0 |
4300.0 |
Male |
| 33 |
Adelie |
Dream |
40.9 |
18.9 |
184.0 |
3900.0 |
Male |
| 339 |
Gentoo |
Biscoe |
NaN |
NaN |
NaN |
NaN |
NaN |
| 253 |
Gentoo |
Biscoe |
59.6 |
17.0 |
230.0 |
6050.0 |
Male |
We would like to unpack the stories that this data might tell. How many different ways can we describe or summarize this data—both its overall shape and its finer details?