EDA1: Exploring Data

Model Workflow, Feature Selection

This Week

Warm-up: Describe this Data

Imagine you’ve just been this random sample of a dataset:

	species	island	bill_length_mm	bill_depth_mm	flipper_length_mm	body_mass_g	sex
205	Chinstrap	Dream	50.7	19.7	203.0	4050.0	Male
42	Adelie	Dream	36.0	18.5	186.0	3100.0	Female
127	Adelie	Torgersen	41.5	18.3	195.0	4300.0	Male
33	Adelie	Dream	40.9	18.9	184.0	3900.0	Male
339	Gentoo	Biscoe	NaN	NaN	NaN	NaN	NaN
253	Gentoo	Biscoe	59.6	17.0	230.0	6050.0	Male

We would like to unpack the stories that this data might tell. How many different ways can we describe or summarize this data—both its overall shape and its finer details?