Lesson 5: Standardizing
Lesson 5: Standardizing
- If concerned with proportion less than or greater than a certain value, we should use relative frequencies
- Smaller bins can provide more accurate details on histograms, but bin sizes too small, can hide shape of distribution
- Continuous distribution
- An equation that allows you to calculate proportion between any two values on x-axis of distribution
- Area of normally proportionally distributed data will always be "1" or 100%
-
z score
- number of standard deviations away from the mean on the x-axis
- Formula:
z = (x - mu) / std_dev
- Negative z-scores are to the left of the mean (less than the mean)
- Standardize distribution
- converting the mean to 0 - shift the distribution left or right
- standard deviation becomes 1