-
Graphing Data 4
-
Lecture1.1
-
Lecture1.2
-
Lecture1.3
-
Lecture1.4
-
-
Mean and Standard Deviation 5
-
Lecture2.1
-
Lecture2.2
-
Lecture2.3
-
Lecture2.4
-
Lecture2.5
-
-
Distributions 6
-
Lecture3.1
-
Lecture3.2
-
Lecture3.3
-
Lecture3.4
-
Lecture3.5
-
Lecture3.6
-
-
Correlation and Linear Regression 7
-
Lecture4.1
-
Lecture4.2
-
Lecture4.3
-
Lecture4.4
-
Lecture4.5
-
Lecture4.6
-
Lecture4.7
-
-
Probability 3
-
Lecture5.1
-
Lecture5.2
-
Lecture5.3
-
-
Counting Principles 3
-
Lecture6.1
-
Lecture6.2
-
Lecture6.3
-
-
Binomial Distribution 3
-
Lecture7.1
-
Lecture7.2
-
Lecture7.3
-
-
Confidence Interval 7
-
Lecture8.1
-
Lecture8.2
-
Lecture8.3
-
Lecture8.4
-
Lecture8.5
-
Lecture8.6
-
Lecture8.7
-
-
Proportion Confidence Interval 3
-
Lecture9.1
-
Lecture9.2
-
Lecture9.3
-
-
Hypothesis Testing 5
-
Lecture10.1
-
Lecture10.2
-
Lecture10.3
-
Lecture10.4
-
Lecture10.5
-
-
Comparing Two Means 5
-
Lecture11.1
-
Lecture11.2
-
Lecture11.3
-
Lecture11.4
-
Lecture11.5
-
-
Chi-squared Test 3
-
Lecture12.1
-
Lecture12.2
-
Lecture12.3
-
Z-Score
One of the most common measures in statistics is the z-score. What it measures is how many standard deviations away from the mean a data point is.
Equation
$$z = \frac{x-\overline{x}}{\sigma}$$
First, let’s check out how far each point in our y data set is away from the mean.
xVals = list(range(100))
yVals = [x+randint(-20,20) for x in xVals]
xVals = np.array(xVals)
yVals = np.array(yVals)
print(yVals-yVals.mean())
Now, if we divide everything by the standard deviation…
print((yVals-yVals.mean())/yVals.std())
Now I want to introduce the correlation equation, there will be two versions, a standard one and a one using z-scores.
Equation
$$r_{xy} = \frac{\Sigma (x-\overline{x})(y-\overline{y})}{\sqrt{\Sigma (x-\overline{x})^2 \Sigma (y-\overline{y})^2}}$$
Challenge
Find the equation which uses z-scores
Prev
Correlation
Next
Z-Score Part 2