Introduction to Discrete & Continuous Probability Distributions

  ✅ 1. What is a Probability Distribution? A probability distribution describes how probabilities are distributed over the values of a random variable . Random Variable : A variable whose values are outcomes of a random phenomenon. ๐Ÿงฎ 2. Types of Probability Distributions Type Description GIS Example Discrete           Takes countable values  Number of landslides per year in a          valley Continuous          Takes infinite values over an                 interval Rainfall (mm), elevation, temperature  ๐Ÿ“Œ Discrete Probability Distributions ๐ŸŽฏ 3. Binomial Distribution ✅ Definition : Used when an experiment is repeated n times , and each trial has two outcomes : success or failure. ✅ Conditions : Fixed number of trials (n) Only two possible outcomes per trial (success/failure) Constant probability of success (p) Trials are in...

Variance and Standard Deviation for Grouped Data

 

 

Variance and standard deviation are statistical measures used to understand the spread or dispersion of data points in a set, whether the data is grouped or ungrouped. However, calculating variance and standard deviation for grouped data involves some modifications to the formulas used for ungrouped data.



Variance for Grouped Data

 When dealing with grouped data, where data points are organized into intervals or classes, we use a slightly different formula to calculate the variance. The formula for variance in grouped data is as follows:

Variance = ฮฃ((f * (x - ฮผ)²)) / N

Where:

  • ฮฃ represents the sum of the values.
  • f is the frequency (number of observations) in each class.
  • x is the midpoint of each class interval.
  • ฮผ (mu) is the mean of the data set.
  • N is the total number of observations (sum of frequencies).

Standard Deviation for Grouped Data

The formula for standard deviation in grouped data is similar to the formula for variance. We take the square root of the variance to obtain the standard deviation:

Standard Deviation = √(ฮฃ((f * (x - ฮผ)²)) / N)

Let's work through an example to illustrate how to calculate variance and standard deviation for grouped data:

Consider the following grouped data:

Class Intervals

Class Intervals

Frequency (fi)

Xi

fi*xi

Xi- ฮผ

(Xi- ฮผ)2

Fi(Xi- ฮผ)2

10 – 19 -

5

14.5

5*14.5=72.5

14.5-29.5= -15

225

1125

20 – 29

8

24.5

8*24.5=196

24.5-29.5= -5

25

200

30 – 39

12

34.5

12*34.5=414

34.5-29.5= 5

25

300

40 – 49

7

44.5

7*44.5=311.5

44.5-29.5= 15

225

1575

Sum

32

 

944

 

 

3200

 Step 1

 Calculate the midpoint (x) for each class interval. It is the average of the lower and upper limits of the interval.

x1 = (10 + 19) / 2 = 14.5

x2 = (20 + 29) / 2 = 24.5

 x3 = (30 + 39) / 2 = 34.5

x4 = (40 + 49) / 2 = 44.5

Step 2

Calculate the mean (ฮผ) of the data set. It is the weighted average of the midpoints, using the frequencies as weights.

ฮผ = ((x1 * f1) + (x2 * f2) + (x3 * f3) + (x4 * f4)) / N =

((14.5 * 5) + (24.5 * 8) + (34.5 * 12) + (44.5* 7)) / (5 + 8 + 12 + 7)

= 29.5

Step 3

Calculate the squared differences between each midpoint and the mean, weighted by the frequencies. ((f1 * (x1 - ฮผ)²) + (f2 * (x2 - ฮผ)²) + (f3 * (x3 - ฮผ)²) + (f4 * (x4 - ฮผ)²)) / N = ((5 * (14.5 – 29.5)²) + (8 * (24.5 – 29.5)²) + (12 * (34.5 – 29.5)²) + (7 * (44.5 – 29.5)²)) / (5 + 8 + 12 + 7) = 100

Step 4: 

Calculate the variance. Variance= 100

Step 5: 

Calculate the standard deviation. Standard Deviation ≈ √100 ≈ 10

Therefore, for the given grouped data, the variance is approximately 100, and the standard deviation is approximately 10.

 

Comments

Popular posts from this blog

Propositional Logic and Its Applications

Introduction to Probabilistic Modelling

Common Logical Fallacies