Posts

Introduction to Discrete & Continuous Probability Distributions

  ✅ 1. What is a Probability Distribution? A probability distribution describes how probabilities are distributed over the values of a random variable . Random Variable : A variable whose values are outcomes of a random phenomenon. 🧮 2. Types of Probability Distributions Type Description GIS Example Discrete           Takes countable values  Number of landslides per year in a          valley Continuous          Takes infinite values over an                 interval Rainfall (mm), elevation, temperature  📌 Discrete Probability Distributions 🎯 3. Binomial Distribution ✅ Definition : Used when an experiment is repeated n times , and each trial has two outcomes : success or failure. ✅ Conditions : Fixed number of trials (n) Only two possible outcomes per trial (success/failure) Constant probability of success (p) Trials are in...

Description and properties of stratified random sampling

Image
  Description of Stratified Random Sampling Stratified random sampling (SRS) is a probability sampling method in which the population is divided into groups or strata, and then a random sample is selected from each stratum. Stratification is often used when the population is heterogeneous, meaning that there is a lot of variation in the variable of interest within the population. By stratifying the population, we can ensure that the sample is representative of all of the different groups in the population. Properties of Stratified Random Sampling SRS has a number of desirable properties, including: Unbiasedness: SRS samples are unbiased, meaning that they have the same expected value as the population they are drawn from. Efficiency: SRS is a relatively efficient sampling method, meaning that it requires a smaller sample size than other sampling methods to achieve the same level of precision. Representativeness: SRS samples are more representative of the ...

Requirement of a good sample

Image
  A good sample is one that is representative of the population from which it is drawn. This means that the sample should have the same characteristics as the population in terms of important variables such as age, gender, race, ethnicity, income, education level, and so on. There are a number of requirements for a good sample, including: Accuracy:  The sample should be an accurate representation of the population. This means that the sample results should be generalizable to the population as a whole. Precision : The sample should be precise, meaning that the results of the sample should be consistent from one sample to another. Reliability:  The sample should be reliable, meaning that the results of the sample can be reproduced over time. Unbiasedness:  The sample should be unbiased, meaning that the results of the sample are not influenced by any systematic errors. In addition to these general requirements, there are a number of s...

Description and Properties of simple random sampling

Image
  Description of Simple Random Sampling Simple random sampling (SRS) is a probability sampling method in which every member of the population has an equal chance of being selected for the sample. This is the most basic type of probability sampling and is often used as a benchmark for other sampling methods. Properties of Simple Random Sampling SRS has a number of desirable properties, including: Unbiasedness: SRS samples are unbiased, meaning that they have the same expected value as the population they are drawn from. Efficiency: SRS is a relatively efficient sampling method, meaning that it requires a smaller sample size than other sampling methods to achieve the same level of precision. Simplicity: SRS is a simple sampling method to implement, both in theory and in practice. Mathematical Proofs of the Properties of SRS Unbiasedness: To prove that SRS samples are unbiased, we can use the following mathematical argument: Let X be the population mean ...

Variance and Standard Deviation for Grouped Data

Image
    Variance and standard deviation are statistical measures used to understand the spread or dispersion of data points in a set, whether the data is grouped or ungrouped. However, calculating variance and standard deviation for grouped data involves some modifications to the formulas used for ungrouped data. Variance for Grouped Data  When dealing with grouped data, where data points are organized into intervals or classes, we use a slightly different formula to calculate the variance. The formula for variance in grouped data is as follows: Variance = Σ((f * (x - μ)²)) / N Where: Σ represents the sum of the values. f is the frequency (number of observations) in each class. x is the midpoint of each class interval. μ (mu) is the mean of the data set. N is the total number of observations (sum of frequencies). Standard Deviation for Grouped Data The formula for standard deviation in grouped data is similar to the formula for variance. W...

Variance and Standard Deviation

Image
  Variance and Standard Deviation Variance and standard deviation are statistical measures that help us understand the spread or dispersion of a set of data points. They provide insights into how closely the data points are clustered around the mean or average value. Variance  Variance measures the average of the squared differences between each data point and the mean of the data set. It gives us an idea of how much the individual data points deviate from the mean. A higher variance indicates a greater spread or dispersion of the data points, while a lower variance suggests they are more closely clustered around the mean. For ungrouped data, the formula for variance is as follows: Variance = Σ((x - μ)²) / N Where: Σ represents the sum of the values. x is each individual data point. μ (mu) is the mean of the data set. N is the total number of data points. Standard Deviation  Standard deviation is the square root of the variance. It provides ...

Confidence Interval for Mean Using Z-Test

  Confidence Interval for Mean When the standard deviation of the population is known, the confidence interval for the population mean can be calculated using the Z-test. The Z-test assumes that the population follows a normal distribution or that the sample size is large enough to apply the Central Limit Theorem. Here's how you can calculate a confidence interval for the population mean with a known standard deviation: Determine the confidence level: Specify the desired level of confidence for the interval. Common choices are 90%, 95%, or 99%. Identify the critical value: Determine the critical value associated with the chosen confidence level. This value corresponds to the z-score from the standard normal distribution. For example, for a 95% confidence level, the critical value is approximately 1.96. Gather sample data: Take a random sample from the population of interest. While the sample size doesn't affect the calculation of ...