Sampling Distributions
8.3 Sampling Distributions
The field of statistical inference is basically concerned with generalizations and predictions. For example, we might claim, based on the opinions of several people interviewed on the street, that in a forthcoming election 60% of the eligible voters in the city of Detroit favor a certain candidate. In this case, we are dealing with
a random sample of opinions from a very large finite population. As a second il- lustration we might state that the average cost to build a residence in Charleston, South Carolina, is between $330,000 and $335,000, based on the estimates of 3 contractors selected at random from the 30 now building in this city. The popu- lation being sampled here is again finite but very small. Finally, let us consider a soft-drink machine designed to dispense, on average, 240 milliliters per drink. A company official who computes the mean of 40 drinks obtains ¯ x = 236 milliliters and, on the basis of this value, decides that the machine is still dispensing drinks with an average content of μ = 240 milliliters. The 40 drinks represent a sam- ple from the infinite population of possible drinks that will be dispensed by this machine.
Inference about the Population from Sample Information
In each of the examples above, we computed a statistic from a sample selected from the population, and from this statistic we made various statements concerning the values of population parameters that may or may not be true. The company official made the decision that the soft-drink machine dispenses drinks with an average content of 240 milliliters, even though the sample mean was 236 milliliters, because
he knows from sampling theory that, if μ = 240 milliliters, such a sample value could easily occur. In fact, if he ran similar tests, say every hour, he would expect the values of the statistic ¯ x to fluctuate above and below μ = 240 milliliters. Only when the value of ¯ x is substantially different from 240 milliliters will the company official initiate action to adjust the machine.
Since a statistic is a random variable that depends only on the observed sample, it must have a probability distribution.
Definition 8.5: The probability distribution of a statistic is called a sampling distribution.
The sampling distribution of a statistic depends on the distribution of the pop- ulation, the size of the samples, and the method of choosing the samples. In the
8.4 Sampling Distribution of Means and the Central Limit Theorem 233 remainder of this chapter we study several of the important sampling distribu-
tions of frequently used statistics. Applications of these sampling distributions to problems of statistical inference are considered throughout most of the remaining chapters. The probability distribution of ¯ X is called the sampling distribution of the mean.
What Is the Sampling Distribution of ¯ X ?
We should view the sampling distributions of ¯ X and S 2 as the mechanisms from which we will be able to make inferences on the parameters μ and σ 2 . The sam- pling distribution of ¯ X with sample size n is the distribution that results when an experiment is conducted over and over (always with sample size n) and the many values of ¯ X result. This sampling distribution, then, describes the variability of sample averages around the population mean μ. In the case of the soft-drink machine, knowledge of the sampling distribution of ¯ X arms the analyst with the knowledge of a “typical” discrepancy between an observed ¯ x value and
true μ. The same principle applies in the case of the distribution of S 2 . The sam- pling distribution produces information about the variability of s 2 values around σ 2 in repeated experiments.
Parts
» Probability Statistics for Engineers Scientists
» Sampling Procedures; Collection of Data
» Measures of Location: The Sample Mean and Median
» Discrete and Continuous Data
» Statistical Modeling, Scientific Inspection, and Graphical Diagnostics
» Conditional Probability, Independence, and the Product Rule
» Concept of a Random Variable
» Discrete Probability Distributions
» Continuous Probability Distributions
» Joint Probability Distributions
» Variance and Covariance of Random Variables
» Means and Variances of Linear Combinations of Random Variables
» Binomial and Multinomial Distributions
» Negative Binomial and Geometric Distributions
» Poisson Distribution and the Poisson Process
» Areas under the Normal Curve
» Applications of the Normal Distribution
» Normal Approximation to the Binomial
» Gamma and Exponential Distributions
» Weibull Distribution (Optional)
» Transformations of Variables
» Moments and Moment-Generating Functions
» Sampling Distribution of Means and the Central Limit Theorem
» Quantile and Probability Plots
» Classical Methods of Estimation
» Single Sample: Estimating the Mean
» Two Samples: Estimating the Difference between Two Means
» Single Sample: Estimating a Proportion
» Two Samples: Estimating the Difference between Two Proportions
» Two Samples: Estimating the Ratio of Two Variances
» Maximum Likelihood Estimation (Optional)
» Statistical Hypotheses: General Concepts
» Testing a Statistical Hypothesis
» The Use of P -Values for Decision Making in Testing Hypotheses
» Single Sample: Tests Concerning a Single Mean
» Two Samples: Tests on Two Means
» Choice of Sample Size for Testing Means
» Graphical Methods for Comparing Means
» One Sample: Test on a Single Proportion
» Two Samples: Tests on Two Proportions
» One- and Two-Sample Tests Concerning Variances
» Test for Independence (Categorical Data)
» Introduction to Linear Regression
» The Simple Linear Regression (SLR) Model
» Least Squares and the Fitted Model
» Inferences Concerning the Regression Coefficients
» Analysis-of-Variance Approach
» Test for Linearity of Regression: Data with Repeated Observations
» Data Plots and Transformations
» Simple Linear Regression Case Study
» Linear Regression Model Using Matrices
» Inferences in Multiple Linear Regression
» Choice of a Fitted Model through Hypothesis Testing
» Special Case of Orthogonality (Optional)
» Categorical or Indicator Variables
» Sequential Methods for Model Selection
» Study of Residuals and Violation of Assumptions (Model Checking)
» Cross Validation, C p , and Other Criteria for Model Selection
» Special Nonlinear Models for Nonideal Conditions
» Analysis-of-Variance Technique
» One-Way Analysis of Variance: Completely Randomized Design (One-Way ANOVA)
» Tests for the Equality of Several Variances
» Single-Degree-of-Freedom Comparisons
» Comparing a Set of Treatments in Blocks
» Randomized Complete Block Designs
» Graphical Methods and Model Checking
» Data Transformations in Analysis of Variance
» Interaction in the Two-Factor Experiment
» Two-Factor Analysis of Variance
» Factorial Experiments for Random Effects and Mixed Models
» Factorial Experiments in a Regression Setting
» Fractional Factorial Experiments
» Analysis of Fractional Factorial Experiments
» Introduction to Response Surface Methodology
» Rank Correlation Coefficient
» Control Charts for Variables
» Control Charts for Attributes
» Bayes Estimates Using Decision Theory Framework
Show more