Tải bản đầy đủ (.ppt) (35 trang)

Statistics for business decision making and analysis robert stine and foster chapter 17

Bạn đang xem bản rút gọn của tài liệu. Xem và tải ngay bản đầy đủ của tài liệu tại đây (258.49 KB, 35 trang )


Chapter 17

Alternative
Approaches to
Inference
Copyright © 2011 Pearson Education, Inc.


17.1 A Confidence Interval for the
Median
An auto insurance company is thinking about
compensating agents by comparing the
number of claims they produce to a
standard. Annual claims average near
$3,200 with a median claim of $2,000.



Claims are highly skewed
Use nonparametric methods that don’t rely on a normal
sampling distribution
3 of 35
Copyright © 2011 Pearson Education, Inc.


17.1 A Confidence Interval for the
Median
Distribution of Sample of Claims (n = 42)

For this sample, the average claim is $3,632 with


s = $4,254. The median claim is $2,456.
4 of 35
Copyright © 2011 Pearson Education, Inc.


17.1 A Confidence Interval for the
Median
Is Sample Mean Compatible with µ=$3,200?


To answer this question, construct a 95% confidence
interval for µ



This interval is
$3,632 ± 2.02 x $4,254 /
[$2,306 to $4,958]

42

5 of 35
Copyright © 2011 Pearson Education, Inc.


17.1 A Confidence Interval for the
Median
Is Sample Mean Compatible with µ=$3,200?



The national average of $3,200 lies within the 95%
confidence t-interval for the mean.



BUT…the sample does not satisfy the sample size
condition necessary to use the t-interval.



The t-interval is unreliable with unknown coverage when
the conditions are not met.
6 of 35
Copyright © 2011 Pearson Education, Inc.


17.1 A Confidence Interval for the
Median
Nonparametric Statistics


Avoid making assumptions about the shape of the
population.



Often rely on sorting the data.




Suited to parameters such as the population median θ
(theta).

7 of 35
Copyright © 2011 Pearson Education, Inc.


17.1 A Confidence Interval for the
Median
Nonparametric Statistics


For the claims data that are highly skewed to the right, θ <
µ.



If the population distribution is symmetric, then
θ = µ.

8 of 35
Copyright © 2011 Pearson Education, Inc.


17.1 A Confidence Interval for the
Median
Nonparametric Confidence Interval


First step in finding a confidence interval for θ is to sort

the observed data in ascending order (known as order
statistics).



Order statistics are denoted as
X(1) < X(2) < … < X(n)

9 of 35
Copyright © 2011 Pearson Education, Inc.


17.1 A Confidence Interval for the
Median
Nonparametric Confidence Interval


If data are an SRS from a population with median θ, then
we know

1.

The probability that a random draw from the population is
less than or equal to θ is ½,
The observations in the random sample are independent.

2.

10 of 35
Copyright © 2011 Pearson Education, Inc.



17.1 A Confidence Interval for the
Median
Nonparametric Confidence Interval


Determine the probabilities that the population median lies
between ordered observations using the binomial
distribution.



To form the confidence interval for θ combine several
segments to achieve desired coverage.

11 of 35
Copyright © 2011 Pearson Education, Inc.


17.1 A Confidence Interval for the
Median
Nonparametric Confidence Interval


In general, can’t construct a confidence interval for θ
whose coverage is exactly 0.95.




The 94.6% confidence interval for the median claim is
[$1,217 to $3,168].

12 of 35
Copyright © 2011 Pearson Education, Inc.


17.1 A Confidence Interval for the
Median
Parametric versus Nonparametric


Limitations of nonparametric methods

1.

Coverage is limited to certain values determined by sums
of binomial probabilities (difficult to obtain exactly 95%
coverage).
Median is not equal to the mean if the population
distribution is skewed. This prohibits obtaining estimates
for the total (total = nµ).

2.

13 of 35
Copyright © 2011 Pearson Education, Inc.


17.2 Transformations

Transform Data into Symmetric Distributions
Taking base 10 logs of the claims data results in a more
symmetric distribution.

14 of 35
Copyright © 2011 Pearson Education, Inc.


17.2 Transformations
Transform Data into Symmetric Distributions
Taking base 10 logs of the claims data results in data that
could be from a normal distribution.

15 of 35
Copyright © 2011 Pearson Education, Inc.


17.2 Transformations
Transform Data into Symmetric Distributions


If y = log10 x, then

=y3.312 with sy = 0.493.



The 95% confidence t-interval for µy is
[3.16 to 3.47].




If we convert back to the original scale of dollars, this
interval resembles that for the median rather than that for
the mean.
16 of 35
Copyright © 2011 Pearson Education, Inc.


17.3 Prediction Intervals


Prediction Interval: an interval that holds a future draw
from the population with chosen probability.



For the auto insurance example, a prediction interval
anticipates the size of the next claim, allowing for the
random variation associated with an individual.

17 of 35
Copyright © 2011 Pearson Education, Inc.


17.3 Prediction Intervals
For a Normal Population
The 100 (1 – α)% prediction interval for an independent draw
from a normal population is


where

1
x ± tα / 2 ,n−1 s 1 +
n

and s estimate µ and σ.

x
18 of 35
Copyright © 2011 Pearson Education, Inc.


17.3 Prediction Intervals
Nonparametric Prediction Interval


Relies on the properties of order statistics:
P(X(i) ≤ X ≤ X(i+1)) = 1/(n + 1)
P(X ≤ X(1)) = 1/(n + 1)
P(X(n) ≤ X) = 1/(n + 1)

19 of 35
Copyright © 2011 Pearson Education, Inc.


17.3 Prediction Intervals
Nonparametric Prediction Interval



Combine segments to get desired coverage.



P (X(2) ≤ X ≤ X(41)) = P ($255 ≤ X ≤ $17,305)
= (41 – 2)/43 0.91




There is a 91% chance that the next claim is between
$255 and
$17,305.

,
20 of 35
Copyright © 2011 Pearson Education, Inc.


4M Example 17.1:
EXECUTIVE
Motivation SALARIES
Fees earned by an executive placement
service are 5% of the starting annual total
compensation package. How much can
the firm expect to earn by placing a current
client as a CEO in the telecom industry?

21 of 35
Copyright © 2011 Pearson Education, Inc.



4M Example 17.1:
EXECUTIVE
SALARIES
Method
Obtain data (n = 23 CEOs from telecom industry).

22 of 35
Copyright © 2011 Pearson Education, Inc.


4M Example 17.1:
EXECUTIVE
SALARIES
Method
The distribution of total compensation for
CEOs in the telecom industry is not normal.
Construct a nonparametric prediction
interval for the client’s anticipated total
compensation package.

23 of 35
Copyright © 2011 Pearson Education, Inc.


4M Example 17.1:
EXECUTIVE
Mechanics SALARIES
Sort the data:


24 of 35
Copyright © 2011 Pearson Education, Inc.


4M Example 17.1:
Mechanics
EXECUTIVE
SALARIES
The interval x(3) to x(21) is
$743,801 to $29,863,393
and is a 75% prediction interval.

25 of 35
Copyright © 2011 Pearson Education, Inc.


×