STATS 412

My Office Hour:

The office hours this week are on 16:30 - 18:00 Tuesday and 13:30 - 15:00 Friday. You may check the campus map to get to my office. I am prepared for your questions, so please feel free to come to my office hours.

Calculus Review:

\(\bullet\) To compute the Maximum Likelihood Estimator (MLE), you may encounter the difficulty for partial differentiation. If you have studied MATH 215 or the equivalent class before, you may review the notes. If you do not know how to perform partial differentiation, you may refer to the following websites for reference:

\(\bullet\) http://tutorial.math.lamar.edu/Classes/CalcIII/PartialDerivsIntro.aspx

\(\bullet\) https://www.khanacademy.org/math/multivariable-calculus/multivariable-derivatives/partial-derivative-and-gradient-articles/a/introduction-to-partial-derivatives

These are great practices to prepare you with essential calculus skills and knowledge of distributions for the subsequent homework and the exam 2.

Homework Grading Policy:

Please include the final answer for each homework question. If the final answer is not included, you will risk 0.5 points for each missing part.

Homework 8 Reminder:

Steps of constructing confidence interval for sample mean:

Professor Miller mentioned in Piazza that for the conditions of performing hypothesis testing. Now we change a little bit to fit for the process of constructing the confidence interval.

\(\bullet\) First, we have to check whether the sample is random. If it is told, great! Just move on! If not, we can either explain by your own reasoning why you think the collect sample is random or not, or draw a scatterplot if the sample is given (if you have great sense of calculation you can explain clearly by plain words). This is important because we need randomness of sample to perform either t-test or z-test.

Then, we check \(\bullet\) 1. Whether the underlying population distributon is normal, we check if we are told that the distribution of the population from which the measurements are taken is normal.

\(\bullet\) 2. If not 1, then we check whether the sample size is large enough \(n\ge 30\) to employ the central limit theorem, which claims that the sampling distribution of the sample mean of the measurements is approximately normal.

\(\bullet\) 3. If not 1 and not 2, we may use QQ-plot (or perform normality test) to look at the data to see if the population seems to be normally distributed.

\(\bullet\) 4. If the data is not given or the sample size is small \(n<30\), then we rely on the robustness of the t procedures against violations of normality. In short, we should check (not PROVE) the randomness and normality.

Clarification for the caveat:

The caveat says t-distribution tolerates the violation of normality due to the small sample size, so that we can perform t-distribution whenever the sample is random. However, the caveat does not indicate that the t-distribution also allows the outliers which may totally deviate the sample from the (approximately) normal distribution.

Using t-score or z-score:

During the office hours there are concerns about using the z-score and t-score. My explanation is that, after checking the randomness:

\(\bullet\) If the sample size is small (\(n<30\)), use t-test.

\(\bullet\) If the population standard deviation \(\sigma\) is not known, use t-test

\(\bullet\) If the underlying distribution is given normal and the population standard deviation is known, use z-test.

\(\bullet\) If the sample size is large (\(n \ge 30\)) and the population standard deviation is known, we employ Central Limit Theorem, and use z-test with approximated probability \(P(\bar X>t) \approx P(Z>\frac{t-\mu}{\sigma})\).

Then you can construct confidence interval as follows:

\(\bullet\) For question 4 - 8, a vast majority of students mistakenly did two things. The first one is the distribution of the sample mean \(\bar X\) and the summation of observations \(S_n\) should be approximately normal, but NOT exactly normal. As a result, the second thing, the probability should be approximately equal, but not equal to the probability given by the z-score. That is the trickiest part of CLT which worth practices. See Homework 7 comments and the solution and come to my office hours for detail.

Extra Example for hypothesis testing

Suppose the nutrition label of apple cider says that apple cider contains an average concentration of sugar 90g/liter, with standard deviation 10g/liter. Now we want to know whether the claim is true. So we bought 50 liters of apple ciders as sample and found that 5050g sugar are contained in total. Based on this information, perform a hypothesis testing to answer whether the claim in the nutrition label is true, at significance level \(\alpha = 0.05\).
With the same setting, now we only know that the average concentration of sugar in apple cier is 90g/liter. Now we bought 10 liters of apple ciders instead, and found the concentrations of sugar for each liter of apple cider as follows: 95.3, 101.2, 92.3, 90.1, 96.4, 99.2, 110.3, 103.4, 91.2, 89.9. Based on this information, perform a hypothesis testing to answer whether the claim in the nutrition label is true, at significance level \(\alpha = 0.05\).

Key Points during Lecture 15:

Critical values for t and z:

With the same significance level, the t-value goes down as the sample size increases, but in comparison with the z-value corresponding to the same significance level, the t-value is always slightly larger. Particularly, when the sample size goes to infinity, then the t-value converges to the z-value. In other words, the standard normal distribution is the limit case of the t-distribution.

t-distribution:

To use the t-table, round the degree of freedom down. In the table, the \(\alpha\) represents the area to the right hand side. A t statistics (in absolute value) greater than the corresponding value for \(\alpha=0.01\) in the table, for example, indicates that the p-value is less than 0.01.

p-value:

We officially introduced the definition of p-value as follows: A p-value is the (conditional) of observing a test statistics as extreme (or weird) as or more extreme than what we observed, assuming that the null hypothesis (\(H_0\), the hypothesis that we would like to find evidence to reject) is true.

Report p-values:

It is always great to report the p-values because everyone may define a small p-value differently. Usually, people choose either \(\alpha = 0.05\) or \(\alpha = 0.01\). This “selected” level is called significance level.

Formal Statement for Conclusion:

\(\bullet\) With a p-value lower than the significant level (such as 0.05, 0.01), we reject \(H_0\), the null hypothesis. And we say: there is sufficient evidence to suggest that the population mean of …..what question say…….. is (different from/greater than/smaller than) …the number….

\(\bullet\) With a p-value greater than the significant level (such as 0.05, 0.01), we fail to reject \(H_0\), the null hypothesis. And we say: there is not sufficient evidence to suggest that the population mean of …..what question say…….. is (different from/greater than/smaller than) …the number….

Derive Confidence Interval:

We typically find \(z_{\frac{\alpha}{2}}\) such that we have \(P(Z>z_{\frac{\alpha}{2}}) = \frac{\alpha}{2}\), then set \(z_{\frac{\alpha}{2}} = \frac{c}{\sigma/\sqrt{n}}\), so that we have derived the confidence interval for the population mean \(\mu\) as \([\bar X - z_{\frac{\alpha}{2}} \cdot \frac{\sigma}{\sqrt{n}}, \bar X + z_{\frac{\alpha}{2}} \cdot \frac{\sigma}{\sqrt{n}}]\). The number \(c\) is called the margin of error. The confidence interval gives us a plausible coverage of the true population parameter, and the confidence interval becomes more accurate when sample size increases.

How to calculate the quantile or the probability in t-distribution:

Student’s t distribution introduces the concept degree of freedom so that the R code for the t distribution involves assigning the degree of freedom. The R-codes to compute the probablity of \(P(t<a)\) with sample size n is given by the format pt(a, n). The R-codes to compute the \((100 \times a) \ ^{th}\) quantile with sample size n is given by the format qt(a, n).

pt(2,200) # The probability of P(t<2) if the sample size is 200.

## [1] 0.9765734

qt(0.95,2000) # The 95-th quantile of Student's t distribution when sample size is 2000

## [1] 1.645616

Key Points during Lecture 16:

Steps of constructing confidence interval for population mean:

\(\bullet\) First, we have to check whether the sample is random. If it is told, great! Just move on! If not, we can either explain by your own reasoning why you think the collect sample is random or not, or draw a scatterplot if the sample is given (if you have great sense of calculation you can explain clearly by plain words). This is important because we need randomness of sample to perform either t-test or z-test.

Then, we check \(\bullet\) 1. Whether the underlying population distributon is normal, we check if we are told that the distribution of the population from which the measurements are taken is normal.

\(\bullet\) 2. If not 1, then we check whether the sample size is large enough \(n\ge 30\) to employ the central limit theorem, which claims that the sampling distribution of the sample mean of the measurements is approximately normal.

\(\bullet\) 3. If not 1 and not 2, we may use QQ-plot (or perform normality test) to look at the data to see if the population seems to be normally distributed.

\(\bullet\) 4. If the data is not given or the sample size is small \(n<30\), then we rely on the robustness of the t procedures against violations of normality. In short, we should check (not PROVE) the randomness and normality.

Then you can construct confidence interval as follows:

\(\bullet\) If \(\sigma\) is given, skip this part. Otherwise, compute \(\sigma\), which is the square root of the sample variance.

\(\bullet\) Find the t-score or z-score by referencing the table, given the specified significance level. For two-tails (keyword: between,\(\pm\) ) , we find \(t_{\frac{\alpha}{2}}\) or \(z_{\frac{\alpha}{2}}\); for one-tail (keyword: no greater than, no less than), we find the confidence upper bound (CUB) or confidence lower bound (CLB) by finding \(t_{\alpha}\) or \(z_{\alpha}\)

Example of Formal Statement for Conclusion for confidence interval:

\(\bullet\) We are approximately (95%/99%) confident that the population mean of …..what question say…….. is (between /no greater than/no smaller than) …the confidence interval….

\(\bullet\) With a p-value greater than the significant level (such as 0.05, 0.01), we fail to reject \(H_0\), the null hypothesis. And we say: there is not sufficient evidence to suggest that the population mean of …..what question say…….. is (different from/greater than/smaller than) …the number….

Clarification for the caveat:

The caveat says t-distribution tolerates the violation of normality due to the small sample size, so that we can perform t-distribution whenever the sample is random. However, the caveat does not indicate that the t-distribution also allows the outliers which may totally deviate the sample from the (approximately) normal distribution.

Using t-score or z-score:

During the office hours there are concerns about using the z-score and t-score. My explanation is that, after checking the randomness:

\(\bullet\) If the sample size is small (\(n<30\)), use t-test.

\(\bullet\) If the population standard deviation \(\sigma\) is not known, use t-test

\(\bullet\) If the underlying distribution is given normal and the population standard deviation is known, use z-test.

\(\bullet\) If the sample size is large (\(n \ge 30\)) and the population standard deviation is known, we employ Central Limit Theorem, and use z-test with approximated probability \(P(\bar X>t) \approx P(Z>\frac{t-\mu}{\sigma})\).

Formal Statement for Conclusion for hypothesis testing:

\(\bullet\) With a p-value lower than the significant level (such as 0.05, 0.01), we reject \(H_0\), the null hypothesis. And we say: there is sufficient evidence to suggest that the population mean of …..what question say…….. is (different from/greater than/smaller than) …the number….

\(\bullet\) With a p-value greater than the significant level (such as 0.05, 0.01), we fail to reject \(H_0\), the null hypothesis. And we say: there is not sufficient evidence to suggest that the population mean of …..what question say…….. is (different from/greater than/smaller than) …the number….

\(\bullet\) Confidence interval contains all the points that we fail to reject the null hypothesis at the desinated significance level. For example, a 99% confidence interval contains all the points that we fail to reject the null hypothesis at the significance level \(\alpha = 0.01\).

Difference between parameter and statistics:

A statistic is defined as a numerical value obtained from a sample. Therefore, a statistic represents just a fraction of the population. We typically use statistics to estimate the parameter.

So what is a parameter? A parameter is a fixed numerical value, or a true value of the population; it reflects the aggregate of all population members under consideration. The difference between these two are described in the following website in detail.

Reference: https://keydifferences.com/difference-between-statistic-and-parameter.html

t-distribution:

To use the t-table, round the degree of freedom down. In the table, the \(\alpha\) represents the area to the right hand side. A t statistics (in absolute value) greater than the corresponding value for \(\alpha=0.01\) in the table, for example, indicates that the p-value is less than 0.01.

Last Comment:

Please inform me to fix the typos and grammatical mistakes if they exist. It is a great practice of writing and I appreciate your help!

STATS 412

Hypothesis Testing and Confidence Interval (Part 1)

Hypothesis Testing and Confidence Interval (Part 1)

In Son Zeng

11/14/2018

My Office Hour:

Calculus Review:

Homework Grading Policy:

Homework 8 Reminder:

Steps of constructing confidence interval for sample mean:

Clarification for the caveat:

Using t-score or z-score:

Extra Example for hypothesis testing

Key Points during Lecture 15:

Critical values for t and z:

t-distribution:

p-value:

Report p-values:

Formal Statement for Conclusion:

Derive Confidence Interval:

How to calculate the quantile or the probability in t-distribution:

Key Points during Lecture 16:

Steps of constructing confidence interval for population mean:

Example of Formal Statement for Conclusion for confidence interval:

Clarification for the caveat:

Using t-score or z-score:

Formal Statement for Conclusion for hypothesis testing:

Difference between parameter and statistics:

t-distribution:

Last Comment:

STATS 412

Hypothesis Testing and Confidence Interval (Part 1)

Hypothesis Testing and Confidence Interval (Part 1)

In Son Zeng

11/14/2018

My Office Hour:

Calculus Review:

Homework Grading Policy:

Homework 8 Reminder:

Steps of constructing confidence interval for sample mean:

Clarification for the caveat:

Using t-score or z-score:

Extra Example for hypothesis testing

Key Points during Lecture 15:

Critical values for t and z:

How are p-value related to the critical values?

t-distribution:

p-value:

Report p-values:

Formal Statement for Conclusion:

Derive Confidence Interval:

How to calculate the quantile or the probability in t-distribution:

Key Points during Lecture 16:

Steps of constructing confidence interval for population mean:

Example of Formal Statement for Conclusion for confidence interval:

Clarification for the caveat:

Using t-score or z-score:

Formal Statement for Conclusion for hypothesis testing:

Difference between parameter and statistics:

How are p-value related to the critical values?

t-distribution:

Last Comment: