Test your knowledge of the material on discrete distributions in the following quiz to see how much you learned. This is entirely private for you---no records are kept of your performance.
1. What are the assumptions necessary for a variable to be distributed as a binomial distribution?
A binomial distribution requires ALL of these assumptions: the variable must be binary (success/failure), discrete (counts), and have a fixed number of trials, along with independent trials and constant probability.
2. How does the skewness of a binomial variable relate to its parameters n and p?
The skewness of a binomial distribution is approximately p(1-p)/n. This shows that skewness decreases as n increases, and is minimized when p=0.5 (symmetric distribution).
3. When would you use a negative binomial distribution over a Poisson distribution?
The key difference is overdispersion: use negative binomial when variance > mean. The Poisson distribution assumes mean = variance, which is often violated in real count data.
4. How does the mean and variance of a Poisson distribution relate to its parameter 位?
For the Poisson distribution, both the mean and variance equal the parameter 位. This equidispersion property (mean = variance) is a key characteristic of the Poisson distribution.
5. Which example is most likely to follow a log-series distribution?
The log-series distribution is best for highly skewed count data where there are many low-frequency events and few high-frequency events, such as word frequencies in text (following Zipf's law). Option (a) would be Poisson, (c) would be binomial, and (d) would be geometric.
6. Which of the following characterize the negative binomial distribution and the geometric distribution?
Both statements are correct. The negative binomial distribution counts failures before achieving r successes, and the geometric distribution is the special case where r=1, counting failures before the first success.
7. In what type of data might you use a negative binomial distribution?
The negative binomial distribution is used for count data (non-negative integers), particularly when the data show overdispersion. It is not appropriate for binary data (that would be binomial or Bernoulli) or continuous data.
8. What is the mean and variance of a variable distributed as a geometric distribution?
For the geometric distribution with parameter p, the mean is (1-p)/p and variance is (1-p)/p^2. Note: There are two parameterizations of the geometric; DDAR uses parameter p rather than 位. This question is problematic as noted in the original quiz.