
Standard Errors of Fitted Category Probabilities by the Delta Method for the Nested Logit Model
John Fox
2023-05-20
Source:vignettes/standard-errors.Rmd
standard-errors.RmdThis document uses the delta method (Fox, 2021, sec. 6.3.5) to derive approximations to the variances of estimated probabilities for dichotomous logit models, and from these, for the nested logit model. The standard errors of these estimated probabilities are the square-roots of their respective variances.
Notation
Let represent the probability that the dichotomous response in the th nested dichotomous logit model is (i.e., a “success”), , where is the number of response categories for the polytomy. Then is the probability that (i.e., a “failure”). I assume that the regression coefficients and their covariance matrix for each dichotomous logit model are estimated in the usual manner.
Let represent the (estimated) logit (log-odds) for the th dichotomous logit model, with variance (see below).
Let , represent the probability that the polytomous response is .
Let and represent the estimates of these probabilities.
In the sequel, which involves only the estimates of these and other parameters, I’ll omit the hats so as to simplify the notation.
In the nested logit model, the polytomous probabilities are each products of probabilities or for ; that is is the subset of the dichotomous logit models that enter into . Let represent either or , as appropriate for category of the polytomous response. Then for .
Finally, the individual-category probabilities can be converted into logits, . The estimates of these logits should approach asymptotic normality more rapidly than the estimates of the corresponding probabilities.
An Example
I’ll use the following example to illustrate the results in this document: Suppose that we have a three-category response variable with categories , , and , and define the two nested dichotomies coded 0 or 1 for categories and , respectively, and coded 0 and 1 for categories and . Then ; . As well, and . Consequently, Here, I abuse the notation slightly in the interest of clarity, using letters rather than numbers for the response categories, so the index of response categories, , takes on the values , , and , rather than 1, 2, and 3.
Variances of the Estimated Probabilities
For the Dichotomous Logit Models
The estimated probability of success for the th dichotomous logit model is Then is a function of the regression coefficients, where (an arbitrary vector of values of the regressors) and . The probability of failure is The variance of the logit is ,
The derivatives of and with respect to are
By the univariate delta method,
For the Nested Logit Model
The variances of the estimated response-category probabilities for the polytomous response can be obtained similarly by the multivariate delta method, recognizing that these probabilities are products of the dichotomous probabilities. The result is greatly simplified because the dichotomies are independent, and so the covariance matrix of the estimated dichotomous probabilities is diagonal.
The required derivatives are for and . Here, denotes set difference. Because , it’s always the case that , and so for .
Applying these results to the example, recall, first, that and so the set for the product , that is, , is empty. In this case, the product is taken = 1, and . That makes intuitive sense, because, as noted previously, .
Proceeding with and , . Consequently, each product and has only one term, for in the case of or in the case of :
Yet another application of the delta method produces approximate variances for the individual-category logits. The relevant derivative is for , and so
Acknowledgment
I’m grateful to Georges Monette of York University for a close reading of an earlier version of this document, and in particular for his suggested simplification of the notation employed.