Proportional Odds Model Proofs | Real Statistics Using Excel

Property 1: The maximum value of the log-likelihood function LL occurs when the following r+k-1 equations hold for h = 1 to r-1.

or alternatively

Newton's method equations (alternative) and for j = 1 to k

where

Proof

The likelihood function is maximized when the first partial derivatives of LL with respect to the a_h are zero for h = 1 to r-1 and the first partial derivatives of LL with respect to the b_j are also zero for j = 1 to k. We define the following:

Thus

We also note that for 1 < h < r

p_ih - p_i.h-1 continued

from which it follows that

For h = 1 and h = r, we have

We next calculate various partial derivatives

For 1 < h < r, we have

For h = 1 and h = r, we get

Partial derivative 6

Since p_i₀ = 0 and p_ir = 1, even when h = 1 or h = r

And so this result holds for 1 ≤ h ≤ r.

For 1 < h < r, we have

For h = 1, we have

and so for 1 ≤ h ≤ r-1

We next consider

which is valid for 0 < h < r-1. It is also valid for h = r-1 as can be seen from

We also note that

Finally, we now calculate the partial derivatives of LL using the results described above.

Derivative LL/b_j continued

Since

it follows that for 0 < h < r

which completes the proof.

Property A: The Jacobian matrix of the second partial derivatives of LL is an k+r-1 × k+r-1 symmetric matrix of form

where C = [c_hl] is an r-1 x r-1 matrix, D = [d_jg] is a k × k matrix and U = [u_jh] is an k × r-1 matrix consisting of the following elements:

c_h,h+1

where

g_h

Proof

We calculate the various second partial derivatives of LL using the properties described in the proof of Property 1. We start with the

terms.

Second derivative LL/bb 5

Next, we calculate the

terms.

Finally, we have the c terms case. First, we note that

which is also true in the following two cases:

We now start with the case where l = h.

If l = h + t where t ≥ 2, then

and so

Finally, we consider the case where and 0 < h < r-1, but first we note that

Partial derivative e'/a

It now follows that

This completes the proof.

Property 2 (Newton’s method)

Proof: Let

F vector using derivatives

Jacobian using derivatives

Using Newton’s method, we conclude that B* is a better estimate of the regression coefficients than B (i.e. it produces a larger value for LL), and so the sequence B, B*, B**, B***, etc. converges to the coefficient vector that maximizes LL and the corresponding Hessian matrix J^-1 is a good estimate of the covariance matrix for the regression coefficients.

From Property 1, we see that the F vector can be expressed as

where

v_h

and by Property A, we see that the Jacobian matrix can be expressed as

where C = [c_hl], D = [d_jg] and U = [u_jh] are as described in Property A. This completes the proof.

Leave a Comment Cancel reply