What happens when p > n? How do we estimate
models when this occurs?
How about statistical inference of parameters and
p-values? What happens?
When there are more parameters than observations
[say in genomics], we need to be careful when fitting models.
We need to invoke the pseudoinverse’s correct
formula ie:
|
Which we can once again solve via Cholesky
Factorization.
We saw in the Linear Regression optimization notes
that using the modified LM algorithm is necessary for
ill conditioned matrices.
|
Likewise by extending this to underdetermined systems:
|
However, for GLMs, we use to have that:
|
For underdetermined systems then we have:
|
However, for underdetermined systems, we MUST use the modified LM
algorithm for GLMs,
otherwise non convergence is seen ie:
|
So in steps:
|
(c) Copyright Protected: Daniel Han-Chen 2020
License: All content on this page
is for educational and personal purposes only.
Usage of material, concepts, equations,
methods, and all intellectual property on any page in this publication
is forbidden for any commercial
purpose, be it promotional or revenue generating. I also claim no
liability
from any damages caused by my material.
Knowledge and methods summarized from various sources like
papers, YouTube videos and other
mediums are protected under the original publishers licensing arrangements.