Suppose we have total
subjects from
transplant centers with
records from center
().
Coefficients of risk factors are denoted by
=
,
and let
denote the effect of centers.
Generalized Linear Models
Let
denotes the outcome variable for record
)
of center
,
and let
be the corresponding
vector of risk factors. We assume that given the linear predictor
,
the outcome
follows a distribution in the exponential family. Incorporate with
penalty terms, our problem of interest is estimating
by minimizing:
where
is the regularization parameter for
.
Two-layer iterative update procedure
outer layer: update the provider effect
Given the previous iterationβs estimate of
(denoted as
),
consider
that is defined as
Since the score function of
is separable for
and the fisher information matrix is diagonal,
can be updated separately using only a subset of the entire data by a
one-step Newton procedure:
inner layer: update the covariate coefficient
Based on the updated value of
(i.e.Β the
that was updated previously), consider
where
.
The coordinate-wise updating function of
utilizing the sub-differential calculus is given by:
where
and
Let
be a
vector of risk factors that are divided into
non-overlapping groups
(
denotes the length of group
).
The coefficients of risk factors are denoted by
=
,
β¦,
.
Given the observed data
,
our problem of interest becomes estimating
by minimizing
where
is the loss function, and
is the
norm of
.
is the regularization parameter on group
with default
.
In the outer layer of each iteration, the center effects
βs
are updated the same way as we discussed previously.
In the inner layer, we use the Majorize-Minimization (MM) algorithm
to improve the efficiency of our algorithm for binary outcomes. Based on
the current value of
,
the subdifferential-based group-wise updating function of the objective
function is given by
where
and
.
is the most recently updated value of
but set
to
,
for continuous outcome and
for binary outcome.
Discrete Survival Logistic Model
Let
represent the underlying uncensored failure time and
be the censoring time of individual
of center
.
Let
denote the
vector of risk factors of
individual from center
,
is the center indicator, and
be the corresponding observed failure or censor time with
,
where
is the discrete failure times indexed by
.
We assume that
is independent with
given
and
.
Let
be the hazard for the individual with risk factor
and from center
at time
,
and let
and
be the set of indices attached to individuals from center
failing and censoring at
,
respectively.
The full likelihood function is given by
where
is the survival function at time
corresponding to individual from center
with covariate
.
Let
be the discrete baseline hazard function at time
,
then the hazard relationship for the discrete-time logistic model is
defined as:
Let
,
then our problem of interest is estimating
by minimizing:
Three-layer iterative update procedure
Given the previous iterationβs estimate of center effect
and coefficient of risk factors
,
we consider
.
The Newton method allows us to update
separately by:
where
is the current value of
,
and $$U_{\lambda}(\tilde{\alpha}_k) = -
\frac{1}{n} \sum_{i = 1}^m \{\sum_{j = 1}^{n_i} \delta_{ij}
\left(t_{k}\right) - \sum_{j|T_{ij} \geq t_k} \frac{e^{\tilde{\alpha}_k
+ \tilde{\gamma}_i + \boldsymbol{Z}_{ij} \tilde{\boldsymbol{\beta}}}}{1
+ e^{\tilde{\alpha}_k + \tilde{\gamma}_i + \boldsymbol{Z}_{ij}
\tilde{\boldsymbol{\beta}}}}\} \\
I_{\lambda}(\tilde{\alpha}_k) = - \frac{1}{n} \sum_{i = 1}^m
\sum_{j|T_{ij} \geq t_k} \frac{e^{\tilde{\alpha}_k + \tilde{\gamma}_i
+ \boldsymbol{Z}_{ij} \tilde{\boldsymbol{\beta}}}}{(1 +
e^{\tilde{\alpha}_k + \tilde{\gamma}_i + \boldsymbol{Z}_{ij}
\tilde{\boldsymbol{\beta}}})^2}.$$
middle layer: update the provider effect
Given the
updated above and the most recently updated
,
we consider
.
Use a similar one-step Newton method we should have
where
represent the most recently updated value of effect of center
,
and $$U_{\lambda}(\tilde{\gamma}_i) = -
\frac{1}{n} \sum_{j = 1}^{n_i} \sum_{k = 1}^{k_{ij}} \{\delta_{ij}
\left(t_{k}\right) - \frac{e^{\hat{\alpha}_k + \tilde{\gamma}_i
+ \boldsymbol{Z}_{ij} \tilde{\boldsymbol{\beta}}}}{1 +
e^{\hat{\alpha}_k + \tilde{\gamma}_i + \boldsymbol{Z}_{ij}
\tilde{\boldsymbol{\beta}}}}\} \\
I_{\lambda}(\tilde{\gamma}_i) = - \frac{1}{n} \sum_{j = 1}^{n_i}
\sum_{k = 1}^{k_{ij}} \frac{e^{\hat{\alpha}_k + \tilde{\gamma}_i
+ \boldsymbol{Z}_{ij} \tilde{\boldsymbol{\beta}}}}{(1 +
e^{\hat{\alpha}_k + \tilde{\gamma}_i + \boldsymbol{Z}_{ij}
\tilde{\boldsymbol{\beta}}})^2}.$$
It should be noted that the effect of the first provider is set to
zero (as the reference group) to prevent issues of
multicollinearity.
inner layer: update the covariate coefficient
Based on the
and
updated from the previous two steps, define
and consider
Denote
and
then the coordinate-wise updating function will be given by:
where
is the same soft-thresholding operator defined above.
MM algorithm can also be applied for solving this problem since
.
The majorizing surrogate function is constructed based on
and the corresponding updating function is given by