When multiple failures occur at the same recorded time point—such as
with daily event recording, grouped clinical visit schedules, or
discretized follow-up times—the underlying risk-set contribution becomes
substantially more complex, and the associated conditional-experiments
structure as well as the stratum-specific probability mass functions
must be modified accordingly. We consider two standard approaches for
handling ties: Cox’s exact method (Cox
1972) and the Breslow approximation (Breslow 1974). Because these two approaches
construct the probability mass in fundamentally different ways, we
discuss them separately.
Cox’s Exact Method
In stratum
,
suppose that at time
,
subjects
fail. Let
denote the at-risk set with cardinality
,
and let
denote the observed failure (tie) set of size
.
Let
denote the collection of all subsets of size
drawn from
,
with cardinality
.
Probabilistic Framework
Under Cox’s exact method, for stratum
and event time
,
let
denote a candidate failure subset of size
,
and define the event
to indicate that the subjects in
fail in the interval
.
Let
denote all censoring and failure information up to
,
together with the information that exactly
failures occur in that interval. Then
remains a well-defined sequence of conditional experiments. At each
event time
,
the internal working model specifies the conditional density as
.
In contrast to the no-ties case where the support consists of
individual subjects, under Cox’s exact method the support is given by
the
candidate failure subsets of size
.
Internal and External Probability Mass Functions
The stratum-specific probability mass at time
under the internal model is
where
denotes the sum of internal risk scores over subjects in the
candidate failure subset
.
Replacing the internal risk scores with the external risk scores
,
the probability mass under the external model is
where
denotes the sum of external risk scores over subjects in
.
KL Divergence
The KL divergence between the external and internal models at time
in stratum
is
Substituting the expressions above and accumulating over all strata
and failure times yields
Integrated Objective Function
Proposition (Exact Ties). Under the stratified Cox
model with exact ties, the integrated objective
admits the representation
where
.
Furthermore, under the linear specification
,
define
Then the objective simplifies to
Breslow Approximation
Although Cox’s exact method provides a precise treatment of tied
failure times, its computational cost grows combinatorially with the
number of ties at each event time: the size of the candidate failure set
becomes prohibitively large whenever
is non-negligible relative to
.
The Breslow approximation (Breslow 1974)
addresses this limitation by replacing the exact combinatorial
denominator with a computationally tractable surrogate, while retaining
the same support
of all
size-
subsets of the risk set.
Approximation of the Combinatorial Denominator
Under Cox’s exact method, the denominator of
involves the elementary symmetric polynomial of degree
in the individual exponentiated risk scores,
which enumerates all possible subsets of size
drawn without replacement from
.
The Breslow approximation replaces this without-replacement enumeration
by treating the
failures as
independent draws with replacement from
,
yielding the approximation
which becomes increasingly accurate as
,
since the probability of selecting the same subject twice becomes
negligible. Crucially, the support of the distribution remains
unchanged: both the exact and Breslow formulations are defined over all
candidate failure subsets
.
What changes is solely the denominator used to normalize the probability
mass.
Internal and External Probability Mass Functions
Substituting this approximation, the Breslow approximation to the
internal probability mass at time
in stratum
is
and correspondingly, the Breslow approximation to the
external probability mass is
where
and
as before. Note that
and
are not proper probability distributions over
in general, since the Breslow denominator does not equal the true
normalizing constant and hence the masses do not sum to one.
Nevertheless, they serve as well-defined surrogates within which the
KL-divergence framework can be applied in an approximate sense.
KL Divergence and Simplified Weight
The approximate KL divergence follows the same derivation as in the
exact-ties setting. Under the Breslow approximation, the combinatorial
sum over all subsets
collapses under the with-replacement structure. Specifically, exchanging
the order of summation and noting that the marginal probability of
subject
appearing in any selected subset equals
times its single-draw softmax probability, one obtains
with no combinatorial enumeration required.
Integrated Objective Function
Proposition (Breslow Approximation). Under the
stratified Cox model with the Breslow approximation for ties, the
integrated objective
under the linear specification
admits the representation
where
and
As in the exact-ties setting,
depends only on the prior risk score
and can be computed once in a preprocessing step. In contrast to the
exact method, however,
under the Breslow approximation requires no combinatorial enumeration:
it reduces to a softmax-weighted average of covariates over the risk set
,
scaled by
,
with computational cost
per event time. The resulting objective is structurally identical to the
standard Breslow partial likelihood, with the observed covariate sum
replaced by the blended term
,
with no additional computational burden introduced by the KL integration
term.