Slight differences in `CoxPHFitter` and `CoxTimeVaryingFitter` test cases #1599

benslack19 · 2024-02-23T14:28:59Z

This is likely very minor for most cases but I still don't understand why there would be a difference. This a result of comparing standard errors between the CoxPHFitter and CoxTimeVarying model when the data is equivalent (only one time period per subject). It originally stemmed from this discussion about left truncation.

I was using cluster_col in the CoxPHFitter and saw in the documentation that the sandwich estimator gets used and that's why the SE changes compared to the CoxTimeVarying model. When I attempted to match robust exactly (along the way I discovered issue #544 and created issue #1598), I could not match summary values past 3 decimal points.

Here's a reproducible example with my comments:

import numpy.testing as npt
import pandas as pd
from lifelines import CoxPHFitter, CoxTimeVaryingFitter
from lifelines.datasets import load_stanford_heart_transplants
from lifelines.utils import to_long_format

stanford = load_stanford_heart_transplants()

# Keep only the last record for each subject, drop all covariate columns except age to simplify data
stanford_last = (
    stanford.groupby("id")
    .tail(1)
    .drop(["year", "surgery", "transplant"], axis="columns")
)
stanford_last.head()

# Format the data for CPH model
stanford_last_cph_wid = stanford_last.rename(
    columns={"start": "W", "stop": "T", "event": "E"}
)
stanford_last_cph_wid.head()

The best I can do to match the standard errors between the CPH and CTV model, is to not use a cluster_col with the CPH model and use an id_col in the CTV model. But now the coefficient is slightly off (0.03616 vs. 0.36163).

When doing npt.assert_array_almost_equal, I could not match summary values past 3 decimal points. Why would this difference be observed?

lifelines version: 0.27.8

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slight differences in `CoxPHFitter` and `CoxTimeVaryingFitter` test cases #1599

Slight differences in `CoxPHFitter` and `CoxTimeVaryingFitter` test cases #1599

benslack19 commented Feb 23, 2024

Slight differences in CoxPHFitter and CoxTimeVaryingFitter test cases #1599

Slight differences in CoxPHFitter and CoxTimeVaryingFitter test cases #1599

Comments

benslack19 commented Feb 23, 2024

Slight differences in `CoxPHFitter` and `CoxTimeVaryingFitter` test cases #1599

Slight differences in `CoxPHFitter` and `CoxTimeVaryingFitter` test cases #1599