includes RMST, difference in RMST and confidence intervals #1526

bayesfactor · 2023-05-22T04:51:06Z

Implements RMST, difference in RMST from the R package: https://cran.r-project.org/package=survRM2

Includes RMST of one population (point estimate and confidence interval), difference in RMST of 2 populations (point estimate, p-value, and confidence intervals). Addresses #821

Implements RMST, difference in RMST from the R package: https://cran.r-project.org/package=survRM2 Includes RMST of one population (point estimate and confidence interval), difference in RMST of 2 populations (point estimate, p-value, and confidence intervals). Addresses CamDavidsonPilon#821

bayesfactor · 2023-05-22T04:52:58Z

@CamDavidsonPilon, I would very much appreciate your review (and approval, conditional on acceptance). My organization is really struggling without a sound method to generate confidence intervals on a difference in 2 survival populations.

add check on point_in_time argument for difference in RMST

CamDavidsonPilon · 2023-06-08T13:21:10Z

Hi @bayesfactor! Thanks for the PR (sorry about the delay).

We have an lifelines.utils.restricted_mean_survival_time function now - have you compared against that? We don't have a difference_ though, so that helps.

bayesfactor · 2023-06-08T16:53:40Z

@CamDavidsonPilon, thank you for your response. I'm sorry, I overlooked lifelines.utils.restricted_mean_survival_time. I now validated that the 2 implementations produce the same point estimate on a test set. My newer implementation provides an estimate of standard error, and lower/upper confidence intervals based on that standard error, which I think is a nice feature. Perhaps you or I should put SE, LCI, and UCI into your original implementation of RMST. If you agree, please let me know if you would rather do it or ask me to.

As you said, the main point of this pull request is to provide a confidence interval on difference in RMST, which is a new feature as far as I know.

bayesfactor · 2023-06-14T17:01:37Z

@CamDavidsonPilon, I'm hoping to hear back from you about whether you or I should include the other outputs from RMST, and therefore how to proceed with the difference in RMST (including confidence intervals) functionality. Can you please let me know what is your preference?

Changes: 1. fixed an issue of returning NaN confidence intervals if the followup time `point_in_time` is greater than the last observed event. Using `.replace(np.inf, 0)`, we correct the issue and return correct confidence intervals for that edge case. 2. cosmetic refactoring to do less recalculation and rely more on pre-calculated values stored in `fitterA`

bayesfactor · 2023-11-12T22:31:59Z

I fixed an issue with RMST where, if the followup interval is greater than the last observed event, the confidence intervals were NaN. The latest commit resolves that issue and makes some minor refactoring for efficiency.

@CamDavidsonPilon, I'm hoping to hear back from you about whether you or I should include the other outputs from RMST, and therefore how to proceed with the difference in RMST (including confidence intervals) functionality. Can you please let me know what is your preference?

bayesfactor · 2023-11-12T22:32:47Z

lifelines/statistics.py

-    wk_var = wk_var.tolist() + [0]
-    rmst_var = sum((np.flip(areas[1:])).cumsum() ** 2 * np.flip(wk_var)[1:])
+    wk_var = wk_n_event.observed / (wk_n_risk * (wk_n_risk - wk_n_event.observed))
+    wk_var = wk_var.replace(np.inf, 0).tolist()[1:] + [0]


this is the part that fixes the NaN issue. By adding .replace(np.inf, 0), the confidence intervals are not NaN now.

CamDavidsonPilon · 2023-11-14T12:35:15Z

Hi @bayesfactor,

Thanks for keeping up with this! Can you explain how this works for parametric fitters? In my head, to compute the AUC of a parametric fitter, some scipy.integrate.quad procedure is necessary.

bayesfactor · 2023-11-14T20:22:18Z

Great point, this won't work for all fitters. The current implementation depends on a fitter.event_table, so it only works for fitters that have an event_table property. Using data from event_table, the code does a numerical integration:
https://github.com/bayesfactor/lifelines/blob/107da27a637cc2ae4125666b8ea2b3e217f2c699/lifelines/statistics.py#L392

I don't know how often people use other fitters; I could
a) modify the code to check if event_table exists, and if not, compute it
b) modify the existing utils.restricted_mean_survival_time to optionally return the RMST variance needed for the calculation of a confidence interval in the difference in RMST

lifelines/lifelines/utils/__init__.py

Line 209 in c9b136b

def restricted_mean_survival_time(

CamDavidsonPilon · 2023-11-15T13:11:54Z

I think we can combine the functions! Have a global restricted_mean_survival_time that, based on the fitter, chooses an implementation. The current implementation for computing the variance for KMF's is imprecise compared to yours.

bayesfactor · 2023-11-19T14:10:50Z

upon further thought, should we push one step further upstream and give all fitters an event_table? It seems like a handy attribute and that making fitters more similar would be beneficial.

…ers get an event_table

bayesfactor · 2023-11-23T19:29:05Z

Unless I'm missing something, my last commit moved the event_table property into the BaseFitter class so that all fits get an event_table property. This passes tests -- is it correct and kosher?

Update statistics.py

f447b26

add check on point_in_time argument for difference in RMST

bayesfactor commented Nov 12, 2023

View reviewed changes

moved the event_table property into the base fitter class so all fitt…

514e898

…ers get an event_table

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

includes RMST, difference in RMST and confidence intervals #1526

includes RMST, difference in RMST and confidence intervals #1526

bayesfactor commented May 22, 2023

bayesfactor commented May 22, 2023

CamDavidsonPilon commented Jun 8, 2023

bayesfactor commented Jun 8, 2023

bayesfactor commented Jun 14, 2023

bayesfactor commented Nov 12, 2023

bayesfactor Nov 12, 2023

CamDavidsonPilon commented Nov 14, 2023

bayesfactor commented Nov 14, 2023

CamDavidsonPilon commented Nov 15, 2023 •

edited

Loading

bayesfactor commented Nov 19, 2023

bayesfactor commented Nov 23, 2023

includes RMST, difference in RMST and confidence intervals #1526

Are you sure you want to change the base?

includes RMST, difference in RMST and confidence intervals #1526

Conversation

bayesfactor commented May 22, 2023

bayesfactor commented May 22, 2023

CamDavidsonPilon commented Jun 8, 2023

bayesfactor commented Jun 8, 2023

bayesfactor commented Jun 14, 2023

bayesfactor commented Nov 12, 2023

bayesfactor Nov 12, 2023

Choose a reason for hiding this comment

CamDavidsonPilon commented Nov 14, 2023

bayesfactor commented Nov 14, 2023

CamDavidsonPilon commented Nov 15, 2023 • edited Loading

bayesfactor commented Nov 19, 2023

bayesfactor commented Nov 23, 2023

CamDavidsonPilon commented Nov 15, 2023 •

edited

Loading