wrong result with CoxPHFitter #1486

takra · 2023-01-31T05:23:58Z

takra
Jan 31, 2023

Hello

I'm beginner in lifelines, so hope you will be indulgent. As a beginner I start to test Cox model in lifelines on known dataset of two groups of leukemia patients: one group of 21 persons has received a certain treatment; the other group of 21 persons has received a placebo. The data come from Freireich et al., Blood, 1963. Results of Cox model for this dataset is fully observed in David G. Kleinbaum, Mitchel Klein auth. Survival Analysis A Self-Learning Text.

In this book for single feature 'group' Cox coefficient is 1.509. But using lifelines I get coefficient 1.57 and I can't find reason for this. Besides, I solved the same problem in Matlab and in scikit-survival and got the same coefficient 1.509 as in book. Could you help me please to find my mistake in lifelines?

kleinbaum.zip

Answered by CamDavidsonPilon

Jan 31, 2023

My guess is that the other methods are not converging "enough". For example, in lifelines and in R, a partial-log-lik or -85.01 is achieved. In the screenshot, a partial-log-lik of -86.38 is achieved. We want to maximize this value, so I suspect that the other methods are halting convergence too early.

I see the same situation happen in model 2: lifelines and R achieve a larger partial-log-lik, hence should be considered more accurate.

Can you change the convergence parameters in the Matlab code?

View full answer

CamDavidsonPilon · 2023-01-31T14:07:07Z

CamDavidsonPilon
Jan 31, 2023
Maintainer

Hi @takra, I downloaded your data and I got the same result in lifelines, 1.57. In R's survival, I also got 1.57. Can you post your Matlab and Python code for sci-kit survival? And also report what the final partial log-likelihood output is for those inferences?

For inference in such a simple dataset (small data, no covariates), things should be almost identical.

1 reply

takra Jan 31, 2023
Author

I have corrected value from book and in matlab - 1.509 in first post. Besides I add matlab calculations and page from book. Look at model 1. For model 2 you will observe difference too.

kleinbaum.zip

CamDavidsonPilon · 2023-01-31T18:38:12Z

CamDavidsonPilon
Jan 31, 2023
Maintainer

My guess is that the other methods are not converging "enough". For example, in lifelines and in R, a partial-log-lik or -85.01 is achieved. In the screenshot, a partial-log-lik of -86.38 is achieved. We want to maximize this value, so I suspect that the other methods are halting convergence too early.

I see the same situation happen in model 2: lifelines and R achieve a larger partial-log-lik, hence should be considered more accurate.

Can you change the convergence parameters in the Matlab code?

4 replies

takra Jan 31, 2023
Author

Thank you for idea,
I have increased
coxphopt.MaxIter = 500;
coxphopt.TolX = 1e-9;
coxphopt.TolFun = 1e-9;
coxphopt.MaxFunEvals = 500 but loglikelihood doesn't change at all.

CamDavidsonPilon Jan 31, 2023
Maintainer

hm hm hm. Those seem to be enough. It could be how ties are handled. Do you know if Matlab uses Breslow or Efron method for handling ties? ~~I know that scikit survival doesn't use Breslow (which lifelines and R does)~~ - that might be it!

EDIT: oops got that backwards: lifelines and R use Efron. scikit uses breslow

CamDavidsonPilon Jan 31, 2023
Maintainer

yeaaa I just tested method="breslow"in R, and got the 1.5092 value.

So it's how ties are handled. Efron is considered the most accurate (vs performance), but take some time to read up and decide which tie-breaking method you want.

takra Jan 31, 2023
Author

By default matlab use breslow ties, after change to Efron get 1.57. Thank you very much!

Uh oh!

wrong result with CoxPHFitter #1486

Uh oh!

Uh oh!

takra Jan 31, 2023

Replies: 2 comments · 5 replies

Uh oh!

Uh oh!

CamDavidsonPilon Jan 31, 2023 Maintainer

Uh oh!

Uh oh!

takra Jan 31, 2023 Author

Uh oh!

CamDavidsonPilon Jan 31, 2023 Maintainer

Uh oh!

takra Jan 31, 2023 Author

Uh oh!

Uh oh!

CamDavidsonPilon Jan 31, 2023 Maintainer

Uh oh!

CamDavidsonPilon Jan 31, 2023 Maintainer

Uh oh!

takra Jan 31, 2023 Author

takra
Jan 31, 2023

Replies: 2 comments 5 replies

CamDavidsonPilon
Jan 31, 2023
Maintainer

takra Jan 31, 2023
Author

CamDavidsonPilon
Jan 31, 2023
Maintainer

takra Jan 31, 2023
Author

CamDavidsonPilon Jan 31, 2023
Maintainer

CamDavidsonPilon Jan 31, 2023
Maintainer

takra Jan 31, 2023
Author