Assumption-Lean Cox Regression

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Assumption-Lean Cox Regression. / Vansteelandt, Stijn; Dukes, Oliver; Van Lancker, Kelly; Martinussen, Torben.

In: Journal of the American Statistical Association, Vol. 119, No. 545, 2024, p. 475-484.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Vansteelandt, S, Dukes, O, Van Lancker, K & Martinussen, T 2024, 'Assumption-Lean Cox Regression', Journal of the American Statistical Association, vol. 119, no. 545, pp. 475-484. https://doi.org/10.1080/01621459.2022.2126362

APA

Vansteelandt, S., Dukes, O., Van Lancker, K., & Martinussen, T. (2024). Assumption-Lean Cox Regression. Journal of the American Statistical Association, 119(545), 475-484. https://doi.org/10.1080/01621459.2022.2126362

Vancouver

Vansteelandt S, Dukes O, Van Lancker K, Martinussen T. Assumption-Lean Cox Regression. Journal of the American Statistical Association. 2024;119(545):475-484. https://doi.org/10.1080/01621459.2022.2126362

Author

Vansteelandt, Stijn ; Dukes, Oliver ; Van Lancker, Kelly ; Martinussen, Torben. / Assumption-Lean Cox Regression. In: Journal of the American Statistical Association. 2024 ; Vol. 119, No. 545. pp. 475-484.

Bibtex

@article{79cc2f9bbfb2423bbe6a74037e70bff6,
title = "Assumption-Lean Cox Regression",
abstract = "Inference for the conditional association between an exposure and a time-to-event endpoint, given covariates, is routinely based on partial likelihood estimators for hazard ratios indexing Cox proportional hazards models. This approach is flexible and makes testing straightforward, but is nonetheless not entirely satisfactory. First, there is no good understanding of what it infers when the model is misspecified. Second, it is common to employ variable selection procedures when deciding which model to use. However, the bias and uncertainty that imperfect variable selection adds to the analysis is rarely acknowledged, rendering standard inferences biased and overly optimistic. To remedy this, we propose a nonparametric estimand which reduces to the main exposure effect parameter in a (partially linear) Cox model when that model is correct, but continues to capture the (conditional) association of interest in a well understood way, even when this model is misspecified in an arbitrary manner. We achieve an assumption-lean inference for this estimand based on its influence function under the nonparametric model. This has the further advantage that it makes the proposed approach amenable to the use of data-adaptive procedures (e.g., variable selection, machine learning), which we find to work well in simulation studies and a data analysis. for this article are available online.",
keywords = "Conditional treatment effect, Debiased machine learning, Estimand, Hazard ratio, Model misspecification, Post-selection inference, CAUSAL INFERENCE, MODELS, HAZARDS",
author = "Stijn Vansteelandt and Oliver Dukes and {Van Lancker}, Kelly and Torben Martinussen",
year = "2024",
doi = "10.1080/01621459.2022.2126362",
language = "English",
volume = "119",
pages = "475--484",
journal = "Journal of the American Statistical Association",
issn = "0162-1459",
publisher = "Taylor & Francis",
number = "545",

}

RIS

TY - JOUR

T1 - Assumption-Lean Cox Regression

AU - Vansteelandt, Stijn

AU - Dukes, Oliver

AU - Van Lancker, Kelly

AU - Martinussen, Torben

PY - 2024

Y1 - 2024

N2 - Inference for the conditional association between an exposure and a time-to-event endpoint, given covariates, is routinely based on partial likelihood estimators for hazard ratios indexing Cox proportional hazards models. This approach is flexible and makes testing straightforward, but is nonetheless not entirely satisfactory. First, there is no good understanding of what it infers when the model is misspecified. Second, it is common to employ variable selection procedures when deciding which model to use. However, the bias and uncertainty that imperfect variable selection adds to the analysis is rarely acknowledged, rendering standard inferences biased and overly optimistic. To remedy this, we propose a nonparametric estimand which reduces to the main exposure effect parameter in a (partially linear) Cox model when that model is correct, but continues to capture the (conditional) association of interest in a well understood way, even when this model is misspecified in an arbitrary manner. We achieve an assumption-lean inference for this estimand based on its influence function under the nonparametric model. This has the further advantage that it makes the proposed approach amenable to the use of data-adaptive procedures (e.g., variable selection, machine learning), which we find to work well in simulation studies and a data analysis. for this article are available online.

AB - Inference for the conditional association between an exposure and a time-to-event endpoint, given covariates, is routinely based on partial likelihood estimators for hazard ratios indexing Cox proportional hazards models. This approach is flexible and makes testing straightforward, but is nonetheless not entirely satisfactory. First, there is no good understanding of what it infers when the model is misspecified. Second, it is common to employ variable selection procedures when deciding which model to use. However, the bias and uncertainty that imperfect variable selection adds to the analysis is rarely acknowledged, rendering standard inferences biased and overly optimistic. To remedy this, we propose a nonparametric estimand which reduces to the main exposure effect parameter in a (partially linear) Cox model when that model is correct, but continues to capture the (conditional) association of interest in a well understood way, even when this model is misspecified in an arbitrary manner. We achieve an assumption-lean inference for this estimand based on its influence function under the nonparametric model. This has the further advantage that it makes the proposed approach amenable to the use of data-adaptive procedures (e.g., variable selection, machine learning), which we find to work well in simulation studies and a data analysis. for this article are available online.

KW - Conditional treatment effect

KW - Debiased machine learning

KW - Estimand

KW - Hazard ratio

KW - Model misspecification

KW - Post-selection inference

KW - CAUSAL INFERENCE

KW - MODELS

KW - HAZARDS

U2 - 10.1080/01621459.2022.2126362

DO - 10.1080/01621459.2022.2126362

M3 - Journal article

VL - 119

SP - 475

EP - 484

JO - Journal of the American Statistical Association

JF - Journal of the American Statistical Association

SN - 0162-1459

IS - 545

ER -

ID: 325327418