Learning Survival Models with Right-Censored Reporting Delays

· Source: stat.ML updates on arXiv.org · Field: Science & Research — Mathematics & Computational Sciences, Research Methodology & Innovation, Actuarial Science & Insurance Risk Management · Depth: Expert, extended

Summary

This study introduces a novel survival analysis framework designed to address right-censored reporting delays, a critical challenge in industries like insurance. The method jointly models parametric hazard functions for event occurrences, such as accidents, and their corresponding report timings. It specifically tackles the issue of estimating hazard rates for newly enrolled cohorts, which often have limited follow-up due to administrative censoring. The proposed approach includes an estimator with asymptotic consistency and an expectation-maximization (EM) algorithm for computation. A two-stage estimation procedure, based on a parametric proportional hazards model, is developed to evaluate observations under administrative censoring by transferring parameters from a source domain. Experimental results, using datasets like Dialysis (6805 individuals) and Support2 (9105 individuals) with synthetic reporting delays (e.g., λ=0.5, 5, 50), demonstrate that this method significantly improves the timeliness of risk evaluation for new cohorts.

Key takeaway

For Data Scientists or Actuaries evaluating insurance risk for new cohorts, this framework offers a robust solution to overcome the challenges of right-censored reporting delays and administrative censoring. By implementing the two-stage estimation procedure and transferring parameters from existing, uncensored data, you can achieve more timely and accurate individual risk assessments. This enables faster, data-driven decisions on repricing, underwriting, or product discontinuation for high-uncertainty segments, enhancing financial stability and market reach.

Key insights

Jointly modeling event occurrence and report timing addresses right-censored reporting delays for timely risk evaluation.

Principles

Method

A two-stage estimation procedure uses an EM algorithm to jointly model event and report hazard functions, transferring parameters from a source domain to a target domain with administrative censoring.

In practice

Topics

Code references

Best for: AI Scientist, Research Scientist, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.