Skip to content
Social Complexity Lab edited this page Dec 22, 2023 · 19 revisions

This paper has gone viral and much of the coverage is not so accurate. In this FAQ, we try to explain what the paper actually says.

Predicting death

Q: Is your algorithm really able to predict people's day of death, age when you die, or anything like that?

A: No! Let us explain.

First, let's explain what the number 78.8% accuracy (that has been widely reported) actually means.

  • We look at a subset of individuals aged between 35 and 65. This is because it is particularly challenging to make survival predictions in this cohort. The vast majority of individuals who pass away are older. And young have extremely low probability of dying.
  • That dataset we split up in two parts
    • Training data: Used to teach the model which correlations are in the data. The training data is the vast majority of the data.
    • Test data: We used the test data to understand how well the model is doing.
  • We now train the model on the training data
  • In the training data, the model learns from information in the years 2008-2015 to tell the difference between actual life/death outcomes for people in the training data during 2016-2020.
  • The trained model is then run on the test data (100000 individuals). Here the model sees the 2008-2015 data and makes a prediction. We then check against actual outcomes if it got it right.

So far so good.

  • There is one final wrinkle. Accuracy is defined as (# correct guesses)/(total guesses). Because our cohort is very young, almost everyone survives (more than 95%).
    • This means that if we created an algorithm that always predicted “survive”, it would get a very high accuracy (over 95%).
    • To address the issue, we balance the dataset, equivalent of 50000 with survive outcome and 50000 with death outcome.
    • (In this balanced dataset a random guess would get 50% accuracy.)
    • When we run our algorithm on that balanced dataset, we get 78.8% accuracy.

Some important consequences.

  • We don’t make predictions for everyone in Denmark, only the test data.
  • We’re not predicting how long people will live. Rather we test mortality over the next 4 years for a young cohort of individuals. A key theme of the paper is about the factors that contribute to such early mortality.

Access to the algorithm

Q: Can you download the software and try this out?

A: No! The dataset and model contain sensitive data and both are safely stored at Statistics Denmark. They cannot be accessed via the internet.

Some follow-ups:

  • We have heard that there are websites that claim to implement life2vec. Those are fraudulent, so be careful.
  • We are working on ways to share the model with the wider research communities, but as LLMs are known to potentially leak data, we have to do further research before we can do this.
  • We have not yet studied how our results generalize to other countries/contexts, but are actively investigating this topic.

The point of the study

Q: But if you're not obsessed with predicting death, what's the study about as you see it?

A: Transformer models (the technology we use) have been developed to find patterns in language. By structuring lives in sequences, we can identify very complicated patterns in life-events.   (Just as in language, where the ordering of words is very important, so is the ordering of events in human lives. In a US context, for example, it matters if you get a job with health care and then get sick, rather than first getting sick without having the healthcare.)   Those patterns and relationships between life-events are encoded in mathematical spaces (called them embedding spaces). We learn the structure of those spaces by processing life-sequences for many millions of individuals. Based on these spaces, we can then make accurate prediction of many kinds of life events.

For that reason, we are not focused on the individual predictions :)

We know it may sound strange, but we chose the topic of predicting death, because it is a problem so many people have worked on. (For example, due to insurance companies, and so on). That means that we know more about what to expect … and that if we are very good at it, it is in competition with many other algorithms.   The arguments in the paper come in the following order  

  • First we show that the algorithm is very good at making diverse predictions (death, personality).
  • Since it is very good, we know that it is capturing interesting patterns in the data.
  • That means it makes sense to study the embedding spaces that capture those patterns in the data.   It is this last part that we are really excited about. Working to understand what new things we can learn about human beings, human behavior, and societies based on the structure of the embedding spaces.

Clone this wiki locally