First, we import some modules:

import numpy as np
import pandas as pd
from scipy.stats import lognorm, genpareto

We then write a function to simulate getting the maximum value out of n samples from a given distribution:

def max_out_of_n_from_dist(dist, out_of_n=1e9):
    manageable_n = 100000
    if out_of_n <= manageable_n:
        return dist.rvs(out_of_n).max()
    else:
        top_percentiles = \
            np.random.rand(manageable_n) * manageable_n / out_of_n
        return dist.isf(top_percentiles.min())

We then write a function to calculate the "skeptic's probability" - the probability of a given "skeptic's distribution" generating a Jesus-level resurrection report. The various parameters fed into the function determines the specific form of the "skeptic's distribution".

def calculate_p_skeptic(
    dist_type, #genpareto or lognorm
    shape_params_dist, #np.logspace or np.geomspace 
    sample_size, #1e9 or 1e10
    greater_by, #24 or 50
    n_outliers, #50 or 250
    n_max_draws=10000,
):
    
    if dist_type == genpareto:
        shape_limits = [0.02, 2,1]
    elif dist_type == lognorm:
        shape_limits = [0.2, 10.0]
        min_shape = 0.2, 
        max_shape = 10.0
    if shape_params_dist == np.logspace:
        shape_limits = [np.log10(x) for x in shape_limits]
    shape_params_list = shape_params_dist(
        shape_limits[0], shape_limits[1], 105)
        
    shape_params = []
    p_max_greater_by = []
    n_outliers_estimation = []
    
    for shape_param in shape_params_list:
        dist = dist_type(shape_param, scale=1, loc=0)
        for i in range(n_max_draws):
            shape_params.append(shape_param)
            max_val = max_out_of_n_from_dist(dist, sample_size)
            p_max_greater_by.append(dist.sf(max_val * greater_by))
            p_outlier = \
                (dist.sf(max_val * 0.2) - dist.sf(max_val)) \
                / dist.cdf(max_val)
            n_outliers_estimation.append(
                int(round(p_outlier * sample_size)))

    result_df = pd.DataFrame({
        "shape_params":shape_params, 
        "p_max_greater_by":p_max_greater_by,
        "n_outliers":n_outliers_estimation,
    })
    
    match_df = result_df[result_df["n_outliers"] == n_outliers]

    if match_df.shape[0] < 50:
        print "warning: match_df.shape = ", match_df.shape
    if match_df["shape_params"].max() == shape_params_list.max():
        print "warning: maxed out shape_param"
    
    p_skeptic = match_df["p_max_greater_by"].mean()
    
    return p_skeptic

Now, let us explore some of the different possible forms of the "skeptic's distribution", and calculate their "skeptic's probability".

Here's one we looked at before. It uses the most pro-skeptical assumptions possible to generate the maximum possible "skeptic's probability".

calculate_p_skeptic(
    dist_type=genpareto,
    shape_params_dist=np.linspace,
    sample_size=int(1e9),
    greater_by=24,
    n_outliers=50,
)

4.139635934580683e-12

Here's another possibility, only changing the most questionable parameters to the edges of their likely values. Here are the changes we're making:

The prior distribution of the shape parameters: from being uniform in linear space to uniform in logarithmic space.

The sample size (that is, the number of reportable deaths in world history): from 1e9 to 1e10.

The number of "outliers" (That is, the number of reports of a "resurrection", with at least a "some people say..." level of evidence): from 50 to 200.

All of these changes are almostly certainly true to at least that extent. The actual truth may be even more extreme - for example, the number of outliers may actually be in the thousands.

This gives a very conservative answer for how small the "skeptic's probability" may be.

calculate_p_skeptic(
    dist_type=genpareto,
    shape_params_dist=np.logspace,
    sample_size=int(1e10),
    greater_by=24,
    n_outliers=200,
)

1.866911316237698e-14

Here we've changed a few more parameters. The distribution type has been changed to lognormal, we've increased the factor by which the Jesus-level of evidence exceeds the maximum, and the number of "outliers" has been increased. The "skeptic's probability" calculated here may perhaps be called "likely".

calculate_p_skeptic(
    dist_type=lognorm,
    shape_params_dist=np.logspace,
    sample_size=int(1e10),
    greater_by=50,
    n_outliers=300,
)

3.855529808259304e-16

Here is another combination of parmeters which may be called "likely".

calculate_p_skeptic(
    dist_type=lognorm,
    shape_params_dist=np.logspace,
    sample_size=int(3e10),
    greater_by=70,
    n_outliers=200,
)

6.095046577090992e-17

NaClhv

Bayesian evaluation for the likelihood of Christ's resurrection (Part 43)

You may next want to read:

Show/hide comments(No Comments)

Leave a Reply Cancel reply

Pages

Post Importance

Post Category