Why p-values over-estimate first order risk ?

👉 Because a p-value is computed conditional on the null hypothesis being true, it does not represent the probability of making a Type I error in the situation you are actually in. When it is interpreted as such, it systematically overstates (over-estimates) the “first-order risk”.

Below is the precise reasoning.

1. What “first-order risk” really is

The Type I error rate (first-order risk) is:

α = P (reject H_{0} ∣ H_{0} is true)

This is a long-run, pre-specified property of a decision rule (e.g. “reject if $p < 0.05$ ”).

It is not a probability about the current experiment.

2. What a p-value actually is

A p-value is:

p = P (T \geq t_{obs} ∣ H_{0})

Key points:

It is conditional on $H_{0}$ being true
It is not $P (H_{0} ∣ data)$
It is not $P (Type I error)$

3. Where the over-estimation comes from

The common (incorrect) interpretation

“If $p = 0.03$ , there is a 3% risk that I am making a Type I error.”

This is false.

Why it over-estimates first-order risk

To make a Type I error in this experiment, two things must both be true:

$H_{0}$ is true
You rejected $H_{0}$

But the p-value already assumes (1) with probability 1.

The actual probability of a Type I error is:

P (H_{0} ∣ data) \times P (reject ∣ H_{0}, data)

Since:

P (H_{0} ∣ data) < 1

the p-value necessarily exaggerates the chance of being wrong.

4. A simple Bayesian illustration

Suppose:

Prior probability that $H_{0}$ is true: 0.5
Observed p-value: 0.05

Under reasonable assumptions, the posterior probability that $H_{0}$ is true is often much larger than 0.05, typically 20–40%.

So:

p-value = 0.05
Actual probability of Type I error ≫ 5%

This is sometimes called the “p-value fallacy” or related to the false positive risk (Colquhoun, 2014).

Colquhoun, D. (2014). An investigation of the false discovery rate and the misinterpretation of p-values. Royal Society Open Science, 1(140216), 1-16. https://doi.org/10.1098/rsos.140216

5. Why this is unavoidable in frequentist testing

The frequentist framework does not assign probabilities to hypotheses
It only controls error rates before seeing the data
Once the data are observed, the p-value has no direct decision-theoretic meaning

So the p-value is not wrong, but its interpretation is routinely wrong.

6. Correct interpretation

✔ Correct:

“If the null hypothesis were true, data at least this extreme would occur with probability p.”

❌ Incorrect:

“There is a p probability that I am making a Type I error.”

7. Key takeaway

p-values over-estimate first-order risk because they:

condition on $H_{0}$ being true,
ignore the probability that $H_{0}$ is false,
are mistaken for posterior probabilities.

This is why:

very small p-values are needed for strong evidence,
replication matters,
Bayesian or likelihood-based measures are often more informative.

These videos can help you to understand these concepts:
https://www.youtube.com/watch?v=jy9b1HSqtSk&t=1103s
https://www.youtube.com/watch?v=1P-HyzGvde4&t=920s

Rechercher dans ce blog

BiostatR Blog