You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: src/cmdstan-guide/stansummary.qmd
+10-11Lines changed: 10 additions & 11 deletions
Original file line number
Diff line number
Diff line change
@@ -12,22 +12,20 @@ diagnostic statistics on the sampler chains, reported in the following order:
12
12
13
13
- Mean - sample mean
14
14
- MCSE - Monte Carlo Standard Error, a measure of the amount of noise in the sample
15
-
- StdDev - sample standard deviation - the variance around the sample mean.
16
-
- MAD - Median Absolute Deviation - the variance around the sample median.
15
+
- StdDev - sample standard deviation - the standard deviation around the sample mean.
16
+
- MAD - Median Absolute Deviation - the median absolute deviation around the sample median.
17
17
- Quantiles - default 5%, 50%, 95%
18
18
- ESS_bulk
19
19
- ESS_tail
20
-
- R_hat - $\hat{R}$ statistic, a measure of chain equilibrium, must be within $0.05$ of $1.0$.
20
+
- R_hat - $\hat{R}$ statistic, a MCMC convergence diagnostic
21
21
22
22
When reviewing the `stansummary` output, it is important to check the final three
23
-
output columns first - these are the diagnostic statistics on chain convergence and
24
-
number of independent draws in the sample.
25
-
A $\hat{R}$ statistic of greater than $1.01$ indicates that the chain has not converged and
26
-
therefore the sample is not drawn from the posterior, thus the estimates of the mean and
27
-
all other summary statistics are invalid.
23
+
output columns first - these are the diagnostic statistics on MCMC convergence and
24
+
effective sample size.
25
+
A $\hat{R}$ statistic of greater than $1$ indicates potential convergence problems and that the sample is not presentative of the target posterior, thus the estimates of the mean and all other summary statistics are likely to be invalid. A value $1.01$ can be used as generic threshold to decide whether more iterations or further convergence analysis is needed, but other thresholds can be used depending on the specific use case.
28
26
29
27
Estimation by sampling produces an approximate value for the model parameters;
30
-
the MCSE statistic indicates the amount of noise in the estimate.
28
+
the MCSE statistic indicates the amount of uncertainty in the estimate.
31
29
Therefore MCSE column is placed next to the sample mean column,
32
30
in order to make it easy to compare this sample with others.
33
31
@@ -37,8 +35,9 @@ chapter of the Stan Reference Manual which describes both the theory and practic
37
35
estimation techniques.
38
36
39
37
The statistics - Mean, StdDev, MAD, and Quantiles - are computed directly from all draws across all chains.
40
-
The diagnostic statistics - MCSE, ESS_bulk, ESS_tail, and R_hat are computed from the rank-normalized,
41
-
folded chains according to the definitions in @Vehtari+etal:2021:Rhat.
38
+
The diagnostic statistics - ESS_bulk, ESS_tail, and R_hat are computed from the rank-normalized,
39
+
folded, and splitted chains according to the definitions by @Vehtari+etal:2021:Rhat.
40
+
the MCSE statistic is computed using split chain R_hat and autocorrelations.
42
41
The summary statistics and the algorithms used to compute them are described in sections
43
42
[Notation for samples](https://mc-stan.org/docs/reference-manual/analysis.html#notation-for-samples-chains-and-draws)
0 commit comments