You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: src/cmdstan-guide/stansummary.qmd
+44-40Lines changed: 44 additions & 40 deletions
Original file line number
Diff line number
Diff line change
@@ -12,34 +12,38 @@ diagnostic statistics on the sampler chains, reported in the following order:
12
12
13
13
- Mean - sample mean
14
14
- MCSE - Monte Carlo Standard Error, a measure of the amount of noise in the sample
15
-
- StdDev - sample standard deviation
15
+
- StdDev - sample standard deviation - the standard deviation around the sample mean.
16
+
- MAD - Median Absolute Deviation - the median absolute deviation around the sample median.
16
17
- Quantiles - default 5%, 50%, 95%
17
-
-N_eff - effective sample size - the number of independent draws in the sample
18
-
-N_eff/S - the number of independent draws per second
19
-
- R_hat - $\hat{R}$ statistic, a measure of chain equilibrium, must be within $0.05$ of $1.0$.
18
+
-ESS_bulk
19
+
-ESS_tail
20
+
- R_hat - $\hat{R}$ statistic, a MCMC convergence diagnostic
20
21
21
22
When reviewing the `stansummary` output, it is important to check the final three
22
-
output columns first - these are the diagnostic statistics on chain convergence and
23
-
number of independent draws in the sample.
24
-
A $\hat{R}$ statistic of greater than $1.05$ indicates that the chain has not converged and
25
-
therefore the sample is not drawn from the posterior, thus the estimates of the mean and
26
-
all other summary statistics are invalid.
23
+
output columns first - these are the diagnostic statistics on MCMC convergence and
24
+
effective sample size.
25
+
A $\hat{R}$ statistic of greater than $1$ indicates potential convergence problems and that the sample is not presentative of the target posterior, thus the estimates of the mean and all other summary statistics are likely to be invalid. A value $1.01$ can be used as generic threshold to decide whether more iterations or further convergence analysis is needed, but other thresholds can be used depending on the specific use case.
27
26
28
27
Estimation by sampling produces an approximate value for the model parameters;
29
-
the MCSE statistic indicates the amount of noise in the estimate.
28
+
the MCSE statistic indicates the amount of uncertainty in the estimate.
30
29
Therefore MCSE column is placed next to the sample mean column,
31
30
in order to make it easy to compare this sample with others.
0 commit comments