Case Study Pack: Intro Stats
This “starter case study pack” is a tiny mini-course inside the Workbook:
one dataset,
one story,
and a short sequence of scripts you can run and check.
It’s designed for absolute beginners who want a concrete, repeatable workflow:
Run → Inspect → Check.
The story
A teacher pilots a new study strategy and wants to know if it improves exam scores.
Two groups of students take the same final exam:
control — normal study habits
treatment — new strategy
We’ll use the same dataset in five short scripts, each building intuition:
descriptives
simulation (bootstrap)
distributions + outliers
confidence intervals
hypothesis testing by simulation + effect size
What you get
Inside your workbook folder (created by pystatsv1 workbook init):
- Dataset
data/intro_stats_scores.csv
- Scripts
scripts/intro_stats_01_descriptives.pyscripts/intro_stats_02_simulation.pyscripts/intro_stats_03_distributions_outliers.pyscripts/intro_stats_04_confidence_intervals.pyscripts/intro_stats_05_hypothesis_testing.py
- Write-up template
writeups/intro_stats_interpretation_template.md
- Tests
tests/test_intro_stats_case_study.py
- Outputs go to
outputs/case_studies/intro_stats/
Read the mini-textbook pages (recommended)
If you are new to statistics, read these short pages as you run each script. Each page explains what the script is doing, what to look for in the outputs, and how to talk about the result in plain language.
Intro Stats mini-course
Run → Inspect → Check
From inside your workbook folder:
# Part 1: descriptives (means/SDs + a quick histogram)
pystatsv1 workbook run intro_stats_01_descriptives
# Part 2: bootstrap simulation for the mean difference
pystatsv1 workbook run intro_stats_02_simulation
# Part 3: distributions + outliers (IQR rule) + plots
pystatsv1 workbook run intro_stats_03_distributions_outliers
# Part 4: 95% confidence intervals (t-based) + plot
pystatsv1 workbook run intro_stats_04_confidence_intervals
# Part 5: permutation test (p-value by simulation) + effect size
pystatsv1 workbook run intro_stats_05_hypothesis_testing
# Check: confirms the dataset shape + the expected effect direction
pystatsv1 workbook check intro_stats
Inspect your outputs
Open the output folder in File Explorer:
explorer outputs/case_studies/intro_stats
Start with these files:
group_summary.csv(group means and SDs)bootstrap_mean_diff.csv+bootstrap_mean_diff.pngdistributions_summary.csv+outliers_iqr.csv+score_distributions.pngci_mean_diff_welch_95.csv+ci_group_means_95.pngpermutation_test_summary.csv+permutation_null_distribution.pngeffect_size.csv
Then (optional) write up your interpretation
The pack includes a tiny write-up template. Copy it and fill in the blanks:
cp writeups/intro_stats_interpretation_template.md writeups/intro_stats_writeup.md
Then open writeups/intro_stats_writeup.md in Notepad (or your editor) and answer the questions.
What you should see
The treatment group should have a higher mean score than the control group.
The bootstrap distribution of the mean difference should be mostly above 0.
The permutation test should usually report a small p-value (because the simulated data was generated with a real group difference).
The effect size (Cohen’s d) should be in the small-to-medium range.
Notes
If you’re short on time, Parts 1–2 are the minimum “vibe check”.
The scripts are intentionally simple and readable — open them and explore!