Merge pull request #8 from pythonhealthdatascience/dev

amyheather · web-flow · commit 3276219d4930 · 2026-01-16T12:21:08.000Z
Dev
diff --git a/.github/workflows/python_tests.yaml b/.github/workflows/python_tests.yaml
@@ -0,0 +1,44 @@
+name: python_tests
+run-name: Run python tests
+
+on:
+  push:
+    branches: [main]
+  workflow_dispatch:
+
+jobs:
+  tests:
+    runs-on: ${{ matrix.os }}
+    strategy:
+      fail-fast: false
+      matrix:
+        include:
+          - os: ubuntu-latest
+            python-version: '3.11'
+          - os: ubuntu-latest
+            python-version: '3.12'
+          - os: windows-latest
+            python-version: '3.12'
+          - os: macos-latest
+            python-version: '3.12'
+
+    steps:
+      - name: Check out repository
+        uses: actions/checkout@v4
+
+      - name: Install python and dependencies
+        uses: actions/setup-python@v4
+        with:
+          python-version: ${{ matrix.python-version }}
+          cache: 'pip'
+      
+      - name: Install requirements (Windows)
+        if: runner.os == 'Windows'
+        run: python -m pip install -r requirements-test.txt
+      
+      - name: Install requirements (Unix)
+        if: runner.os != 'Windows'
+        run: pip install -r requirements-test.txt
+
+      - name: Run tests
+        run: pytest examples/python_package
diff --git a/.gitignore b/.gitignore
@@ -7,4 +7,5 @@ __pycache__/
 .pytest_cache/
 .Rhistory
 site_libs/
-**_files/
+**_files/
+.coverage
diff --git a/assets/styles.css b/assets/styles.css
@@ -37,6 +37,7 @@
   background-color: #f5f5f5;
   border-radius: 8px;
   padding: 1em 1em 0.2em 1em;
+  margin-bottom: 1em;
 }
 
 /* Add space before H2 + H3 */
diff --git a/environment.yaml b/environment.yaml
@@ -9,6 +9,7 @@ dependencies:
   - pip=25.3
   - pylint=4.0.4
   - pytest=9.0.2
+  - pytest-cov=7.0.0
   - python=3.12.12
   - scipy=1.17.0
   - pip:
diff --git a/images/github_actions.png b/images/github_actions.png
diff --git a/pages/back_tests.qmd b/pages/back_tests.qmd
@@ -27,14 +27,14 @@ We will run back tests using the [dataset we introduced for our waiting times ca
 On the test page, we need to import:
 
 ```{python}
-#| execute: false
+#| eval: false
 #| file: code/test_back__imports.py
 ```
 
 ## Back test
 
 ```{python}
-#| execute: false
+#| eval: false
 #| file: code/test_back__test_reproduction.py
 ```
 
diff --git a/pages/functional_tests.qmd b/pages/functional_tests.qmd
@@ -29,7 +29,7 @@ Unlike unit tests, which check each function in isolation, functional tests run
 We will need the follow imports in our test script:
 
 ```{python}
-#| execute: false
+#| eval: false
 #| file: code/test_functional__imports.py
 ```
 
diff --git a/pages/github_actions.qmd b/pages/github_actions.qmd
@@ -11,3 +11,132 @@ use_condaenv("hdruk_tests", required = TRUE)
 {{< include /assets/language-selector.html >}}
 
 <br>
+
+As mentioned on the page [When and why to run tests?](why_test.qmd), you should run tests regularly **after any code or data changes**, as catching errors earlier makes them easier to fix. This practice of re-running tests is called **regression testing**, and it ensures recent changes haven't introduced errors.
+
+GitHub Actions can be a great tool to support this.
+
+## GitHub Actions
+
+GitHub is widely used for hosting research code and managing version control. We have a [tutorial on setting up a repository](https://pythonhealthdatascience.github.io/des_rap_book/pages/guide/setup/version.html) if you are new to GitHUb.
+
+**GitHub Actions** is a built-in automation system that runs workflows directly in your repository. You can access it from the **Actions** tab on your GitHub repository page:
+
+![](/images/github_actions.png)
+
+Workflows are defined using YAML files stored in `.github/workflows/` in your repository. Each workflow can be triggered by one or more events, with common triggers including:
+
+* `push`: run tests on every push to a branch.
+* `push: branches: ["main"]`: run tests on every push to the `main` branch.
+* `pull_request`: run tests when a pull request is opened or updated.
+* `workflow_dispatch`: allows manual runs from the "Actions" tab.
+
+## Workflow to run tests
+
+This workflow will run the tests from our case study via GitHub actions. We explain it step-by-step below.
+
+```{bash}
+#| eval: false
+#| file: ../.github/workflows/python_tests.yaml
+```
+
+### Explaining the workflow
+
+```
+name: python_tests
+run-name: Run python tests
+```
+
+The beginning of the YAML sets the workflow's name and how it appears in the Actions tab.
+
+* `name` is the internal name of the workflow file.
+* `run-name` is what is displayed when a run appears in the Actions history.
+
+```
+on:
+  push:
+    branches: [main]
+  workflow_dispatch:
+```
+
+Next, we define when the workflow is triggered. Here we have chosen:
+
+* `push`: automatically run on pushes to the `main` branch.
+* `workflow_dispatch`: allows you to trigger the workflow manually from the GitHub actions interface (*note: it only becomes available after first having been pushed to main*).
+
+```
+jobs:
+  tests:
+    runs-on: ${{ matrix.os }}
+    strategy:
+      fail-fast: false
+      matrix:
+        include:
+          - os: ubuntu-latest
+            python-version: '3.11'
+          - os: ubuntu-latest
+            python-version: '3.12'
+          - os: windows-latest
+            python-version: '3.12'
+          - os: macos-latest
+            python-version: '3.12'
+```
+
+Now we start to define the job that runs our tests. We are using matrix testing as this allows us to check our code across multiple operating systems and Python versions. In this case the tests will run on:
+
+* Python 3.11 (Linux)
+* Python 3.12 (Linux, Windows, macOS)
+
+This is good as it allows you to spot any bugs/run issues related to specific operating systems or python versions. They will run in parallel, ensuring a single efficient workflow.
+
+```
+    steps:
+      - name: Check out repository
+        uses: actions/checkout@v4
+```
+
+Now we start defining the steps executed within our test job. The first step is typically to check out your repository, so the workflow can access your code.
+
+```
+      - name: Install python and dependencies
+        uses: actions/setup-python@v4
+        with:
+          python-version: ${{ matrix.python-version }}
+          cache: 'pip'
+```
+
+Next we install the version of Python specified in the matrix and enable pip caching to speed up future runs.
+
+```
+      - name: Install requirements (Windows)
+        if: runner.os == 'Windows'
+        run: python -m pip install -r requirements-test.txt
+      
+      - name: Install requirements (Unix)
+        if: runner.os != 'Windows'
+        run: pip install -r requirements-test.txt
+```
+
+Depending on the operating system, the command syntax for installing dependencies differs slightly. We use `requirements-test.txt` instead of `environment.yaml` as we want to use different Python versions. To reduce runtime, our requirements file only contains packages needed for running tests (e.g., excludes our linting packages).
+
+::: {.callout-note title="See `requirements-test.txt`" collapse="true"}
+
+```{bash}
+#| eval: false
+#| file: ../requirements-test.txt
+```
+
+:::
+
+```
+      - name: Run tests
+        run: pytest examples/python_package
+```
+
+Finally, we run the tests! We call `pytest` on our case study (`examples/python_package/`). If all tests pass, you'll see green ticks for each environment - confirming that your code works consistently across Python versions and operating systems.
+
+### See GitHub actions, in action!
+
+The video below demonstrates this workflow running in GitHub Actions. For the demo, the workflow is triggered manually using `workflow_dispatch`, but it would also run automatically whenever you push changes to `main`.
+
+TODO.
diff --git a/pages/parametrising_tests.qmd b/pages/parametrising_tests.qmd
@@ -26,7 +26,7 @@ Let's say we want to verify that our `summary_stats()` function works correctly
 
 ```{python}
 #| file: code/patient_analysis__summary_stats.py
-#| execute: false
+#| eval: false
 ```
 
 :::
@@ -35,14 +35,14 @@ We will need the following imports in our test script:
 
 ```{python}
 #| file: code/test_intro_parametrised__imports.py
-#| execute: false
+#| eval: false
 ```
 
 Instead of writing separate test functions for each case, we can use pytest's `@pytest.mark.parametrize` decorator:
 
 ```{python}
 #| file: code/test_intro_parametrised__test_summary_stats.py
-#| execute: false
+#| eval: false
 ```
 
 ## How it works
diff --git a/pages/test_coverage.qmd b/pages/test_coverage.qmd
@@ -12,4 +12,64 @@ use_condaenv("hdruk_tests", required = TRUE)
 
 <br>
 
-<!-- Test coverage - what it means, how to generate, use, etc. -->
+**Coverage** refers to the percentage of your code that is executed when you run your tests. It can help you spot parts of your code that are not included in any tests.
+
+## `pytest-cov`
+
+The [pytest-cov](https://github.com/pytest-dev/pytest-cov) package can be used to run coverage calculations easily alongside `pytest`. You can install it from PyPI or conda:
+
+```{.bash}
+pip install pytest-cov
+```
+
+```{.bash}
+conda install pytest-cov
+```
+
+To calculate coverage, you can then simply run tests with the `--cov` flag:
+
+```{.bash}
+pytest --cov
+```
+
+## Running `pytest --cov` on our example
+
+::: {.callout-note title="Test output"}
+
+```{python}
+#| echo: false
+import pytest
+
+pytest.main([
+    "../examples/python_package/",
+    "--cov=waitingtimes"
+])
+```
+
+:::
+
+The coverage results are under the banner:
+
+```
+================================ tests coverage ================================
+```
+
+You can see we get nearly 100% coverage. But what does this actually mean?
+
+## Interpreting coverage
+
+Coverage is telling you whether code was **executed** during testing - but not necessarily whether it has been tested well. A function could run as part of another test without its results or behaviour being properly checked by assertions.
+
+::: {.box-grey}
+
+**Coverage tells you what code ran, not whether it worked correctly**.
+
+:::
+
+It's mostly useful for finding code that **isn't covered by tests at all**. Having parts of your code with no/low coverage means:
+
+* They're not imported or run by any tests.
+* They're only used in rare branches or failure conditions.
+* They were added recently but have not yet been incorporated into tests.
+
+Rather than try to achieve 100% coverage, you should aim to meaningfully test all your code: every important path, decision and behaviour should be tested at least once.
diff --git a/pages/unit_tests.qmd b/pages/unit_tests.qmd
@@ -19,7 +19,7 @@ A **unit** test checks one small, isolated unit of code - usually a single funct
 Let's use the `import_patient_data()` function from our case study. We will import it to our test script, alongside other required packages.
 
 ```{python}
-#| execute: false
+#| eval: false
 #| file: code/test_unit__imports.py
 ```
 
@@ -33,7 +33,7 @@ Its main behaviours are that it:
 ::: {.callout-note title="View `import_patient_data()`" collapse="true"}
 
 ```{python}
-#| execute: false
+#| eval: false
 #| file: code/patient_analysis__import_patient_data.py
 ```
 
@@ -73,7 +73,7 @@ If someone edits the code later and accidentally removes that validation, your t
 :::
 
 ```{python}
-#| execute: false
+#| eval: false
 #| file: code/test_unit__test_import_success.py
 ```
 
@@ -90,7 +90,7 @@ For `import_patient_data()`, a `ValueError` should be raised when we have:
 For each case, we can create a small DataFrame which triggers the problem and assert that a `ValueError` is raised.
 
 ```{python}
-#| execute: false
+#| eval: false
 #| file: code/test_unit__test_import_errors.py
 ```
 
@@ -103,7 +103,7 @@ For example, what if the CSV has the correct headers but no data? Should that su
 In this case, you might decide that an empty CSV with correct headers is fine and does not raise an error. You may still choose to write a test though, as that makes this decision explicit so other coders know what "correct" means at the edges.
 
 ```{python}
-#| execute: false
+#| eval: false
 #| file: code/test_unit__test_import_empty_csv.py
 ```
 
@@ -114,7 +114,7 @@ If the function promises to accept multiple equivalent input types, verify they
 With `import_patient_data()`, we expect a `str` or `Path` object to both succeed and return the same DataFrame.
 
 ```{python}
-#| execute: false
+#| eval: false
 #| file: code/test_unit__test_import_path_types.py
 ```
 
diff --git a/pages/write_basic_test.qmd b/pages/write_basic_test.qmd
diff --git a/requirements-test.txt b/requirements-test.txt

Original file line number	Diff line number	Diff line change
`@@ -37,6 +37,7 @@`
`37`	`37`	`background-color: #f5f5f5;`
`38`	`38`	`border-radius: 8px;`
`39`	`39`	`padding: 1em 1em 0.2em 1em;`
	`40`	`+ margin-bottom: 1em;`
`40`	`41`	`}`
`41`	`42`
`42`	`43`	`/* Add space before H2 + H3 */`