A Python package for managing project templates, metadata, and file organization using flexible YAML schemas. Designed for research and data projects that require consistent file naming, metadata, and directory structures.
pip install appm- Template-driven project structure: Define project layouts, file naming conventions, and metadata in YAML.
- Automatic project initialization: Create new projects with standardized folders and metadata files.
- File placement and matching: Automatically determine where files belong based on their names and template rules.
- Extensible and validated: Uses Pydantic for schema validation and ruamel.yaml for YAML parsing. Installation Or for development:
- Define a Template
Create a YAML template describing your project's structure, naming conventions, and file formats. See examples/template.yaml for the default template.
- Initialize a Project
from appm import ProjectManager
pm = ProjectManager.from_template(
root="/mnt/tpa_field_data/",
template="examples/template.yaml",
project="2025_OzBarley",
year=2024,
platform="phenomate_1",
site="roseworthy",
summary="Wheat yield trial",
internal=True,
researcherName="Jane Doe",
organisationName="Adelaide University",
)
print(str(pm.location)) # This prints the directory that will be created for the project data.
pm.init_project() # This creates the directory and writes a metadata.yaml file (for reloading the project)- Add Files
Files are automatically placed in the a directory based on the template used.
An example template.yaml file:
version: 0.1.1
naming_convention:
# Use `sep: "/"` to specify elements as separate
# directories : <root>/organisationName/project/site/platform
# or any other string e.g. `sep: "_"` to concatenate all elements into
# a single directory name : <root>/organisationName_project_site_platform
sep: "/"
structure: ['organisationName', 'project', 'site', 'platform']
layout:
structure: [ 'date', 'procLevel', 'sensor' ]
mapping:
procLevel:
raw: 'T0-raw'
proc: 'T1-proc'
trait: 'T2-trait'
date_convert:
base_timezone: 'UTC'
output_timezone: 'Australia/Adelaide'
input_format: '%Y-%m-%d %H-%M-%S' # concatenated file components: 'date' and 'time'
output_format: '%Y%m%d%z'
file:
# Individual processing is availble for specific file extensions (except json, which is a special case)
"bin":
sep: "_"
default:
procLevel: raw
components:
- sep: "_"
components:
- ['date', '\d{4}-\d{2}-\d{2}']
- ['time', '\d{2}-\d{2}-\d{2}']
- ['ms', '\d{6}']
- name: 'timezone'
pattern: '[+-]\d{4}'
required: false
- ['trial', '[^_.]+']
- ['sensor', '[^_.]+']
- name: 'procLevel'
pattern: 'T0-raw|T1-proc|T2-trait|raw|proc|trait'
required: false
Using an input file named: 2025-08-14_06-30-03_393242_+1030_extra-site-details_jai1.bin the above
template will output files to the following directory:
/mnt/tpa_field_data/adelaide-university/2025_ozbarley/roseworthy/phenomate_1/20250814+1030/T0-raw/jai1
as per the layout format specified in the template.yaml file:
structure: ['organisationName', 'project', 'site', 'platform']
N.B. Input strings for path creation are standardized by converting to lowercase and substituting spaces with dashes.
The extracted files (from JAI phenomate-core processing) will have the name:
2025-08-14_06-30-03_393242_+1030_extra-site-details_jai1_preproc-<timestamp>.tiff
Programmatically this is done using the following method:
pm.copy_file("<src_dir>/2025-08-14_06-30-03_393242_+1030_extra-site-details_jai1.bin")Version numbers follow the standard pattern of: MAJOR.MINOR.PATCH and the project
has been configured to use the Python libray bump-my-version to help automate the
change of version numbers that are used in the files within the project.
The following proceedures outline its use:
Make sure mump-my-version is installed
uv pip install bump-my-version
# add to pyproject.toml
uv add --dev bump-my-version
This tool uses the file .bumpmyversion.toml for configuring what files get modified.
N.B. If files are added to the project that use an explicit version number, then add the files
to .bumpmyversion.toml along with the rules.
Use the tool as follows:
- set the current version in
.bumpmyversion.tomle.g.
current_version = "0.1.1"
Set the bumpwhat value and run the bump-my-version command:
# uv run bump-my-version -h
export bumpwhat=major | minor | patch
uv run bump-my-version bump $bumpwhat
N.B. For YAML files used in testing, it is easier to modify them using sed
# Check the current version:
find ./tests/fixtures -type f \( -name "*.yaml" -o -name "*.yml" \) -exec grep "version" {} +
# set FINDVERSION to be the version number found in files above:
export FINDVERSION=0.0.10
# set the replacement string:
export REPVERSION=0.2.0
find ./tests/fixtures -type f \( -name "*.yaml" -o -name "*.yml" \) -exec sed -i -e "s/$FINDVERSION/$REPVERSION/g" {} +
After a version update the package can be published to PyPi:
rm -fr ./dist
uv build
uv publish # requires a token from PyPi - see .pypirc filegit add --all
git commit -m "Release version 0.2.1 - adds extra tests for file system errors"
git tag v0.2.1 -m "Release version 0.2.1 - adds extra tests for file system errors"
git push origin master
git push origin v0.2.1
Now setup the Phenomate project repository telling it about the new version -
- Edit
pyproject.tomland change the "appm>=X.Y.Z" dependency to the latest version. - Then run:
uv lock
N.B. If installing into the Docker application, first comment out the local installation
path in pyproject.toml
#[tool.uv.sources]
# phenomate-core = { path = "../phenomate-core" }
# appm = { path = "../appn-project-manager" }
and then rebuild the the docker container:
docker compose up -d --force-recreate --build celery_worker
Otherwise, just reinstall the new package into the uv virtual environemt:
make install-local-appm # this runs uv pip install ${LOCAL_APPM}- appm – Core package (template parsing, project management, utilities)
- examples – Example YAML templates
- schema – JSON schema for template validation
- tests – Unit tests and fixtures
- Python 3.11+
- Pydantic
- ruamel.yaml
- pytest for testing
pytest