Pseudovirus deep mutational scanning of A/Massachusetts/18/2022 (H3N2) hemagglutinin

Study by Timothy Yu and Jesse Bloom. See Yu et al (2025) for details on the study.

This repo contains data and code from deep mutational scanning experiments on H3 hemagglutinin.

For user-friendly links to interactive visualization of the data and key numerical results, see https://dms-vep.org/Flu_H3_Massachusetts2022_DMS/.

Organization of this repo

`dms-vep-pipeline-3` submodule

Most of the analysis is done by the dms-vep-pipeline-3, which was added as a git submodule to this pipeline via:

git submodule add https://github.com/dms-vep/dms-vep-pipeline-3

This added the file .gitmodules and the submodule dms-vep-pipeline-3, which was then committed to the repo. Note that if you want a specific commit or tag of dms-vep-pipeline-3 or to update to a new commit, follow the steps here, basically:

cd dms-vep-pipeline-3
git checkout <commit>

and then cd ../ back to the top-level directory, and add and commit the updated dms-vep-pipeline-3 submodule. You can also make changes to the dms-vep-pipeline-3 that you commit back to that repo.

Code and configuration

The snakemake pipeline itself is run by dms-vep-pipeline-3/Snakefile which reads its configuration from config.yaml. The conda environment used by the pipeline is that specified in the environment.yml file in dms-vep-pipeline-3.

Data

Input data utilized by the pipeline are located in ./data/.

Results and documentation

The results of running the pipeline are placed in ./results/. Due to space, only some results are tracked. For those that are not, see the .gitignore document.

The pipeline builds HTML documentation for the pipeline in ./docs/. These docs are rendered for viewing at https://dms-vep.org/Flu_H3_Massachusetts2022_DMS/.

Non-pipeline analyses

All other non-pipeline analyses are contained in ./analysis/ and ./validations/. The notebooks in this directory are not part of the main pipeline but have been used to generate files used as input for the pipeline.

Running the pipeline

To run the pipeline, build the conda environment dms-vep-pipeline-3 in the environment.yml file of dms-vep-pipeline-3, activate it, and run snakemake, such as:

conda activate dms-vep-pipeline-3
snakemake -j 32 --use-conda -s dms-vep-pipeline-3/Snakefile

To run on the Hutch cluster via slurm, you can run the file run_Hutch_cluster.bash:

sbatch -c 32 run_Hutch_cluster.bash

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.github/workflows		.github/workflows
analysis		analysis
data		data
dms-vep-pipeline-3 @ f498f3f		dms-vep-pipeline-3 @ f498f3f
docs		docs
homepage		homepage
results		results
sra_upload		sra_upload
validations		validations
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
custom_rules.smk		custom_rules.smk
package-lock.json		package-lock.json
package.json		package.json
run_Hutch_cluster.bash		run_Hutch_cluster.bash

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pseudovirus deep mutational scanning of A/Massachusetts/18/2022 (H3N2) hemagglutinin

Organization of this repo

`dms-vep-pipeline-3` submodule

Code and configuration

Data

Results and documentation

Non-pipeline analyses

Running the pipeline

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Pseudovirus deep mutational scanning of A/Massachusetts/18/2022 (H3N2) hemagglutinin

Organization of this repo

dms-vep-pipeline-3 submodule

Code and configuration

Data

Results and documentation

Non-pipeline analyses

Running the pipeline

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`dms-vep-pipeline-3` submodule

Packages