Skip to content

Alphafold

Franklin has a deployment of deepmind's alphafold as well as its databases compiled from source1. The databases are located at /share/databases/alphafold; this directory is exported as $ALPHAFOLD_DB_ROOT when the module is loaded.

This is not a docker deployment. As such, many of the examples provided online need to be slightly modified. The main script supplied by the alphafold package is run_alphafold.py, which is the script that the docker container calls internally. All the same command line arguments that can be passed to alphafold's run_docker.py script can be passed to run_alphafold.py, but the latter requires all the database locations be supplied:

run_alphafold.py \
--data_dir="$ALPHAFOLD_DB_ROOT" \
--uniref90_database_path=$ALPHAFOLD_DB_ROOT/uniref90/uniref90.fasta \
--mgnify_database_path=$ALPHAFOLD_DB_ROOT/mgnify/mgy_clusters_2022_05.fa \
--template_mmcif_dir=$ALPHAFOLD_DB_ROOT/pdb_mmcif/mmcif_files \
--obsolete_pdbs_path=$ALPHAFOLD_DB_ROOT/pdb_mmcif/obsolete.dat \
--bfd_database_path=$ALPHAFOLD_DB_ROOT/bfd/bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt \
--uniref30_database_path=$ALPHAFOLD_DB_ROOT/uniref30/UniRef30_2021_03 \
--pdb70_database_path=$ALPHAFOLD_DB_ROOT/pdb70/pdb70 \
[OTHER ARGS]

Because this is annoying, we have supplied a wrapper script named alphafold-wrapped with our module that passes these common options for you. Any of the arguments not passed above will be passed along to the run_alphafold.py script; for example:

alphafold-wrapped \
--output_dir=[OUTPUTS] \
--fasta_paths=[FASTA INPUTS] \
--max_template_date=2021-11-01 \
--use_gpu_relax=true

  1. Jumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., Ronneberger, O., … Hassabis, D. (2021). Highly accurate protein structure prediction with AlphaFold. Springer Science and Business Media LLC. https://doi.org/10.1038/s41586-021-03819-2