Configuration Reference¶
All QuantumPDB parameters are set in a YAML configuration file passed to
the CLI with -c config.yaml. Parameters are organized into three
sections corresponding to the three CLI commands.
Only include the parameters you need — all parameters have sensible defaults.
RUN Parameters¶
These parameters control qp run (structure preparation, protonation, and
cluster extraction).
Pipeline control:
Parameter |
Default |
Description |
|---|---|---|
|
|
Path to input CSV, single PDB code, local PDB path, or list of PDB
codes (e.g., |
|
|
Directory where all output files are saved. |
|
|
Enable Modeller to rebuild missing atoms, residues, and loops. |
|
|
Enable Protoss to assign protonation states via the web server. |
|
|
Enable QM cluster model generation. |
|
|
Skip existing results. |
|
|
Modeller refinement level: |
|
|
Maximum iterations of the Modeller–Protoss feedback loop for resolving steric clashes. |
Cluster model parameters:
Parameter |
Default |
Description |
|---|---|---|
|
|
Three-letter residue code(s) for the cluster center. Use YAML list
syntax: |
|
|
List of three-letter residue codes to force into the first interaction
sphere (e.g., |
|
|
Number of interaction spheres to construct. More spheres capture more of the protein environment at increased computational cost. |
|
|
Distance cutoff in angstroms for the first (distance-based) sphere. The default of 4.0 Å works well for most systems. Increase for large ligands or weak interactions. |
|
|
Ligand inclusion level: |
|
|
Maximum number of atoms in the QM cluster. When set, the most distant residues are pruned until the count is below the threshold. Capping atoms are not counted toward this limit. |
|
|
Distance in angstroms for merging nearby center residues into a single center. Set slightly larger than the metal–metal distance. Examples: 4.0 for MMO diiron (~3.4 Å), 12.0 for PHM dicopper (~11 Å). |
|
|
How to cap chain breaks: |
|
|
Voronoi regularization method: |
Output control:
Parameter |
Default |
Description |
|---|---|---|
|
|
Compute per-residue charges from protonation states and ligand properties. |
|
|
Count the number of residues in each interaction sphere and write
|
|
|
Write XYZ coordinate files alongside PDB cluster files. |
SUBMIT Parameters¶
These parameters control qp submit (QM job creation and submission).
QM calculation settings:
Parameter |
Default |
Description |
|---|---|---|
|
|
|
|
|
DFT functional. TeraChem supports: wpbeh, b3lyp, ub3lyp, pbe0, etc. |
|
|
Basis set. Common choices: |
|
|
Initial wavefunction guess method. |
|
|
Number of GPUs for TeraChem calculations. |
|
|
Memory allocation per job. |
|
|
Dielectric constant for implicit solvent (PCM). Use |
|
|
Enable PCM implicit solvent (COSMO). Defaults to |
|
|
Include MM point charges around the QM cluster. By default uses
ff14SB partial charges (also TIP3P for water, standard charges for
common ions). The charge source can be customized with
|
|
|
Distance cutoff in angstroms for MM point charges, measured from the QM cluster centroid. If any atom of a residue falls within this distance, all atoms of that residue are included to preserve integer per-residue charge contributions. |
|
|
Path to a JSON file with custom partial charges, keyed by residue
name then atom name (e.g., |
|
|
Job scheduler: |
|
|
Path to a custom PCM radii file (TeraChem-specific). |
Job management:
Parameter |
Default |
Description |
|---|---|---|
|
|
Generate QM input files for each cluster. |
|
|
Submit generated jobs to the scheduler. |
|
|
Maximum number of concurrent jobs on the scheduler. |
ANALYZE Parameters¶
These parameters control qp analyze (job monitoring and post-processing).
Parameter |
Default |
Description |
|---|---|---|
|
|
Check the status of all QM jobs and generate summary reports. |
|
|
Delete |
|
|
Path to the Multiwfn executable. If Multiwfn is on your |
|
|
Charge scheme for partial charge analysis. Options: |
|
|
Run Multiwfn to compute partial atomic charges. |
|
|
Run Multiwfn to compute substrate dipole moments using the center of mass as the reference point. |
Minimal Configuration Examples¶
Cluster generation only (most common starting point):
input: 1OS7
output_dir: output
modeller: true
protoss: true
coordination: true
center_residues: [FE]
Full pipeline (generation + QM + analysis):
# RUN
input: proteins.csv
output_dir: dataset
modeller: true
protoss: true
coordination: true
center_residues: [FE]
number_of_spheres: 2
# SUBMIT
method: wpbeh
basis: lacvps_ecp
dielectric: 10
create_jobs: true
submit_jobs: true
# ANALYZE
charge_scheme: Hirshfeld
calc_charge_schemes: true