GitHub | GitLab mirror | Preprint

Applications - Case Studies¶

This page details practical applications of CABS-flex in exploring protein structure, dynamics, and function, based on the publication: CABS-flex applications review (Protein Science, 2024).

1. Modeling the flexibility of globular protein structures¶

⬆ Back to top

CABS-flex provides a computationally efficient alternative to all-atom Molecular Dynamics (MD) for simulating the fluctuation profiles of folded globular proteins. Its coarse-grained methodology produces flexibility patterns that align well with both MD simulations and NMR ensembles. In a comparative analysis of AlphaFold2 (AF2) predictions versus NMR structures, CABS-flex simulations were utilized to validate structural accuracy against Random Coil Index (RCI) data. For example, in the case of PDB ID 2ct5, the flexibility profile generated by CABS-flex for the AF2 model showed a higher correlation with experimental RCI data than the corresponding NMR ensemble, particularly in correctly identifying flexible residues 18-28. This highlights the utility of CABS-flex dynamics as an independent validation metric for structural models, capable of distinguishing between model artifacts and true biological flexibility (CABS-flex NMR validation / Bioinformatics, 2014). Flexibility profiles computed from RCI compared with CABS-flex simulations for NMR and AlphaFold2 models are shown in the figure below. The top panel shows PDB ID 2ct5 NMR model 3 and AF2 model (UniProt: O96006) while the bottom panel shows PDB ID 2y4q NMR model 3 and AF2 model (UniProt: Q13563). In the right panels, the color and the thickness of the tube representation are proportional to the flexibility of each residue. RCI stands for random coil index.

Comparison of flexibility profiles

2. Structure–function analysis of protein complexes: Interactions and mutational dynamics¶

⬆ Back to top

Protein-protein interactions are often governed by dynamic conformational changes that are not evident in static crystal structures. CABS-flex allows for the modeling of these dynamics to elucidate binding mechanisms and the functional impact of mutations. A prominent application involves the SARS-CoV-2 Spike (S) protein, shown in the figure below in both its receptor-accessible "up" conformation (panel a, PDB ID: 7a98 bound to ACE2) and its receptor-inaccessible "down" conformation (panel b, PDB ID: 7df3 without binding to ACE2). The trimeric structure consists of an N-terminal S1 subunit—comprising the NTD (blue), RBD (tv_red), CTD1 (yellow), and CTD2 (cyan)—and a C-terminal S2 subunit (raspberry). Interaction with the ACE2 receptor occurs specifically through the light magenta residues on the RBD and wheat residues on the receptor (panel c). CABS-flex simulations revealed significant flexibility in the distal ends of the binding motif, providing insights into the enhanced affinity of variants like Omicron (Pathogens, 2022). Furthermore, the trajectories identified residue clusters (441–445, 477–484) where high flexibility suggests a significant role in viral evolution (Int J Biol Macromol, 2022), as well as specific stabilizing mutations like V503D (BBRC, 2022) that enhance the complex's stability. This demonstrates how CABS-flex complements static analysis by mapping dynamic hotspots crucial for understanding virus-host interactions.

SARS-CoV-2 Spike protein dynamics

3. Exploring allosteric mechanisms through integrative computational modeling¶

⬆ Back to top

Allosteric regulation involves signal transmission across a protein structure, often mediated by subtle conformational changes and hinge motions. CABS-flex identifies regions of significant inherent flexibility that frequently correspond to allosteric sites. In studies of the Hsp90 chaperone system, CABS-flex was integrated with atomistic simulations to map conformational landscapes, identifying dynamic regions critical for the recruitment and integration of client proteins (J Phys Chem B, 2022). Similarly, in Plasmodium falciparum Hsp70s, CABS-flex highlighted flexibility patterns indicative of allosteric modulation sites (Int J Mol Sci, 2019), which were subsequently verified by extensive MD simulations. By modeling the "up/down" state transitions of the SARS-CoV-2 Spike protein RBD, CABS-flex also helped map potential allosteric communication pathways (J Phys Chem B, 2021), demonstrating its value in exploring conformational landscapes in complex systems.

4. Structure-based prediction of aggregation-prone regions (APRs)¶

⬆ Back to top

Protein aggregation is often initiated by local structural fluctuations that expose hydrophobic regions which are buried in the native state. CABS-flex serves as the simulation engine for the "Dynamic Mode" of the Aggrescan3D (A3D) method (Nucleic Acids Res, 2019). Unlike static analysis, which evaluates aggregation propensity based solely on a single structure, the dynamic approach accounts for conformational fluctuations. For the human p53 DNA-binding domain (PDB ID: 2fej), static analysis failed to reveal all potential APRs. However, incorporating CABS-flex dynamics unveiled hidden APRs that match experimental aggregation data (Nucleic Acids Res, 2015). This dynamic profiling is crucial for the rational design of soluble protein variants and for predicting aggregation risks in therapeutic proteins. The Aggrescan3D "dynamic mode" pipeline is shown in the figure below (panel a), where the input structure's flexibility is simulated using CABS-flex to generate a conformational ensemble; these models are then scored based on aggregation propensity, with the highest-scoring model presented as the final prediction. Panel (b) compares aggregation propensity predictions for the human p53 DNA-binding domain (PDB ID: 2fej) using "static mode" (left) versus "dynamic mode" (right), where residues are colored from blue (solubility) to red (aggregation), with key aggregation-prone regions encircled.

Aggrescan3D dynamic mode

5. Structure-based prediction of S-nitrosylation sites¶

⬆ Back to top

S-nitrosylation (SNO) is a reversible post-translational modification of cysteine residues that plays a key role in cellular signaling. The susceptibility of a cysteine to SNO depends not only on its sequence context but also on its spatial accessibility and proximity to other thiol groups. SNO susceptibilities are also strongly influenced by the local dynamics of cysteine-containing regions. CABS-flex ensembles provide a description of the structural environment of cysteine residues. In the case of the mitochondrial chaperone TRAP1, an ensemble of 20 CABS-flex models was generated to analyze the spatial proximity of Cys501 and Cys527 (Cell Death Dis, 2023). The analysis revealed that structural fluctuations bring these cysteines into sufficient proximity to form a disulfide bridge, a fluctuation-dependent property that regulates the protein's function. This demonstrates how CABS-flex ensembles can facilitate the structure-based prediction of PTM sites. The workflow (panel a) integrates SNOfinder for identifying susceptible proteins, CABS-flex for generating 20-model structural ensembles, and SNOmodel for calculating key biochemical parameters to enhance the understanding of S-nitrosylation mechanisms. Panel (b) illustrates the conformational dynamics of the mitochondrial chaperone TRAP1 (deep teal) with models from the CABS-flex ensemble shown in white; stick-and-ball representations highlight the S-nitrosylated cysteine (SNO, red) and the proximal cysteine (marine), alongside the fluctuating Sγ-Sγ distances between them. S-nitrosylation prediction workflow

6. Structure prediction of linear and cyclic peptides¶

⬆ Back to top

While deep learning methods like AlphaFold have revolutionized protein structure prediction, they can sometimes struggle with peptides that exhibit diverse conformational behaviors or fall outside the training distribution. CABS-flex offers a physics-based sampling approach for the de novo structure prediction of linear and cyclic peptides (Linear and cyclic peptide modeling / Briefings in Bioinformatics, 2024). In a benchmark of 159 peptides, CABS-flex demonstrated the ability to effectively navigate the structural diversity of short peptides. In specific instances, particularly for flexible linear peptides, CABS-flex predicted conformations that were closer to experimental structures than those generated by AlphaFold2. This establishes CABS-flex as a robust tool for peptide modeling, requiring only the amino acid sequence as input. The figure below (panel a) shows example CABS-flex predictions (best out of 10,000 structures in blue, best of top 10 ranked structures in yellow) superimposed on experimental PDB structures in red. Panel (b) provides a direct comparison between CABS-flex predictions (yellow) and AlphaFold2 predictions (cyan) for flexible cases where the coarse-grained method identified conformations closer to the experimental data (red) than the deep learning baseline.

Peptide structure prediction

7. AlphaFold's pLDDT: Bridging predictive confidence and protein flexibility¶

⬆ Back to top

AlphaFold's pLDDT score is a metric of local structural confidence, but it is often inversely correlated with structural flexibility. CABS-flex provides a complementary, physics-based perspective on protein dynamics. For protein 6o2v_A, a high correlation (0.86) was observed between CABS-flex RMSF profiles and standard MD simulations. Interestingly, the pLDDT scores inversely mirrored these RMSF profiles, confirming that regions of low predictive confidence often correspond to biologically relevant flexibility. In cases like protein 1d3y_B, where experimental data was missing for certain loops, low pLDDT scores indicated uncertainty. CABS-flex simulations correctly modeled the physical flexibility of these loops, allowing researchers to distinguish between model uncertainty and intrinsic structural dynamics (MD data from (Nucleic Acids Res, 2024)). The figure below illustrates protein structure fluctuations from CABS-flex vs. MD runs alongside pLDDT confidence scores for proteins 6o2v_A (panel a) and 1d3y_B (panel b). On the right, pLDDT scores are visualized directly on the protein structures, highlighting how regions of low predictive confidence often map to the most flexible segments.

pLDDT and CABS-flex comparison

← Examples | ⬆ Back to top | Next: Visualization Guide