<p>DNA-binding proteins (DBPs) regulate and repair genes. It is therefore important to understand their dynamics. DBPs find their target sites by combining three-dimensional diffusion and one-dimensional scanning of the DNA. Here, we study the one-dimensional diffusion and DNA binding of the dimeric <em>lac</em> repressor (LacI) using coarse-grained molecular dynamic simulations and compare the results to experimental data. This study supports that linear diffusion along DNA combines tight rotation-coupled groove tracking and rotation-decoupled hopping, where the protein briefly dissociates and re-associates just a few base-pairs away. Tight groove tracking is crucial for target-site recognition, while hopping speeds up the overall search process. We show how the flexibility of LacI’s hinge regions ensures agility on DNA as well as faithful groove tracking. Based on our additional study of different encounter complexes, we argue that the conformational change in LacI and DNA occur simultaneously. </p>
<p><br></p>
<p>The content of the database can be split into Starting structures, original trajectories, processed data, data for visualization, movies in 3D space (to be used in e.g. pymol) and code.</p>
<p><br></p>
<p>The Starting structures contain .pdb files with all-atom models and .dat files with coarse-grained models.</p>
<p><br></p>
<p>The trajectories can be found in the folders starting with diffusion_ for monomer, dimer and full-length LacI. Additionally there are trajectories of the different encounter complexes with straight and bent DNA and the two protein conformations.</p>
<p><br></p>
<p>The processed data contains the position of the center of mass of the proteins recognition region relative to the DNA. The data is split into the different systems we studied: the full-length proteins, dimers and monomers of the search and recognition conformations as well as encounter complexes with A- and B-forms DNA. All these systems have been studied at different salt concentrations.</p>
<p><br></p>
<p>The code <strong>CG-analysis-rackham</strong> contains the code that was used for plotting the data for the figure in the publication as it was downloaded from github on November 22 2022. This code contains jupyter notebooks that analyse the processed data and produce the figures in the publication. It also contains <strong>pipeline_trajectory_analysis </strong>which produces the processed data from the trajectories. The processed data contains the position of the protein relative to the DNA (position along and around the DNA and distance from the DNA), which can be obtained from the trajectory using the Spiral package contained in the <strong>pipeline_trajectory_analysis </strong> folder and the <a href="https://github.com/mallu2/CG_analysis/blob/rackham/pipeline_trajectory_analysis/Ex_spiral1.py" target="_blank"><u>Ex_spiral1.py</u></a> script of <strong>CG_analysis-rackham</strong>. </p>
<p>The preprosessed trajetcory data can the be plotted with the notebook <strong>plotting_CG_sim.ipynb</strong> (Figure 2 of the paper).</p>
<p>The diffusion can be analysed and plotted with <strong>msd_diffusion_coefficient.ipynb</strong> (Figure 3 of the paper).</p>
<p>The trajectory data can also be split into 1D and 3D diffusion and into groove tracking/sliding motions on the DNA with <strong>analysis_sliding_and_hopping.ipynb</strong> (Figure 4 of the paper).</p>
<p>Interaction profiles of the protein on DNA can be plotted using <strong>interaction_profiles.ipynb </strong>(Fig. 5A).</p>
<p>Finally different energies obtained from the simulation and bonds formed between protein and DNA of different conformations can be analysed using the script <a href="https://github.com/mallu2/CG_analysis/blob/rackham/pipeline_trajectory_analysis/Ex_Bind_Occ.py" target="_blank"><u>Ex_Bind_Occ.py</u></a> and <strong>CG_energies_analysis.ipynb</strong> (Fig. 5 C and D).</p>
<p><br></p>
<p>Each zip archive contains a README with further descriptions of the subfolder structure and the files contained within. The same goes for the code. </p>
<p><br></p>
<p><br></p>
Funding
Knut and Alice Wallenberg Foundation: 2016.0077
Knut and Alice Wallenberg Foundation: 2017.0291
Knut and Alice Wallenberg Foundation: 2019.0439
SNIC 2.0: Swedish National Infrastructure for Computing