Skip to content

Commit 42a5e3c

Browse files
authored
Update README.md
1 parent 453a364 commit 42a5e3c

File tree

1 file changed

+6
-3
lines changed

1 file changed

+6
-3
lines changed

README.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ To keep the data files of Cryo2StructData permanent, we published all data to th
2323
The Cryo2StructData Dataverse comprises the Full Cryo2StructData (https://doi.org/10.7910/DVN/FCDG0W) along with its associated trained deep transformer model and data split (https://doi.org/10.7910/DVN/SXNYRE). Similarly, within the Cryo2StructData Dataverse, you will find the Small Subsample (https://doi.org/10.7910/DVN/CGUENL) of the complete Cryo2StructData companied by its respective trained deep transformer model and data splits (https://doi.org/10.7910/DVN/DTV4JF). Finally, the test dataset can be access here: https://doi.org/10.7910/DVN/2GSSC9 .
2424

2525
## Description of the dataset
26-
The dataset can be accessed using the above dataset download link. The protein structures and cryo-EM density maps can be visualized using tools such as: [UCSF ChimeraX](https://www.cgl.ucsf.edu/chimerax/index.html). The dataset follows the format described below:
26+
The dataset can be accessed using the above dataset download link. The protein structures and cryo-EM density maps can be visualized using tools such as: [UCSF ChimeraX](https://www.cgl.ucsf.edu/chimerax/index.html). The dataset follows the format described below, and please read the subsequent section describing what these data mean.
2727
```text
2828
cryo2struct
2929
├── metadata.csv
@@ -172,10 +172,13 @@ The atoms and amino acid labeling program can be run as:
172172
python3 get_atoms_label.py <absolute input path>
173173
python3 get_amino_labels.py <absolute input path>
174174
```
175-
The secondary structure labeling program first extracts coils, helices, and strands using [UCSF ChimeraX](https://www.cgl.ucsf.edu/chimerax/index.html) in no-GUI mode and saves them as seperate ``.pdb`` files. Then, the program labels density map with the saved ``.pdb`` files.
176-
175+
176+
Run the below if `{emd_id}_coil.pdb`, `{emd_id}_helix.pdb`, and `{emd_id}_strand.pdb` are not present in the files downloaded from Cryo2StructData dataverse. The below program extracts coils, helices, and strands using [UCSF ChimeraX](https://www.cgl.ucsf.edu/chimerax/index.html) in no-GUI mode and saves them as seperate ``.pdb`` files.
177177
```
178178
python3 get_secondary_pdb.py <absolute input path>
179+
```
180+
Run the below to generate the label mask maps for secondary structure. The programs below use the above generated `{emd_id}_coil.pdb`, `{emd_id}_helix.pdb`, and `{emd_id}_strand.pdb` files to generate a single labled MRC file.
181+
```
179182
python3 get_sec_stru_coil_label.py <absolute input path>
180183
python3 get_sec_stru_helix_label.py <absolute input path>
181184
python3 get_sec_stru_strand_label.py <absolute input path>

0 commit comments

Comments
 (0)