Skip to content

Commit 95b4a82

Browse files
authored
Bug fixes and improvements (#4)
* Change from minimap2 to winnowmap - winnowmap performs better in repeat regions as shown in the paper https://doi.org/10.1093/bioinformatics/btaa435 * Add new error catch * Add script to check if magnipore positions are within annotated regions * add (0-based) information to readmes * - changed default parameters for winnowmap - fixed bug that doubled the read counts - minor bug fixes
1 parent 81548a7 commit 95b4a82

File tree

8 files changed

+186
-405
lines changed

8 files changed

+186
-405
lines changed

README.md

Lines changed: 8 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -31,14 +31,6 @@ mamba create -n magnipore -c jannessp magnipore
3131
conda activate magnipore
3232
```
3333

34-
*Alternatively you can create a conda environment using the [conda_env.yml](conda.recipe/conda_env.yml) and mamba.*
35-
```bash
36-
conda install mamba
37-
mamba env create -f conda/conda_env.yml
38-
conda activate magnipore
39-
git clone https://github.com/JannesSP/magnipore.git
40-
```
41-
4234
If you want to basecall your ONT data you also need a Guppy version from [Oxford Nanopore Technologies](https://community.nanoporetech.com).
4335

4436
---
@@ -67,7 +59,7 @@ Conda Dependencies
6759
- matplotlib>=3.6.2
6860
- numpy>=1.23
6961
- scipy>=1.9
70-
- minimap2>=2.24
62+
- winnowmap>=2.0
7163
- pandas>=1.5
7264
- seaborn>=0.12
7365
- psutil>=5.9
@@ -173,7 +165,7 @@ The .magnipore file is a TSV containing the following columns.
173165
- bayesian_p : p-value for the signal comparison
174166
- signal_type : classification into "mod" for modification and "mut" for mutation
175167
- ref_1 : contig name of sample 1
176-
- pos_1 : position in contig of sample 1
168+
- pos_1 : position in contig of sample 1 (0-based)
177169
- base_1 : base at the position of sample 1
178170
- motif_1 : motif around the base at the position of sample 1
179171
- signal_mean_1 : mean of the signal distribution at the position of sample 1
@@ -205,31 +197,34 @@ Errors of first sample:
205197
- 119: Cannot basecall .slow5/.blow5 with guppy
206198
- 120: Could not find raw data or unknown file format
207199
- 121: Guppy basecalling failed
208-
- 122: minimap2 mapping failed
200+
- 122: mapping failed
209201
- 123: Samtools indexing failed
210202
- 124: f5c index failed
211203
- 125: f5c eventalign failed
212204
- 126: Could not find provided fastq files
205+
- 127: f5c eventalign file is empty
213206
---
214207
Errors of second sample
215208
- 219: Cannot basecall .slow5/.blow5 with guppy
216209
- 220: Could not find raw data or unknown file format
217210
- 221: Guppy basecalling failed
218-
- 222: minimap2 mapping failed
211+
- 222: mapping failed
219212
- 223: Samtools indexing failed
220213
- 224: f5c index failed
221214
- 225: f5c eventalign failed
222215
- 226: Could not find provided fastq files
216+
- 227: f5c eventalign file is empty
223217

224218
### If Subscript Nanosherlock is Executed Separately
225219

226220
The -e parameter of nanosherlock specifies the leading number of the error code. Default is 0.
227221
- 019: Cannot basecall .slow5/.blow5 with guppy
228222
- 020: Could not find raw data or unknown file format
229223
- 021: Guppy basecalling failed
230-
- 022: minimap2 mapping failed
224+
- 022: mapping failed
231225
- 023: Samtools indexing failed
232226
- 024: f5c index failed
233227
- 025: f5c eventalign failed
234228
- 026: Could not find provided fastq files
229+
- 027: f5c eventalign file is empty
235230
</details>

README.rst

Lines changed: 9 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -10,16 +10,6 @@ for **linux-64 and osx-64**.
1010
mamba create -n magnipore -c jannessp magnipore
1111
conda activate magnipore
1212
13-
Alternatively you can create a conda environment using
14-
the `conda_env.yml <conda.recipe/conda_env.yml>`__ and mamba.
15-
16-
.. code:: bash
17-
18-
conda install mamba
19-
mamba env create -f conda/conda_env.yml
20-
conda activate magnipore
21-
git clone https://github.com/JannesSP/magnipore.git
22-
2313
If you want to basecall your ONT data you also need a Guppy version from
2414
`Oxford Nanopore Technologies <https://community.nanoporetech.com>`__.
2515

@@ -54,7 +44,7 @@ Conda Dependencies:
5444
- matplotlib>=3.6.2
5545
- numpy>=1.23
5646
- scipy>=1.9
57-
- minimap2>=2.24
47+
- winnowmap>=2.0
5848
- pandas>=1.5
5949
- seaborn>=0.12
6050
- psutil>=5.9
@@ -169,7 +159,7 @@ The .magnipore file is a TSV containing the following columns.
169159
- signal_type : classification into “mod” for modification and “mut”
170160
for mutation
171161
- ref_1 : contig name of sample 1
172-
- pos_1 : position in contig of sample 1
162+
- pos_1 : position in contig of sample 1 (0-based)
173163
- base_1 : base at the position of sample 1
174164
- motif_1 : motif around the base at the position of sample 1
175165
- signal_mean_1 : mean of the signal distribution at the position of
@@ -208,22 +198,24 @@ Error Codes Explanation
208198
- 119: Cannot basecall other .slow5/.blow5 with guppy
209199
- 120: Could not find raw data or unknown file format
210200
- 121: Guppy basecalling failed
211-
- 122: minimap2 mapping failed
201+
- 122: mapping failed
212202
- 123: Samtools indexing failed
213203
- 124: f5c index failed
214204
- 125: f5c eventalign failed
215205
- 126: Could not find provided fastq files
206+
- 127: f5c eventalign file is empty
216207

217208
Errors of second sample
218209

219210
- 219: Cannot basecall other .slow5/.blow5 with guppy
220211
- 220: Could not find raw data or unknown file format
221212
- 221: Guppy basecalling failed
222-
- 222: minimap2 mapping failed
213+
- 222: mapping failed
223214
- 223: Samtools indexing failed
224215
- 224: f5c index failed
225216
- 225: f5c eventalign failed
226217
- 226: Could not find provided fastq files
218+
- 227: f5c eventalign file is empty
227219

228220
If Subscript Nanosherlock is Executed Separately
229221
------------------------------------------------
@@ -234,8 +226,9 @@ error code. Default is 0.
234226
- 019: Cannot basecall other .slow5/.blow5 with guppy
235227
- 020: Could not find raw data or unknown file format
236228
- 021: Guppy basecalling failed
237-
- 022: minimap2 mapping failed
229+
- 022: mapping failed
238230
- 023: Samtools indexing failed
239231
- 024: f5c index failed
240232
- 025: f5c eventalign failed
241-
- 026: Could not find provided fastq files
233+
- 026: Could not find provided fastq files
234+
- 027: f5c eventalign file is empty

conda.recipe/conda_env.yml

Lines changed: 0 additions & 235 deletions
This file was deleted.

0 commit comments

Comments
 (0)