You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Big change is the addition of a `SemiBin2` script, which is still experimental, but should be a slightly nicer interface.
USER-VISIBLE IMPROVEMENTS SINCE v1.4.0
- Added a new option for ORF finding, called `fast-naive` which is an internal very fast implementation.
- Added the possibility of bypassing ORF finding altogether by providing prodigal outputs directly (or any other gene prediction in the right format)
- Command line argument checking is more exhaustive instead of exiting at first error
- Added `--quiet` flag to reduce the amount of output printed
- Better `--help` (group required arguments separately)
- Add `--output-compression` option to compress outputs
- Add `--tag-output` option which allows for control of the output filenames (and also makes the anvi'o compatible — see discussion at [#123](#123).
- Add contig->bin mapping table ([#123](#123))
- `SemiBin.main.main1` and `SemiBin.main.main2` can now be called as a function with command line arguments (`main1` corresponds to _SemiBin1_ and `main2` corresponds to _SemiBin2_)
```python
import SemiBin.main
...
SemiBin.main.main2(['single_easy_bin', '--input-fasta', ...])
```
Copy file name to clipboardExpand all lines: docs/semibin2.md
+16-3Lines changed: 16 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -7,13 +7,25 @@ They have the same functionality, but slightly different interfaces. The exact
7
7
interface to `SemiBin2` should be considered as unstable (while we will strive
8
8
to maintain backwards compatibility if you call the `SemiBin` script).
9
9
10
-
# Differences between SemiBin2 and SemiBin1
10
+
## Upgrading to SemiBin2
11
+
12
+
1. If you are using the `easy_*` workflows, then they will probably continue to
13
+
work exactly the same (except that you will get better results faster).
14
+
2. Outputs are now **always** in a directory called `output_bins`.
15
+
3. By default, bins are in file named as `SemiBin_{label}.fa.gz` (and
16
+
compressed with _gzip_ as the name indicates).
17
+
18
+
Points `2` and `3` may require some minor modifications to wrapper scripts.
19
+
20
+
## Longer list of differences between SemiBin2 and SemiBin1
11
21
12
22
The biggest different is that the default training mode is self-supervised mode.
13
23
14
24
- Output bins are now **always** in a directory called `output_bins` (in
15
-
- Output filenames are now anvi'o compatible (effectively, the default value of `--tag-output` is `SemiBin`) (see discussion in [#123](https://github.com/BigDataBiology/SemiBin/issues/123))
16
25
_SemiBin1_, it actually depended on which parameters were used)
26
+
- Output filenames are now anvi'o compatible (effectively, the default value of
- Add `--output-compression` option to compress outputs
13
-
- Add `--tag-output` option which allows for control of the output filenames (and also makes the anvi'o compatible)
18
+
- Add `--tag-output` option which allows for control of the output filenames (and also makes the anvi'o compatible — see discussion at [#123](https://github.com/BigDataBiology/SemiBin/issues/123).
-`SemiBin.main.main1` and `SemiBin.main.main2` can now be called as a function with command line arguments (`main1` corresponds to _SemiBin1_ and `main2` corresponds to _SemiBin2_)
0 commit comments