You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A singularity image could be built via singularity recipe donated by Eric Deveaud.
89
-
The recipe (`recipe.singularity`) is located in the singularity folder of this repository.
90
-
The docker image [README section](https://hub.docker.com/repository/docker/kbessonov/mob_suite) also has instructions on how to create singularity image from a docker image.
99
+
A singularity image could be built locally via Singularity recipe donated by Eric Deveaud.
100
+
The recipe (`recipe.singularity`) is located in the `singularity` folder of this repository and installs MOB-Suite via `conda`.
Alternatively, Singularity image can be pulled from [BioContainers repository](https://biocontainers.pro/tools/mob_suite) where `<version>` is
112
+
the desired version (e.g. `3.0.3--py_0`)
113
+
114
+
```bash
115
+
% singularity run https://depot.galaxyproject.org/singularity/mob_suite:<version>
116
+
```
117
+
96
118
## Using MOB-typer to perform replicon and relaxase typing of complete plasmids and to predict mobility and replicative plasmid host-range
97
119
98
120
### Setuptools
@@ -106,7 +128,7 @@ Clone this repository and install via setuptools.
106
128
107
129
## Using MOB-typer to perform replicon and relaxase typing of complete plasmids and predict mobility
108
130
109
-
You can perform plasmid typing using a fasta formated file containing a single plasmid represented by one or more contigs or it can treat all of the sequences in the fasta file as independant. The default behaviour is to treat all sequences in a file as from one plasmid, do not include multiple unrelated plasmids in the file without specifying --multi as they will be treated as a single plasmid.
131
+
You can perform plasmid typing using a fasta formated file containing a single plasmid represented by one or more contigs or it can treat all of the sequences in the fasta file as independent. The default behaviour is to treat all sequences in a file as from one plasmid, so do not include multiple unrelated plasmids in the file without specifying --multi as they will be treated as a single plasmid.
110
132
111
133
112
134
```
@@ -126,7 +148,7 @@ unicycler is used, then the circularity information can be parsed directly from
As of v. 3.0.0, we have added the ability of users to provide their own specific set of sequences to remove from plasmid reconstruction. This should be performed with caution and with the knowlede of your organism. Sequences which are frequently of plasmid origin but are not in your organism is the primary use case we envision for this feature.
151
+
As of v. 3.0.0, we have added the ability of users to provide their own specific set of sequences to remove from plasmid reconstruction. This should be performed with caution and with the knowledge of your organism. Filtering of sequences which are frequently of plasmid origin but are not in your organism is the primary use case we envision for this feature.
130
152
131
153
```
132
154
### User sequence mask
@@ -135,14 +157,14 @@ As of v. 3.0.0, we have added the ability of users to provide their own specific
135
157
136
158
As of v. 3.0.0, we have provided the ability to use a collection of closed genomes which will be quickly checked using Mash for genomes which are genetically close and limit blast searches to those chromosomes. This more nuanced and automatic approach is recommended for users where there are sequences which should be filtered in one genomic context but not another. We provide as an optional download as set of closed Enterobacteriacea genomes from NCBI which can be used to provide added accuracy for some organisms such as E. coli and Klebsiella where there are sequences which switch between chromosome and plasmids.
137
159
<br><br>
138
-
If reconstructed plasmids exceed the Mash distance for primary cluster assignment, then they will get assigned a name in the format novel_{md5} where the md5 hash is calculated based on all of the sequences belonging to that reconstructed plasmid. This will provide a unique name for them but any change will result in a changed in the md5 hash. It is inadvised to use these groups for further analyses. Rather they should be highlighted as cases where targeted long read sequencing is required to obtain a closer database representitive of that plasmid.
160
+
If reconstructed plasmids exceed the Mash distance for primary cluster assignment, then they will be assigned a name in the format novel_{md5} where the md5 hash is calculated based on all of the sequences belonging to that reconstructed plasmid. This will provide a unique name for the plasmids but any change will result in a corresponding change in the md5 hash. It is therefore not advised to use these assigned names for further analyses. Rather they should be highlighted as cases where targeted long read sequencing is required to obtain a closer database representative of that plasmid.
Use this tool only to update the plasmid databases or build a new one and should only be completed with closed high quality plasmids. If you add in poor quality data it can severely impact MOB-recon. As od v. 3.0.0, MOB-cluster has been re-written to utilize the output from MOB-typer to greatly speed up the process of updating and builing plasmid databases by using pre-computed results. Clusters generated from earlier versions of MOB-suite are not compatibile with the new clusters. We have povided a mapping file of previous cluster assignments and their new cluster accessions. Each cluster code is unique and will not be re-used.
167
+
Use this tool only to update the plasmid databases or build a new one, however MOB-cluster should only be run with closed high quality plasmids. If you add in poor quality data it can severely impact MOB-recon. As of v3.0.0, MOB-cluster has been re-written to utilize the output from MOB-typer to greatly speed up the process of updating and building plasmid databases by using pre-computed results. Clusters generated from earlier versions of MOB-suite are not compatible with the new clusters. We have provided a mapping file of previous cluster assignments and their new cluster accessions. Each cluster code is unique and will not be re-used.
146
168
147
169
```
148
170
### Build a new database
@@ -177,7 +199,7 @@ Use this tool only to update the plasmid databases or build a new one and should
177
199
# MOB-recon contig report format
178
200
| field | Description |
179
201
| --------- | --------- |
180
-
| sample_id | Sample ID specified by user or deault to filename |
202
+
| sample_id | Sample ID specified by user or default to filename |
181
203
| molecule_type | Plasmid or Chromosome |
182
204
| primary_cluster_id | primary MOB-cluster id of neighbor |
183
205
| secondary_cluster_id | secondary MOB-cluster id of neighbor |
@@ -205,12 +227,12 @@ Use this tool only to update the plasmid databases or build a new one and should
205
227
# MOB-typer report file format
206
228
| field | Description |
207
229
| --------- | --------- |
208
-
| sample_id | Sample ID specified by user or deault to filename |
230
+
| sample_id | Sample ID specified by user or default to filename |
209
231
| num_contigs | Number of sequences belonging to plasmid |
0 commit comments