You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -67,7 +67,7 @@ This module is used to construct structural variant databases from vcf files. Th
67
67
68
68
69
69
## Export
70
-
This module is used to export the variants of the SVDB sqlite database. The variants of the sqlite svdb database is clustered using one out of three algorihms, overlap or DBSCAN.
70
+
This module is used to export the variants of the SVDB sqlite database. The variants of the sqlite svdb database is clustered using one out of three algorithms, overlap or DBSCAN.
71
71
72
72
print a help message
73
73
svdb --export --help
@@ -81,9 +81,9 @@ This module is used to export the variants of the SVDB sqlite database. The vari
81
81
82
82
--overlap OVERLAP the overlap required to merge two events(0 means anything that touches will be merged, 1 means that two events must be identical to be merged), default = 0.8
83
83
84
-
--DBSCAN use dbscan to cluster the variants, overides the overlap based clustering algoritm
84
+
--DBSCAN use dbscan to cluster the variants, overides the overlap based clustering algorithm
85
85
86
-
--epsilon EPSILON used together with --DBSCAN; sets the epsilon paramter(default = 500bp)
86
+
--epsilon EPSILON used together with --DBSCAN; sets the epsilon parameter(default = 500bp)
87
87
88
88
--min_pts MIN_PTS the min_pts parameter(default = 2
89
89
@@ -92,30 +92,34 @@ This module is used to export the variants of the SVDB sqlite database. The vari
92
92
--memory load the database into memory: increases the memory requirements, but lowers the time consumption
93
93
94
94
## Query
95
-
The query module is used to query a structural variant database. Typically a database is constructed using the build module. However, since this module utilize the genotype field of the sructural variant database vcf to compute the frequency of structural variants, a wide range of files could be used as database. The query module requires a query vcf, as well as a database file(either multisample vcf or SVDB sqlite database):
95
+
The query module is used to query one or more structural variant databases. Typically a database is constructed using the build module. However, since this module utilize the genotype field of the structural variant database vcf to compute the frequency of structural variants, a wide range of files could be used as database. The query module requires a query vcf, as well as a database file(either multisample vcf or SVDB sqlite database):
96
96
97
97
print a help message
98
98
svdb --query --help
99
99
Query a structural variant database, using a vcf file as query:
--bedpedb BEDPEDB path to a SV database of the following format chrA-posA-chrB-posB-type-count-frequency
110
-
--in_occ IN_OCC The allele count tag, if used, this tag must be present in the INFO column of the input DB(usually set to AN or OCC)
111
-
--in_frq IN_FRQ The frequency count tag, if used, this tag must be present in the INFO column of the input DB(usually set to AF or FRQ)
112
-
--out_occ OUT_OCC the allle count tag, as annotated by SVDB variant(defualt=OCC)
113
-
--out_frq OUT_FRQ the tag used to describe the frequency of the variant(defualt=FRQ)
114
-
--prefix PREFIX the prefix of the output file, default = print to stdout
111
+
--db DB path to a db vcf, or a comma separated list of vcfs
112
+
--sqdb SQDB path to a SVDB sqlite db, or a comma separated list of dbs
113
+
--bedpedb BEDPEDB path to a SV database of the following format chrA-posA-chrB-posB-type-count-frequency, or a comma separated list of files
114
+
--in_occ IN_OCC The allele count tag, if used, this tag must be present in the INFO column of the input DB(usually set to AN or OCC). This parameter is required if multiple databases are queried.
115
+
--in_frq IN_FRQ The frequency count tag, if used, this tag must be present in the INFO column of the input DB(usually set to AF or FRQ). This parameter is required if multiple databases are queried.
116
+
--out_occ OUT_OCC the allele count tag, as annotated by SVDB variant(default=OCC). This parameter is required if multiple databases are queried.
117
+
--out_frq OUT_FRQ the tag used to describe the frequency of the variant(default=FRQ). This parameter is required if multiple databases are queried.
118
+
--prefix PREFIX the prefix of the output file, default = print to stdout. Required if multiple databases are queried.
115
119
--bnd_distance BND_DISTANCE the maximum distance between two similar breakpoints(default = 10000)
116
120
--overlap OVERLAP the overlap required to merge two events(0 means anything that touches will be merged, 1 means that two events must be identical to be merged), default = 0.6
117
121
--memory load the database into memory: increases the memory requirements, but lowers the time consumption(may only be used with sqdb)
118
-
--no_var count overlaping variants of different type as hits in the db
122
+
--no_var count overlapping variants of different type as hits in the db
print("please ensure that both count and frequency tags are specified for all samples")
35
+
else:
36
+
query_module.main(args)
5
37
6
38
defmain():
7
-
version="2.5.1"
39
+
version="2.5.2"
8
40
parser=argparse.ArgumentParser(
9
41
"""SVDB-{}, use the build module to construct databases, use the query module to query the database usign vcf files, or use the hist module to generate histograms""".format(version), add_help=False)
10
42
parser.add_argument('--build', help="create a db",
@@ -22,22 +54,22 @@ def main():
22
54
"""SVDB.{}: query module""".format(version))
23
55
parser.add_argument('--query', help="query a db", required=False, action="store_true")
24
56
parser.add_argument('--query_vcf', type=str, help="a vcf used to query the db", required=True)
25
-
parser.add_argument('--db', type=str, help="path to a SVDB db vcf ")
26
-
parser.add_argument('--sqdb', type=str, help="path to a SVDB sqlite db")
57
+
parser.add_argument('--db', type=str, help="path to a SVDB db vcf or a comma separated list of vcfs")
58
+
parser.add_argument('--sqdb', type=str, help="path to a SVDB sqlite db or a comma separated list of dbs")
27
59
parser.add_argument('--bedpedb', type=str,
28
-
help="path to a SV database of the following format chrA-posA-chrB-posB-type-count-frequency")
60
+
help="path to a SV database of the following format chrA-posA-chrB-posB-type-count-frequency, or a or a comma separated list of dbs")
29
61
parser.add_argument('--in_occ', type=str,
30
-
help="The allele count tag, if used, this tag must be present in the INFO column of the input DB(usually set to AC or OCC)")
62
+
help="The allele count tag, if used, this tag must be present in the INFO column of the input DB(usually set to AC or OCC), required if multiple databases are queried. Use default (as shown in the example in README) if you'd like to use default tag for a specific database")
31
63
parser.add_argument('--in_frq', type=str,
32
-
help="The frequency count tag, if used, this tag must be present in the INFO column of the input DB(usually set to AF or FRQ)")
64
+
help="The frequency count tag, if used, this tag must be present in the INFO column of the input DB(usually set to AF or FRQ), required if multiple databases are queried. Use default (as shown in the example in README) if you'd like to use default tag for a specific database")
0 commit comments