You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A directory called `reference/mm39` will be created and you will use the `mm39` as the species name in other peakScout operations.
99
+
A directory called `reference/mm39` will be created and should be used as the `ref_dir` argument for downstream peakScout operations.
92
100
93
101
### Finding Nearest Genes
94
102
95
103
Once a reference GTF has been decomposed, you can use the decomposition to find the nearest genes to your peaks. Peak files can be MACS2, SEACR outputs, or standard BED6 format files and can be Excel sheets or BED files.
96
104
97
-
Run the following command to create an Excel sheet containing the nearest k genes to your peaks
98
-
```bash
99
-
peakScout peak2gene --peak_file /path/to/peak/file --peak_type MACS2/SEACR/BED6 --species species of gtf --k number of nearest genes --ref_dir /path/to/reference/directory --output_name name of output file --o /path/to/save/output --output_type csv/xslx
|`output_type`|`str`| Output type (csv file or xlsx file). |
114
+
|`species_genome`|`str`| Species of the reference genome. |
115
+
|`option`|`str`| Option for defining start and end positions of peaks. Default native_peak_bounaries. |
116
+
|`boundary`|`int`| Boundary for artificial peak boundary option. `None` if other options. |
117
+
|`up_bound`|`int`| Maximum allowed distance between peak and upstream feature. Default `None`. |
118
+
|`down_bound`|`int`| Maximum allowed distance between peak and downstream feature. Default `None`. |
119
+
|`consensus`|`bool`| Whether to use consensus peaks. Default `False`. |
120
+
|`drop_columns`|`bool`| Whether to drop unnecessary columns from the original file. Default `False`. |
121
+
|`view_window`|`float`| Proportion of the peak region in entire genome browser window. Default `0.2`. |
103
122
123
+
Run the following command to create an Excel sheet containing the nearest k genes to your peaks
104
124
```bash
105
125
peakScout peak2gene \
106
-
--peak_file test/test_MACS2.bed \
107
-
--peak_type MACS2 \
108
-
--species mm39 \
109
-
--k 2 \
110
-
--ref_dir reference/mm39 \
111
-
--output_name peakScout_test_MACS2 \
112
-
--o my_output_dir \
113
-
--output_type xslx
126
+
--peak_file /path/to/peak/file \
127
+
--peak_type MACS2/SEACR/BED6 \
128
+
--species_genome UCSC-defined species of gtf \
129
+
--k number of nearest genes \
130
+
--ref_dir /path/to/reference/directory \
131
+
--output_name name of output file \
132
+
--o /path/to/save/output \
133
+
--output_type csv/xslx
114
134
```
115
135
116
136
Specific example:
@@ -119,7 +139,7 @@ Specific example:
119
139
peakScout peak2gene \
120
140
--peak_file test/test_MACS2.bed \
121
141
--peak_type MACS2 \
122
-
--species mm39 \
142
+
--species_genome mm39 \
123
143
--k 2 \
124
144
--ref_dir reference/mm39 \
125
145
--output_name peakScout_test_MACS2 \
@@ -129,16 +149,46 @@ peakScout peak2gene \
129
149
130
150
### Finding Nearest Peaks
131
151
132
-
Once a reference GTF has been decomposed, you can use the decomposition to find the nearest peaks to a set of genes. Peak files can be MACS2, SEACR outputs, or standard BED6 format files and can be Excel sheets or BED files. Gene names should be in a single column CSV file with no header.
152
+
Once a reference GTF has been decomposed, you can also use the decomposition to find the nearest peaks to a set of genes. Peak files can be MACS2, SEACR outputs, or standard BED6 format files and can be Excel sheets or BED files. Gene names should be in a single column CSV or txt file with no header.
|`output_type`|`str`| Output type (csv file or xlsx file). |
164
+
|`option`|`str`| Option for defining start and end positions of peaks. Default native_peak_boundaries. |
165
+
|`boundary`|`int`| Boundary for artificial peak boundary option. `None`å if other options. |
166
+
|`consensus`|`bool`| Whether to use consensus peaks. Default `False`. |
133
167
134
168
Run the following command to create an Excel sheet containing the nearest k peaks to your genes
135
169
```bash
136
-
peakScout gene2peak --peak_file /path/to/peak/file --peak_type MACS2/SEACR/BED6 --gene_file /path/to/gene/file --species species of gtf --k number of nearest peaks --ref_dir /path/to/reference/directory --output_name name of output file --o /path/to/save/output --output_type csv/xslx
170
+
peakScout gene2peak \
171
+
--peak_file /path/to/peak/file \
172
+
--peak_type MACS2/SEACR/BED6 \
173
+
--gene_file /path/to/gene/file \
174
+
--k number of nearest peaks \
175
+
--ref_dir /path/to/reference/directory \
176
+
--output_name name of output file \
177
+
--o /path/to/save/output \
178
+
--output_type csv/xslx
137
179
```
138
180
139
181
Specific example:
140
182
```bash
141
-
peakScout gene2peak --peak_file /path/to/peak/file --peak_type MACS2/SEACR/BED6 --gene_file /path/to/gene/file --species species of gtf --k number of nearest peaks --ref_dir /path/to/reference/directory --output_name name of output file --o /path/to/save/output --output_type csv/xslx
183
+
peakScout gene2peak \
184
+
--peak_file test/test_MACS2.bed \
185
+
--peak_type MACS2 \
186
+
--gene_file test/test_genes.txt \
187
+
--k 3 \
188
+
--ref_dir reference/mm39 \
189
+
--output_name test_gene2peak_MACS2 \
190
+
--o my_output_dir \
191
+
--output_type csv
142
192
```
143
193
144
194
## peakScout ready-made references for common organisms
0 commit comments