Skip to content

Commit 320be6f

Browse files
committed
update Vignettes
1 parent 1564eb0 commit 320be6f

File tree

2 files changed

+27
-21
lines changed

2 files changed

+27
-21
lines changed

inst/extdata/motif_consistency.png

83.8 KB
Loading

vignettes/TFregulomeR.Rmd

Lines changed: 27 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -43,21 +43,27 @@ source, TF source ID, number of peaks and number of peaks with motif.
4343
In TFregulomeR project, we
4444
used MEME-ChIP to perform motif de novo discovery in each ChIP-seq. Highly and centrally enriched
4545
motifs were selected and compared with the existing TFBS databases, such as HOCOMOCO and JASPAR.
46-
Around 6% of highly enriched motifs were not consistent with the TFBS databases, and it might
47-
be due to the fact that in those cell types, the given TFs are indirectly recruited to genome.
48-
Besides, approximately 9% of motifs were not recorded for their corresponding TFs in the databases.
49-
In order to confirm the reliability of these 15% motifs, we used HOMER to perform a de novo motif
46+
89 highly enriched motifs were not consistent with the TFBS databases. it might
47+
be due to the fact that in those cell types, the given TFs are indirectly recruited to genome,
48+
and/or that highly abundant presence of cohesion and polycomb group proteins masks the motif
49+
enrichment of the ChIP’ed TF.
50+
Besides, 136 motifs were not recorded for their corresponding TFs in the databases.
51+
In order to confirm the reliability of these 225 motifs, we used HOMER to perform a de novo motif
5052
discovery again. Motif results by HOMER were compared with those by MEME-ChIP and their similarity
51-
were measured by normalised Pearson correlation coefficient using the formula: Ncor = cor * w / w_smaller,
53+
were measured by normalised Pearson correlation coefficient using compare-matrices function in
54+
RSAT with the formula: Ncor = cor * w / w_smaller,
5255
where cor is raw Pearson correlation coefficient, w is the alignment width of two matrices from
5356
MEME-ChIP and HOMER (the minimum value of w was set as 5), and w_smaller is the width of smaller
5457
motifs from MEME-ChIP and HOMER. We found that majority of those PWM matrices generated by MEME-ChIP,
5558
a combined algorithm suite of expectation maximization and regular expressions, were able to be
56-
recapitulated by HOMER, which takes advantage of hypergeometric enrichment. We have added the
57-
information into the last two columns of `dataBrowser` output (from v1.2.0).
59+
recapitulated by HOMER, which takes advantage of hypergeometric enrichment (Figure 1). We have added the information into the last two columns of `dataBrowser` output (from v1.2.0).
5860

5961
In particular, if no input is given for the function, all records in TFregulomeR compendium will be returned.
6062

63+
```{r echo=FALSE, fig.cap="Figure 1. Similarity of de novo enriched motifs by MEME-ChIP and HOMER. The beeswarm and violin plots show the normalised Pearson correlation coefficient of de novo motifs called by MEME-ChIP and HOMER, and the red dash denotes normalised Pearson correlation coefficient value 0.7.", out.width = '80%',fig.align="center"}
64+
knitr::include_graphics("../inst/extdata/motif_consistency.png")
65+
```
66+
6167
```{r eval=FALSE}
6268
library(TFregulomeR)
6369
# browse all records in TFregulomeR TFBS compendium
@@ -158,7 +164,7 @@ class(K562_CEBPB)
158164
#> [1] "TFregulomeR"
159165
```
160166

161-
After obtaining a MethMotif matrix, user can use `plotLogo` to plot logo as below (Figure 1).
167+
After obtaining a MethMotif matrix, user can use `plotLogo` to plot logo as below (Figure 2).
162168
If the TFBS source is MethMotif, then a MethMotif logo will be saved. Two options are available for motif logo,
163169
"entropy" and "frequency", and also different methylation levels
164170
("all", "methylated" and "unmethylated") can be opted for methylation bar charts. However,
@@ -194,7 +200,7 @@ plotLogo(HL60_CEBPB, logo_type = "entropy")
194200
#> Success: a PDF named 'GTRD-EXP040801_HSA_HL-60_CEBPB-logo-entropy.pdf' has been saved!
195201
```
196202

197-
```{r echo=FALSE, fig.cap="Figure 1. CEBPB (Meth)Motif logos in K562 and HL-60", out.width = '80%',fig.align="center"}
203+
```{r echo=FALSE, fig.cap="Figure 2. CEBPB (Meth)Motif logos in K562 and HL-60", out.width = '80%',fig.align="center"}
198204
knitr::include_graphics("../inst/extdata/plotLogo.png")
199205
```
200206

@@ -252,7 +258,7 @@ head(K562_CEBPB_peaks)
252258
## Study TFBS propensity
253259

254260
### Common peak regions
255-
```{r echo=FALSE, fig.cap="Figure 2. functionality in TFregulomeR for common peak analysis", out.width = '100%',fig.align="center"}
261+
```{r echo=FALSE, fig.cap="Figure 3. functionality in TFregulomeR for common peak analysis", out.width = '100%',fig.align="center"}
256262
knitr::include_graphics("../inst/extdata/commonPeaks_tutorial.png")
257263
```
258264

@@ -430,13 +436,13 @@ methylation_profile$HCT116_CEBPB_common_peaks
430436
#> [1,] NA
431437
```
432438

433-
```{r echo=FALSE, fig.cap="Figure 3. MethMotif logo of K562 CEBPB common peaks", out.width = '40%', fig.align="center"}
439+
```{r echo=FALSE, fig.cap="Figure 4. MethMotif logo of K562 CEBPB common peaks", out.width = '40%', fig.align="center"}
434440
knitr::include_graphics("../inst/extdata/MM1_HSA_K562_CEBPB_common_peaks-logo-entropy.png")
435441
```
436442

437443
### Exclusive peak regions
438444

439-
```{r echo=FALSE, fig.cap="Figure 4. functionality in TFregulomeR for exclusive peak analysis", out.width = '100%',fig.align="center"}
445+
```{r echo=FALSE, fig.cap="Figure 5. functionality in TFregulomeR for exclusive peak analysis", out.width = '100%',fig.align="center"}
440446
knitr::include_graphics("../inst/extdata/exclusivePeaks_tutorial.png")
441447
```
442448

@@ -589,13 +595,13 @@ methylation_profile$MM1_HSA_K562_CEBPB_exclusive_peaks
589595
#> 90-100% 40 #40 CpG methylation scores are more than 0.9 ((homogeneously methylated)) in +/-100bp window around exclusive peak summits
590596
```
591597

592-
```{r echo=FALSE, fig.cap="Figure 5. MethMotif logo of K562 CEBPB exclusive peaks", out.width = '40%', fig.align="center"}
598+
```{r echo=FALSE, fig.cap="Figure 6. MethMotif logo of K562 CEBPB exclusive peaks", out.width = '40%', fig.align="center"}
593599
knitr::include_graphics("../inst/extdata/MM1_HSA_K562_CEBPB_exclusive_peaks-logo-entropy.png")
594600
```
595601

596602
### Intersected peak matrix
597603

598-
```{r echo=FALSE, fig.cap="Figure 6. functionality in TFregulomeR for cofactor and interactome analysis", out.width = '100%',fig.align="center"}
604+
```{r echo=FALSE, fig.cap="Figure 7. functionality in TFregulomeR for cofactor and interactome analysis", out.width = '100%',fig.align="center"}
599605
knitr::include_graphics("../inst/extdata/intersectPeakMatrix_tutorial.png")
600606
```
601607

@@ -969,15 +975,15 @@ cofactorReport(intersectPeakMatrix = intersectMatrix_exclusive)
969975
#> ... ... ... Cofactor report for id 'MM1_HSA_K562_CEBPB' has been saved as MM1_HSA_K562_CEBPB_cofactor_report.pdf
970976
```
971977

972-
```{r echo=FALSE, fig.cap="Figure 7. MethMotif logo of K562 CEBPB common peaks intersected with K562 CEBPD peaks", out.width = '40%', fig.align="center"}
978+
```{r echo=FALSE, fig.cap="Figure 8. MethMotif logo of K562 CEBPB common peaks intersected with K562 CEBPD peaks", out.width = '40%', fig.align="center"}
973979
knitr::include_graphics("../inst/extdata/MM1_HSA_K562_CEBPB_overlapped_with_MM1_HSA_K562_CEBPD-logo-entropy.png")
974980
```
975981

976-
```{r echo=FALSE, fig.cap="Figure 8. MethMotif logo of K562 CEBPB exclusive peaks intersected with K562 ATF4 peaks", out.width = '40%', fig.align="center"}
982+
```{r echo=FALSE, fig.cap="Figure 9. MethMotif logo of K562 CEBPB exclusive peaks intersected with K562 ATF4 peaks", out.width = '40%', fig.align="center"}
977983
knitr::include_graphics("../inst/extdata/MM1_HSA_K562_CEBPB_overlapped_with_MM1_HSA_K562_ATF4-logo-entropy.png")
978984
```
979985

980-
```{r echo=FALSE, fig.cap="Figure 9. cofactorReport output", out.width = '100%', fig.align="center"}
986+
```{r echo=FALSE, fig.cap="Figure 10. cofactorReport output", out.width = '100%', fig.align="center"}
981987
knitr::include_graphics("../inst/extdata/cofactor_report.png")
982988
```
983989

@@ -1071,7 +1077,7 @@ plotDistrib(motifDistrib = motifDistrib_output)
10711077
#> Distribution of motif MM1_HSA_K562_CEBPB in peak set my_peak has been saved!
10721078
```
10731079

1074-
```{r echo=FALSE, fig.cap="Figure 10. Motif distribution", out.width = '100%', fig.align="center"}
1080+
```{r echo=FALSE, fig.cap="Figure 11. Motif distribution", out.width = '100%', fig.align="center"}
10751081
knitr::include_graphics("../inst/extdata/motif_distribution.png")
10761082
```
10771083

@@ -1124,13 +1130,13 @@ head(K562_CEBPB_exclusivePeak_loc)
11241130
#> 6 77;117;73
11251131
```
11261132

1127-
```{r echo=FALSE, fig.cap="Figure 11. HTML annotation report of the genomic locations of K562 CEBPB exclusive peak", out.width = '100%', fig.align="center"}
1133+
```{r echo=FALSE, fig.cap="Figure 12. HTML annotation report of the genomic locations of K562 CEBPB exclusive peak", out.width = '100%', fig.align="center"}
11281134
knitr::include_graphics("../inst/extdata/genomeAnnotate.png")
11291135
```
11301136

11311137
## Annotate TFBS functions
11321138

1133-
The key function of transcription factors is to regulate gene expression. By working with Genomic Regions Enrichment of Annotations Tool (GREAT), TFregulomeR allows users to annotate the functions of TFBSs using `greatAnnotate`. Given that GREAT server doesn't support hg38, liftOver R package has been incorporated in TFregulomeR to convert hg38 to hg19. The annotation output of `greatAnnotate` is intuitive, not only will a data.frame containing annotation results be returned, but also an HTML report will be saved. The HTML report takes advantage of `rbokeh` package, which presents a vivid and dynamic interface (Figure 12).
1139+
The key function of transcription factors is to regulate gene expression. By working with Genomic Regions Enrichment of Annotations Tool (GREAT), TFregulomeR allows users to annotate the functions of TFBSs using `greatAnnotate`. Given that GREAT server doesn't support hg38, liftOver R package has been incorporated in TFregulomeR to convert hg38 to hg19. The annotation output of `greatAnnotate` is intuitive, not only will a data.frame containing annotation results be returned, but also an HTML report will be saved. The HTML report takes advantage of `rbokeh` package, which presents a vivid and dynamic interface (Figure 13).
11341140

11351141
```{r eval=FALSE}
11361142
# annotate the functions of K562 CEBPB exclusive peaks
@@ -1172,7 +1178,7 @@ head(K562_CEBPB_exclusivePeak_func)
11721178
```
11731179

11741180

1175-
```{r echo=FALSE, fig.cap="Figure 12. HTML annotation report of the genes targeted by K562 CEBPB exclusive peak", out.width = '100%', fig.align="center"}
1181+
```{r echo=FALSE, fig.cap="Figure 13. HTML annotation report of the genes targeted by K562 CEBPB exclusive peak", out.width = '100%', fig.align="center"}
11761182
knitr::include_graphics("../inst/extdata/greatAnnotate.png")
11771183
```
11781184

0 commit comments

Comments
 (0)