Published August 19, 2019
| Version v2
Dataset
Open
BPNet manuscript data 1
Creators
Description
This repository holds the trained BPNet models and all the output files produced using the models including TF-MoDISco files and motif instances.
Files:
- output.tar.gz
- Full output directory containing trained BPNet models, contribution scores, TF-MoDISco results, motif instances, etc.
- See https://github.com/kundajelab/bpnet-manuscript for more information (archived at https://zenodo.org/record/4294814 with DOI: https://doi.org/10.5281/zenodo.4294813).
- bpnet.model.h5
- BPNet Keras model. This model can be directly used via the model repository Kipoi: http://kipoi.org/models/BPNet-OSKN .
- regions.bed
- ChIP-nexus peak regions (resized to 1 kb width) where BPNet was trained and interpreted on.
- Third column denotes the TF for which the peak was called.
- motif-instances.bed
- BPNet motif instances for 11 representative motifs
- Stored as BED file with the following columns
- Chromosome
- Start
- End
- Motif name
- 'match' score - similarity to the CWM computed using the continuous Jaccard distance metric between the CWM and L1 normalized contribution scores
- Strand
- 'contrib' score - computed as the L1 norm of the contribution scores at the motif instance position
- The log odds score with respect to the PWM derived from the PFM (i.e. classical PWM score)
- <TF>.preds.<strand>.bw
- Model predictions in `regions.bed` for a particular TF in {Oct4, Sox2, Nanog, Klf4} and strand in {pos, neg}.
- <TF>.importance.counts.bw
- Total count contribution scores for a particular TF
- <TF>.importance.profile.bw
- Profile contribution scores. These were used to run TF-MoDISco and downstream analysis.
- PWM,CWM,ChIP-nexus-profiles.tar.gz
- Directory containing PFM, CWM and aggregated ChIP-nexus footprint for 11 representative motifs.
- Figure-5d-periodicity.csv
- Raw data corresponding to Figure 5d where the 10bp periodicity of contribution scores was computed for each motif.
- dfabf.Oct4-Sox2-subset.parq
- Parquet file (read with `pd.read_parquet`) of genomics motif instance pairs and along with the BPNet-predicted interaction score.
- See 08-motif-interactions.genomic.ipynb for more information on how to generate plots for Figure 5 from this data.
Files
Figure-5d-periodicity.csv
Files
(22.0 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:bbe883baef261877bfad07d05feb627d
|
1.8 MB | Download |
|
md5:574990ec95c1a76b2d416cfa5daf0579
|
3.0 GB | Download |
|
md5:a04953eb9292ef36e1ed091532350178
|
7.6 kB | Preview Download |
|
md5:ab68ce31fac4c20e320cd3d8d0bfe750
|
515.7 MB | Download |
|
md5:d50d400649a756fff473734ad8caa0cb
|
518.2 MB | Download |
|
md5:fc7cd41060d628cc2788a61607ebe5cd
|
424.2 MB | Download |
|
md5:a14c07bdd16747414ab250d79124cd7f
|
502.9 MB | Download |
|
md5:534b5f426a7b681611f630431d0a616b
|
21.7 MB | Download |
|
md5:a900b478a72476dc6b0dcd3c4dc16092
|
516.8 MB | Download |
|
md5:9a65f5b05158added2809c16817455de
|
519.9 MB | Download |
|
md5:7fbece49a0947b3fc4ccfac82885a913
|
427.6 MB | Download |
|
md5:c51c4eeee39c4af7787efd597f92e2d9
|
505.7 MB | Download |
|
md5:22ed0f43fb9a201a0463acbb41691a59
|
513.1 MB | Download |
|
md5:89e117c702d66b60e4265c0a48e44540
|
515.3 MB | Download |
|
md5:b9d4d7b2a32fe0efed95251a57f4c39c
|
423.8 MB | Download |
|
md5:b93222b5b3f6e3ece9d0f6a7aab2aeb5
|
502.4 MB | Download |
|
md5:f7efea1f8b0d19e908551e79780466e3
|
11.1 GB | Download |
|
md5:ce7002737f556ad61272c6d6794ce967
|
54.0 kB | Download |
|
md5:388574886e6dc522979ac030103216dc
|
4.3 MB | Download |
|
md5:eae18518f61a74b7a43dad4ccc9356c2
|
514.2 MB | Download |
|
md5:087a8d2d25decc7dbb09acda2ac4c2cf
|
515.8 MB | Download |
|
md5:af843ea9a7e4cb36e2c798bd1a8dd6b3
|
421.1 MB | Download |
|
md5:ac0cbbaa7c9d4fcef76aeea59211e744
|
500.6 MB | Download |