Prerequisites, data summary and availability

Software

In terms of software, we are mainly using deeptools2, which is available to you by loading the “gencore” and “gencore_variant_detection” modules. We will also explore an alternative way of generating the CHiP-seq plots in R and Rstudio using the bioconductor package CoverageView.

Data

As mentioned earlier, our exercise will focus on the data generated as part of the paper in (Xin et al). The data represents 2 conditions, knockout (KO) and wildtype (WT), with 2 different modifications, Brg1 and H3K9Me3. In addition, we have 1 Input for each of our conditions. So in total we have the 6 samples below,

Sample_KO_Brg1_Chip
Sample_KO_H3K9Me3_Chip
Sample_KO_Input
Sample_WT_Brg1_Chip
Sample_WT_H3K9Me3_Chip
Sample_WT_Input

The data can be found at,

/scratch/gencore/datasets/chipseq_dataset.tar.gz

Getting the data and creating your analysis directory

Open up a terminal
Log in to your HPC (Dalma)
```
ssh yournetid@dalma.abudhabi.nyu.edu
```
Change into your scratch directory
```
cd $SCRATCH
```

Copy the datasets

cp /scratch/gencore/datasets/chipseq_dataset.tar.gz .

Untar the dataset and cd to the analysis directory.

tar -xvf chipseq_dataset.tar.gz
cd chipseq_analysis

Change into the bigWigs directory
```
ls -lah bigWigs
```