Description

During this workshop participants will be introduced to GenomicDistributions package. At first, we will go over a general introduction to GenomicDistributions, where we will learn about the purpose, design, and advantages of GenomicDistributions. Followed by instructor-led demo, we will go through all of the functions included in GenomicDistributions, where participants will be able to try the functions on provided example datasets. The participants will learn, how to calculate and plot summary statistics extracted from the example datasets and how to further customize plots generated with GenomicDistributions. Some of the GenomicDistributions functions require genome annotations (for hg19, hg38, and mm10 these are all included in the GenomicDistributionsData package and passed to a subset of functions ending with the suffix “Ref”), the participants will be taught how to easily create and use these annotations. At the end of the workshop, all participants will be able to extract following summary statistics about their regions of interest: distribution over chromosomes, distances from genomic features (e.g., TSSs), distribution over annotation classes, signal summary in regions, region widths, distances to the nearest neighbor, GC content, and dinucleotide frequencies.

Pre-requisites

Helpful but not required:

  • Basic knowledge of R syntax
  • Familiarity with the GenomicRanges class

Anyone interested in getting familiar with GenomicDistributions before the workshop is welcome to that that by reading the GenomicDistributions publication or vignettes provided with the package:

Participation

Participants will be able to test all of the functions provided in GenomicDistributions on example datasets, learn how to customize plots and create genome annotations required by some of the functions.

R / Bioconductor packages used

  • GenomicDistributions
  • GenomicDistributionsData

Time outline

Activity Time
Introduction 10m
Package demo 25m
Questions 10m