COCOA is a method for understanding epigenetic variation among samples. COCOA can be used with epigenetic data that includes genomic coordinates and an epigenetic signal, such as DNA methylation and chromatin accessibility data. To describe the method on a high level, COCOA quantifies inter-sample variation with either a supervised or unsupervised technique then uses a database of "region sets" to annotate the variation among samples. A region set is a set of genomic regions that share a biological annotation, for instance transcription factor (TF) binding regions, histone modification regions, or open chromatin regions. COCOA can identify region sets that are associated with epigenetic variation between samples and increase understanding of variation in your data.
To install from Bioconductor (recommended):
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("COCOA")
COCOA may also be installed from Github:
devtools::install_github("databio/COCOA")
or locally after downloading/cloning the source code:
install.packages("path/to/COCOA/directory", repos=NULL, type="source")
A vignette is included with the package that shows an overview of COCOA and walks you through multiple analysis scenarios with code.