8000 GitHub - lrylaarsdam/amethyst: An analysis toolkit for single-cell methylation sequencing data
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

lrylaarsdam/amethyst

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Amethyst: A METHyl-c Single-cell analysis Toolkit

Amethyst

Single-cell sequencing technologies have revolutionized biomedical research by enabling deconvolution of cell type-specific properties in highly heterogeneous tissue. While robust tools have been developed to handle bioinformatic challenges posed by single-cell RNA and ATAC data, options for emergent modalities such methylation are much more limited, impeding the utility of results. Here we present Amethyst, the first comprehensive R package for atlas-scale single-cell methylation sequencing data analysis. Amethyst takes base-level methylation calls and facilitates batch integration, doublet detection, dimensionality reduction, clustering, cell type annotation, differentially methylated region calling, and interpretation of results all in one streamlined platform. Versatile visualization functions mediate rapid interaction with the data in a local environment. Efforts like Amethyst will increase accessibility to single-cell methylation data interpretation, accelerating progress in understanding principles of this critical epigenetic modification across diverse contexts. To learn more, see our preprint!

Installation

Installation of Amethyst can be done using devtools:

library(devtools)
devtools::install_github("lrylaarsdam/amethyst")

You will likely need to install one or more dependencies:

if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install()

library(BiocManager)
BiocManager::install(c("caret", "devtools", "data.table", "dplyr", "furrr", "future", "future.apply",
  "ggplot2", "grDevices", "gridExtra", "igraph", "irlba", "janitor", "Matrix", "methods", 
  "plotly", "plyr", "purrr", "randomForest", "rhdf5", "rtracklayer", "scales", "stats", "stringr", 
  "tibble", "tidyr", "umap", "utils"))

devtools::install_github("JinmiaoChenLab/Rphenograph")
devtools::install_github("KrishnaswamyLab/MAGIC/Rmagic")

Getting Started

Amethyst begins with base-level methylation calls per cell wrapped into .h5 files. If you need to generate this file from your sequencing data, scripts for initial processing of reads are available at the Adey Lab Premethyst repo. Please see vignettes for example Premethyst outputs and subsequent analysis steps.

If you are using the Scale Biosciences pipeline, we have written a helper function to load the output into an Amethyst object. createScaleObject automatically populates the metadata and h5path slots for you. In its most basic form, all that is needed is the directory path:

obj <- createScaleObject(directory = "path/to/scalebio/output/folder")

You may also wish to load any pre-generated matrices, which would allow one to skip past the makeWindows step in the vignette. Below is an example of how to load the "CG.score" matrix. Double-check your computational resources are capable of handling the entire matrix size first.

obj <- createScaleObject(directory = "path/to/scalebio/output/folder", genomeMatrices = c("CG.score"))

Vignettes

To become familiar with the Amethyst workflow, we recommend beginning with the pbmc vignette, which is focused on CG methylation and applicable to any tissue.

Certain tissues - such as the brain and stem cells - also contain high levels of non-CG methylation and necessitate a very different workflow. After completing the pbmc vignette, we recommend going over the brain vignette for CH-specific analysis.

In addition to these general workflow examples, we have specific vignettes for:

Issues

Amethyst is still a work in progress. Please let us know if any issues come up.

Updates

  • Aug 27 2024: Added compatibility for more genome builds Affected functions: makeRef, fetchMarkers, makeWindows, calcSmoothedWindows
  • Nov 1 2024: Added flexibility with visualization parameters
    • Affected functions: histograM, heatMap, dotM
    • Examples:
      • histograM baseline can either be 0 or mean methylation (credit: Ryan Mulqueen, PhD)
      • heatMap color scale and max value can be adjusted
      • dotM can be further faceted by a variable in the metadata
  • Nov 1 2024: switched to logFC = log2(mean_1 / mean_2) (credit: Joe Verity-Legg)
    • Affected functions: findClusterMarkers
  • Nov 14: Vignettes added

License

Amethyst is distributed under the MIT License. Please see LICENSE.txt for further information.

About

An analysis toolkit for single-cell methylation sequencing data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  
0