# SCEVAN **Repository Path**: joyeric_admin_admin/SCEVAN ## Basic Information - **Project Name**: SCEVAN - **Description**: A fast variational algorithm to detect the clonal copy number substructure of tumors from single-cell data - **Primary Language**: Unknown - **License**: GPL-3.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2022-09-06 - **Last Updated**: 2022-09-07 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Single CEll Variational Aneuploidy aNalysis (SCEVAN)

Preprint Link: [A fast variational algorithm to detect the clonal copy number substructure of tumors from single-cell data](https://www.biorxiv.org/content/10.1101/2021.11.20.469390v1) ## Introduction SCEVAN is an R package that starting from the raw count matrix of scRNA data automatically classifies the cells present in the biopsy by segregating non-malignant cells of tumor microenviroment from the malignant cells and also characterizes the clonal structure of these malignant cells. It identfies cell subpopulations with different copy number architecture and reports g the specific and shared alterations of each subpopulation. The aim of the tool is to automate the entire analysis by allowing it to be performed in a very simple and completely unsupervised way. Analyses carried out on 106 samples and 93332 cells show better classification with an F1 score for all samples of 0.90 compared to 0.63 obtained with the state-of-the-art tools. It also explits a greedy multichannel segmentation algorithms making it particularly fast even for large datasets. ## Installation ``` library(devtools) install_github("miccec/yaGST") install_github("AntonioDeFalco/SCEVAN") library(SCEVAN) ``` ## Usage ### Single-sample analysis A single call (pipelineCNA) allows the execution of the entire analysis of classification and characterization of clonal structure. - ***count_mtx*** : Count matrix with genes on rows (both Gene Symbol or Ensembl ID are allowed) and cells on columns. - ***sample*** : Sample name to save results (optional) - ***par_cores*** : Number of cores to run the pipeline (optional - default 20) - ***norm_cells*** : vectors of normal cells if the classification is already known and you are only interested in the clonal structure (optional) - ***SUBCLONES*** : Boolean value TRUE if you are interested in analysing the clonal structure and FALSE if you are only interested in the classification of malignant and non-malignant cells (optional - default 20) - ***beta_vega*** : Specifies beta parameter for segmentation, higher beta for more coarse-grained segmentation. (optional - default 0.5) - ***ClonalCN*** : Get clonal CN profile inference from all tumour cells (optional) - ***plotTree*** : Plot Phylogenetic tree (optional - default FALSE) - ***AdditionalGeneSets*** : list of additional signatures of normal cell types (optional) - ***SCEVANsignatures*** : FALSE if you only want to use only the signatures specified in AdditionalGeneSets(optional - default TRUE) - ***organism*** : Organism to be analysed (optional - default human) ``` results <- pipelineCNA(count_mtx) ``` ### Multi-sample analysis A single call (multiSampleComparisonClonalCN) allows the comparison of clonal profiles of multiple samples. - ***listCountMtx*** : Named list of raw count matrix of samples to be analysed - ***analysisName*** : Name of the analysis (optional) - ***organism*** : Organism to be analysed (default human) - ***par_cores*** : Number of cores (default 20) ``` multiSampleComparisonClonalCN(listCountMtx) ``` ### Integration with Seurat Integration of information obtained with SCEVAN (aneuploidy/diploid, subclones) into Seurat object. - ***count_mtx*** : Count matrix of sample. ``` results <- pipelineCNA(count_mtx) #Create Seurat Object with SCEVAN info seurObj <- Seurat::CreateSeuratObject(count_mtx, meta.data = results) #or add SCEVAN info to an existing Seurat object seurObj <-Seurat::AddMetaData(seurObj, metadata = results) ``` ## Usage examples (vignettes) - [Intratumoral heterogeneity](http://htmlpreview.github.io/?https://github.com/AntonioDeFalco/SCEVAN/blob/main/vignettes/IntratumoralHeterogeneityInGlioblastoma.html) - [Multi-sample analysis (2 samples)](http://htmlpreview.github.io/?https://github.com/AntonioDeFalco/SCEVAN/blob/main/vignettes/multiSamplesHeadNeck.html) - [Multi-sample analysis (3 samples)](http://htmlpreview.github.io/?https://github.com/AntonioDeFalco/SCEVAN/blob/main/vignettes/multiSamples.html) ## Sample Datasets We provide some pre-processed samples used in the examples (vignettes): - ***MGH106.RData*** and ***listCountMtx.RData***: scRNA data of MGH102, MGH104, MGH105 and MGH106 samples from the public dataset of Gliobastoma (GSE131928), you can download the pre-processed data from [here](https://www.dropbox.com/s/b9udpvhnc2ez9pc/MGH106_data.RData?dl=0) and [here](https://www.dropbox.com/s/esqvnltucdqajg1/listCountMtx.RData?dl=0) - ***HNSCC26.RData*** : scRNA data of HNSCC26 Primary and HNSCC26 Lymph Node sample from the public dataset of Head&Neck cancer (GSE10332), you can download the pre-processed data from [here](https://www.dropbox.com/s/6zns12amobs39g8/HNSCC26_data.RData?dl=0) ## Citation > >@article {De Falco2021.11.20.469390,\ > author = {De Falco, Antonio and Caruso, Francesca P and Su, Xiao Dong and Iavarone, Antonio and Ceccarelli, Michele},\ > title = {A fast variational algorithm to detect the clonal copy number substructure of tumors from single-cell data},\ > elocation-id = {2021.11.20.469390},\ > year = {2021},\ > doi = {10.1101/2021.11.20.469390},\ > publisher = {Cold Spring Harbor Laboratory},\ URL = { https://www.biorxiv.org/content/early/2021/11/22/2021.11.20.469390 }, eprint = { https://www.biorxiv.org/content/early/2021/11/22/2021.11.20.469390.full.pdf }, \ > journal = {bioRxiv}\ >}