# sedef
**Repository Path**: wang_shuai_1996/sedef
## Basic Information
- **Project Name**: sedef
- **Description**: 生信分析软件啊啊啊啊啊
- **Primary Language**: Unknown
- **License**: MIT
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2022-12-20
- **Last Updated**: 2022-12-20
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
# 🔴 ⚠️ SEDEF has been deprecated. Please use [BISER](https://github.com/0xTCG/biser/) (SEDEF's successor) instead. ⚠️ 🔴
# SEDEF: Segmental Duplication Evaluation Framework
SEDEF is a tool for quick detection of segmental duplications in a genome.
## Paper
SEDEF has been presented at [ECCB 2018](http://eccb18.org) (DOI [10.1093/bioinformatics/bty586](https://doi.org/10.1093/bioinformatics/bty586)).
Preprint is [available here](https://arxiv.org/abs/1807.00205).
Get the final paper [here](https://academic.oup.com/bioinformatics/article/34/17/i706/5093240).
### Results
| 👨🎨 Human (hg38) | 👨🎨 Human (hg19) | 🐭 Mouse (mm8) |
|-----|-----|-----|
| [Final calls](http://alkanlab.org/share/sedef/hg38.bed) | [Final calls](http://cb.csail.mit.edu/cb/sedef/hg19.bed) | [Final calls](http://cb.csail.mit.edu/cb/sedef/mm8.bed) |
The experiment pipeline from the paper is described [in this Jupyter notebook](paper/experiments.ipynb).
## How to compile
Simple! Do this:
```bash
git clone https://github.com/vpc-ccg/sedef
cd sedef
make -j release
```
By default, SEDEF uses Intel C++ compiler. If you are using g++, build with:
```bash
make -j release CXX=g++
```
If you are using Clang on macOS, compile as
```bash
brew install libomp
make -j release OPENMP="-Xpreprocessor -fopenmp" CXX=clang++
```
> You need at least g++ 5.1.0 (C++14) to compile SEDEF. Clang should work fine as well.
SEDEF requires Boost libraries in order to compile. In case you installed Boost in a non-standard directory, you can still compile as follows:
```bash
CPATH={path_to_boost} make -j release
```
## How to run
The genome assembly **must be soft-masked** (i.e. all common and tandem repeats should be converted to lower-case letters) and **indexed**.
Suppose that our genome is `hg19.fa` (we use UCSC hg19 genome with 24 standard chromosomes that does not contain patches (unGl) or random strains (chrXX_random)).
### Automatic transmission
Just go to `sedef` directory and run
```bash
./sedef.sh -o