< Back to previous page

Publication

EDIR

Journal Contribution - Journal Article

Subtitle:Exome Database of Interspersed Repeats

MOTIVATION: Intragenic exonic deletions are known to contribute to genetic diseases and are often flanked by regions of homology. RESULTS: In order to get a more clear view of these interspersed repeats encompassing a coding sequence, we have developed EDIR (Exome Database of Interspersed Repeats) which contains the positions of these structures within the human exome. EDIR has been calculated by an inductive strategy, rather than by a brute force approach and can be queried through an R/Bioconductor package or a web interface allowing the per-gene rapid extraction of homology-flanked sequences throughout the exome. AVAILABILITY AND IMPLEMENTATION: The code used to compile EDIR can be found at https://github.com/lauravongoc/EDIR. The full dataset of EDIR can be queried via an Rshiny application at http://193.70.34.71:3857/edir/. The R package for querying EDIR is called 'EDIRquery' and is available on Bioconductor. The full EDIR dataset can be downloaded from https://osf.io/m3gvx/ or http://193.70.34.71/EDIR.tar.gz. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Journal: Bioinformatics
ISSN: 1367-4803
Issue: 1
Volume: 39
Publication year:2023
Keywords:EDIR, Database, interspersed repeats
Accessibility:Open