Download PDFOpen PDF in browser

Signed Rearrangement Distances Considering Repeated Genes and Intergenic Regions

12 pagesPublished: March 22, 2022

Abstract

Genome Rearrangement distance problems are used to infer the evolutionary distance between genomes. These problems look at the number of mutations called rearrangement events necessary to transform one genome into another. Two commonly studied rearrange- ments are the reversal, which inverts a sequence of genes, and the transposition, which exchanges two consecutive sequences of genes. Seminal works on that topic looked only at the sequence of genes and assumed that no gene has more than one copy. More realistic models have been assuming multiple copies of a gene or have been taking the number of nucleotides between intergenic regions into account. This work combines these two generalizations defining the Signed Intergenic Reversal Distance (SIRD) and the Signed Intergenic Reversal and Transposition Distance (SIRTD) problems. Using a relationship with a problem called Signed Minimum Common Intergenic String Partition, we show Θ(k)-approximation algorithms for the SIRD and the SIRTD problems, where k is the maximum number of copies of a gene in the genomes. Our experimental tests on simulated genomes show that the algorithms tend to find low distances despite the high theorical approximation factor.

Keyphrases: approximation algorithm, genome rearrangement, intergenic regions, reversal, transposition

In: Hisham Al-Mubaid, Tamer Aldwairi and Oliver Eulenstein (editors). Proceedings of 14th International Conference on Bioinformatics and Computational Biology, vol 83, pages 31-42.

BibTeX entry
@inproceedings{BICOB2022:Signed_Rearrangement_Distances_Considering,
  author    = {Gabriel Siqueira and Alexsandro Oliveira Alexandrino and Zanoni Dias},
  title     = {Signed Rearrangement Distances Considering Repeated Genes and Intergenic Regions},
  booktitle = {Proceedings of 14th International Conference on Bioinformatics and Computational Biology},
  editor    = {Hisham Al-Mubaid and Tamer Aldwairi and Oliver Eulenstein},
  series    = {EPiC Series in Computing},
  volume    = {83},
  publisher = {EasyChair},
  bibsource = {EasyChair, https://easychair.org},
  issn      = {2398-7340},
  url       = {/publications/paper/jnqB},
  doi       = {10.29007/np96},
  pages     = {31-42},
  year      = {2022}}
Download PDFOpen PDF in browser