Repository logo
Research Data
Publications
Projects
Persons
Organizations
English
Français
Log In(current)
  1. Home
  2. Publications
  3. Thèse de doctorat (doctoral thesis)
  4. Frequentist estimation of evolutionary history of sequences with substitutions & indels

Frequentist estimation of evolutionary history of sequences with substitutions & indels

Author(s)
Jowkar, Gholam-Hossein  
Faculté des sciences  
Editor(s)
Croll, Daniel  
Laboratoire de génétique évolutive  
Publisher
Neuchâtel : Université de Neuchâtel
Date issued
2024
Number of pages
134
Subjects
deletion insertion joint ancestral sequence reconstruction Poisson indel process uncertainty of inferred ancestral sequences
Abstract
Estimation of the evolutionary history of molecules is mainly done by reconstructing the ancestral sequences given present-day sequences and phylogeny information. Biological sequence data is a result of evolution by mutational events such as character substitutions (or point mutations), insertions and deletions (indels). Inference of the evolutionary history of sequences with substitution and indels can be used in various biomedical applications, from tracking the origin of pandemic viruses to studies of the cause of visual impairment.
Indels are among the most important sources of genomic variation and carry sound evolutionary signals; however, well-known ancestral sequence reconstruction (ASR) methods ignore or mistreat them. ASR with indels is a big challenge from both computational and statistical viewpoints. This research proposed a novel solution to infer the ancestral sequences, while accounting for the evolutionary indel process.
First, I used an evolutionary model of substitution and indel for ASR and implemented it in the ARPIP program. ARPIP implemented a novel empirical Bayes method, which allows us to reconstruct ancestral sequences with indels under the Poisson indel process (PIP). While PIP is a continuous-time Markov chain (CTMC) model that assumes single-character indels, and has important computational advantages. I showed that ARPIP reconstructed biologically reasonable indels.
Second, it is difficult to model multiple-character (or "long") indels since most evolutionary CTMC models assume site-independence. Thus, I investigated whether a single-character indel assumption was detrimental for ASR. Analysis of real and simulated data showed that the single-character indel model could be used for ASR. ARPIP preserved gap length distribution in multiple sequence alignment, including regions with long indels. Moreover, the indel variation
in six eutherian mammalian orthologous proteins was studied to explore the evolutionary dynamics of insertions and deletions.
Finally, ASR, similar to other inferences, is affected by uncertainty. To account for it, a posterior probability profile method was devised. In collaboration with an experimental lab to study properties of ancestral proteins, the approach was applied to reflect the variation in ASR inference on neural retina leucine zipper transcription factor of selected vertebrates. Moreover, an alternative reconstruction for the ambiguous regions was introduced.
Notes
Membres du jury :
Prof. Dr. Daniel Croll, University of Neuchâtel, Switzerland (Co-chair)
Prof. Dr. Maria Anisimova, Zürich University of Applied Sciences, Switzerland (Co-chair)
Prof. Dr. Pilar Eugenia Junier, University of Neuchâtel, Switzerland (Internal expert)
Prof. Dr. Ziheng Yang, University College London, UK (External expert)
Publication type
doctoral thesis
Identifiers
https://libra.unine.ch/handle/20.500.14713/31547
DOI
10.35662/unine-thesis-3096
-
https://libra.unine.ch/handle/123456789/33339
File(s)
Loading...
Thumbnail Image
Download
Name

00003096.pdf

Type

Main Article

Size

2.62 MB

Format

Adobe PDF

Checksum

(MD5):69b9d585f5f4bccffc0053a16b07dcf6

Université de Neuchâtel logo

Service information scientifique & bibliothèques

Rue Emile-Argand 11

2000 Neuchâtel

contact.libra@unine.ch

Service informatique et télématique

Rue Emile-Argand 11

Bâtiment B, rez-de-chaussée

Powered by DSpace-CRIS

v2.0.0

© 2025 Université de Neuchâtel

Portal overviewUser guideOpen Access strategyOpen Access directive Research at UniNE Open Access ORCIDWhat's new