IRIS UniCatt

The identification of ancestry tracks is a powerful tool to assist the inference of evolutionary events in the genomes of animals and plants. However, algorithms for ancestry track detection typically require labelled reference population data. This dependency prevents the analysis of genomic data lacking prior information on genetic structure, and may produce classification bias when samples in the reference data are inadvertently admixed. We combined heuristics with K-means clustering to deploy a method that can detect ancestry tracks without the provision of lineage labels for reference population data. The resulting algorithm uses phased genotypes to infer individual ancestry proportions and local ancestry. By piling up ancestry tracks across individuals, our method also allows for mapping loci with excess or deficit ancestry from specific lineages. Using both simulated and real genomic data, we found that the proposed method was accurate in inferring genetic structure, assigning chromosomal segments to lineages and estimating individual ancestry, especially in cases where ancestry tracks resulted from recent admixture of highly divergent lineages. The method is implemented as part of the v2 release of the GHap r package (available at https://cran.r-project.org/package=GHap and https://bitbucket.org/marcomilanesi/ghap/src/master/)

Tani Utsunomiya, Y., Milanesi, M., Barbato, M., Taiti Harth Utsunomiya, A., Sölkner, J., Ajmone Marsan, P., Fernando Garcia, J., Unsupervised detection of ancestry tracks with the GHap R package, <<METHODS IN ECOLOGY AND EVOLUTION>>, 2020; 11 (11): 1448-1454. [doi:10.1111/2041-210x.13467] [http://hdl.handle.net/10807/167143]

Unsupervised detection of ancestry tracks with the GHap R package

Yuri Tani Utsunomiya;Marco Milanesi;Barbato, Mario;Adam Taiti Harth Utsunomiya;Johann Sölkner;Ajmone Marsan, Paolo;José Fernando Garcia

2020

Abstract

The identification of ancestry tracks is a powerful tool to assist the inference of evolutionary events in the genomes of animals and plants. However, algorithms for ancestry track detection typically require labelled reference population data. This dependency prevents the analysis of genomic data lacking prior information on genetic structure, and may produce classification bias when samples in the reference data are inadvertently admixed. We combined heuristics with K-means clustering to deploy a method that can detect ancestry tracks without the provision of lineage labels for reference population data. The resulting algorithm uses phased genotypes to infer individual ancestry proportions and local ancestry. By piling up ancestry tracks across individuals, our method also allows for mapping loci with excess or deficit ancestry from specific lineages. Using both simulated and real genomic data, we found that the proposed method was accurate in inferring genetic structure, assigning chromosomal segments to lineages and estimating individual ancestry, especially in cases where ancestry tracks resulted from recent admixture of highly divergent lineages. The method is implemented as part of the v2 release of the GHap r package (available at https://cran.r-project.org/package=GHap and https://bitbucket.org/marcomilanesi/ghap/src/master/)

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2020
			
	Lingua del contenuto
	
				Inglese
			
	Nome del periodico
	
				METHODS IN ECOLOGY AND EVOLUTION
			
	DOI del contributo
	
				https://dx.doi.org/10.1111/2041-210x.13467
			
	Citazione
	
				Tani Utsunomiya, Y., Milanesi, M., Barbato, M., Taiti Harth Utsunomiya, A., Sölkner, J., Ajmone Marsan, P., Fernando Garcia, J., Unsupervised detection of ancestry tracks with the GHap R package, <<METHODS IN ECOLOGY AND EVOLUTION>>, 2020;  11 (11): 1448-1454. [doi:10.1111/2041-210x.13467] [http://hdl.handle.net/10807/167143]
			
	Appare nelle tipologie:
	
				Articolo in rivista, Nota a sentenza

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/167143

Citazioni

ND

6

6

social impact