Background: Misassembly signatures, created by shuffling the order of sequences while assembling a genome, can be detected by the unexpected behavior of marker linkage disequilibrium (LD) decay. We developed a heuristic process to identify misassembly signatures, applied it to the bovine reference genome assembly (UMDv3.1) and presented the consequences of misassemblies in two case studies. Results: We identified 2,906 single nucleotide polymorphism (SNP) markers presenting unexpected LD decay behavior in 626 putative misassembled contigs, which comprised less than 1 % of the whole genome. Although this represents a small fraction of the reference sequence, these poorly assembled segments can lead to severe implications to local genome context. For instance, we showed that one of the misassembled regions mapped to the POLL locus, which affected the annotation of positional candidate genes in a GWAS case study for polledness in Nellore (Bos indicus beef cattle). Additionally, we found that poorly performing markers in imputation mapped to putative misassembled regions, and that correction of marker positions based on LD was capable to recover imputation accuracy. Conclusions: This heuristic approach can be useful to cross validate reference assemblies and to filter out markers located at low confidence genomic regions before conducting downstream analyses.

Utsunomiya, A. T. H., Santos, D. J. A., Boison, S. A., Utsunomiya, Y. T., Milanesi, M., Bickhart, D. M., Ajmone Marsan, P., Sölkner, J., Garcia, J. F., Da Fonseca, R., Da Silva, M. V. G. B., Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan, <<BMC GENOMICS>>, 2016; 17 (1): N/A-N/A. [doi:10.1186/s12864-016-3049-8] [http://hdl.handle.net/10807/92028]

Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan

Milanesi, Marco;Ajmone Marsan, Paolo;
2016

Abstract

Background: Misassembly signatures, created by shuffling the order of sequences while assembling a genome, can be detected by the unexpected behavior of marker linkage disequilibrium (LD) decay. We developed a heuristic process to identify misassembly signatures, applied it to the bovine reference genome assembly (UMDv3.1) and presented the consequences of misassemblies in two case studies. Results: We identified 2,906 single nucleotide polymorphism (SNP) markers presenting unexpected LD decay behavior in 626 putative misassembled contigs, which comprised less than 1 % of the whole genome. Although this represents a small fraction of the reference sequence, these poorly assembled segments can lead to severe implications to local genome context. For instance, we showed that one of the misassembled regions mapped to the POLL locus, which affected the annotation of positional candidate genes in a GWAS case study for polledness in Nellore (Bos indicus beef cattle). Additionally, we found that poorly performing markers in imputation mapped to putative misassembled regions, and that correction of marker positions based on LD was capable to recover imputation accuracy. Conclusions: This heuristic approach can be useful to cross validate reference assemblies and to filter out markers located at low confidence genomic regions before conducting downstream analyses.
2016
Inglese
Utsunomiya, A. T. H., Santos, D. J. A., Boison, S. A., Utsunomiya, Y. T., Milanesi, M., Bickhart, D. M., Ajmone Marsan, P., Sölkner, J., Garcia, J. F., Da Fonseca, R., Da Silva, M. V. G. B., Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan, <<BMC GENOMICS>>, 2016; 17 (1): N/A-N/A. [doi:10.1186/s12864-016-3049-8] [http://hdl.handle.net/10807/92028]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/92028
Citazioni
  • ???jsp.display-item.citation.pmc??? 12
  • Scopus 21
  • ???jsp.display-item.citation.isi??? 21
social impact