TY - JOUR
T1 - Construction and integration of three de novo Japanese human genome assemblies toward a population-specific reference
AU - Takayama, Jun
AU - Tadaka, Shu
AU - Yano, Kenji
AU - Katsuoka, Fumiki
AU - Gocho, Chinatsu
AU - Funayama, Takamitsu
AU - Makino, Satoshi
AU - Okamura, Yasunobu
AU - Kikuchi, Atsuo
AU - Sugimoto, Sachiyo
AU - Kawashima, Junko
AU - Otsuki, Akihito
AU - Sakurai-Yageta, Mika
AU - Yasuda, Jun
AU - Kure, Shigeo
AU - Kinoshita, Kengo
AU - Yamamoto, Masayuki
AU - Tamiya, Gen
N1 - Publisher Copyright:
© 2021, The Author(s).
PY - 2021/12/1
Y1 - 2021/12/1
N2 - The complete human genome sequence is used as a reference for next-generation sequencing analyses. However, some ethnic ancestries are under-represented in the reference genome (e.g., GRCh37) due to its bias toward European and African ancestries. Here, we perform de novo assembly of three Japanese male genomes using > 100× Pacific Biosciences long reads and Bionano Genomics optical maps per sample. We integrate the genomes using the major allele for consensus and anchor the scaffolds using genetic and radiation hybrid maps to reconstruct each chromosome. The resulting genome sequence, JG1, is contiguous, accurate, and carries the Japanese major allele at most loci. We adopt JG1 as the reference for confirmatory exome re-analyses of seven rare-disease Japanese families and find that re-analysis using JG1 reduces total candidate variant calls versus GRCh37 while retaining disease-causing variants. These results suggest that integrating multiple genomes from a single population can aid genome analyses of that population.
AB - The complete human genome sequence is used as a reference for next-generation sequencing analyses. However, some ethnic ancestries are under-represented in the reference genome (e.g., GRCh37) due to its bias toward European and African ancestries. Here, we perform de novo assembly of three Japanese male genomes using > 100× Pacific Biosciences long reads and Bionano Genomics optical maps per sample. We integrate the genomes using the major allele for consensus and anchor the scaffolds using genetic and radiation hybrid maps to reconstruct each chromosome. The resulting genome sequence, JG1, is contiguous, accurate, and carries the Japanese major allele at most loci. We adopt JG1 as the reference for confirmatory exome re-analyses of seven rare-disease Japanese families and find that re-analysis using JG1 reduces total candidate variant calls versus GRCh37 while retaining disease-causing variants. These results suggest that integrating multiple genomes from a single population can aid genome analyses of that population.
UR - https://www.scopus.com/pages/publications/85099248106
UR - https://www.scopus.com/pages/publications/85099248106#tab=citedBy
U2 - 10.1038/s41467-020-20146-8
DO - 10.1038/s41467-020-20146-8
M3 - Article
C2 - 33431880
AN - SCOPUS:85099248106
SN - 2041-1723
VL - 12
JO - Nature Communications
JF - Nature Communications
IS - 1
M1 - 226
ER -