TY - JOUR
T1 - A 12-kb structural variation in progressive myoclonic epilepsy was newly identified by long-read whole-genome sequencing
AU - Mizuguchi, Takeshi
AU - Suzuki, Takeshi
AU - Abe, Chihiro
AU - Umemura, Ayako
AU - Tokunaga, Katsushi
AU - Kawai, Yosuke
AU - Nakamura, Minoru
AU - Nagasaki, Masao
AU - Kinoshita, Kengo
AU - Okamura, Yasunobu
AU - Miyatake, Satoko
AU - Miyake, Noriko
AU - Matsumoto, Naomichi
N1 - Funding Information:
CLN6 deletion was supported by three PacBio reads without any read supporting the reference allele (Fig. 1d). The findings showed that the deletion spanned 12,433 bp, including complete deletion of exon 1 of CLN6 (chr15: 68,518,038–68,530,471). We initially evaluated the relative DNA copy number using real-time quantitative PCR. By comparing the amplification signal of exon 1 (deleted) with that of exon 2 (not deleted), we confirmed homozygous and heterozygous deletion in the affected individuals (II-2 and II-3) and their parents (I-1 and I-2), respectively (Supplementary Fig. S4). To further validate the size and position of the deletion, we performed Southern blot analysis. A 23.9-kb EcoRV fragment of the wild type and an 11.5-kb deletion variant (23.9–12.4 kb) were detected, indicating the reliable size estimate obtained by PacBio reads (Fig. 1a). This deletion variant cosegregated with disease based on autosomal recessive inheritance. Alternative ATG codon usage is possible, but RT-PCR analysis revealed a lack of CLN6 expression, confirming that the variant was pathogenic (Supplementary Fig. S5).
Publisher Copyright:
© 2019, The Author(s) under exclusive licence to The Japan Society of Human Genetics.
PY - 2019/5/1
Y1 - 2019/5/1
N2 - We report a family with progressive myoclonic epilepsy who underwent whole-exome sequencing but was negative for pathogenic variants. Similar clinical courses of a devastating neurodegenerative phenotype of two affected siblings were highly suggestive of a genetic etiology, which indicates that the survey of genetic variation by whole-exome sequencing was not comprehensive. To investigate the presence of a variant that remained unrecognized by standard genetic testing, PacBio long-read sequencing was performed. Structural variant (SV) detection using low-coverage (6×) whole-genome sequencing called 17,165 SVs (7,216 deletions and 9,949 insertions). Our SV selection narrowed down potential candidates to only five SVs (two deletions and three insertions) on the genes tagged with autosomal recessive phenotypes. Among them, a 12.4-kb deletion involving the CLN6 gene was the top candidate because its homozygous abnormalities cause neuronal ceroid lipofuscinosis. This deletion included the initiation codon and was found in a GC-rich region containing multiple repetitive elements. These results indicate the presence of a causal variant in a difficult-to-sequence region and suggest that such variants that remain enigmatic after the application of current whole-exome sequencing technology could be uncovered by unbiased application of long-read whole-genome sequencing.
AB - We report a family with progressive myoclonic epilepsy who underwent whole-exome sequencing but was negative for pathogenic variants. Similar clinical courses of a devastating neurodegenerative phenotype of two affected siblings were highly suggestive of a genetic etiology, which indicates that the survey of genetic variation by whole-exome sequencing was not comprehensive. To investigate the presence of a variant that remained unrecognized by standard genetic testing, PacBio long-read sequencing was performed. Structural variant (SV) detection using low-coverage (6×) whole-genome sequencing called 17,165 SVs (7,216 deletions and 9,949 insertions). Our SV selection narrowed down potential candidates to only five SVs (two deletions and three insertions) on the genes tagged with autosomal recessive phenotypes. Among them, a 12.4-kb deletion involving the CLN6 gene was the top candidate because its homozygous abnormalities cause neuronal ceroid lipofuscinosis. This deletion included the initiation codon and was found in a GC-rich region containing multiple repetitive elements. These results indicate the presence of a causal variant in a difficult-to-sequence region and suggest that such variants that remain enigmatic after the application of current whole-exome sequencing technology could be uncovered by unbiased application of long-read whole-genome sequencing.
UR - http://www.scopus.com/inward/record.url?scp=85061484254&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85061484254&partnerID=8YFLogxK
U2 - 10.1038/s10038-019-0569-5
DO - 10.1038/s10038-019-0569-5
M3 - Article
C2 - 30760880
AN - SCOPUS:85061484254
SN - 1434-5161
VL - 64
SP - 359
EP - 368
JO - Journal of Human Genetics
JF - Journal of Human Genetics
IS - 5
ER -