Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing

被引:9
|
作者
Yang, Shuaibing [1 ]
Zhao, Qianqian [2 ]
Tang, Lihua [3 ]
Chen, Zejia [3 ]
Wu, Zhaoting [3 ]
Li, Kaixin [4 ]
Lin, Ruoru [4 ]
Chen, Yang [4 ]
Ou, Danlin [4 ]
Zhou, Li [3 ]
Xu, Jianzhen [2 ]
Qin, Qingsong [1 ,5 ,6 ]
机构
[1] Shantou Univ, Lab Human Virol & Oncol, Med Coll, Shantou, Peoples R China
[2] Shantou Univ, Dept Bioinformat, Computat Syst Biol Lab, Med Coll, Shantou, Peoples R China
[3] Shantou Univ, Dept Gynecol Oncol, Canc Hosp, Med Coll, Shantou, Peoples R China
[4] Shantou Univ, Undergrad Program Innovat & Entrepreneurship, Med Coll, Shantou, Peoples R China
[5] Guangdong Prov Key Lab Infect Dis & Mol Immunopat, Shantou, Peoples R China
[6] Guangdong Prov Key Lab Diag & Treatment Breast Ca, Shantou, Peoples R China
基金
中国国家自然科学基金;
关键词
HPV; nanopore sequencing; cervical cancer; integration; episomal genome; CERVICAL INTRAEPITHELIAL NEOPLASIA; VIRAL LOAD; INTEGRATION SITES; PHYSICAL STATUS; HPV INTEGRATION; VIRUS GENOMES; CANCER; DNA; EXPRESSION; CARCINOMA;
D O I
10.3389/fgene.2021.798608
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Human papillomavirus (HPV) is a causal agent for most cervical cancers. The physical status of the HPV genome in these cancers could be episomal, integrated, or both. HPV integration could serve as a biomarker for clinical diagnosis, treatment, and prognosis. Although whole-genome sequencing by next-generation sequencing (NGS) technologies, such as the Illumina sequencing platform, have been used for detecting integrated HPV genome in cervical cancer, it faces challenges of analyzing long repeats and translocated sequences. In contrast, Oxford nanopore sequencing technology can generate ultra-long reads, which could be a very useful tool for determining HPV genome sequence and its physical status in cervical cancer. As a proof of concept, in this study, we completed whole genome sequencing from a cervical cancer tissue and a CaSki cell line with Oxford Nanopore Technologies. From the cervical cancer tissue, a 7,894 bp-long HPV35 genomic sequence was assembled from 678 reads at 97-fold coverage of HPV genome, sharing 99.96% identity with the HPV sequence obtained by Sanger sequencing. A 7904 bp-long HPV16 genomic sequence was assembled from data generated from the CaSki cell line at 3857-fold coverage, sharing 99.99% identity with the reference genome (NCBI: U89348). Intriguingly, long reads generated by nanopore sequencing directly revealed chimeric cellular-viral sequences and concatemeric genomic sequences, leading to the discovery of 448 unique integration breakpoints in the CaSki cell line and 60 breakpoints in the cervical cancer sample. Taken together, nanopore sequencing is a unique tool to identify HPV sequences and would shed light on the physical status of HPV genome in its associated cancers.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Nanopore long-read sequencing of circRNAs
    Rahimi, Karim
    Nielsen, Anne Faerch
    Veno, Morten T.
    Kjems, Jorgen
    [J]. METHODS, 2021, 196 : 23 - 29
  • [2] Improving the genome assembly of rabbits with long-read sequencing
    Bai, Yiqin
    Lin, Weili
    Xu, Jie
    Song, Jun
    Yang, Dongshan
    Chen, Y. Eugene
    Li, Lin
    Li, Yixue
    Wang, Zhen
    Zhang, Jifeng
    [J]. GENOMICS, 2021, 113 (05) : 3216 - 3223
  • [3] Complex genome assembly based on long-read sequencing
    Zhang, Tianjiao
    Zhou, Jie
    Gao, Wentao
    Jia, Yuran
    Wei, Yanan
    Wang, Guohua
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [4] Detection of four isomers of the human cytomegalovirus genome using nanopore long-read sequencing
    Nanamiya, Hideaki
    Tanaka, Daisuke
    Hiyama, Gen
    Isogai, Takao
    Watanabe, Shinya
    [J]. VIRUS GENES, 2024, 60 (04) : 377 - 384
  • [5] Comprehensive genetic analysis of facioscapulohumeral muscular dystrophy by Nanopore long-read whole-genome sequencing
    Huang, Mingtao
    Zhang, Qinxin
    Jiao, Jiao
    Shi, Jianquan
    Xu, Yiyun
    Zhang, Cuiping
    Zhou, Ran
    Liu, Wenwen
    Liang, Yixuan
    Chen, Hao
    Wang, Yan
    Xu, Zhengfeng
    Hu, Ping
    [J]. JOURNAL OF TRANSLATIONAL MEDICINE, 2024, 22 (01)
  • [6] Rapid, multiplexed, whole genome and plasmid sequencing of foodborne pathogens using long-read nanopore technology
    Taylor, Tonya L.
    Volkening, Jeremy D.
    DeJesus, Eric
    Simmons, Mustafa
    Dimitrov, Kiril M.
    Tillman, Glenn E.
    Suarez, David L.
    Afonso, Claudio L.
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [7] Long-read whole-genome methylation patterning using enzymatic base conversion and nanopore sequencing
    Sakamoto, Yoshitaka
    Zaha, Suzuko
    Nagasawa, Satoi
    Miyake, Shuhei
    Kojima, Yasuyuki
    Suzuki, Ayako
    Suzuki, Yutaka
    Seki, Masahide
    [J]. NUCLEIC ACIDS RESEARCH, 2021, 49 (14)
  • [8] Rapid, multiplexed, whole genome and plasmid sequencing of foodborne pathogens using long-read nanopore technology
    Tonya L. Taylor
    Jeremy D. Volkening
    Eric DeJesus
    Mustafa Simmons
    Kiril M. Dimitrov
    Glenn E. Tillman
    David L. Suarez
    Claudio L. Afonso
    [J]. Scientific Reports, 9
  • [9] Phytophthora capsici genome assembly for two isolates using long-read Oxford Nanopore Technology sequencing
    Szadkowski, Emmanuel
    Lagnel, Jacques
    Touhami, Nasradin
    Sayeh, Amalia
    Lopez-Roques, Celine
    Bouchez, Olivier
    Lefebvre, Veronique
    [J]. MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2023, 12 (12): : 1 - 2
  • [10] Comparison of long-read methods for sequencing and assembly of a plant genome
    Murigneux, Valentine
    Rai, Subash Kumar
    Furtado, Agnelo
    Bruxner, Timothy J. C.
    Tian, Wei
    Harliwong, Ivon
    Wei, Hanmin
    Yang, Bicheng
    Ye, Qianyu
    Anderson, Ellis
    Mao, Qing
    Drmanac, Radoje
    Wang, Ou
    Peters, Brock A.
    Xu, Mengyang
    Wu, Pei
    Topp, Bruce
    Coin, Lachlan J. M.
    Henry, Robert J.
    [J]. GIGASCIENCE, 2020, 9 (12):