Tool and Algorithm for the Determination of Aptamers in Nanopore Sequencing Data: AptaLong

Authors

DOI:

https://doi.org/10.14529/jsfi240306

Keywords:

nanopore sequencing, aptamer, SELEX, primer, sequence alignment

Abstract

Nanopore sequencing is a third generation sequencing technology that allows direct, real-time sequencing of individual DNA or RNA molecules. It utilizes a nanopore – an extremely small pore – in a membrane to pass a single strand DNA or RNA. As the sequence passes through the nanopore, changes in electrical current are detected and used to determine the nucleotide sequence. Nanopore sequencing has several advantages. It offers long read lengths, allowing for the sequencing of difficult regions of the genome, such as repetitive regions. It also enables real-time sequencing, providing immediate data generation without the need for extensive library preparation. Many bioinformatics pipelines and tools have been developed specifically for nanopore sequencing data analysis, addressing the unique characteristics and challenges of this technology, while dealing with non-standard long reads, derived from the ligation process of shorter oligonucleotides, might be challenging. In this research we present a new algorithm that extracts an aptamer sequence from the results of nanopore sequencing of several SELEX experiments with single-stranded DNA. The algorithm is based on statistical methods, based on known primer sequences and length of searching aptamer. We used step-by-step displacement of the reference sequence with positional alignment and calculated the positional frequencies of each nucleotide. As a result, the nucleotide frequencies obtained at each step are averaged, and thus, we find the sequence that is more likely to represent the aptamer.

References

Berkovich, A., Pyshkina, O., Zorina, A., et al.: Direct determination of the structure of single biopolymer molecules using nanopore sequencing. Biochemistry (Moscow) 89, S234–S248 (2024). https://doi.org/10.1134/S000629792414013X

Khrenova, M., Nikiforova, L.A., Grabovenko, F., et al.: Highly specific aptamer for SARSCoV-2 Spike protein from the authentic strain. Org. Biomol. Chem. 22, 5936–5947 (2024). https://doi.org/10.1039/D4OB00645C

Kohlberger, M., Gadermaier, G.: SELEX: Critical factors and optimization strategies for successful aptamer selection. Biotechnol. Appl. Biochem. 69(5), 1771–1792 (2022). https://doi.org/10.1002/bab.2244

Kumar Kulabhusan, P., Hussain, B., Y¨uce, M.: Current perspectives on aptamers as diagnostic tools and therapeutic agents. Pharmaceutics 12(7), 646 (2020). https://doi.org/10.3390/pharmaceutics12070646

Nimjee, S.M., White, R.R., Becker, R.C., et al.: Aptamers as therapeutics. Annu. Rev. Pharmacol. Toxicol. 57, 61–79 (2017). https://doi.org/https://doi.org/10.1146/annurev-pharmtox-010716-104558

Rhoads, A., Au, K.F.: PacBio Sequencing and its Applications. Genomics Proteomics Bioinformatics 13(5), 278–289 (2015). https://doi.org/10.1016/j.gpb.2015.08.002

Subach, M.F., Khrenova, M.G., Zvereva, M.I.: Modern methods of aptamer chemical modification and principles of aptamer library selection. Moscow Univ. Chem. Bull 79, 79–85 (2024). https://doi.org/10.3103/S002713142470010X

Voevodin, V.V., Antonov, A.S., Nikitenko, D.A., Shvets, P.A., Sobolev, S.I., Sidorov, I.Y., Stefanov, K.S., Voevodin, V.V., Zhumatiy, S.A.: Supercomputer Lomonosov-2: Large Scale, Deep Monitoring and Fine Analytics for the User Community. Supercomput. Front. Innov. 6(2), 4–11 (2019). https://doi.org/10.14529/jsfi190201

Zhou, M.Y., Gomez-Sanchez, C.E.: Universal ta cloning. Curr. Issues Mol. Biol. 2(1), 1–7 (2000). https://doi.org/10.21775/cimb.002.001

Downloads

Published

2024-10-25

How to Cite

Grigoryeva, M. A., Khrenova, M. G., Subach, M. F., Voevodin, V. V., & Zvereva, M. I. (2024). Tool and Algorithm for the Determination of Aptamers in Nanopore Sequencing Data: AptaLong. Supercomputing Frontiers and Innovations, 11(3), 93–106. https://doi.org/10.14529/jsfi240306

Most read articles by the same author(s)