Introduction to Bioinformatics and Basic Concepts
- Ege Altun
- Apr 28
- 14 min read
Bioinformatics is a multidisciplinary branch of science that deals with the analysis, storage and processing of biological data (especially genome and proteome data) using computer technology and mathematical approaches. Bioinformatics is at the intersection of branches of biology such as genetics, proteomics and molecular biology and technical disciplines such as computer science and statistics (1). Bioinformatics is used to better understand biological systems and obtain meaningful results from complex data. For example, it interprets DNA and RNA sequences obtained from genome sequencing projects, studies cellular mechanisms by modeling protein structures and functions, and performs data analysis to understand disease mechanisms or discover biomarkers. Bioinformatics operates in various subfields. Sequence analysis covers the alignment and comparison of DNA, RNA and protein sequences, while structural bioinformatics focuses on the prediction and analysis of three-dimensional structures of proteins and nucleic acids. Genomics and epigenomics analyze the organization of genetic material and the functions of epigenetic changes, while transcriptomics examines gene expression profiles using technologies such as RNA-Seq (2). These subfields diversify the approach of bioinformatics to biological data and provide solutions to many scientific questions. Bioinformatics has a wide range of applications. In medical genetics, the analysis of genetic variations associated with diseases and the development of personalized treatment methods are provided. In cancer research, it is used to examine tumor genomes and discover biomarkers. In agricultural biotechnology, disease-resistant and productive plant species are developed in light of genomic data. In addition, environmental microbial communities are analyzed through metagenomic studies and potential drug candidates are identified by modeling protein-ligand interactions in drug discovery processes (3).

History of Bioinformatics
The first algorithm that could be used to compare protein or DNA sequences was published by Needleman and Wunsch in 1970 (4). In 1977, the genome of the bacteriophage ΦX174 was first completely sequenced (5). The term bioinformatics was first coined in 1978 and defined as "Bioinformatics for the Study of Computational Processes in Biotic Systems" (6). In 1981, an algorithm for local sequence alignment was published by Smith and Waterman (7). In 1986, the polymerase chain reaction (PCR) technique was developed (8). This technique made it possible to amplify a specific region of DNA millions of times, revolutionizing genetic research. In the same year, the term genomics was coined by Tom Roderick to describe the large-scale study of genes and genetic material (9). The Human Genome Project was launched in 1990 and greatly encouraged the development and use of bioinformatics tools (10). In 1995, the complete genome sequence of the bacterium Haemophilus influenzae was published, the first time that a living organism’s genome was completely sequenced (11). By 2000, the Human Genome Project was largely complete, and bioinformatics had become a discipline that forms the basis of fields such as genomics, proteomics, and systems biology (10). In particular, databases (e.g., GenBank) and analysis tools (e.g., BLAST) have enabled scientists to easily analyze large biological data sets. In the following years, bioinformatics has been further integrated with technologies such as artificial intelligence and machine learning. Bioinformatics has also begun to play a key role in the field of genetic engineering, especially with the development of gene editing technologies such as CRISPR-Cas9 (12). In addition, personalized medicine and big data analytics continue to expand the application areas of bioinformatics.
Some Basic Terms Used in Bioinformatics
Base Pair (bp): It refers to the pairing of two mutual nucleotides found in the structure of the DNA molecule. It is usually used to express the length of DNA sequences; for example, 200bp is a DNA sequence that is 200 base pairs long (13). Kilobase (kb) refers to 1000 base pairs. It is used to indicate the length of large DNA or RNA segments in genomic and genetic studies (14). Megabase (Mb) refers to 1 million base pairs. It is used to describe genetic lengths at the chromosome level (15). Gigabase (Gb) refers to 1 billion base pairs. It is used to describe entire genome lengths (16).
Codon: In DNA or RNA, units consisting of three nucleotides that code for a specific amino acid. For example, the AUG codon codes for the amino acid methionine and initiates protein synthesis (17).
Anticodon: A triple nucleotide sequence found in the structure of tRNA that matches the codon of mRNA. It plays an important role in transporting amino acids to the ribosome during protein synthesis (18).
ORF (Open Reading Frame): A DNA or RNA sequence that starts at a start codon (e.g., AUG) and ends at a stop codon (e.g., UAA, UAG, UGA) (19).
Promoter: A region of DNA that initiates gene expression. Promoter regions allow RNA polymerase to bind and control the start of transcription (20).
Introns: Sections of a gene that do not code for protein but are cut out of RNA (21).
Exons: Sections of a gene that code for protein or are converted into a functional RNA molecule. After transcription, exons are combined to form mature RNA (22).
Genome: Refers to the entire genetic material of an organism. The genome includes both coding (contributing to protein synthesis) and non-coding DNA regions. The branch of science that studies the structure, function, and evolution of genomes is called genomics (25).
Proteome: The total of all proteins in an organism. The proteome includes all proteins expressed at a given time or under a given condition. The branch of science that studies the proteome is called proteomics (26).
Transcriptome: The total of all RNA molecules expressed in a cell at a given time. The transcriptome is critical for understanding gene expression levels and gene regulation. The branch of science that analyzes transcriptomes is called transcriptomics (27).
Metabolomics: It deals with the analysis of all small molecules (metabolites) found in a cell, tissue, or organism. Metabolomics is used to understand the functioning of metabolic pathways and the organism's response to environmental changes (28).
Metagenomics: Analyzes the genetic material of all microorganisms (such as bacteria, viruses, and archaea) in an environmental sample. Metagenomics is used to study microbial diversity and the functional properties of ecosystems (29).
Viromics: Analyzes the genetic material of all viruses found in an organism or the environment. Viromics is used to understand viral diversity and discover new virus species (30).
Indel: The addition (insertion) or deletion (deletion) of one or more bases in a DNA sequence (23).
Gene: A unit of genetic information located in an organism's DNA that is responsible for the production of a protein or RNA molecule. Genes carry the codes necessary for the synthesis of proteins and form the basic building blocks of biological functions (24).
Epigenomics: Examines how gene expression is regulated by analyzing DNA methylation, histone modifications, and other epigenetic changes (31).
Phylogenetics: The branch of science that studies the evolutionary relationships between species or genes. Phylogenetic analyses use phylogenetic trees to understand the evolutionary history of organisms (32).

Sequence Alignment: The process of comparing DNA, RNA, or protein sequences to determine similarities (34).
Global Alignment: Compares the entire sequence (e.g., Needleman-Wunsch algorithm) (35).
Local Alignment: Focuses only on similar regions (e.g., Smith-Waterman algorithm) (36).
Homology: Describes the similarities between two biological sequences (DNA, RNA, or protein) that result from a common ancestor. Homologous sequences usually have similar functions (37).
SNP (Single Nucleotide Polymorphism): Genetic variations resulting from a change in a single nucleotide in the DNA sequence. SNPs are important sources of genetic diversity and are used to determine disease risks (25).
NGS (Next Generation Sequencing): Next generation sequencing technologies allow DNA or RNA sequences to be read quickly and with high accuracy. Examples of NGS technologies include Illumina, Oxford Nanopore, Ion Torrent, etc. (38).
Coverage Depth: In NGS projects, it refers to the number of times a genome region is sequenced. High coverage depth provides more accurate sequencing. For example, 30x coverage means that each region of the genome is sequenced an average of 30 times (39).
Database: These are systems where genetic and biological information is stored in bioinformatics. GenBank can be given as an example (40).
Some Software and Tools Used in Bioinformatics
Alignment and Genomic Analysis Tools
BLAST: It is one of the basic bioinformatics tools used to perform similarity analysis by comparing DNA, RNA or protein sequences (41).
Clustal Omega: It is used for multiple sequence alignment (42).
BWA (Burrows-Wheeler Aligner): It is a fast tool used for aligning short genomic sequences with long reference genomes (43).
Analysis of Transcriptomic Data
tRNAscan-SE: Used to find and analyze tRNA genes (44).
HISAT2: A rapid tool for aligning RNA sequences (RNA-Seq) to reference genomes (45).
Cufflinks: Used to estimate gene expression levels from RNA-Seq data (46).
StringTie: Provides transcript-level gene expression analysis and combined transcript annotation (47).
Phylogenetic and Evolutionary Analysis
MEGA (Molecular Evolutionary Genetics Analysis): Used in phylogenetic analyses and molecular evolution studies (48).
PhyML: Creates phylogenetic trees using the maximum likelihood method (49).
RAxML: Used in phylogenetic analyses of large data sets (50).
Metagenomic Analysis
QIIME: A toolkit used to analyze microbial samples (51).
Kraken: A software used for rapid and accurate metagenomic sequencing classification (52).
Metabat: Used to predict individual genomes from metagenomic sequences (53).
Some Databases Used in Bioinformatics
NCBI (National Center for Biotechnology Information): The official source of tools and databases such as BLAST, GenBank (54).
ENSEMBL: ENSEMBL focuses specifically on vertebrate genomes and provides information such as gene predictions, variations, and regulatory elements. It also provides tools for comparative analysis of genomes (55).
UCSC Genome Browser: It presents genome maps in an interactive manner and allows users to examine data such as genes, transcripts, variations, and regulatory regions. It is one of the most commonly used browsers for the human genome (56).
KEGG (Kyoto Encyclopedia of Genes and Genomes): KEGG focuses specifically on biochemical pathways (metabolic pathways, signal transduction pathways, etc.) and focuses on the functional analysis of genes, proteins, and metabolites (57).
REFERENCES
1. Selzer, P. M., Marhöfer, R. J., Koch, O. (2018). Applied Bioinformatics: An Introduction. Germany: Springer International Publishing. https://doi.org/10.1007/978-3-319-68301-0
2. BIOINFORMATICS, FIFTH EDITION: METHODS AND APPLICATIONS - GENOMICS, PROTEOMICS AND DRUG DISCOVERY. (2022). (n.p.): PHI Learning Pvt. Ltd.
3. Gupta, O. P., & Rani, S. (2010). Bioinformatics applications and tools: An overview. CiiT-International Journal of Biometrics and Bioinformatics, 3(3), 107-110.
4. Needleman, S. B., & Wunsch, C. D. (1970). A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of molecular biology, 48(3), 443-453. https://doi.org/10.1016/0022-2836(70)90057-4
5. Sanger, F., Coulson, A. R., Friedmann, T., Air, G. M., Barrell, B. G., Brown, N. L., ... & Smith, M. (1978). The nucleotide sequence of bacteriophage φX174. Journal of molecular biology, 125(2), 225-246. https://doi.org/10.1016/0022-2836(78)90346-7
6. Hogeweg, P. (2011). The roots of bioinformatics in theoretical biology. PLoS computational biology, 7(3), e1002021. https://doi.org/10.1371/journal.pcbi.1002021
7. Smith, T. F., & Waterman, M. S. (1981). Identification of common molecular subsequences. Journal of molecular biology, 147(1), 195-197. https://doi.org/10.1016/0022-2836(81)90087-5
8. Mullis, K., Faloona, F., Scharf, S., Saiki, R., Horn, G., & Erlich, H. (1986, January). Specific enzymatic amplification of DNA in vitro: the polymerase chain reaction. In Cold Spring Harbor symposia on quantitative biology (Vol. 51, pp. 263-273). Cold Spring Harbor Laboratory Press. https://doi.org/10.1101/sqb.1986.051.01.032
9. Yadav, S. P. (2007). The wholeness in suffix-omics,-omes, and the word om. Journal of biomolecular techniques: JBT, 18(5), 277.
10. Human Genome Project Timeline. (2025). Retrieved 3 January 2025, from https://www.genome.gov/human-genome-project/timeline
11. Fleischmann, R. D., Adams, M. D., White, O., Clayton, R. A., Kirkness, E. F., Kerlavage, A. R., Bult, C. J., Tomb, J. F., Dougherty, B. A., & Merrick, J. M. (1995). Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science (New York, N.Y.), 269(5223), 496–512. https://doi.org/10.1126/science.7542800
12. Jiang, F., & Doudna, J. A. (2017). CRISPR–Cas9 structures and mechanisms. Annual review of biophysics, 46(1), 505-529. https://doi.org/10.1146/annurev-biophys-062215-010822
13. Varani, G., & McClain, W. H. (2000). The G· U wobble base pair. A fundamental building block of RNA structure crucial to RNA function in diverse biological systems. EMBO reports, 1(1), 18-23. https://doi.org/10.1093/embo-reports/kvd001
14. KILOBASE (KB). (2025). Retrieved 3 January 2025, from https://www.genome.gov/genetics-glossary/Kilobase-kb
15. MEGABASE (MB). (2025). Retrieved 3 January 2025, from https://www.genome.gov/genetics-glossary/Megabase-Mb.
16. GIGABASE (GB). (2025). Retrieved 3 January 2025, from https://www.genome.gov/genetics-glossary/Gigabase-Gb.
17. codon. (2025). Retrieved 3 January 2025, from https://www.cancer.gov/publications/dictionaries/genetics-dictionary/def/codon.
18. Kaufmann, G. (2000). Anticodon nucleases. Trends in biochemical sciences, 25(2), 70-74. https://doi.org/10.1016/s0968-0004(99)01525-x
19. Sieber, P., Platzer, M., & Schuster, S. (2018). The definition of open reading frame revisited. Trends in Genetics, 34(3), 167-170. https://doi.org/10.1016/j.tig.2017.12.009
20. Cartharius, K., Frech, K., Grote, K., Klocke, B., Haltmeier, M., Klingenhoff, A., ... & Werner, T. (2005). MatInspector and beyond: promoter analysis based on transcription factor binding sites. Bioinformatics, 21(13), 2933-2942. https://doi.org/10.1093/bioinformatics/bti473
21. Chorev, M., & Carmel, L. (2012). The function of introns. Frontiers in genetics, 3, 55. https://doi.org/10.3389/fgene.2012.00055
22. Keren, H., Lev-Maor, G., & Ast, G. (2010). Alternative splicing and evolution: diversification, exon definition and function. Nature Reviews Genetics, 11(5), 345-355. https://doi.org/10.1038/nrg2776
23. Redelings, B. D., Holmes, I., Lunter, G., Pupko, T., & Anisimova, M. (2024). Insertions and deletions: Computational methods, evolutionary dynamics, and biological applications. Molecular biology and evolution, 41(9), msae177. https://doi.org/10.1093/molbev/msae177
24. GENE. (2025). Retrieved 3 January 2025, from https://www.genome.gov/genetics-glossary/Gene.
25. Roth, S. C. (2019). What is genomic medicine?. Journal of the Medical Library Association: JMLA, 107(3), 442. https://doi.org/10.5195/jmla.2019.604
26. Anderson, J. D., Johansson, H. J., Graham, C. S., Vesterlund, M., Pham, M. T., Bramlett, C. S., Montgomery, E. N., Mellema, M. S., Bardini, R. L., Contreras, Z., Hoon, M., Bauer, G., Fink, K. D., Fury, B., Hendrix, K. J., Chedin, F., El-Andaloussi, S., Hwang, B., Mulligan, M. S., Lehtiö, J., … Nolta, J. A. (2016). Comprehensive Proteomic Analysis of Mesenchymal Stem Cell Exosomes Reveals Modulation of Angiogenesis via Nuclear Factor-KappaB Signaling. Stem cells (Dayton, Ohio), 34(3), 601–613. https://doi.org/10.1002/stem.2298
27. Pertea, M. (2012). The human transcriptome: an unfinished story. Genes, 3(3), 344-360. https://doi.org/10.3390/genes3030344
28. Clish C. B. (2015). Metabolomics: an emerging but powerful tool for precision medicine. Cold Spring Harbor molecular case studies, 1(1), a000588. https://doi.org/10.1101/mcs.a000588
29. National Research Council (US) Committee on Metagenomics: Challenges and Functional Applications. The New Science of Metagenomics: Revealing the Secrets of Our Microbial Planet. Washington (DC): National Academies Press (US); 2007. 1, Why Metagenomics? Available from: https://www.ncbi.nlm.nih.gov/books/NBK54011/
30. Ramamurthy, M., Sankar, S., Kannangai, R., Nandagopal, B., & Sridharan, G. (2017). Application of viromics: a new approach to the understanding of viral infections in humans. Virusdisease, 28(4), 349–359. https://doi.org/10.1007/s13337-017-0415-3
31. Choi, M. Y., Fritzler, M. J., & Mahler, M. (2021). Development of multi-omics approach in autoimmune diseases. In Precision Medicine and Artificial Intelligence (pp. 189-201). Academic Press. https://doi.org/10.1016/B978-0-12-820239-5.00004-8
32. Semple, C., & Steel, M. (2003). Phylogenetics (Vol. 24). Oxford University Press on Demand.
33. Nair P. (2012). Woese and Fox: Life, rearranged. Proceedings of the National Academy of Sciences of the United States of America, 109(4), 1019–1021. https://doi.org/10.1073/pnas.1120749109
34. Mount, D. W. (2004). Bioinformatics: Sequence and Genome Analysis. Tayland: Cold Spring Harbor Laboratory Press.
35. Huang X. (1994). On global sequence alignment. Computer applications in the biosciences : CABIOS, 10(3), 227–235. https://doi.org/10.1093/bioinformatics/10.3.227
36. Altschul, S. F., Gish, W., Miller, W., Myers, E. W., & Lipman, D. J. (1990). Basic local alignment search tool. Journal of molecular biology, 215(3), 403–410. https://doi.org/10.1016/S0022-2836(05)80360-2
37. Brigandt, Ingo, "Essay: Homology". Embryo Project Encyclopedia ( 2011-11-23 ). ISSN: 1940-5030 https://hdl.handle.net/10776/1754
38. Hu, T., Chitnis, N., Monos, D., & Dinh, A. (2021). Next-generation sequencing technologies: An overview. Human Immunology, 82(11), 801-811. https://doi.org/10.1016/j.humimm.2021.02.012
39. Sims, D., Sudbery, I., Ilott, N. E., Heger, A., & Ponting, C. P. (2014). Sequencing depth and coverage: key considerations in genomic analyses. Nature Reviews Genetics, 15(2), 121-132. https://doi.org/10.1038/nrg3642
40. Sayers, E. W., Cavanaugh, M., Clark, K., Ostell, J., Pruitt, K. D., & Karsch-Mizrachi, I. (2020). GenBank. Nucleic acids research, 48(D1), D84–D86. https://doi.org/10.1093/nar/gkz956
41. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J. (1990) “Basic local alignment search tool.” J. Mol. Biol. 215:403-410. https://doi.org/10.1016/S0022-2836(05)80360-2
42. Madeira, F., Madhusoodanan, N., Lee, J., Eusebi, A., Niewielska, A., Tivey, A. R., ... & Butcher, S. (2024). The EMBL-EBI Job Dispatcher sequence analysis tools framework in 2024. Nucleic Acids Research, gkae241. https://doi.org/10.1093/nar/gkae241
43. Li H. and Durbin R. (2009) Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics, 25:1754-60. https://doi.org/10.1093/bioinformatics/btp324
44. Lowe, T. M., & Chan, P. P. (2016). tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic acids research, 44(W1), W54–W57. https://doi.org/10.1093/nar/gkw413
45. Kim, D., Langmead, B., & Salzberg, S. L. (2015). HISAT: a fast spliced aligner with low memory requirements. Nature methods, 12(4), 357–360. https://doi.org/10.1038/nmeth.3317
46. Trapnell, C., Williams, B. A., Pertea, G., Mortazavi, A., Kwan, G., van Baren, M. J., Salzberg, S. L., Wold, B. J., & Pachter, L. (2010). Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature biotechnology, 28(5), 511–515. https://doi.org/10.1038/nbt.1621
47. Pertea, M., Pertea, G. M., Antonescu, C. M., Chang, T. C., Mendell, J. T., & Salzberg, S. L. (2015). StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nature biotechnology, 33(3), 290-295. https://doi.org/10.1038/nbt.3122
48. Tamura, K., Stecher, G., & Kumar, S. (2021). MEGA11: molecular evolutionary genetics analysis version 11. Molecular biology and evolution, 38(7), 3022-3027. https://doi.org/10.1093/molbev/msab120
49. Guindon, S., Dufayard, J. F., Lefort, V., Anisimova, M., Hordijk, W., & Gascuel, O. (2010). New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Systematic biology, 59(3), 307-321. https://doi.org/10.1093/sysbio/syq010
50. Stamatakis, A. (2014). RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics, 30(9), 1312-1313. https://doi.org/10.1093/bioinformatics/btu033
51. Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, Alexander H, Alm EJ, Arumugam M, Asnicar F, Bai Y, Bisanz JE, Bittinger K, Brejnrod A, Brislawn CJ, Brown CT, Callahan BJ, Caraballo-Rodríguez AM, Chase J, Cope EK, Da Silva R, Diener C, Dorrestein PC, Douglas GM, Durall DM, Duvallet C, Edwardson CF, Ernst M, Estaki M, Fouquier J, Gauglitz JM, Gibbons SM, Gibson DL, Gonzalez A, Gorlick K, Guo J, Hillmann B, Holmes S, Holste H, Huttenhower C, Huttley GA, Janssen S, Jarmusch AK, Jiang L, Kaehler BD, Kang KB, Keefe CR, Keim P, Kelley ST, Knights D, Koester I, Kosciolek T, Kreps J, Langille MGI, Lee J, Ley R, Liu YX, Loftfield E, Lozupone C, Maher M, Marotz C, Martin BD, McDonald D, McIver LJ, Melnik AV, Metcalf JL, Morgan SC, Morton JT, Naimey AT, Navas-Molina JA, Nothias LF, Orchanian SB, Pearson T, Peoples SL, Petras D, Preuss ML, Pruesse E, Rasmussen LB, Rivers A, Robeson MS, Rosenthal P, Segata N, Shaffer M, Shiffer A, Sinha R, Song SJ, Spear JR, Swafford AD, Thompson LR, Torres PJ, Trinh P, Tripathi A, Turnbaugh PJ, Ul-Hasan S, van der Hooft JJJ, Vargas F, Vázquez-Baeza Y, Vogtmann E, von Hippel M, Walters W, Wan Y, Wang M, Warren J, Weber KC, Williamson CHD, Willis AD, Xu ZZ, Zaneveld JR, Zhang Y, Zhu Q, Knight R, and Caporaso JG. 2019. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nature Biotechnology 37: 852–857. https://doi.org/10.1038/s41587-019-0209-9.
52. Wood, D. E., & Salzberg, S. L. (2014). Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome biology, 15, 1-12. https://doi.org/10.1186/gb-2014-15-3-r46
53. Kang, D. D., Froula, J., Egan, R., & Wang, Z. (2015). MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ, 3, e1165. https://doi.org/10.7717/peerj.1165
54. Sayers, E. W., Bolton, E. E., Brister, J. R., Canese, K., Chan, J., Comeau, D. C., Connor, R., Funk, K., Kelly, C., Kim, S., Madej, T., Marchler-Bauer, A., Lanczycki, C., Lathrop, S., Lu, Z., Thibaud-Nissen, F., Murphy, T., Phan, L., Skripchenko, Y., Tse, T., … Sherry, S. T. (2022). Database resources of the national center for biotechnology information. Nucleic acids research, 50(D1), D20–D26. https://doi.org/10.1093/nar/gkab1112
55. Peter W Harrison, M Ridwan Amode, Olanrewaju Austine-Orimoloye, Andrey G Azov, Matthieu Barba, If Barnes, Arne Becker, Ruth Bennett, Andrew Berry, Jyothish Bhai, Simarpreet Kaur Bhurji, Sanjay Boddu, Paulo R Branco Lins, Lucy Brooks, Shashank Budhanuru Ramaraju, Lahcen I Campbell, Manuel Carbajo Martinez, Mehrnaz Charkhchi, Kapeel Chougule, Alexander Cockburn, Claire Davidson, Nishadi H De Silva, Kamalkumar Dodiya, Sarah Donaldson, Bilal El Houdaigui, Tamara El Naboulsi, Reham Fatima, Carlos Garcia Giron, Thiago Genez, Dionysios Grigoriadis, Gurpreet S Ghattaoraya, Jose Gonzalez Martinez, Tatiana A Gurbich, Matthew Hardy, Zoe Hollis, Thibaut Hourlier, Toby Hunt, Mike Kay, Vinay Kaykala, Tuan Le, Diana Lemos, Disha Lodha, Diego Marques-Coelho, Gareth Maslen, Gabriela Alejandra Merino, Louisse Paola Mirabueno, Aleena Mushtaq, Syed Nakib Hossain, Denye N Ogeh, Manoj Pandian Sakthivel, Anne Parker, Malcolm Perry, Ivana Piližota, Daniel Poppleton, Irina Prosovetskaia, Shriya Raj, José G Pérez-Silva, Ahamed Imran Abdul Salam, Shradha Saraf, Nuno Saraiva-Agostinho, Dan Sheppard, Swati Sinha, Botond Sipos, Vasily Sitnik, William Stark, Emily Steed, Marie-Marthe Suner, Likhitha Surapaneni, Kyösti Sutinen, Francesca Floriana Tricomi, David Urbina-Gómez, Andres Veidenberg, Thomas A Walsh, Doreen Ware, Elizabeth Wass, Natalie L Willhoft, Jamie Allen, Jorge Alvarez-Jarreta, Marc Chakiachvili, Bethany Flint, Stefano Giorgetti, Leanne Haggerty, Garth R Ilsley, Jon Keatley, Jane E Loveland, Benjamin Moore, Jonathan M Mudge, Guy Naamati, John Tate, Stephen J Trevanion, Andrea Winterbottom, Adam Frankish, Sarah E Hunt, Fiona Cunningham, Sarah Dyer, Robert D Finn, Fergal J Martin, and Andrew D Yates Ensembl 2024Nucleic Acids Res. 2024, 52(D1):D891–D899PMID: 37953337 https://doi.org/10.1093/nar/gkad1049
56. Perez G, Barber GP, Benet-Pages A, Casper J, Clawson H, Diekhans M, Fischer C, Gonzalez JN, Hinrichs AS, Lee CM, Nassar LR, Raney BJ, Speir ML, van Baren MJ, Vaske CJ, Haussler D, Kent WJ, Haeussler M. (2024). The UCSC Genome Browser database: 2025 update. Nucleic Acids Res. https://doi.org/10.1093/nar/gkae974
Kanehisa, M., Furumichi, M., Sato, Y., Kawashima, M., & Ishiguro-Watanabe, M. (2023). KEGG for taxonomy-based analysis of pathways and genomes. Nucleic acids research, 51(D1), D587–D592. https://doi.org/10.1093/nar/gkac963
Комментарии