Telefonica Tech · Blog · Sonia Doblado

Sonia Doblado

Genomcore's bioinformatician specializes in genomics. I manage tenders and opportunities related to precision medicine. I want to improve the quality of life for patients through tools like Genomcore BIMS, so that healthcare professionals can focus on improving patient care. In my free time I cross swords with my friends (I'm a sabreur).

AI & Data

The breakthrough technological leap in DNA sequencing

In this series of posts on precision medicine, we have constantly mentioned DNA, DNA sequencing, genome sequencing, or massive sequencing (NGS), but what does this concept really mean? What is DNA sequencing Simply put, sequencing is the technique that converts a biological sample (such as a biopsy, blood, or saliva) into data that can be analyzed by a computer. We go from having a piece of tissue to obtaining a text file that contains the "instruction manual" of that person's cells, that is, the exact sequence of their DNA. This sequence, which we can think of as a plain text file, is what we will use in the methods and procedures for genomic testing for precision medicine that we discussed in our previous article. This time we will talk about the technical differences we can find to sequence that genomic material. This will enable us to analyze it and find patterns or variants that help us advance in precision medicine. DNA, or deoxyribonucleic acid, is the molecule that contains all living beings' genetic information. It consists of four nucleotides, each represented by a letter: A (adenine), T (thymine), C (cytosine), and G (guanine). These letters are organized in a double helix structure, where each nucleotide on one strand pairs with its complementary nucleotide on the other: A with T and C with G, forming the so-called base pairs. Deciphering the sequence of these letters in DNA allows us to understand how genes work, how they are regulated, and how they influence health and disease. The ability to sequence DNA has led to great advances in the knowledge of genome organization and function. Thanks to these techniques, scientists have identified genes responsible for hereditary diseases, developed targeted therapies in precision medicine, and reconstructed species evolution over time. But not all sequencing technologies are the same: for example, depending on the technique used, longer or shorter DNA fragments can be obtained, which influences the accuracy and usefulness of the analysis. Below, we will explore the different generations of sequencing techniques. First-generation sequencing: the Sanger method Historically, the most used method for sequencing DNA has been the one developed by Sanger and his team. In this procedure, the DNA molecule whose sequence is to be determined is converted into single strands, which are used as templates to synthesize a series of complementary strands. Each of these strands randomly ends in a different specific nucleotide. This produces a series of DNA fragments separated electrophoretically, and whose analysis reveals the DNA sequence. In the first step of this reaction, DNA is heated to denature, forming single strands. The single-stranded DNA is mixed with primers that hybridize to the 3' end of this DNA. The single-stranded DNA sample bound to the primer is distributed in four tubes. In the next step, DNA polymerase and the four deoxyribonucleotide triphosphates (dATP, dCTP, dGTP, and dTTP) are added to each tube. In addition, each tube also receives a small amount of a modified deoxyribonucleotide, called a dideoxynucleotide (ddATP, ddCTP, ddGTP, ddTTP). Dideoxynucleotides have a 3'-H group instead of a 3'-OH group. To analyze the sequence, one of the deoxyribonucleotides or the primer is radioactively labeled. DNA polymerase is added to each tube, and the primer is elongated in the 5'-3' direction, forming a complementary strand to the template. During DNA synthesis, DNA polymerase occasionally inserts a dideoxynucleotide instead of a deoxyribonucleotide into the growing DNA chain. Since the dideoxynucleotide lacks a 3'-OH group, it cannot form a 3' bond with any other nucleotide, and DNA synthesis stops. 🧬 For example, in the tube to which ddATP has been added, the polymerase inserts ddATP instead of dATP,causing chainn elongation to stop. In the other tubes, the reactions end in a C, a G, or a T respectively. The DNA fragments from each reaction tube (one for each dideoxynucleotide) are separated in adjacent lanes by gel electrophoresis. The result is a series of bands that form a ladder pattern that is visualized by exposing the film to the gel. The nucleotide sequence is read directly from the base to the top of the gel, which corresponds to the 5'-3' sequence of the DNA strand complementary to the template. DNA sequencing in large-scale genome sequencing projects has been automated and uses machines that can sequence several hundred thousand nucleotides each day. In this procedure, each of the four dideoxynucleotide analogs is labeled with a fluorescent dye of a different color. This is so that the chains that end in adenosine are labeled with one color, those ending in cytosine with another, and so on. All four labeled dideoxynucleotides are added to the same tube. After primer extension by DNA polymerase, the reaction products are loaded into one lane of a gel. The gel is scanned by a laser, causing each band to emit fluorescence of a different color. The sequencing machine has a detector that reads the color of each band and determines whether it represents an A, a T, a C, or a G. This data is represented as colored peaks, each corresponding to a nucleotide in the sequence. Image of the Sanger sequencing flow. Source 🧬 This is the simplest and most universal sequencing method. In second- and third-generation sequencers, sequencing techniques differ among different manufacturers, so we won't go into detail here (you can breathe easy). From traditional sequencing to second-generation sequencing (NGS): the great technological leap Before the advent of Next-Generation Sequencing (NGS), DNA sequencing was performed using the aforementioned Sanger method, which is ultimately a slow and expensive process that allows reading relatively short DNA fragments sequentially. The arrival of the second generation of sequencers marked a before and after, as it introduced the possibility of sequencing millions of DNA fragments simultaneously (massive parallel sequencing). This allowed the DNA of entire organisms to be deciphered in record time, accelerating the development of projects such as the Human Genome Project and making genetic sequencing more accessible. The great feature of NGS is that it fragments DNA into small pieces called short reads, typically measuring between 50 and 600 base pairs (bp), which will be copied many times to amplify the material to be read. Subsequently, these pieces are computationally reconstructed to obtain the complete sequence, in what we call pipelines or bioinformatics workflows. 🧬 This works very well for identifying point mutations and small changes in DNA and giving themmeaning. For example,s variants that can influence hereditary diseases. However, when it comes to analyzing complex regions of DNA, such as highly repetitive ones or those containing large rearrangements, reconstructing the complete sequence can be complicated and prone to errors. Third-generation sequencing: longer reads and real-time analysis While NGS remains the most widely used technology today, third-generation sequencers have brought significant improvements. Instead of splitting DNA into small fragments, these new technologies allow reading much longer fragments, ranging from 10,000 to 100,000 base pairs (long reads), making genome assembly easier and identifying complex structural changes. Another key point is that third-generation sequencing is performed in real-time and does not require prior amplification of DNA, reducing errors introduced by genetic duplication. Additionally, these techniques can detect epigenetic modifications, such as DNA methylation, without the need for additional steps in the process. 🧬 This has significant implications for studies of diseases like cancer, where the regulation of gene expression, that is, which genes are "activated" and when they do, plays a fundamental role. Short reads or long reads? Which is better? The choice between second- and third-generation technologies depends on the specific application. While short reads (NGS) are extremely accurate and allow analyzing large volumes of data at low cost, long reads offer greater capacity to detect structural variants and assemble complex genomes without complicated computational reconstructions. Short reads (NGS, second generation) ✅ High precision for detecting point mutations and small variations. ✅ Lower cost per sequence base. ✅ Ideal for studies of genetic diseases, cancer, and transcriptomics. ❌ Difficulties in assembling complete genomes due to fragmentation. ❌ Limitations to detecting large structural variants. Long reads (third generation) ✅ Long reads that allow assembling complete genomes more easily. ✅ Ability to detect structural variants and repeats in DNA. ✅ Does not require amplification, avoiding biases in the process. ❌ Higher error rate compared to NGS (although it can be corrected with additional coverage). ❌ Higher cost per sequencing. Trends in the use of these technologies for DNA sequencing In recent years, third-generation sequencing has gained ground, especially in studies where it is essential to have a complete view of the genome without interruptions. For example, long-read sequencing has been key to identifying structural variants in diseases such as autism and certain forms of epilepsy (Chaisson et al., 2019). Additionally, in 2022, the Telomere-to-Telomere (T2T) Consortium managed to sequence the entire human genome without gaps for the first time, thanks to the combination of third-generation technologies (Nurk et al., 2022). However, NGS remains the most widely used option in hospitals and diagnostic laboratories, due to its low cost and high precision in detecting individual mutations. In many research cases, the current trend is to combine both technologies in hybrid studies, where short reads offer precision and long reads allow resolving complex regions of DNA. In the clinical setting, third-generation sequencing is still received with some skepticism, which makes sense if we think that an error in sequencing can lead to a diagnostic error for a patient, and therefore directly affect their health. Sequencing technologies are constantly evolving, and it is likely that in the coming years we will see even greater integration between second- and third-generation techniques. It is foreseeable that third-generation sequencing will become widespread both in biomedical research and in the clinic as costs decrease and accuracy improves. Third generation will become increasingly common in hospitals. Regardless of the technology used, the impact of DNA sequencing on personalized medicine, the identification of rare diseases, and cancer research will continue to grow, bringing us ever closer to treatments specifically designed for each person's profile. Cyber Security Cloud Connectivity & IoT IA & Data Healthcare's digital transformation: challenges, needs, and benefits December 18, 2024

March 31, 2025

AI & Data

Methods and procedures in genomic testing for precision medicine

We have talked a lot in previous articles about genomic testing and its importance in the revolution of personalized precision medicine. But what are they? What exactly are we talking about? Genomic tests, sometimes also called genetic tests if we are talking about the study of inherited diseases, are those that sequence (“read”) the patient's genome, or a part of it, to identify biomarkers; i.e. specific signals within that genomic sequence, that are related to diseases. How useful is this? Well, knowing the presence or absence of these biomarkers can help doctors to detect the risk of suffering from certain diseases, such as hereditary diseases, or the specific genetic variation related to a disease that is already being suffered, as in the case of oncological diseases. Genomic tests sequence the genome to identify biomarkers that help detect risks for certain diseases. In this last case, knowing which mutation or genetic variation is causing the cancer will help the physician to decide which treatment has the best chance of success, thus personalizing the therapy to the patient. In this article we will find out what genomic tests are, what they mean for us as patients, and what are the most common types of tests. What is genomic or genetic testing? Genomic or genetic testing begins with taking a sample from the patient. In germline genetic testing or genomic testing, i.e., with markers present in all cells of the body, as in diseases inherited from the parents, samples are usually obtained from saliva or blood. This is due to the ease of obtaining this type of sample, with minimally invasive procedures such as a blood draw or saliva swab extraction. Why is blood still used if DNA can be obtained from saliva by simply “swabbing” a swab? Among other reasons, it must be taken into account that in the mouth we have an enormous amount of symbiont or commensal bacteria, which coexist with us and which have their own genetic material. Therefore, they can contaminate to some extent the result of the sequencing of that sample, lowering its quality. Knowing which genetic mutation or variation is causing cancer will help the physician decide which treatment has the best chance of success, thus personalizing the therapy to the patient. This does not mean that the results are useless, but for certain genetic markers that need to be detected with very high precision, the quality may not be high enough for the physician to consider the results as decisive. In the case of looking for genomic variations not in the germline, but in the somatic line, the sampling process is slightly different. Somatic mutations are those inherent to oncological diseases, for example, since the biomarkers we are looking for will be found in some cells of the body and not in others. That is to say, in a tumor cancer, for instance, we will look for the mutation or genomic variation in the tumor cells (“diseased”) in relation to the rest of the body cells (“healthy” or control cells). We will then normally need two samples, one as described above from saliva, or healthy tissue, and another from the tumor biopsy. This process is invasive and, depending on the patient's state of health and the location of the tumor, can be dangerous. Transforming biological samples into data There are liquid biopsies to minimize these risks, in which, through a blood sample, circulating tumor DNA is located in the bloodstream. Although this type of biopsy is not currently useful for all oncological diseases, it is being researched and developed at a dizzying pace due to its enormous practical usefulness in improving the quality of care. Samples are generally obtained from saliva or blood, due to the ease of obtaining this type of sample, with minimally invasive procedures. Once the sample of interest has been obtained, the genetic material of interest (DNA or RNA) will be extracted and prepared for the chosen sequencing technique. The prepared sample will then be introduced into the so-called sequencers, which will be the machines in charge of converting these biological samples into data, into text. This text will be used by the bioinformaticians to search for genetic variations compared to a reference sequence of a human genome, or of the control sample in the case of somatic mutations and give them meaning. Those mutations that have clinical weight in relation to the test requested by the physician will normally be communicated to the physician through the production of a report. From this point on, it will be the specialist physician, who has or should have all the information concerning the patient, who will assess the value of these detected variations and make the best decisions for the patient's health. IA & Data Omics sciences: redefining health October 29, 2024 Genetic testing types for precision medicine In this post we will focus on molecular tests related to the study of DNA sequencing. Apart from these, we can have other types of tests related to genetics within the world of precision medicine, including: Biochemical tests to measure protein activity. Expression tests to study which genes are activated. Chromosomal tests that look at changes in genetic material at the chromosomal level, on a large scale. Molecular tests, which are those that look for the aforementioned genetic variations, include whole genome sequencing (WGS) and whole exome sequencing (WES) and targeted sequencing, as in panels. Within healthcare practice, this translates into the following highlighted tests: Neonatal screening This is the most common genetic test; we are all familiar with the heel prick test. Through this systematic test, newborns can be screened for the possibility of suffering from some type of genetic disease before the adults around them can see the symptoms. ■ It may be too late in diseases such as glutaric aciduria type I, by the time the baby shows symptoms, and can have lifelong consequences. On the other hand, if detected early, the onset of symptoms can be prevented “simply” by adjusting the diet. Carrier screening or carrier-screening This type of testing is often used to help future parents to know if they are carriers of certain diseases, thus knowing if there is a risk of transmitting them to their offspring. ■ These tests are performed on people who have a family history and therefore have a high chance of being transmitters of inherited diseases. Prenatal diagnosis These are tests performed on the fetus to detect early modifications in its genes and chromosomes, so that both physicians and parents can make an informed decision. This type of test is performed, as before, in cases in which the parents may be carriers of hereditary diseases due to family history. However, given the universalization of these tests, they are also usually performed in pregnancies suspected of being at risk or that have presented some type of problem during their development. ■ In this case, the sample is obtained either by amniocentesis or by chorionic villus sampling (placental sampling). Genetic predisposition testing Panels sequence very specific parts of the genome as they target specific genes, which can be between one gene, or a few, to several thousand. They are used when it is known in which gene or genes the mutation should be sought. The best-known case might be the breast and ovarian cancer panel, which includes the BRCA1 and BRCA2 genes as the main indicators of the risk of developing breast and ovarian cancer in the future. But these are not the only genes that may be involved in this type of cancer; most current tests already sequence around 30 genes. Why perform a panel and not a WGS or WES? Well, because by reducing the size of the genetic material to be sequenced the technique is more cost-effective and the data produced are much easier for professionals to handle, so that waiting times for results are reduced. ■ These panels can be used both to determine genetic predisposition and to help determine patient treatment. Conclusion Genetic testing has revolutionized modern medicine by enabling early detection of inherited diseases and prediction of genetic predispositions. From testing for prospective parents, prenatal diagnostics to genetic predisposition panels, these tools provide key information to make informed and personalized health decisions. The universalization and continuous improvement of these tests ensure that more and more people can benefit from more accurate and effective medical care. Genetics thereby opens a window into the future of preventive and personalized medicine. With the development of more affordable and accurate tests thanks to digital technologies, more and more people will be able to know their risks and take proactive measures to maintain their health. In addition, the integration of genetics into clinical practice is not only improving the diagnosis and treatment of diseases but also providing a more holistic and patient-centered approach.

January 23, 2025

Búsquedas recomendadas

Sonia Doblado

Find out more about us