GET THE APP

Variant Detection Using Clinical Oncology and Biological Complexity
..

Journal of Oncology Medicine & Practice

ISSN: 2576-3857

Open Access

Perspective - (2023) Volume 8, Issue 1

Variant Detection Using Clinical Oncology and Biological Complexity

David Chien*
*Correspondence: David Chien, Department of Molecular and Translational Medicine, University of Brescia, Brescia, Italy, Email:
Department of Molecular and Translational Medicine, University of Brescia, Brescia, Italy

Received: 17-Dec-2022, Manuscript No. jomp-23-87832; Editor assigned: 19-Dec-2022, Pre QC No. P-87832; Reviewed: 02-Jan-2023, QC No. Q-87832; Revised: 09-Jan-2023, Manuscript No. R-87832; Published: 18-Jan-2023 , DOI: 10.37421/2576-3857.2023.08.183
Citation: Chien, David. “Variant Detection Using Clinical Oncology and Biological Complexity.” J Oncol Med & Pract 8 (2023): 183.
Copyright: © 2023 Chien D. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Introduction

The most significant of these obstacles is understanding how genomic changes might affect a patient's health and how this information can be used to fine-tune personalized therapies. We will talk about important technologies, computational methods, and models that can be used with NGS data, from Whole Genome to Targeted Sequencing, to deal with the biological complexity of cancer. In the concluding section, we will examine the opportunities and challenges that bioinformatics for precision medicine will face in the future, both on the molecular and clinical levels. We will focus specifically on homologous recombination deficiency as a newly discovered biomarker that can be used in clinical practice. Sequence variation detection, epigenetic and transcriptional regulation, chromatin conformation, its three-dimensional architecture, and the interaction of these phenomena are just a few of the numerous potential applications in diagnostic and research settings. When applied to a large number of loci per subject, this procedure results in the generation of an enormous amount of sequence data. The primary topics of this review will be the process of identifying variants and its preprocessing phases. A reference genome sequence that has already been assembled or input reads concatenated using "de novo" techniques, which use no reference sequence at all, are used to align small DNA or RNA fragments.

Various methodologies are optimized for genomic study at various scales, from a few genes to the entire genome. To investigate previously unknown genetic changes, whole genome sequencing, which covers the entire genome but requires more time and money, is utilized. Whole genome sequences may only cover protein-coding genes, which make up 3% of the genome, but at a lower cost because protein-associated mutations frequently affect genome regulation. However, the difficulty of data interpretation severely limits the application of WES in clinical practice and research. As a result, targeted sequencing was developed to investigate specific genomic mutational hotspots. Genomic changes that cause diseases and are either known or suspected to be pathogenic are identified through this approach. A typical NGS workflow includes stages like sample pretreatment, library preparation, sequencing, and bioinformatics analysis. Each stage is crucial and may conceal inaccuracies that could have an impact on the outcome. Currently, the well-established methods for variation detection and annotation are carried out using the following stages.

Description

As high throughput sequencing technology developed, biomedical machine learning techniques found widespread application. New approaches to patient stratification, diagnostic tools, and medication discovery methods were developed as a result [1]. In addition to the substantial quantity of consortium-produced NGS data that is accessible to the public. There has been an increase in demand for ML-based software that is computationally more efficient, accurate, and reusable as a result of the widespread availability of inexpensive sequencing technologies [2]. One of the most challenging challenges for ML in clinical genomics is modeling how genetic variations and their interactions affect cell growth and fate, leading to cancer transformation. Although traditional inference methods can be very adaptable and interpretable in terms of causation, linearity or model-based assumptions frequently limit them, offering aggregate. To predict the likelihood of variation for each genomic locus, conventional methods like GATK rely heavily on various statistical models and heuristics based on calling accuracy, allelic coverage, and sequencing coverage [3]. The presence of only partially controllable sequencing artifacts, such as those caused by DNA synthesis dephasing and efficiency, low-complexity and repeated genomic sequences, and polymerase chain reaction errors, significantly hinders this work.

Additionally, sequencing data has a high dimensionality by definition; the same phenotype can be caused by multiple combinations of genomic changes, and typically, only a small percentage of people have the variant that is linked to the observed disease. Due to their capacity to represent a very large number of characteristics and parameters, deep neural networks have been extensively utilized for variant discovery. The fundamental principle of convolutional neural networks (CNNs) is to transform aligned data collections into image patterns, resulting in variant clusters with potentially pathogenic connections. Examples of CNN-based algorithms include the general-purpose program Deep Variant, the specialist program Clairvoyant, and NeuSomatic [4]. For single-molecule technologies, somatic variations, and structural variants, respectively, these algorithms were developed. Gains in predicted accuracy have frequently been achieved by employing ensemble approaches, in which a variety of integrated models are used in the learning process. CNNScoreVariants, for instance, makes use of GATK's pre-trained models in order to locate SNVs and indels from short-read sequencing data. One of the main limitations of many deep learning methods is the possibility of information biases in their training sets. The goal of variant discovery is to locate genomic regions that are causally linked to the breakdown of one or more biological activities and pathways [5]. Because they are more likely to alter the structure and function of the protein that is encoded, pathogenic modifications to coding DNA make it much simpler to connect a disease phenotype to these modifications. The majority of diagnostic methods in clinical and cancer genomics are, as a result, based on WES or panels of a small number of exons. However, many of the negative aspects of disease are caused by noncoding variants that are likely to be found at regulatory elements.

Conclusion

In conclusion, we talk about important computational algorithms, models, and technologies that can be used with NGS data from Whole Genome to Targeted Sequencing to find complicated biomarkers associated with cancer. In addition, we focus on a new complex biomarker like homologous recombination deficiency as we investigate the future perspectives and challenges of bioinformatics for precision medicine at the molecular and clinical levels.

Conflicts of Interest

None.

References

  1. Park, Jay J.H, Grace Hsu, Ellie G. Siden and Kristian Thorlund, et al. “An overview of precision oncology basket and umbrella trials for clinicians.” CA Cancer J Clin 70 (2020): 125-137.
  2. Google Scholar, Crossref, Indexed at

  3. Prelaj, Arsela, Rebecca Tay, Roberto Ferrara and Nathalie Chaput, et al. “Predictive biomarkers of response for immune checkpoint inhibitors in non-small-cell lung cancer.” Eur J Cancer 106 (2019): 144-159.
  4. Google Scholar, Crossref, Indexed at

  5. Rizzo, Alessandro, Angela Dalia Ricci and Giovanni Brandi, et al. “PD-L1, TMB, MSI, and Other Predictors of Response to Immune Checkpoint Inhibitors in Biliary Tract Cancer.” Cancers 13 (2021): 558.
  6. Google Scholar, Crossref, Indexed at

  7. Vestergaard, Lau K, Douglas N.P. Oliveira, Tim S. Poulsen and Claus K. Høgdall, et al. “Oncomine comprehensive assay v3 vs. oncomine comprehensive assay plus.” Cancers 13 (2021): 5230.
  8. Google Scholar, Crossref, Indexed at

  9. Dehghani, Mehdi, Kevin P. Rosenblatt, Lei Li and Mrudula Rakhade, et al. “Validation and clinical applications of a comprehensive next generation sequencing system for molecular characterization of solid cancer tissues.” Front Mol Biosci 6 (2019): 82.
  10. Google Scholar, Crossref, Indexed at

arrow_upward arrow_upward