key: cord-0065790-3q6wotiw authors: Zeng, Tao; Huang, Tao; Lu, Chuan title: Editorial: Cross-Domain Analysis for “All of Us” Precision Medicine date: 2021-07-01 journal: Front Genet DOI: 10.3389/fgene.2021.713771 sha: e95df2b2f65afab0eb987c9a30682beaf3154d53 doc_id: 65790 cord_uid: 3q6wotiw nan Precision medicine aims to combat complex human diseases by providing preventative, diagnostic, and treatment tools for each patient rather than each cohort. Along with the development of precision medicine (formerly called personalized medicine) (Aronson and Rehm, 2015) , the recent "All of Us" initiative seeks to gather data related to "lifestyle, environment, and biology" from a wide range of people in order to better understand individual differences and improve precision medicine (All of Us Research Program Investigators et al., 2019) . "All of Us" precision medicine is seeking easy, low-cost, non-invasive, and effective early clinical detection approaches, which include novel biological technology (BT) and information technology (IT) in precision healthcare research. Importantly, many types/domains of data from new technologies can be utilized, such as voice/text recording, image scanning, behavioral monitoring, and feature phenotyping, in addition to omics data from high-throughput technologies (Zeng, 2021) . These consist of cross-domain data, where the micro cross-domain can cover conventional omics data and the macro crossdomain can bridge the omics with other data types. Therefore, the cross-domain analysis is a key to understanding "All of Us" precision medicine. In particular, the artificial intelligence (AI) model and method (e.g., deep learning) has achieved great success in analyzing image data in many fields and is bringing new opportunities to clinical research . However, for cross-domain analysis in "All of Us" precision medicine, several challenges still exist: the numbers of variables in different domains are unbalanced, and the bias coming from the dominant domain should be tackled (Zhang et al., 2020) ; the number of observations is small or not large enough in clinical or biological fields, thus small-sample modeling is required (Lai et al., 2020) ; the samples in different domains may be not matched, so that heterogeneous integration should be considered ; and many data organizations exist, so data without a vector or matrix form needs a reasonable and powerful transformation/representation (Wang et al., 2021) . This Research Topic serves as an update of our previous topic (Zeng et al., 2019) , "Machine Learning Advanced Dynamic Omics Data Analysis for Precision Medicine, " with a focus on the application of artificial intelligence with cross-domain data for further developing "All of Us" precision medicine. At present, we do not have a clear understanding of the molecular pathogenesis of patent ductus arteriosus (PDA). Chen et al. screened the rare copy number variants of 39 PDA patients and 100 healthy children by whole exome sequencing. This study was narrowed down by network and enrichment based analysis, and finally selected 7 candidate genes significantly expressed in human embryonic heart, which include CHRNA4 as potential genetic cause of PDA. Yan et al. investigated the potential of serum miRNAs as signatures for pancreatic cancer (PC). First, they analyzed the expression differences of miRNAs between PC patients and healthy controls by feature selection approaches, obtaining a group of PC signatures. Next they compared miRNAs' expressions between inoperable and operable PC patients, identifying lots of operability miRNAs. These findings revealed a non-invasive way on the diagnosis and therapy of pancreatic cancer. Xu et al. applied network-based bioinformatics analysis for reanalyzing temporal gene expression data related to Infantile hemangioma (IH), with the purpose of distinguishing genes for proliferative and regressive IH lesions. This analysis pipeline included Short Time-series Expression Miner, weighted gene co-expression network analysis, and area under curve ranking, which determined key hub genes which were verified using qRTPCR. Restoration of altered transcriptional regulations in disease states should be an important action for the pathogen genes associated with disease diagnosis and prognosis (Lu et al., 2020) . The quantitative relation between transcriptional regulation and disease phenotypes can help detect such targeted genes for transcriptional modification. Duan et al. proposed a modelbased quantitative relationship measurement between mRNA (non-TF) genes and Transcription Factor (TF) genes. They further applied this method to detect quantitatively altered mRNA genes in lung cancer on the basis of one learning dataset and several independent evaluation datasets, which provided new candidates for lung cancer diagnosis, prognosis, and treatment. Complex diseases are determined by many heterogeneous biological factors (Su et al., 2020) , thus, omics integration analysis should help capture new prognostic signatures and models for disease prognosis. Ren et al. explored the key biomarkers of early-stage Lung adenocarcinoma and built a prognostic model on the combination of gene transcription and methylation. In the re-analysis of public RNA-seq and methylation microarray data, with a series of statistical, network-based and machine learning approaches, a prognostic index based on 17 methylation driver genes, and 4 CD8 T cell-related genes was established for predicting the survival risk of lung cancer patients in early-stage. Finally, we sincerely thank the reviewers; their great efforts ensure the high quality of all articles in this topic. This Research Topic should bring more attention to the important topics of "All of Us" precision medicine based on machine learning and cross-domain data. TZ drafted the manuscript. TZ, TH, and CL revised the manuscript. All authors contributed to the article and approved the submitted version. The "All of Us" research program Building the foundation for genomics in precision medicine Learning to learn adaptive classifier-predictor for few-shot learning Safety and feasibility of CRISPR-edited T cells in patients with refractory non-small-cell lung cancer Multi-omics resolves a sharp disease-state shift between mild and moderate COVID-19 scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses Multilevel heterogeneous omics data integration with kernel fusion Repurpose analysis expanding biomedical benefits by omics data integration Editorial: Machine learning advanced dynamic omics data analysis for precision medicine Applying artificial intelligence in the microbiome for gastrointestinal diseases: a review Towards class-imbalance aware multi-label learning The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.Copyright © 2021 Zeng, Huang and Lu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.