A machine learning framework that integrates multi-omics data predicts cancer-related LncRNAs

نویسندگان

چکیده

Abstract Background LncRNAs (Long non-coding RNAs) are a type of RNA molecule with transcript length longer than 200 nucleotides. LncRNA has been novel candidate biomarkers in cancer diagnosis and prognosis. However, it is difficult to discover the true association mechanism between lncRNAs complex diseases. The unprecedented enrichment multi-omics data rapid development machine learning technology provide us opportunity design framework study relationship Results In this article, we proposed new approach, namely LGDLDA (LncRNA-Gene-Disease networks based LncRNA-Disease Association prediction), for disease-related prediction data, methods neural network neighborhood information aggregation. Firstly, calculates similarity matrix lncRNA, gene disease respectively, through lncRNA expression profile matrix, lncRNA-miRNA interaction lncRNA-protein matrix. We obtain by calculating lncRNA-gene gene-disease ontology, disease-miRNA Gaussian kernel similarity. Secondly, integrates matrices using nonlinear feature network. Thirdly, uses embedded node representations approximate observed matrices. Finally, ranks lncRNA-disease pairs then selects potential lncRNAs. Conclusions Compared methods, our method takes into account more critical obtains performance improvement cancer-related predictions. Randomly split experiment results show that stability better IDHI-MIRW, NCPLDA, LncDisAP NCPHLDA. on different simulation sets can accurately effectively predict Furthermore, applied three real including gastric cancer, colorectal breast

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine learning methods for omics data integration

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv

متن کامل

Breast Cancer Prognostics Using Multi-Omics Data

Breast cancer affects one in eight women in America and is a leading cause of death from cancer worldwide. In the current study, four types of Omics data including copy number variation, gene expression, proteome and phosphoproteome were collected from seventy-seven breast cancer patients. Individual types of Omics data were used to separately construct predictive models to predict ten-year sur...

متن کامل

Analysis of Cancer Omics Data In A Semantic Web Framework

Our work concerns the elucidation of the cancer (epi)genome, transcriptome and proteome to better understand the complex interplay between a cancer cell’s molecular state and its response to anti-cancer therapy. To study the problem, we have previously focused on data warehousing technologies and statistical data integration. In this paper, we present recent work on extending our analytical cap...

متن کامل

Omics Pipe: a community-based framework for reproducible multi-omics data analysis

MOTIVATION Omics Pipe (http://sulab.scripps.edu/omicspipe) is a computational framework that automates multi-omics data analysis pipelines on high performance compute clusters and in the cloud. It supports best practice published pipelines for RNA-seq, miRNA-seq, Exome-seq, Whole-Genome sequencing, ChIP-seq analyses and automatic processing of data from The Cancer Genome Atlas (TCGA). Omics Pip...

متن کامل

Multi-Omics Data Integration: A Modular Approach

Substantial extent of research has been carried out over past few decades to understand the molecular and genetic level information within the cell that help improving the methodologies in the fields of agricultural or medical research. However, often the research is focused at one particular area of expertise by an individual researcher or a work group and very less number of attempts were mad...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: BMC Bioinformatics

سال: 2021

ISSN: ['1471-2105']

DOI: https://doi.org/10.1186/s12859-021-04256-8