【摘 要】
:
In recent years,with the increasing application of highthroughput sequencing technology,researchers have obtained and accumulated a large amount of multi-omics data,making it possible to diagnose cancer at the gene expression level.The proliferation of va
【机 构】
:
School of Computer Science,Qufu Normal University,Rizhao 276826,China;School of Automation,Central S
论文部分内容阅读
In recent years,with the increasing application of highthroughput sequencing technology,researchers have obtained and accumulated a large amount of multi-omics data,making it possible to diagnose cancer at the gene expression level.The proliferation of various omics data can provide a large amount of biological information,which brings new opportunities and great challenges as well to cancer classification and diagnosis.Machine learning algorithms for early diagnosis of lung cancer have emerged that distinguish cancers of the early and late stages by using genomic features.Omics data are generally characterized with low sample size,high dimensionality and high noise.Therefore,simple direct application of common classification methods cannot achieve better performance and must be improved in a targeted manner.This paper puts forward a combined convolutional neural network and convolutional auto-encoders approach to construct a deep migratory learning classification model for early lung cancer diagnosis.First,the convolutional auto-encoders algorithm is used to reduce the dimensionality of the dataset in order to make it better meet the requirements of migration learning.Second,a neural network model is constructed with the original dataset and the existing labeled dataset,and the model migration rules are set as well.Finally,a small number of labeled target datasets are used in the training to complete the construction of the classification model.The proposed convolutional neural network method based on model migration and five other popular machine learning models are used to classify and predict the three lung cancer gene datasets and the integrated dataset.The experimental results show that such four evaluation metrics as accuracy,precision,recall,and f1-score with our proposed method have obtained better prediction performance,and the average area under curve result also shows our proposed method is optimal.
其他文献
With the expansion of data scale and the increase in data complexity,it is particularly important to accurately identify clusters and efficiently save clustering results.To address this,we propose a novel clustering algorithm,Shape clustering based on dat
Electromagnetic emissions from electrical information equipment may contain useful information and lead to information leakage. In order to detect the electromagnetic information leakage of digital vi
Vehicular ad hoe networks (VANETs) cre-ate an vital platform for communication between vehicles,which can realize accident warning,auxiliary driving,road traffic information query,passenger communication and other applications.While providing convenient s
Although the federated learning method has the ability to balance data and protect data privacy by means of model aggregation,while the existing methods are difficult to achieve the effectiveness of centralized learning under data sharing.The existing fed
Ultrasound computed tomography(UCT)is a promising approach for early breast cancer screening.However, current studies which use prone posture to collect breast ultrasonic data cause four problems, a l
Ionizing radiation effect and failure mechanism of Digital signal processor (DSP) is studied through test-board and automatic test equipment to find the relationship between system function failure and parameter degradation.Static bias is more sensitive t
This paper aims to solve the problem of low efficiency,high cost and instability in opportunistic network transmission in the process of mobile group intelligence perception task allocation.Two multi-task dynamic distribution methods based on Lowest cost
A four-stage Operational transconduc-tance amplifier (OTA) used in an infrared temperature sensor adopting the proposed Feed-forward Gm-stage and segmenting nested Miller compensation technique is presented.The purpose of the proposed segment compensation
Popularity prediction of online video is widely used in many different scenarios.It can not only help video service providers to schedule video web sites,but also bring considerable profits on investment for both providers and advertisers if popularity of
We focus on the problems of the accurate time delay estimation,the design of training pilots,and hybrid matrix optimization within the large-scale antenna array Terahertz (THz) broadband communication system.In contrast to the existing researches based on