Outline

Ingegneria Sismica

Ingegneria Sismica

A migration learning-based approach to cross-domain model building for biological big data analytics

Author(s): Xinxin Gan1, Linhui Wang2
1School of Materials Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200082, China
2Department of Urology, Changhai Hospital, Naval Medical University, Shanghai, 200082, China
Gan, Xinxin. and Wang, Linhui. “A migration learning-based approach to cross-domain model building for biological big data analytics.” Ingegneria Sismica Volume 43 Issue 2: 1-18, doi:10.65102/is2026610.

Abstract

A cross-domain model based on migration learning can solve the problems of missing annotated data and large distributional differences between domains, these are prevalent in biological data fields, like genomics, proteomics, and medical imaging. In this research paper, a novel deep transfer learning model founded on multi – source domain integration (MUCT) is put forward, building upon conventional cross – domain transfer learning approaches.  Firstly, an end-to-end training mechanism is established based on deep neural networks, secondly, high-confidence target samples collected through consistency filters are trained as a way to create target domain supervisory information, and finally, the outcomes of the classification achieved among multiple source domains and the target domain are combined by means of the relative majority voting approach to enhance the model’s resilience. This approach demonstrates an excellent identification outcome for medical entities within Chinese electronic medical records, with a strict F1 value of 85.4% on the CCKS 2018 review dataset. Typical case study results validate that the migration method can effectively recognize entities such as personal information, disease symptoms, diagnosis and treatment, and drug use in patient question texts by utilizing only a small amount of annotated corpus, realizing the full utilization of existing data resources. This study provides an efficient knowledge migration paradigm for biological big data analysis, which is expected to promote the in-depth development of precision medicine and systems biology research.

Keywords
migration learning; cross-domain modeling; biological data; MUCT; deep neural network

Related Articles

Huiqiao Liu1
1Yinchuan University of Energy, Ningxia, 750000, China
Xin Zhao1, Yan Li1, Xiangyang Cao1, Qiushuang Li1, Jianing Zhang1
1State Grid Shandong Electric Power Company Economic and Technological Research Institute ShanDong JiNan 250001, China
Dan Yang1
1School of Marxism, Suzhou Polytechnic University, Suzhou, 215104, China
Liuhang Shen1, Xiangwen Sun1
1Ulster college at Shaanxi University of Science &Technology, Xi’an,710021, Shaanxi, China