This GitHub repository shares the scripts in Julia and Python programming languages that are used for resolution-adaptive regression equation computation for SQL database construction, denoising, data extraction & transformation (the ETL process), feature selection, model selection, model development, and deployment for the work entitled "Physically Constrained Mass Spectrometry Data Binning on Multi-Platform Untargeted Metabolomics Data Reveals Generic Biomarkers for Hepatocellular Carcinoma".
https://bitbucket.org/hiulokngan/modelsHCCAMC/src/main/
Hiu-Lok Ngan,1,# Jialing Zhang,1,# Kenneth Kin-Leung Kwan,2,3 Jacinth Wing-Sum Cheu,2,3 Li Zhong,1 Yike Guo,4,5 Xian Yang,4,6 Carmen Chak-Lui Wong,2,3, Hong Yan,1, Zongwei Cai1,7, *** 1State Key Laboratory of Environmental and Biological Analysis, Department of Chemistry, Hong Kong Baptist University, Hong Kong, P. R. China 2State Key Laboratory of Liver Research, Department of Pathology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, P. R. China 3Centre for Oncology and Immunology, Hong Kong Science Park, Hong Kong, P. R. China 4Department of Computer Science, Hong Kong Baptist University, Hong Kong, P. R. China 5Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, P. R. China 6Alliance Manchester Business School, The University of Manchester, United Kingdom 7College of Science, Eastern Institute of Technology, Ningbo, China
#The authors contributed equally to this work. *Corresponding authors: Drs. Carmen Chak-Lui Wong, Hong Yan, and Zongwei Cai Emails: cclwong@hku.hk; hongyan@hkbu.edu.hk; zwcai@hkbu.edu.hk