由质谱分析数据建立肝癌病人与健康人血清的分类模型

Classification Model of HCC Patients and Healthy People Established from Mass Spectrometry Data

  • 摘要: 通过对质谱数据处理和分析,建立肝细胞癌(HCC)病人与健康人的分类模型,可以用来判别样本是否来源于肝癌病人。运用表面加强激光解析电离-飞行时间质谱(SELDI-TOF-MS)获取肝癌病人和健康人血清蛋白指纹图谱数据,并运用偏最小二乘(PLS)变量筛选法建立分类模型,最终得到分类模型的交叉检验相关系数达0.96以上,判别准确率大大提高。同时对模型进行分析,找出对肝癌病人和健康人的差异有重要影响的因素或变量。这些变量为30个质荷比区间内特定蛋白的峰强度值,反映这些质荷比区间内蛋白量的增加或减少与肝癌的形成有密切关系,可作为重要的生物标志物进一步加以研究。并且采用所得模型的拟合值等一些信息做分类图,能较好地表达回归模型的分类效果。

     

    Abstract: Classification models of hepatocellular carcinoma (HCC) patients and healthy people were built from mass spectrometry data, which could be used for the detection of HCC. Surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (SELDI-TOF-MS) technique was applied to get the data of serum protein from HCC patients and healthy people. Then PLS variable selection method was used to deal with the data and to establish the classification model. The cross validation relativity coefficients of the model cames to over 0.96. Furthermore the important factors or variables that discriminated HCC patients and healthy people were found by analyzing the model. The 30 variables were several peak intensities of protein from some m/z sections, which could express the up-regulation or down-regulation of protein in the sections. As potential biomarkers, the proteins may be closely related to the formation of HCC, which can be deeply studied. The classification figures constructed by the fitting value of the model in the article are clear and intuitive, and can express the discrimination effect of the model well.

     

/

返回文章
返回