基于组合算法改进的谱库检索算法

Spectral Library Search Algorithm Based on Improved Composite Algorithm

  • 摘要: 本工作对Stein和Scott提出的SS组合算法(SS)进行改进,采用Kim等研究得到的权值因子优化该算法中对应的权值因子,并重新分配了加权点积相似度算法和峰比例算法的系数。采用改进的SS组合算法,在NIST 11标准参考谱库(212 961张质谱图)中检索了查询库中的30 932张质谱图,使用气相色谱-质谱联用仪分析了8种不同的化合物样品,并且在NIST 11参考库中检索对应的质谱图。为了评价该算法的性能,分别利用2种组合算法分析查询谱图或实验样品的准确度和相似度。结果表明:与之前的SS组合算法相比,使用本方法后,查询谱图在参考谱库中匹配的准确度平均提高了1.15%,并且查询库中94.45%谱图的相似度得到了提高;通过气相色谱-质谱联用仪得到的样品质谱图在参考谱库中有着更高的命中率,并且谱图的相似度平均提高了3.56%。改进的组合算法能够较好地提高待测谱图在参考库中的准确度和相似度,同时也可以利用这种方法改进以SS组合算法为理论基础的其他算法。

     

    Abstract: The composite algorithm proposed by Stein and Scott was improved, whose weight factors were optimized by the weighting factors proposed by Kim et al, and whose coefficients of weighted dot-product similarity measure and peak ratio algorithm were redistributed. Using the improved composite algorithm, 30 932 mass spectra in the query library were retrieved in the Mass Spectral Library 2011 (NIST 11) main library (212 961 mass spectra) used as reference spectral library. In addition, 8 kinds of different compounds were analysed by gas chromatography-mass spectrometry (GC/MS), and the corresponding mass spectra were also retrieved in the NIST 11. In order to evaluate the performance of the algorithm, two different sets of experiments were carried out, and the accuracy and similarity of the query spectra or the experimental samples were analysed by using two combinatorial algorithms respectively. The results showed that compared with the previous composite algorithm, the accuracy of the query spectra matching in the reference spectral library was increased by 1.15%, and the similarity of the 94.45% of the mass spectra in the query library were improved. The spectrum of the sample through the GC/MS had higher hit rates in the reference spectral library, and the spectrum similarity increased 3.56% in average. Since the improved composite algorithm can improve the accuracy and similarity of the spectrum to be measured in the reference spectral library, it can also be used to improve other algorithm based on Stein and Scott’s composite algorithm.

     

/

返回文章
返回