Using combinations of principal component scores from different spectral ranges in near-infrared region to improve discrimination for samples of complex composition

Sanguk Lee, Hoeil Chung, Hangseok Choi, Kyungjoon Cha

Research output: Contribution to journalArticle

8 Scopus citations

Abstract

Principal component analysis (PCA) is widely used as an exploratory data analysis tool in the field of vibrational spectroscopy, particularly near-infrared (NIR) spectroscopy. PCA represents original spectral data containing large variables into a few feature-containing variables, or scores. Although multiple spectral ranges can be simultaneously used for PCA, only one series of scores generated by merging the selected spectral ranges is generally used for qualitative analysis. Alternatively, the combined use of an independent series of scores generated from separate spectral ranges has not been exploited. The aim of this study is to evaluate the use of PCA to discriminate between two geographical origins of sesame samples, when scores independently generated from separate spectral ranges are optimally combined. An accurate and rapid analytical method to determine the origin is essentially required for the correct value estimation and proper production distribution. Sesame is chosen in this study because it is difficult to visually discriminate the geographical origins and its composition is highly complex. For this purpose, we collected diffuse reflectance near-infrared (NIR) spectroscopic data from geographically diverse sesame samples over a period of eight years. The discrimination error obtained by applying linear discriminant analysis (LDA) was improved when separate scores from two spectral ranges were optimally combined, compared to the discrimination errors obtained when scores from singly merged two spectral ranges were used.

Original languageEnglish
Pages (from-to)96-101
Number of pages6
JournalMicrochemical Journal
Volume95
Issue number1
DOIs
Publication statusPublished - 2010 May 1

    Fingerprint

Keywords

  • Geographical origin
  • Linear discriminant analysis
  • Principal component score
  • Score combination
  • Sesame

Cite this