Auto-classification for confocal back-scattering micro-spectrum at single-cell scale using principal component analysis

Auto-classification for confocal back-scattering micro-spectrum at single-cell scale using principal component analysis

Optik 127 (2016) 1007–1010 Contents lists available at ScienceDirect Optik journal homepage: www.elsevier.de/ijleo Auto-classification for confocal ...

564KB Sizes 1 Downloads 63 Views

Optik 127 (2016) 1007–1010

Contents lists available at ScienceDirect

Optik journal homepage: www.elsevier.de/ijleo

Auto-classification for confocal back-scattering micro-spectrum at single-cell scale using principal component analysis Cheng Wang ∗ , Miao Wen, Lihong Bai, Tong Zhang Laboratory of Medical Optics & Optometry, School of Medical Instrument and Food Engineering, University of Shanghai for Science and Technology, Shanghai, PR China

a r t i c l e

i n f o

Article history: Received 11 December 2014 Accepted 9 October 2015 Keywords: Spectrum Single cell Principal component analysis (PCA) Classification

a b s t r a c t Early diagnosis and treatment for cancer is a key to reduce the mortality of cancer. Based on confocal back scatter micro-spectroscopic system, micro-spectra of normal stomach epithelial cells and cancerous cells were collected. In order to discriminate the two types of cells, the micro-spectra of cells were analyzed with principal component analysis in which the first two scores of the principal components were utilized. The results showed this method could clearly distinguish the confocal back scattering micro-spectra of the normal stomach epithelial cells and cancerous cells. This might provide a new method for early diagnosis of stomach cancer, which is fast, accurate and low-cost at single-cell scale. © 2015 Elsevier GmbH. All rights reserved.

1. Introduction China is the country with the highest incidence of stomach cancer, and accounts for over 40% of all new stomach cancer cases in the world [1]. Stomach cancer is the third leading cause of cancer mortality in China [2,3], and is approaching to young people. Early diagnosis and early treatment is a key approach to reduce the mortality of stomach cancer. Surgery for early stomach cancer has excellent clinical curative effect with the 5-year survival rate being up to 95% [4]. Therefore, it will hold great clinical significance to develop a fast, accurate and low cost diagnostic method. The elastic optical scattering is one of the basic forms of the interaction between light and tissue. The light-scattering signals from a nucleated cell contain cell size and morphological information [5,6]. Flow cytometry is the most representative scientific instrument to analyze cellular size and morphology by utilizing collected forward and side scattering light from a cell. Moreover, Understanding light-scattering in living cells is crucial to many modern diagnostic technologies, such as optical coherent tomography (OCT) [7], elastic scattering spectroscopy [8], and confocal microscopy [9]. These techniques are developed for non-invasive and real time optical diagnosis based on light scattering signal from the tissue. Light-scattering spectroscopy (LSS) [10] is a system that measures the scattering spectrum related to angle to obtain

∗ Corresponding author. Tel.: +86 2155271165. E-mail address: [email protected] (C. Wang). http://dx.doi.org/10.1016/j.ijleo.2015.10.066 0030-4026/© 2015 Elsevier GmbH. All rights reserved.

quantitative information of cell morphology. Wu et al. used the elastic scattering spectrum to study the light-scattering changes of uterine cancer cells treated with acetic acid [11]. Lin et al. reported an elastic back-scattering spectroscopy technique, which could obtain both the back-scattering spectrum and microscopic image on one pixel simultaneously [12]. Fang et al. developed a new technique, namely confocal light absorption and scattering spectroscopic (CLASS) microscopy, which could observe in vivo sub-micron cell structure [13,14]. Spectroscopic results are usually multiple dimensional and include large amounts of data. To classify or identify spectrum from different samples, these data would be processed with certain methods. For example, the smoothing algorithm is often used to preprocess the spectra, which can analyze characteristic spectrum line so that different spectrum in some wavebands can be compared with useful information obtained, e.g., relative peak intensity. However, if the characteristic spectra have the same or similar feature, they will be too hard to be identified. With improvement of the speed of collecting spectrum, it is necessary to accomplish allautomatic spectrum identification. Principal Component Analysis (PCA) [15] is a dimension-reduction tool that can be used to reduce a large set of variables to a small set that still contains most of the information in the large set. PCA is sensitive to the relative scaling of the original variables. In the field of biomedicine, the potential ability of PCA method has been demonstrated. For example, spectra from particles labeled with each of four different SERS tags could be distinguished by principal component analysis [16]. Identification of hepatitis based on normalized data of reflective spectrum

1008

C. Wang et al. / Optik 127 (2016) 1007–1010

Sciences. Cell culture methods and environment were described in literature [18]. 2.2. Spectra acquisition and preprocessing

Fig. 1. Schematic of the confocal back-scattering micro-spectrometer based on fiber.

from healthy adults and viral hepatitis suffers was processed by the method of PCA with BP neural network detected non-invasively on hepatitis patients [17]. This study aimed to use principal component analysis (PCA) for accomplishing automatic identification of micro-spectra of the normal cells (GES-1) and cancerous cells (NCI-N87), which was obtained from confocal back scattering micro-spectroscopy. This technique holds potential for early diagnosis of stomach cancer at cellular level, which is fast, accurate and low-cost. 2. Material and methods 2.1. Experiment setup and sample preparing The fiber-confocal back scattering micro-spectrometer was set up based on our previous work (Fig. 1) [18]. Briefly, broadband white light was coupled to a port of the fiber coupler with a 1 × 2 port and 50:50 splitting ratio via the SMA-905 standard interface. The fiber worked as light source and point detector and the fiber coupler worked as optical splitter to separate the light of illumination and that of the signal [19]. Moreover, the fiber coupler Port 3 replaced the pinhole in confocal system. The light was coupled by Port 3 to the optic probe, which was made up of an achromatic objective (NA = 0.25, 10×) as the collimator and an achromatic objective (NA = 0.6, 40×) as the objective. The lateral and axial resolutions of this system were 5 and 10 ␮m, respectively. The light from cell scattering in the objective NA angle range was all collected by the optic probe and transmitted through Port 2. The back-scattering light was transmitted through SMA-905 interface of the Port 2 to a CCD spectrograph (B&W Tek Inc., BRC112, USA) with a spectral resolution 4 nm. In order to ensure that the object in the field of view was a cell rather than impurity, the microspectrometer offered an observation function to help observe the location of the cell. Illuminating light was given by the same transmitted light through same optic probe. Broadband white light was coupled to a Port 1 and transmitted to Port 3. An achromatic objective (NA = 0.25, 10×) and an achromatic objective (NA = 0.6, 40×) were used for signal and image collection, respectively. The lateral and axial resolutions of this system were 5 and 10 ␮m, respectively. The back-scattering light from a cell in the objective NA angle range was all collected and transmitted through Port 2 to the fiber spectrometer. CCD was used to observe the cells to ensure a single cell was detected at one time. L1, L2, L3, optical lens; M1, mirror. A human normal stomach epithelial cell strain GES-1 and a human stomach carcinoma cell strain NCI-N87 were used in our study. They were purchased from The Cell Bank of Type Culture Collection, Shanghai Institute of Cell Biology, Chinese Academy of

In the experiments, the scattering light intensity from a cell was low. To improve the signal contrast for collecting cellular scattering light intensity, a silicon slice with reflectivity of 30% in both visual light and near infrared (NIR) light was used as a standard reflector in the cell back-scattering spectrum analysis. When cultured cells were steady and adhered to the petric dish, the culture medium was removed. Signals from a single cell were measured with the micro-spectrometer. Because cells of the same type have similar spectrum, twenty spectra were collected for each cell strain. All of spectra data were automatically converted and stored in ASCII format by spectrometer software. 2.3. Principal component analysis Principal component analysis (PCA) is used to reduce the dimensionality of spectroscopic data. Principal components are guaranteed to be independent if the data set is jointly normally distributed. The principal components are sorted according to their scores which represent the correlativity with raw spectra data. Automatic identification can be performed by utilizing the first few component principals whose accumulative scores are more than 85%. 3. Results 3.1. Cellular scattering micro-spectra Confocal back-scattering micro-spectra for normal cell (GES-1) and cancerous cell (NCI-N87) were smoothed with 50 sampling points average (Fig. 2). In Fig. 2, curve 1 represented the normal cell spectra and curve 2 represented cancerous cell spectra. The X axis was the spectroscopic range, and Y axis was the relative intensity of the cellular scattering with the reference of silicon reflective intensity. Fig. 2 showed that normal cell spectra had regular intensity changes from 800 nm to 1000 nm range. It was due to regular cell internal structure. Although heterogeneities were present in the cellular structure, they were not significant enough to affect the peaks resulting from the cytoplasm and nuclear boundaries. However, the internal structure and biochemical changes in cancerous cells were asymmetric, resulting in irregular intensity changes in the spectra [19]. Theoretic prediction had demonstrated that there

Fig. 2. Confocal back-scattering micro-spectra of normal cells (GES-1) and cancerous cells (NCI-N87), respectively. Normal cell spectra had regular intensity changes from 800 nm to 1000 nm range while cancer cells had irregular intensity changes in the spectra.

C. Wang et al. / Optik 127 (2016) 1007–1010 Table 1 The first 9 principal components and its scores as well as accumulative scores (%). Principal component

Score

Accumulative scores

PC1 PC2 PC3 PC4 PC5 PC6 PC7 PC8 PC9

81.0643 6.9240 3.7463 3.5439 2.2706 1.5338 0.7519 0.1211 0.0440

81.0643 87.9883 91.7347 95.2786 97.5492 99.0830 99.8349 99.9560 100.0000

Fig. 3. The scattering plot of the first two principal components of confocal back scattering spectra of the normal cell (GES-1) and cancerous cell (NCI-N87). Normal cells mainly distributed in the negative half shaft of PC1, while cancer cells mainly distributed in the positive half shaft of PC1.

1009

size and refractive index of these cellular nuclei can be obtained from the spectral variations of the signal and then to identify different pathological cells. Either Spatial gate or polarize gate can be chosen, in order to obtain the single back scattering from multiply scattering light (diffusive background) coming from the tissue specimen or cell. In this paper, spatial gate based on confocal theory was used to obtain single scattering signal coming from a cell and filter interference of multiply scattering. The spectral variations can be automatically identified by a number of pattern recognition algorithm, such as principal component analysis, cluster analysis, discriminant analysis as well as neural network model. To enhance our ability to resolve micro-spectra more efficiently, we evaluated Principal Component Analysis (PCA), an approach often used to reduce the dimensionality of spectroscopic data. Principal components identified by PCA definitely enable the differentiation of microspectra of back scattering by normal cells (GES-1) and cancer cells (NCI-N87). Next, we will develop software based on PCA template, therefore accomplish totally automatic cellular scattering microspectral classification and improve speed for cellular analysis. In summary, we have developed a fiber-confocal back scattering micro-spectrometer system that can measure micro-spectra from individual cells. The feature-rich spectra of the normal stomach epithelial cells and cancerous cells along with the processed spectra by smoothing method were then subjected to PCA. Scores for the first two principal components are able to distinguish the normal stomach epithelial cells and cancerous cells. This will provide an excellent analysis tool for auto-recognizing cancerous cells and powerful equipment for early cancer diagnosis. Acknowledgement

were no distinct interference peaks distributed in back-scattering spectra from abnormal cells, although much fewer irregular structures in normal cells lead no break in the interference peaks between cytoplasm and cell border [20].

The project was financially supported by the grants of Ph.D. Programs Foundation of Ministry of Education of China (no. 20123120120012)

3.2. The principal component analysis of the micro-spectra

References

Spectra acquired from the confocal back scattering micro spectrometer were smoothed, and subjected to principal component analysis function, whose function name was “princomp” in MATLIB software [21]. The principal components and their scores were obtained and shown in Table 1. The accumulative scores of the first two principal components, PC1 and PC2, were more than 85%, indicating that information included in the initial spectra were extracted. PCA biplot with samples was plotted in two dimensions using their projections onto the first two principal components for confocal back-scattering micro-spectra of normal cells (GES-1) and cancerous cells (NCI-N87) (Fig. 3). Normal cells mainly distributed in the negative half shaft of PC1, while cancer cells mainly distributed in the positive half shaft of PC1. This indicated that normal cells (GES-1) and cancer cells (NCI-N87) were identified by using the first two PCs based on confocal back-scattering microspectrometer and PCA method. 4. Discussion and conclusion Changes in the nuclei of the epithelial cells are amongst the most important indicators for dysplasia [22]. The major diagnostic criteria include nuclear enlargement, increased variation in nuclear size and morphology (pleomorphism), and increased concentration of chromatin, leading to darker staining of the nuclear material (hyperchromasia). The angular and wavelength distributions of the light scattered by a cell depend on nuclear size and refractive index. Thus, if light directly backscattered from the cell is observed, the

[1] Z. Bu, Jf. Ji, A current view of gastric cancer in China, Transl. Gastrointest. Cancer 2 (Suppl. 1) (2013) 1–4. [2] G. Zhao, E.S. Ford, I.B. Ahluwalia, et al., Prevalence and trends of receipt of cancer screenings among US women with diagnosed diabetes, J. Gen. Intern. Med. 24 (2009) 270–275. [3] J. Liu, L.P. Sun, Y.H. Gong, et al., Risk factors of precancerous gastric lesions in a population at high risk of gastric cancer, Chin. J. Cancer Res. 22 (2010) 267–273. [4] C.X. Zheng, R.S. Zheng, W.Q. Chen, et al., An analysis of incidence and mortality of stomach cancer in China, 2009, China, Cancer 22 (5) (2013) 327–332. [5] Steven L. Jacques, Fractal nature of light scattering in tissues, J. Innov. Opt. Health Sci. 4 (1) (2011) 1–7. [6] R. Drezek, M. Guillaud, T. Collier, et al., Light scattering from cervical cells throughout neoplastic progression: influence of nuclear morphology, DNA content, and chromatin texture, J. Biomed. Opt. 8 (1) (2003) 7–16. [7] J.G. Fujimoto, Optical coherence tomography for ultrahigh resolution in vivo imaging, Nat. Biotechnol. 21 (2003) 1361–1367. [8] L. Lovat, S. Bown, Elastic scattering spectroscopy for detection of dysplasia in Barrett’s esophagus, Gastrointest. Endosc. Clin. N. Am. 14 (3) (2004) 507–517. [9] James Pawley, Handbook of Biological Confocal Microscopy, Springer, Madison, Wisconsin, 2010. [10] I. Georgakoudi, et al., Reflectance and light-scattering spectroscopy for evaluating dysplasia in patients with Barrett’s esophagus, Gastroenterology 120 (2001) 1620–1629. [11] Tao T. Wu, Jianan Y. Qu, Assessment of the relative contribution of cellular components to the acetowhitening effect in cell culture and suspensions using elastic light-scattering spectroscopy, Appl. Opt. 46 (21/20) (2007) 4834–4842. [12] Y. Liu, X. Li, Y.L. Kim, V. Backman, Elastic backscattering spectroscopic microscopy, Opt. Lett. 30 (18) (2005) 2445–2447. [13] H. Fang, L. Qiu, V. Edward, et al., Confocal light absorption and scattering spectroscopic microscopy, Appl. Opt. 46 (10) (2007) 1760–1769. [14] Irving Itzkan, L. Qiu, V. Edward, et al., Confocal light absorption and scattering spectroscopic microscopy monitors organelles in live cells with no exogenous labels, PNAS 103 (44) (2007) 17255–17260. [15] Nandakishore Kambhatla, Todd K. Leen, Dimension reduction by local principal component analysis, Neural Comput. 9 (7) (1997) 1493–1516. [16] D.A. Watson, L.O. Brown, D.F. Gaskill, et al., A flow cytometer for the measurement of Raman spectra, Cytometry A 73 (2) (2008) 119–128.

1010

C. Wang et al. / Optik 127 (2016) 1007–1010

[17] W. Yan, J. Zhang, J. Zhao, et al., Spectrum identification of hepatitis tongue diagnosis based on principal component analysis and BP neural network, J. Tianjin Univ. 4 (2011) 001–005. [18] C. Wang, X.D. Guo, et al., Study of back-scattering micro-spectrum for stomach cells at single cell scale, J. Biomed. Opt. 15 (4) (2010) 040505-1–040505-3. [19] C. Wang, B.Y. Fang, W.J. Cai, et al., Fiber confocal back-scattering microspectrum in single biology cellular scale, Chin. J. Laser 36 (10) (2009) 2636–2641.

[20] R. Drezek, D. Andrew, R.R. Kortum, A pulsed finited-difference time-domain (FDTD) method for calculating light scattering from biological cells over broad wavelength ranges, Opt. Express 6 (7) (2006) 147–156. [21] L.J.P. van der Maaten, E.O. Postma, H.J. van den Herik, Matlab Toolbox for Dimensionality Reduction, MICC, Maastricht University, Maastricht, 2007. [22] C.W. Boone, et al., Quantitiative grading of rat esophageal carcinogenesis using computer-assisted imaging tile analysis, Cancer Epidemiol. Biomarkers Prev. 9 (2000) 495–500.