An information theory of chromatography

An information theory of chromatography

462 (1989) 13-21 Elsevier Science Publishers B.V., Amsterdam Journal of Chromatography, Printed in The Netherlands CHROM. 20 990 AN INFORMATION I. ...

577KB Sizes 4 Downloads 93 Views

462 (1989) 13-21 Elsevier Science Publishers B.V., Amsterdam Journal of Chromatography,

Printed in The Netherlands

CHROM. 20 990

AN INFORMATION I. EVALUATION

THEORY

OF CHROMATOGRAPHY

OF ANALYTICAL

RIEKO MATSUDA, YUZURU

SYSTEMS BY MEANS OF FUMZ

HAYASHI*, MUM10 ISHIBASHI and YASUSHI TAKEDA

Division qf Drugs, National Institute of Hygienic Sciences, I& I, Kamiyoga

I-Chome,

Setagaya-ku,

Tokyo 158

(Japan) (First received April 7th, 1988; revised manuscript received September 19th, 1988)

SUMMARY

The application of the function of mutual information (FUME) to the logical evaluation of methods for chromatographic quantitation is described. FUMlprovides the Shannon mutual information of overlapped peaks with various resolutions. Hence an analytical method can be evaluated logically based on both the amount of information transmitted by overlapped peaks and the observation period of chromatography. As an example of the evaluation, a previously proposed chromatographic analysis consisting in a rapid but incomplete separation of naphthalene and diphenyl, and peak-deconvolution based on the Kalman filter, is considered. The “best” chromatogram that can transmit the maximal mutual information in unit time is given.

INTRODUCTION

A major aim in analytical chemistry is to elaborate or search for a method through which more information can be effectively transmitted from analytes (Sz) of interest. For high-performance liquid chromatography (HPLC), efforts are made towards the development of column structure, elution conditions, etc., and also data processing of signals. The development of methods often depends on trial and error, and may result from chance. Such new methods are evaluated quantitatively, even if empirically, by some chemometric strategies. A more desirable aim, however, is strict evaluation on a theoretical basis, without recourse to experience. The aid of information theory is desirable for the above-mentioned purpose. The information on analytes Q is quantitatively described as the Shannon information AQ] (ref. 1). In general, it is through observation and data processing of the raw data Y that we can actualy acquire knowledge about the analytes St. The total amount of information I[01 involved originally in the samples, however, cannot be obtained from the data y because of inevitable noise contamination, baseline drift, interference, etc., in the measurement process. Of more analytical importance is the available information called the mutual information I[W; fl between the analytes D and the data Y (ref. 0021-9673/89/$03.50

0

1989 Elsevier Science Publishers B.V.

R. MATSUDA

14

et al.

1). Excess peak overlap, for example, which often arises in rapid chromatography, induces a critical loss of the mutual information I[Q; Yj and makes ambiguous our knowledge about the samples of interest. The amount of mutual information I[@ Y], therefore, should underlie the logical evaluation of the whole analytical system. The simplefunction of mutual information (FUZZ), which describes the mutual information of chromatography, has recently been proposed’. The derivation is based on information theory and the theory of the Kalman filter*. Given the numerically expressed shape of each chromatographic peak and the noise level in the measurement process, FUMZ can provide mutual information about overlapped chromatographic peaks with various resolutions. The whole analytical system, therefore, can be evaluated by taking into account both the observation period of the chromatography and the information loss caused by the peak overlap. The best method is defined as one with the maximal flow of mutual information under the experinental conditions adopted, i.e., one that can transmit the largest amount of information in unit time. Our aim is to apply FUMZ to the logical evaluation of an HPLC method for quantitation. The method adopted here as an example concerns a rapid but incomplete HPLC separation of naphthalene and diphenyl and the mathematical processing of the overlapped peaks by means of the Kalman filter3. We examine the problems of whether or not the data processing of the Kalman filter can outweigh the information loss caused by the peak overlap in rapid chromatography and whether the whole system can provide the maximal information flow. Logical optimization of overlapped chromatograms is treated in Part II”. THEORETICAL

We give a brief review of FUMZ and some additions for its rational utilization. An approximate function Pi plays an important role in the derivation of FUA4Z2. Pl is derived from the strict Pk {the error variance involved in the Kalman filter algorithm) by mathematical induction’. It has the further theoretical importance that the correlation between the Kalman filter and the linear least-squares method can be elucidated clearly by Pl (ref. 2). For a single peak, FUMI represents the mutual information that we can collect through the filtering of the raw data ranging from a data point. i = 1 to k (k = 1, . . _, hy:

FUMZ

= -

; log (~1)

(1)

where Fi denotes the signal intensity of a peak at a data point k and mc is the variance of the contaminating noise (= constant). We see that larger peaks provide more information and that FUMZ increases as a new signal Fk+l appears, but never decreases. The mutual information for q peaks partially overlapped is given at the last

INFORMATION

THEORY

UF CHROMATOGRAPHY

15

1.

point N (the observation period of the chromatogram) sum of the individual peak information*:

and is simply described as the

where [i = k,(j) f 1, k&j’)]d enotes the region where the signals Fi(i) of the jth peak contribute to FUMI. The cutoff point k,(/) is specified to be the point where the signal FJ’j) first gains predominance over the noise PC. The filtering-off point k,G) is defined as the cutoff point k,(j + 1) of the following peak j + 1. Without peak overlap, the overall shape of every peak contributes to FUMZ. If two peaks weakly overlap, then the virtual peak lacking the tailing edge after k,(l) [ = k,(2)] is input in FUMI. The efficiency of the chromatograms is given as the mutual information in unit time:

ML Nl =

FUMI

(4)

7

This function denotes the averaged flow of mutual information through the chromatography and the tiltering of the whole data sequence FI--FN [IV = k,(g)]. For convenience to chromatographers, we shall describe the mutual information in terms of the relative standard deviation (R.S.D.) of the filtering error P:. The error variance PI is derived from the assumption that the amount (or concentration) indicated by the signals Fk of the peak is unity. Hence the R.S.D. of the error at a point k is given by R.S.D+ = (PI)“* . 100

(5)

R.S.D+ can be described by the minimum error Pf,i,. Let Pi be &2P~in: R.S.D.k = SI(PfniJl”

100

(6)

The coefficient a denotes the ratio R.S.D+ a = R.S.D.,i, The minimum error R.S.D.,i, or P&incorresponds to the maximum information I,,, (see eqn. 1). Usually, the observation period N is specified to be wide compared with the peak region and taken as a point that can give the maximum information (Imax = In). The information loss 61 is defined as

= -

1lOg(Pfnin) + t lOg(P~ina')

= log a

(9)

R. MATSUDA

16

where Ik denotes FUMI at k. The R.S.D. ratio @is given by the information a =

exp(6fl

et al.

loss: (11)

The filtering error R.S.D+ is described in the intelligible form R.S.D+ = exp(bZ) . exp(-

I,,,)

* 100

(12)

(see eqns. 1 and 6). R.S.D+ and 61denote the two measures of the excess error or the lost information when the data processing is stopped at k. For multi-component chromatograms, let us consider the information loss or the filtering error, which depends on the degree of peak overlap, the whole data sequence is analysed. We give a convenient method covering the worst condition that the loss is concentrated on a peak: only two peaks overlap. For simplicity, it is assumed that all the peaks move along the time scale (k) of the chromatogram without any changes in shape. The maximal information I,,, will be picked up from a chromatogram with peaks sufficiently or even well separated from each other. The error in the worst case is easily estimated by the information loss. The mean of ImaXis defined as (13) According to eqn. 12, the worst error is given by the information (R.S.D.)

= exp(SI) . exp( - (I,,,))

. 100

loss 61: (14)

We can easily obtain, using eqn. 14, an estimate of the error (R.S.D.) concentrated on a peak. If 61= 0, then the error of the filtering is equal to the minimum error and may be completely negligible, indicating that the selected experimental conditions are optimal. If 61 = 1, then the filtering error is e-fold more than the minimum error. It should be noted that R.S.D.k, (R.S.D.), PI, etc., are not referred to the total HPLC error involving the elution process, detection process, etc., but depend only on the error in the Kalman filtering’. EXPERIMENTAL

All the programs were written in BASIC. A PC-9801 VX desk-top computer (NEC) equipped with an Intel-80286 compatible CPU (8 MHz), a 640-kbyte RAM and two 5-in. floppy disk drives was used. The cutoff point k,(i) was specified to be the point of the fronting edge with 0.5% signal of the peak maximum. The chromatographic experiments were performed on a Model 655A- 11 liquid chromatograph system (Hitachi) with an Inertsil ODS column (250 mm x 4.6 mm I.D.) (Gasukuro Kogyo). The details have been described previously3. RESULTS

AND DISCUSSION

A chromatogram of naphthalene and diphenyl is shown in Fig. 1. The overlapped peaks have an apparent ratio of the height of the valley to that of the mean

INFORMATION

0

THEORY

OF CHROMATOGRAPHY.

I7

400

200

data

1.

point

Fig. 1. Chromatogram of a mixture of naphthalene (leading) and diphenyl (trailing). The abscissa denotes the number of the data acquired by an analogue-to-digital converter at 200.ms intervals, 420 s after injection. For details, see ref. 3

A

B

-L A’

4-----P-------

/

/

I

I

/

.-----

/ .

$.1

A”

Es”

I

‘L 0

200 data

400 point

0

200 data

400 point

Fig. 2. Chromatograms of (A) naphthalene and (B) diphenyl. (A’ and B’) the time (k) courses of the mutual information and (A” and B”) the filtering error. (A) I,,,,, = 9.68; (B) I,,,,, = 9.46. -, X, = 1; ~~~~, X, = IO; -. -. -, X, = 100. Experimental conditions as in Fig. 1.

R. MATSUDA et al.

18

peak maximum of ca. 40% and were analysed by the Kalman filter3. The analytical system was concluded to be satisfactory: the observed total error, obtained from five expriments, was less than 0.7% (R.S.D.) for both peaks3. The above system was originally evaluated quantitatively but empirically3; it has now been evaluated again but deductively based on FUMI. Fig. 2 shows the individual peaks of (A) naphthalene and (B) diphenyl in Fig. 1, (A'and B’) the time (k) course of FUMZ and the (A” and B”) R.S.D+ of the filtering error. As the observation proceeds, the mutual information increases and reaches a maximum around the peak maximum. The filtering error R.S.D., displays the opposite behaviour to FUMZ. In other words, our knowledge about the samples increases greatly or becomes more precise through observation and analysis until the vicinity of the peak maxima. The slight waving in the early region of FUME for the

A

20

--__-_-

2 2

/ 10

ii B

0.04 w cl 0.02

0 C

* :: 1% *I

5

0

0

200

400

Fig. 3. Influence of the peak overlap on (A) FliMI, (B) 1,[1. iVl and (C) (R.S.D.K) for a mixture of and diphenyl. The abscissa denotes the cutoff point k,(2) of the diphenyi peak. I,,, = 19.14 (= Z,, of naphthalene + I,,, of diphenyl). -, X, = 1: ---,X, = IO: -. X, = 100. N = k,(2) + 282.

naphthahe

INFORMATION

THEORY

OF CHROMATOGRAPHY.

I.

19

naphthalene peak seems to come from the conspicuous fronting of the peak3. We can see that after the peak maxima, no further appreciable amount of information can be obtained from the Kalman filtering of the chromatograms. This suggests that the most efficient chromatogram must consist of overlapped peaks, and not only baseline separation. Let us consider the relationship between the mutual information and the degree of peak overlap. In Fig. 3, (A) FUME, (B) the efficiency Z,[l, N] and (C) (R.S.D.) for the overlapped peaks are plotted as a function of the cutoff point k,(2) of the trailing peak. It is assumed that the trailing peak moves the time scale k without any changes in shape; the position of the leading peak is fixed. As the peaks are increasingly separated, the efftciency increases, reaches a maximum and decreases hyperbolically. If the peaks overlap completely [k,(l) = k,(2)], then no information can be obtained and the efficiency takes the least value. The hyperbolic decrease in Z,[l, N] is due to the saturation of the mutual information for both peaks. It should be noted that the last peak gives the saturated or maximal information according to our definition. The most efficient chromatogram with the maximum of Z,[ 1, IVj is simulated in Fig. 4C. The input signals FT (= Fi/‘Xs) is lOO-fold smaller than the actual signals Fib The peak suppression (Xs = 100) means that Zn[l , N] allows for the “unexpected” small peaks (for theoretical meanings, see below). This chromatogram is the “best” in that it can transmit the mutual information in the most efficient way through observation and filtering. In other words, the flow of information through the chromatogram is maximal. Fortuitously, it corresponds to the actual chromatogram shown in Fig. 1. The total errors of the small peaks (6% concentration) of naphthalene and diphenyl

I

I

I

I

A

I

I

1

:

0

500 data

point

0

500 data

point

.

J

0

500 data

point

Fig. 4. Simulated chromatograms with various amounts of mutual information. (A) Peak intervalp = 19, information loss SZ = 3.66. R.S.D.,,, = 0.27%. period of the chromatogram N = 436: (B) p = 61, &Z= 0.60, R.S.D.,i, = 0.013%. N = 478: (C)p = 73,6Z = 0.25, R.S.D.,, = 0.009%, N = 490; @)p = 119, 61 = 0.006, RL~D.,,,~, = 0.007%. N = 536; (E)p = 269. SI = 0.000, R.S.D.,,, = 0.007, IV = 686.

20

R. MATSUDA

et al.

were reported to be ea. 4% and 11% (R.S.D.), respectively, and larger than those for the 100% materials shown in Fig. 2A and B3. This result indicates the necessity to use the suppression factor X, in evaluating or designing versatile determination (see Mow). Some chromatograms with their own information are also shown in Fig. 4. For the more strongly overlapped peaks (A and B), the observation time is relatively short, but sufftcient information cannot always be obtained. For the chromatograms with weakly overlapped peaks (D and E) the opposite situation applies. FUMZand Za[1, k] have been derived on the assumption that the data processing involved in the calculation of the mutual information is the one-dimensional Kalman filter for peak resolution 5s6. If another type of data handling with inferior peakresolving power such as the commonly used perpendicular dropping is utilized, then the chromatograms with the weak peak overlap (D or E) would be the most efficient. The bias for the estimates provided by the common technique was shown to be not less than 10% for the overlapped peaks. whereas the Kalman filter gave a ca. 0.2% bias3. The superior data processing with the Kalman filter can provide a faster flow of mutual information than the commonly used technique, The information loss 61 is very important as it can serve to solve the problem of whether or not the rapidity of the chromatography shown in Fig. 1 can outweigh the incomplete peak separation in the elution process”. The loss blin Fig. 1 is shown to be negligibly small in comparison with the total HPLC error in the following way. The information loss of the leading peak is 0.25 and the corresponding filtering error (R.S.D.) is 0.009% for a 100% concentration. These values for the trailing peak are the most favourable. On the other hand, the total error in the whole HPLC system covering the elution, detection and filtering was observed to be ca. 0.7% (R.S.D.) for the overlapped peaks: the error or reproducibility of the whole system, measured with a solution of diphenyl, was 0.24%3. The filtering error is far smaller than the observed total HPLC error; the information loss 61 (= 3.538) corresponding to the HPLC error ( = 0.24%) is far larger than fil for the filtering ( = 0.25). We therefore conclude that the rapid analysis design comprising Kalman filtering of the overlapped peaks shown in Fig. 1 is excellent if the overall HPLC errors are acceptable. When the suppressed signals F? are input in FC’MZ, the values of the mutual information and the efficiency function decrease (see Figs. 2A’, 2B’ and 3B). The position of the efficiency maximum increases slightly with increasing X, (see Fig. 3B) and the degrees of peak overlap of the best chromatograms are varied accordingly. The factor X, seems to remain arbitrary in the function Zk. but is closely related to the linearity of the filter involved in FUMZ and of the signals of HPLC systems’. X, can be determined as characteristic of a particular HPLC system used. Incomplete linearity of HPLC signals has been observed in special cases and interfered with the successful application of the linear filter ” . This. is the case for FUMZ. If the HPLC signals completely satisfied the linearity postulate. there would be no need for the above consideration and Fig. 4B (X, = 1) would be given as the best in the ideal HPLC system. The factor X,, therefore, was introduced to bridge the gap between theory and practice, and should bc specified according to the HPLC linearity observance2. The necessity to use X, in proposing the best chromatogram (Fig. 4C) can be interpreted by the same situation: the small peaks that would give poor precision should be appropriately separated in the chromatogram for a successful determination. The relationship between the suppression factor X, and the peak-resolving powers of the one-dimensional Kalman filter for peak resolution was described previously’.

INFORMATION

THEORY

OF CHROMATOGRAPHY.

1

21

The best chromatogram presented here holds true for the adopted experimental conditions involving linear Kalman filtering and an HPLC system with limited linearity. Its elution pattern may vary according to the conditions adopted. If the analysis is carried out or designed under more ideal situations, then more complicated chromatograms will be provided as the best (see above). The experimental design should be performed so that the mutual information may be maximal or most efficiently transferred to real situations. CONCLUSION

The overall analytical system shown in Fig. 1 has been logically evaluated with the aid of FUMIand concluded to provide the maximal information flow. Information theory and the Kalman filter underlie the procedure proposed here and no experience has been utilized. FUMI is a simple, quantitative description of the Shannon mutual information in chromatography and will be useful in various areas of analytical chemistry. REFERENCES I 2 3 4 5 6

S. Y. Y. R. Y. Y.

Arimoto, Hayashi, Hayashi, Matsuda, Hayashi, Hayashi,

Information Theory, Kyoritsu Shuppan; Tokyo, 1976. S. Yoshioka and Y. Takeda. submitted for publication. T. Shibazaki. R. Matsuda and M. Uchiyama, J. Chromatogr., 407 (1987) 59. Y. Hayashi, M. Ishibashi and Y. Takeda, J Chromatogr., 462 (1989) 23. S. Yoshioka and Y. Take&. Anal. C&m. Acta. 212 (1988) 81. R. Matsuda. S. Yoshioka and Y. Takeda. Anal. C’him. A&a. 209 (1988) 45.