Two dimensional correlation analysis is a mathematical technique that is used to study changes in measured signals. As mostly spectroscopic signals are discussed, sometime also two dimensional correlation spectroscopy is used and refers to the same technique.
In 2D correlation analysis, a sample is subjected to an external perturbation while all other parameters of the system are kept at the same value. This perturbation can be a systematic and controlled change in temperature, pressure, pH, chemical composition of the system, or even time after a catalyst was added to a chemical mixture. As a result of the controlled change (the perturbation), the system will undergo variations which are measured by a chemical or physical detection method. The measured signals or spectra will show systematic variations that are processed with 2D correlation analysis for interpretation.
When one considers spectra that consist of few bands, it is quite obvious to determine which bands are subject to a changing intensity. Such a changing intensity can be caused for example by chemical reactions. However, the interpretation of the measured signal becomes more tricky when spectra are complex and bands are heavily overlapping. Two dimensional correlation analysis allows one to determine at which positions in such a measured signal there is a systematic change in a peak, either continuous rising or drop in intensity. 2D correlation analysis results in two complementary signals, which referred to as the 2D synchronous and 2D asynchronous spectrum. These signals allow amongst others[1] [2] [3]
2D correlation analysis originated from 2D NMR spectroscopy. Isao Noda developed perturbation based 2D spectroscopy in the 1980s. This technique required sinusoidal perturbations to the chemical system under investigation. This specific type of the applied perturbation severely limited its possible applications. Following research done by several groups of scientists, perturbation based 2D spectroscopy could be developed to a more extended and generalized broader base. Since the development of generalized 2D correlation analysis in 1993 based on Fourier transformation of the data, 2D correlation analysis gained widespread use. Alternative techniques that were simpler to calculate, for example the disrelation spectrum, were also developed simultaneously. Because of its computational efficiency and simplicity, the Hilbert transform is nowadays used for the calculation of the 2D spectra. To date, 2D correlation analysis is used for the interpretation of many types of spectroscopic data (including XRF, UV/VIS spectroscopy, fluorescence, infrared, and Raman spectra), although its application is not limited to spectroscopy.
2D correlation analysis is frequently used for its main advantage: increasing the spectral resolution by spreading overlapping peaks over two dimensions and as a result simplification of the interpretation of one-dimensional spectra that are otherwise visually indistinguishable from each other. Further advantages are its ease of application and the possibility to make the distinction between band shifts and band overlap.[3] Each type of spectral event, band shifting, overlapping bands of which the intensity changes in the opposite direction, band broadening, baseline change, etc. has a particular 2D pattern. See also the figure with the original dataset on the right and the corresponding 2D spectrum in the figure below.
2D synchronous and asynchronous spectra are basically 3D-datasets and are generally represented by contour plots. X- and y-axes are identical to the x-axis of the original dataset, whereas the different contours represent the magnitude of correlation between the spectral intensities. The 2D synchronous spectrum is symmetric relative to the main diagonal. The main diagonal thus contains positive peaks. As the peaks at (x,y) in the 2D synchronous spectrum are a measure for the correlation between the intensity changes at x and y in the original data, these main diagonal peaks are also called autopeaks and the main diagonal signal is referred to as autocorrelation signal. The off-diagonal cross-peaks can be either positive or negative. On the other hand, the asynchronous spectrum is asymmetric and never has peaks on the main diagonal.
Generally contour plots of 2D spectra are oriented with rising axes from left to right and top to down. Other orientations are possible, but interpretation has to be adapted accordingly.[4]
Suppose the original dataset D contains the n spectra in rows. The signals of the original dataset are generally preprocessed. The original spectra are compared to a reference spectrum. By subtracting a reference spectrum, often the average spectrum of the dataset, so called dynamic spectra are calculated which form the corresponding dynamic dataset E. The presence and interpretation may be dependent on the choice of reference spectrum. The equations below are valid for equally spaced measurements of the perturbation.
A 2D synchronous spectrum expresses the similarity between spectral of the data in the original dataset. In generalized 2D correlation spectroscopy this is mathematically expressed as covariance (or correlation).[5]
\phi(\nu1,\nu2)=
1 | |
n-1 |
T(\nu | |
y | |
1) |
.y(\nu2)
where:
Orthogonal spectra to the dynamic dataset E are obtained with the Hilbert-transform:
\psi(\nu1,\nu2)=
1 | |
n-1 |
T(\nu | |
y | |
1) |
.N.y(\nu2)
where:
The values of N, Nj, k are determined as follows:
1 | |
\pi(k-j) |
where:
Interpretation of two-dimensional correlation spectra can be considered to consist of several stages.[6]
As real measurement signals contain a certain level of noise, the derived 2D spectra are influenced and degraded with substantial higher amounts of noise. Hence, interpretation begins with studying the autocorrelation spectrum on the main diagonal of the 2D synchronous spectrum. In the 2D synchronous main diagonal signal on the right 4 peaks are visible at 10, 20, 30, and 40 (see also the 4 corresponding positive autopeaks in the 2D synchronous spectrum on the right). This indicates that in the original dataset 4 peaks of changing intensity are present. The intensity of peaks on the autocorrelation spectrum are directly proportional to the relative importance of the intensity change in the original spectra. Hence, if an intense band is present at position x, it is very likely that a true intensity change is occurring and the peak is not due to noise.
Additional techniques help to filter the peaks that can be seen in the 2D synchronous and asynchronous spectra.[7]
It is not always possible to unequivocally determine the direction of intensity change, such as is for example the case for highly overlapping signals next to each other and of which the intensity changes in the opposite direction. This is where the off diagonal peaks in the synchronous 2D spectrum are used for:
As can be seen in the 2D synchronous spectrum on the right, the intensity changes of the peaks at 10 and 30 are related and the intensity of the peak at 10 and 30 changes in the opposite direction (negative cross-peak at (10,30)). The same is true for the peaks at 20 and 40.
Most importantly, with the sequential order rules, also referred to as Noda's rules, the sequence of the intensity changes can be determined.[6] By carefully interpreting the signs of the 2D synchronous and asynchronous cross peaks with the following rules, the sequence of spectral events during the experiment can be determined:
where x and y are the positions on the x-xaxis of two bands in the original data that are subject to intensity changes.
Following the rules above. It can be derived that the changes at 10 and 30 occur simultaneously and the changes in intensity at 20 and 40 occur simultaneously as well. Because of the positive asynchronous cross-peak at (10, 20), the changes at 10 and 30 (predominantly) occur before the intensity changes at 20 and 40.
In some cases the Noda rules cannot be so readily implied, predominately when spectral features are not caused by simple intensity variations. This may occur when band shifts occur, or when a very erratic intensity variation is present in a given frequency range.