| Online Publication | Bioinformatics | Research Article | http://werner.yellowcouch.org/Papers/maldiart/ |
Werner Van Belle1* - werner@yellowcouch.org, werner.van.belle@gmail.com
Olav Mjaavatten2
1- Bioinformatics Group; Norut IT; Research Park; 9294 Tromsų; Norway
2- Proteomic Unit (Probe); University of Bergen; ; Bergen; Norway
* Corresponding author
Abstract: MALDI-TOF mass spectrometry is a well known and widely used technique to fingerprint and sequence proteins. A carefull investigation of the mass spectra output from unnamed machines shows a number of artefacts produced by the machines themselves. Because these artefacts complicate a number of procedures we present a number of preliminary techniques we developed to get rid of most of the artefacts.
Keywords: matrix assisten laser desorption ionisation MALDI time of flight TOF artefacts noise
Reference: Werner Van Belle, Olav Mjaavatten; Artefacts in the Mass Spectra Output from MALDI-TOF and MALDI-TOF/TOF Machines; Proceeding of the VIIth International Symposium of the Protein Society section proteomics, interactomics and protein networks; April 2005
Files: Maldi2004.pdf, Maldi2004.ps.gz
In MALDI-TOF (Matrix assistend laser desorption ionisation) a sample
is mixed with a matrix. When this mixture dries it forms crystals.
When such a crystalized mixture is targeted with a high energy laser
beam with the correct wavelength, the matrix itself will suddenly
absorb the incoming energy and heat up. This rapid heating causes
sublimation of the matrix and subsequent expansian of the molecules
co-crystalised within the matrix. The ions are then accelerated using
a strong electrical field and thus seperated based on their
ratio. The ions can then be detected at the end of the tube, or
reflected
and then be detected. This (optional) reflection phase increases the
accuracy of the technique substantially.
In a typical proteomics setup a mass spectrogram is taken, the peaks are selected and then used to fingerprint proteins. Some machines offer the possibility to use an advanced lift system which makes it possible to measure the mass of the (poly)peptides within a larger frament of a specific weight. This makes sequencing of proteins possible.
We performed a number of measurements on differnet mass spectrometers. Surprisingly, the output from these machines contains a number of artefacts, which were also present on machines located at other sites, such as the Flemish Biotechnology Centre and freely published online spectra.
We believe that these artefacts complicate a number of possible uses of those machines

The first experiment concerns the typical fingerprinting of a protein. In this experiment the reflection mode was turned on. The mass spectrum output consist of 158548 samples between 100.003 and 4019.170 Da. The window size of the SFFT is 2048 samples, which forms a good compromis between frequency-accuracy and position accuracy. In all the figures we present, both the m/z axis and the energy axis have been normalized. The frequency analsyis has also been normalized and is shown in dB.
This experiment
(figure
) clearly shows

In a second experiment we measured the lift of a peak using a
Maldi-TOF/TOF
machine. The mixture contained a proteinfragement which was to be
sequenced. The output from the machine ranges from 20.067 till
1264.626,
in 67873 samples. Again, the m/z, energy and frequency content are
all three normalized. The frequency analysis (figure
)
shows


In a third
experiment we measured the pure noise output of a Maldi
machine in lineair mode. The output shown in figures
and
covers 110296 samples between 40 kDa and 80kDa. During
the experiment, the laser was switched off, as such we measure only
the noise generated by the machine. The artefacts we now observed
were even more interesting then the previous ones.
To investigate the feasibility to obtain more data out of the spectra, we created a number of denoising and enhancing techniques which we briefly present below.
The first step is to remove the energy overhead in the measurments.
This is done by removing the baseline of the spectrum using a specific
filter technique. The result is shown in figure
.
In order to denoise the data we first tried the creation of a number
of digital notch filters. Because we don't want to shift the peaks
back or forth in time, such a filter was required to have a zero-phase
response over its entire spectrum. Also the impulse response of the
filter needed to be as small as possible because we did not want to
broaden the peaks, nor introduce unwelcome echos. A number of small
experiments indicates that the results of such a filter would not
be so very good. It became also clear that the chirp could not easily
be removed by such a time independent filter. Therefore we created
another technique of which you see the result in figure
.
A local closeup of the denoised data (figure
)
shows how the peaks are located at the same places, but now allow
for fully automatic detection (certainly
if you look at the
SFFT of the data), which makes its very attractive in high trhroughput
proteomics.
The accuracy of the algorithm we created is extremely high. It will retain position information exact. However the resolution of lower peaks wil be a little bit less than the higher peaks. This however should not form a problem because these peaks are still well differentiated. As can be seen in the previous pictures, accuracies far below 0.1 dalton can be achieved for smaller peaks.
Another experiment we performed was data enhancement of a linear mode
mass spectrum. The mass spectrum we present is the output from a sample
containing the cell-lysaat of Hela-cells. Clearly it is a relatively
bad sample to put into maldi heavy mass lineair mode. Not only are
these heavy masses difficult to get suspended, but also because the
noise level might suffocate what we actually want to measure. Figure
shows how data enhancing helps in filtering out
the noise.
The result of the
algorithm on a standard protein mixture is shown
in figure
. Important here is that certain peaks
which would normally not be selected if we simply look at the highest
value now show up. Whether some of these new peaks are important might
be interesting to investigate.
A phenomenon often used to detect important peaks is the fact that
isotopes will weigh different. For every ionised similar fragemetn
we will sometimes measure x dalton, sometimes we might measure x+1
dalton (if there is one neutron more), and so on. This knowledge can
be used to automatically detect important peaks as shown in figure
. The visualised graph is the autocorrelation graph
which
mainly measures whether a peak has 'echos'. If it has echos, then
it probably is a series of peaks of the same fragment.
In a simila way, if we measure the autocorrelation of the enhanced lineair mode experiment, then we clearly see vertical bands. Very likely the content of every band will allow us to detect which bands are important. However, this is merely an educated guess.
We have presented a number of artefacts we have encountered in Maldi Tof and Maldi Tof/Tof machines. These are
| http://werner.yellowcouch.org/ mailto:werner@yellowcouch.org | ![]() |