3. Learning and intelligent image and signal analysis (LIISA)

3.1 Adaptive image analysis

Abstract

This project utilised neural compution in image analysis problems in which the speed requirements are demanding, the objects are difficult to separate from the background, and the shapes to be recognized are irregular. Applications included the fault analysis of running paper web and optical character recognition of hand-written characters. The intention was to lift the automation level in such a way that shape-based object recognition become possible by learning, with both the separation of the the desired objects from the background and the recognition of the desired irregular shapes being learnt at the same time.

Results

The project has been commercially successful in that ABB Drives has started both an independent research project and a product development project based on the achieved results. The idea is to commercialize at least some of the results. In a scientific sense the project has also been successful and two Doctors of Technology have been produced, Jorma Laaksonen and Jukka Iivarinen. In addition, one Licenciate of Technology (Jukka Iivarinen) and two Master degrees (Katriina Heikkinen, Jarmo Hurri) have been awarded and 13 publications have been produced based on the project.

Project information

Participants

ABB Drives, Helsinki University of Technology, Tampere University of Technology, Joensuu University, Helsinki University Rolf Nevanlinna Institute and Infomarket were involved in the project during the first project year. At the beginning of the second project year the project was refocused on fault detection in running paper web and ABB Drives, Helsinki University of Technology, Tampere University of Technology were the only participants involved.

Project dates

1.3.1995 – 28.2.1998

Project volume

FIM 1.620.000

Project manager

Professor Ari Visa, Dr.Tech.
Lappeenranta University of Technology
Department of Information Technology
P.O. Box 20, FIN-53851 LAPPEENRANTA, FINLAND

Publications

Iivarinen, J., Rauhamaa, J., Visa, A., Unsupervised Segmentation of Surface Defects, in Proc. of 13th IAPR International Conference on Pattern Recognition, Vienna, Austria, August 26-29, pp. 356-360, 1996.

Iivarinen, J., Visa, A., Shape Recognition of Irregular Objects, in D. P. Casasent (Ed.), Intelligent Robots and Computer Vision XV: Algorithms, Techniques, Active vision, and Materials Handling, Proc. SPIE 2904, pp. 25-32, 1996.

Iivarinen, J., Rauhamaa, J., Visa, A., An Adaptive Approach to Segmentation of Surface Defects, Report A34 , Helsinki University of Technology, Faculty of Information Technology,  Laboratory of Computer and Information Science, March, 15 p., 1996.

Iivarinen, J., Rauhamaa, J., Visa, A., An Adaptive Two-Stage Approach to Classification of Surface Defects, in Proc. of the 10th Scandinavian Conference on Image Analysis, Lappeenranta, Finland, June 9-11, pp. 317-322, 1997.

Iivarinen, J., Peura, M., Särelä, J., Visa, A., Comparison of Combined Shape Descriptors for Irregular Objects, in Proc. of the 8th British Machine Vision Conference, University of Essex, UK, September 8-11, vol. 2, pp. 430-439, 1997.

Iivarinen, J., Visa, A., An Adaptive Texture and Shape Defect Classification, in Proc. of 14th IAPR International Conference on Pattern Recognition, Brisbane, Australia, August 16-20, vol. I, pp. 117-122, 1998.

Iivarinen, J., Visa, A., Unsupervised Image Segmentation with The Self-Organizing Map and Statistical Methods, in D. P. Casasent (Ed.), Intelligent Robots and Computer Vision XVII: Algorithms, Techniques, and Active vision, Proc. SPIE 3522, pp. 516-526, 1998.

Iivarinen, J., Rauhamaa, J., Surface Inspection of Web Materials Using The Self-Organizing Map, in D. P. Casasent (Ed.), Intelligent Robots and Computer Vision XVII: Algorithms, Techniques, and Active vision, Proc. SPIE 3522, pp. 96-103, 1998.

Heikkinen, K., Vuorimaa, P., Hardware Architecture of the SOM for Real-Time Applications, in Proc. Of the 5th European Congress on Intelligent Techniques and Soft Computing, vol. III, pp 2495-2499, Aachen, Germany, September 8-11, 1997.

Heikkinen K., Vuorimaa, P., Computation of two texture features in hardware, in proc. of the International Conference on Acoustics, Speech, and Signal Processing, pp. xxx-xxx, Phoenix, Arizona, USA, March 15-19, 1999.

Holmström, L., Koistinen, P., Laaksonen, J., Oja, E., Neural Network and Statistical Perspectives of Classification, in Proc. 13th International Conference on Pattern Recognition, Vienna, Austria, August 25-30, vol IV, pp 286-290, 1996.

Holmström, L., Koistinen, P., Laaksonen, J., Oja, E., Comparison of Neural and Statistical Classifiers – Theory and two Practice, University of Helsinki, Rolf Nevanlinna Institute, Research Reports A13, 37 p. 1996.

Holmström, L., Koistinen, P., Laaksonen, J., Oja, E., Neural and Statistical Classifiers – Taxonomy and two Case Studies, IEEE Transactions on Neural Networks, 8:5-17, 1997.

Laaksonen, J., Oja, E., Classification with Learning k-Nearest Neighbors, in Proc. International Conference on Neural Networks (ICNN’96), Washington DC, USA, June 3-6, vol. 3, pp. 1480-1483, 1996.

Laaksonen, J., Oja, E., Subspace Dimension Selection and Averaged Learning Subspace Method in Handwritten Digit Classification, in Proc. International Conference on Artificial Neural Networks (ICANN’96), Bochum, Germany, July 16-19, pp. 227-232, 1996.


3.2 Intelligent signal analysis and the required component technology

Abstract

This project aimed to develop signal processing methods based on neural computing as well as VLSI architectures for their implementation. The possible applications include a third-generation mobile phone system and a new type of heart rate monitor. The research has been carried out in three main areas: the development of methods for classification and synchronisation and the design of basic structures for CMOS realisation of a neuron.

Results

1. Classification

The purpose of the research was to examine the possibility of approximating aerobic fitness based on heart rate RR-measurements performed at rest. The idea of continuous aerobic fitness or maximal oxygen uptake approximation using neuro-calculation was raised due to the lack of unambiguous classification standards.

There are special clinics where accurate clinical methods can be used to measure the maximal oxygen uptake, but the measurements are expensive and the tests are time-consuming. The need for an inexpensive and reasonably accurate system for measuring aerobic fitness is therefore obvious. The purpose of this study was to create a method for measuring aerobic fitness from a relatively short time series of data (R-R intervals) without extensive physiological tests. The material for this study was obtained from the Merikoski Institute of Health Research and Rehabilitation, Oulu, Finland.

Fitness approximation
The research material was analysed and over 50 different statistical and logical features were identified and calculated during the course of the study. The features were analysed and the best ones were included in the input vector of the ANN structure. Figure 1 presents an overall view of the measurement method.

Figure 1. Block diagram of the method.

The Levenberg-Marguardt (LM) optimising algorithm was used to train the ANN. The LM method was superior to the other training algorithms, since the LM procedure produced the most optimal results for ANN simulation. The training results were verified with a cross-checking method. The results are based on simulations of the ANN structure created. The correlation of the simulation values compared to the accurate clinical measurements was 0.97 for the training material and 0.96 for the testing material.

Using the neural network developed here, it is possible to define the maximal oxygen uptake from a short R-R interval measurement without a testing protocol with high correlation (0.96) and high accuracy (mean error 6.5%). This is a new and easy way to determine and control personal aerobic fitness, even at home.

A prototype of the model was also constructed. The estimation program operates in a MATLAB environment. The code includes signal filtering, feature extraction and, ultimately, neurocalculation. The program was tested with several test persons, and the results were promising. The aerobic fitness approximation method will be implemented in a Windows 95 program. The purpose of the program is to validate the novel method in field tests in several research clinics.

A multi-structure solution for the context-sensitive feature selection problem
Fuzzy preclassifier
The purpose of the preclassifier is to identify similar clusters in the input/output classification relationships. In this way, the accuracy of classification can be maximized. Figure 2 shows the symmetric structure of the neuro-fuzzy classifier.

Figure 2. Operating principle of the fuzzy preclassifier.

Parallel-structured neural networks for classification
The presented multistructure solution is a modification of the network-boosting algorithm. The input patterns are statistically analysed and clustered. A preclassifier is then constructed based on the statistical clusters. The purpose of the preclassifier is to classify roughly similar patterns into different classes and, in that way, to select the final classification structure to be used. The use of several neural structures for the classification problem preserves the local characteristics of the features. In this way, the problem of context-sensitive features is solved and each structure has its own characteristics and weight space for calculating the final output of the hybrid system.

The training material is divided into clusters with the optimal principle. Some overlap is used in classification. The reason for this is that the preclassifier cannot have perfect accuracy in absolute terms, but the failed classes are, with high probability, in the cluster next to the proposed structure. This overlap requires the pattern classes to be sorted with some similarity criteria.

The final form of the each structure is created using a genetic-like algorithm. The starting weights are randomly generated, as is the network layer size. The network is trained with the given parameters and attributes. The outcome of the training is evaluated with cross-checking. The best results are recorded, including the size of the network and the weight information. This looping is done several thousand times to find the optimal size of the structure as well as the global minimum of the weight space. It should be pointed out that a genetic algorithm of this kind requires very intensive calculating power.

Human aerobic fitness classification
Three different neural network structures were trained using correct training and testing material. The structure ‘S’ was created using the fitness class examples 1 and 2, while the structure ‘L’ was created using the classes 4 and 5. The ‘M’ structure was created using all the classes. The reason for this arrangement was the nature of the preclassification. The preclassifier classifies correctly, but it does not identify every instance of S and L example.

The accuracy of the fuzzy preclassifier was 80.9 %, indicating low-level fitness samples, and 86.2 %, indicating samples of good fitness level. It should be pointed out that the classified samples were all correctly classified. The 20 % of the samples were not recognised as examples of a good or a poor fitness level.

For the sake of comparison, all the samples were classified using the M structure, which was implemented conventionally using a single neural network structure. The accuracy of the single structure classifier was 50 %.

When the whole chain of the proposed method was used, the following results were obtained. The accuracy of the S network was 89 %, that of the M network 45 % and that of the L network 70 %. This overall accuracy of the neuro-fuzzy system was 63.2 %. The accuracy was 13.2 % better than that obtained using a conventional neural network classifier.

The results were very promising. With this technique, it is possible to divide even a very large set of training data into smaller parts for different neural structures. This division improves accuracy and reduces the time needed for neural network training. In the test case, the resulting accuracy of 63.2 % was not optimal in view of the classification application. However, it is the best result that can be achieved using the biological features of the person in a resting state.

The generality of the method proposed is due to its modularity. In the literature, modular networks have been presented, but their structure selection has not been based on a fuzzy preclassifier. In our approach, modularity is created using statistical analysis of features and fuzzy analysis. The classification space is divided into smaller parts with significant similarity. A symmetric structure is then created, which requires measurement of the distance between the used classes. The classes and structures are sorted using the distances measured. In this way, the aerobic fitness classification obtained by using neuro-fuzzy approach and a modular network structure can be generalised to other real-world classification problems and applications.

Intensifying NN learning with dimension reduction
The number of variables is quite large compared to the number of subjects, and the approximation process should be accelerated. Therefore, dimension reduction is needed, and it can be done with principal component analysis.

Traditional PCA can only be used for linear data, and because the physiological functions of a human being are highly non-linear, the data have to be divided to internally linear groups. After this, PCA can be performed independently for each group.

Another possibility is to use non-linear principal components. They can be constructed with neural networks. An NLPCA network is an auto-associative network with three hidden layers: a mapping layer, a bottleneck layer and a de-mapping layer. The bottleneck layer will produce an effective but lower dimensional presentation of the original data.

The study population consisted of 237 healthy men and women aged 15-65 years, of whom 203 subjects were used as the training set and 34 subjects as an independent test set.

Both non-linear and linear PCA reduced the dimensions from 18 original variables to 8 principal components. Maximal oxygen uptake, which was normalised for the subject’s weight (ml/kg/min), was approximated with three different data sets (18 original parameters, 8 linear PCs, 8 NLPCs). MLP with the Levenberg-Marquardt optimisation algorithm was used in the approximation. The optimal learning result was selected by minimising the RMS error for the test set.

The best result was achieved with 8 non-linear principal components, and the correlations between the target and predicted values were 0.95 for the training set and 0.94 for the test set. The mean errors were 7% for the training set and 9% for the test set. The results of the three input sets did not differ much from each other in accuracy, but the training time was over twofold longer when the original variables were used instead of the non-linear principal components. On the other hand, no significant amount of information was lost due to the dimension reduction.

2. Synchronisation

The research was carried in two main areas: neural network-based implementation of a synchronising circuit in CDMA systems and adaptive filtering of cyclostationary interference from a speech signal.

A neural network-based synchroniser for a CDMA spread spectrum system was developed. The goal was to realise a fast-channel delay estimator when one or two signals are transmitted through the channel. The performance of the multilayer neural network as a channel delay estimator was studied with simulations. The results were compared to those of an ML estimator derived in the work. The neural network used in the simulations was a multilayer perceptron with a back-propagation learning algorithm and a complex-valued Kalman filter algorithm.

New digital filtering methods are being developed for filtering audio interference from the speech signal. A method based on timing information of the interference is used to control the filtering. The algorithms are designed to be adaptable, enabling the filter to learn from interference on the basis of samples. The filter is also able to detect speech and non-speech segments of the signal, and hence to control dynamic adaptation during the filtering process. A coherent subtraction method and an adaptive line enhancer filter bank approach were developed in 1997. Listening tests have been used to evaluate the results.

Neural network-based implementation of a synchronising circuit in CDMA systems
The objectives of the work were to determine the efficiency of neural network-based methods for multi-user identification in CDMA systems, to develop a neural network-based method for channel estimation, and to produce knowledge together with the local industry.

In this work, only one-user and two-user systems were examined. Thus, the duration of simulations in some cases was several weeks. One task was to examine how simulations could be shortened without sacrificing the quality of the results.

To reduce the complexity of the research problem, the effects of the complexity parameters were examined one by one. The most important parameters were: the phase shift of the carrier wave between two sent signals and the variation of relative signal strength in the case of two users. The near-far problem, where strong signals interfere with the reception of weaker signals, was especially considered.

Results with the one-user system
In a one-user system, channel delay was estimated almost as accurately by the neural network as by the ML estimator. The variance of the estimate was close to the Cramer-Rao lower bound, a lower bound for the variance of an unbiased estimate.

Envelope detectors were added to the system model in contradiction to the ML-estimator. The network was trained with noiseless training material. With noisy test signals, the performance of the network was inferior to that of a non-optimal estimator based on maximum search. The network learned to estimate the training signals accurately, but failed to do so with noisy signals. With noiseless test signals, the performance of the network was better than that of the non-optimal estimator.

Results with the two-user system
In a two-user system, the learning algorithm of the neural network was a complex-valued Kalman filter algorithm. A two-user system, with users A and B, was examined. The delay of user A was estimated, while user B produced an interfering signal component.

These parameters were used to generate the training data: the relative intensity of the signals (NFR in decibels), the delays of the users A and B, and the phase angles of the carrier waves of the users A and B.

The network did not learn to estimate the channel delay well in the noisy two-user system. In addition, the simulation times and the size of the training material increase exponentially with the number of users in the system. The results can be improved in simple cases by post-integrating the estimates, but in more complicated cases this has no effect. The present results indicate that the multi-user synchronisation problem should be divided into smaller sub-problems, some of which could be solved by a neural network.

Adaptive filtering of cyclostationary interference from a speech signal
The objective was to develop adaptive digital filtering techniques, which eliminate cyclostationary interference from speech signals. The algorithms were simulated in a MATLAB environment with authentic audio data. The filtering results were evaluated via hearing tests and error signal RMS values.

The interference originates the TDMA system’s frame rate. The amplitude modulation of the sending signal causes electromagnetic interference with audio-signals in the electrical circuits near the transmitter.

The interference has two important properties: timing effects of GSM data and external effects, e.g. movement of the receiver in relation to the terminal changes the amplitude of the interference. The timing properties of GSM data can be estimated from a speech signal by utilising knowledge of the known timing information of the interference. External effects cannot be predicted and the algorithm should therefore be adaptive.

Figure 3. Original (top) and filtered signal.

Results with speech filtering
New digital filtering methods were developed for filtering audio interference from the speech signal. A method based on timing information of the interference was used to control the filtering. The algorithms were designed to be adaptive, enabling the filter to learn interference on the basis of samples. The filter was also able to detect speech and non-speech segments of the signal and thus to control dynamic adaptation during the filtering process.

Based on the literature and previous experience, two different methods were developed: a coherent subtraction method and an adaptive line enhancer filter bank approach.

Listening tests were used to evaluate the results. With both methods, interference in a speech signal was reduced to a nearly acceptable level. A subtraction method in the time domain gave better speech quality, because filtering in the frequency domain also modifies the spectrum of the filtered speech signal.

A coherent subtraction method required interpolation and decimation to increase the sampling rate from 8 kHz to 9.1 kHz: otherwise, the interference drifted in relation to the sampling rate. Interpolation and decimation were overly heavy operations for implementation, and some work was therefore done to model the drifting effect of interference.

The time domain method is being further developed by creating an parameter model of the known behaviour of the interference and incorporating the adaptive features that have been developed.

3. VLSI implementation

The objective of this project has been to develop integrated analogue structures for adaptive and neural systems. The applications are in ambulatory biomedical measurements, especially in heart rate monitoring. In ECG measurements, the noise may overlap the signal spectrum, and adaptive noise cancellation may be more effective than frequency-selective filtering in noise reduction. Ambulatory heart rate recorders need to be small in size and operable with low power consumption. The main objective of this work was to develop integrated micropower cells that achieve large signal dynamic range simultaneously with low power consumption. These are contradictory requirements for analogue circuitry and hence not straightforward to achieve.

Neuron architectures
During the project period, synapse structures for a fully analogue neuron and for a mixed-signal neuron were designed. The analogue synapse consists of a linearized analogue multiplier and a weight cell implemented with SC techniques. The mixed-signal synapse consists of an analogue multiplier, but the weight cell is digital to prevent the leakage problems associated with the analogue one. The linearized analogue multiplier was fabricated in a commercial 1.2 um CMOS process, and it achieves a linear input range of 1 V with non-linearity corresponding to 2% total harmonic distortion. The current consumption is 1 uA and the linear range is achieved with the use of a multitude of parallel differential pairs.

During the project period a fully analogue cell for a Herault-Jutten network was designed and fabricated. A commercial 1.2 um CMOS technology was used, and the total current consumption of the chip that implements a two-input-two-output Herault-Jutten network is 10 uA. The chip is functional, and most of the cells operate as intended. However, some offset voltage problems associated with the weight cell have been encountered, and they prevent use in actual applications at the moment. Fig. 4 shows the layout of the integrated Herault-Jutten circuit.

The usability of a Herault-Jutten network in signal and noise separation was studied in simulations. Two-channel ECG signal recordings were made using different electrode orientations with respect to the heart to obtain test signals for the simulations. No noticeable signal separation or improvement in signal quality was seen in the simulations. However, by using principal component analysis, marked improvement was achieved, which is demonstrated in Fig. 5.

Figure 4. The Herault-Jutten chip.

Figure 5. ECG signal quality improvement by principal component analysis.

Project information

Participants

The research was carried out in cooperation with the Computer Engineering, Telecommunication and Electronics Laboratory of the University of Oulu and VTT Electronics.

The work was carried out in cooperation with the following industrial companies:

  • Fincitec Oy (Yrjö Mäkelä)
  • Polar Electro Oy (Ilkka Heikkilä)
  • Nokia Mobile Phones (Jorma Lilleberg)
  • Nokia Cellular Systems (Ilkka Keskitalo)

Project dates

The project was conducted during 1.3.1995-28.2.1998.

Project volume

The total budget of the project was about FIM 3 million.

Project manager

Professor Juha Röning
Infotech Oulu and Department of Electrical Engineering
University of Oulu
P.O. Box 4500, FIN-90014 University of Oulu, Finland
Phone: +358-8-5532794
Fax: +358-8-5532600
E-mail: juha.roning@oulu.fi

Publications

Seppo Nissilä, Juha Röning, Antti Ruha, Kauko Väinämö, Method and apparatus for measuring exertion endurance, FI961148, PCT/FI9700163, DE19781642T1, GB2326240A, US09/142,444, Patent application (Written Opinion).

Kauko Väinämö, Seppo Nissilä, Timo Mäkikallio, Mikko Tulppo, Juha Röning, “Artificial Neural Networks for aerobic fitness Approximation”, International Conference on Neural Networks (ICNN ‘96), Washington DC, USA, June 3-6, 1996.

Kauko Väinämö, Juha Röning, ”An artificial neural network -based rotation correction method for 3D-measurement using a single perspective view”, International Conference on Quality Control by Artificial Vision (QCAV ’97), Le Creusot, France, May 28-30, 1997.

Kauko Väinämö, Timo Mäkikallio, Mikko Tulppo, Juha Röning,”A Neuro-Fuzzy Approach to Aerobic Fitness Classification: a multistructure solution to the context-sensitive feature selection problem”, International Joint Conference on Neural Networks (IJCNN ‘98), Anchorage, USA, May 4-9, 1998.

Satu Lipponen, Kauko Väinämö, Timo Mäkikallio, Mikko Tulppo, Juha Röning, “Approximating aerobic fitness with neural networks”, Finsig, Pori, 1997.

Satu Lipponen, Timo Mäkikallio, Mikko Tulppo, Juha Röning, “Finding structure in fitness data”, Practical application in Knowledge Discovery and Data Mining (PADD98), London, UK, March 25-27, 1998.

Lasanen, K., J. Tervaluoto, J., A. Ruha, A., Röning, J., A Structure for Extending the Linear Input Voltage Range of a Differential Input Stage, The 5th International Conference on Electronics, Circuits and Systems (ICECS98), 7-10 September, 1998, Lissabon, Portugal, pp. 355-358.

Väinämö, K.,Neural Networks for Human Aerobic Fitness Approximation, University of Oulu, Department of Electrical Engineering, Diploma Thesis, 1996, 89 pp.

Mannerkoski, J., Neural Network based Synchronisation in a CDMA System, University of Oulu, Department of Electrical Engineering, Diploma Thesis, 1996, 1995, 60 pp. + appendices.

Mäkelä J. Signal Subspace Tracking in Multiuser Delay Estimation. Diploma Thesis (in Finnish), University of Oulu, 1997, 62 pp.

Lasanen, K.: Analog CMOS Implementation of an Adaptive Neuron Cell, Diploma Thesis (in Finnish), University of Oulu, 1997, 49 pp.

Tervaluoto, J.: Micropower Range Neuron Integration Using CMOS-technology, Diploma Thesis (in Finnish), University of Oulu, 1997, 71 pp.


3.3 Neuro-Fuzzy systems

Abstract

The aim of the subproject was to study the use of neural networks, fuzzy logic, and neuro-fuzzy systems in signal processing. New techniques and methodologies for certain application areas were also developed. Research in this project was focused on three application areas:

  • Modelling of measurements signals
  • Processing of unreliable information
  • Processing of video and audio signals (including hardware implementations)
In practice, the research was divided to smaller subprojects carried out in cooperation with the respective industrial partner. Therefore, details of the content of each application area are given in the following Results section.

Results

1. Modelling of measurements signals

Signal processing of a quality sensor with neural networks
The quality of the products obtained from industrial processes is important. In many cases, the problem in retaining quality is that it is impossible to measure properties of products continuously. The properties of products are henceforth called quality units. In order to determine the values of the quality units, samples have to be taken from the products for laboratory analyses. This type of assessment method takes a long time and it is not continuous. The electromagnetic radiation reflecting from products depends on their properties at that moment. In the research system, infra-red radiation was directed at a product and a reflection spectrum was obtained by measuring the reflected radiation. The objective of this project was to make a quality unit prediction model based on the reflection spectrum.

The values of the reflection spectrum channels strongly depend on each other. To compress the information of the channels, principal component analysis (PCA) was used. With PCA the original information can be expressed as a small number of variables. A model based on either principal component regression (PCR) or neural network (radial basis function, RBF) was established between the new variables and the values of the quality units. The t-statistic was used to select the most significant principal components for the model. As a result of the research it was concluded that it is possible to make prediction models for some of the quality units. With the model it is possible to measure the values of quality units continuously. During the project there was relatively little data available and the neural network models predicted the values of quality units only slightly better than the models containing simple linear structures. Prediction errors were generally equally good with the PCR and the neural networks.

Applying neural networks to on-line pulp quality measurements
The quality of produced paper is a very important issue in the paper mills. In order to produce good quality paper the pulp itself has to have a high quality. The quality of the pulp is mostly determined using slow and expensive laboratory tests. Quality control and continuous monitoring, however, increase the need for on-line quality measurements. This project has concentrated on measurement signal analysis of an on-line pulp quality measuring device, developed by Kajaani Electronics Ltd. For this device, traditional linear models have not been accurate enough and thus non-linearity was suspected.

A multilayer perceptron (MLP) network was the first method to try. A basic MLP network with one hidden layer, trained with Levenberg-Marguard algorithm, works properly most of the time but it sometimes gives very strange results when the test data is clearly different from the training data. It also usually saturates at a certain level and thus can not estimate the quality levels outside the range of the training data. Training the MLP network requires a lot of data, which is not usually possible during the installation and fine-tuning phase of the device. Further development of the algorithms or the network structures was not considered necessary since different methods seemed more promising. Self-organizing maps (SOM) were used to visualise the behaviour of the device in different conditions and to classify which kind of wood the pulp was made of. Learning vector quantization was also used in classification. We discovered, however, that there were two different kinds of data: in one group the data is strongly dependent on the different woods and another group is not. A first order Sugeno-style fuzzy inference system (FIS) has been the best method so far. FIS has been effective, especially using the wood-dependent data. It is also simple; a set of linear models for different situations connects smoothly. Fuzzy rules are easy to understand and there are no too many parameters to tune. One of the main problems was to identify the reason why the device behaves in a totally different way with some data sets. The device under study was found to be relatively linear and the new methods examined gave no distinct improvement in the performance.

Neural network modelling of the pumping process
In this work modelling of pumping process was studied. The test case was MC pump of Ahlstrom Pumps, which is used for mass transfer in the pulp and paper industry. Where not all of the measurable values of the pumping process have been instrumented, the model of the pumping process can be used to estimate the state of the pump and the values of missing measurements. The modelling of the pumping process is difficult, due to the multi-modal inverse problem. For inverse problems there exists a well-defined forward problem which is characterized by single-valued mapping. Often this corresponds to causality in a physical system.

In the case of the pumping process, the forward problem can be defined with characteristic curves.  In this work the modelling of the characteristic curves and solving the inverse problem with the MLP neural network was studied. The MLP neural network is a flexible method for nonlinear modelling. This work includes a review describing theory, characteristics, training and selection of model complexity of the MLP network. There are several methods to produce inverse MLP network models. This study included a review describing the most common methods and their pros and cons.

The modelling of the characteristic curves was tested with three different data sets using advanced methods for MLP network training and model complexity selection. Four different methods were tested for solving the inverse problem.

2. Processing of unreliable information

Applying neural network algorithms to estimate the position of the fabric edge
A neural network algorithm was developed for determining the position of the fabric edge. Both pre-processing and the network structure were implemented in C-language. The resulting algorithms were implemented in the measurement device. However, the limited computational efficiency and allocated memory for the algorithms did not permit the use of the developed methods as such. The computation of the exponential functions in particular had to be by-passed. Substituting functions were found and the neural network was implemented without impairing the performance. A graphical user interface in the MATLAB environment was developed. It enables the user to try different kinds of network structures, teach the network and approve or discharge the trained network. Several types of plots are shown to the user during the different phases of training. In the final phase of the research the use of these methods for measurement of the paper web position has been worked on.

Signal feature extraction using fuzzy gates
A new approach - fuzzy gates - was developed for signal feature detection. Fuzziness allows the definition of features to be approximate and thus gates are comparable to fuzzy membership functions. A MATLAB-based feature detection tool was created. The tool enables a quick way of performing feature detection from paper manufacturing process variables. The resulting indicator signal can be further used in web break analysis and process simulations.

Prediction model for the lime kiln process / SOM training and visualizing tool
A prediction model for the residual CaCO3 of a lime kiln was developed. A combination of SOM and PLS was found to give the most reliable prediction. This method was implemented by Valmet Automation Oy and is under test at the mill. Some software was also developed in MATLAB to train and visualize self organizing maps used to process variable analysis. It will be used as part of a larger modelling and analysis tool created by Valmet Automation Oy.

Adaptive equalization of binary data bursts
In telecommunication systems, transmitted data is often distorted during transmission. This means that some compensation needs to be carried out at the receiver end in order to determine the original transmitted data. This compensation is called equalization. In this work we studied different techniques for adaptive equalization of binary data bursts in a baseband digital communication system. These techniques include neural networks, linear equalizers and an adaptive clustering method. We studied the use of these equalization techniques in a channel which introduces both intersymbol interference (ISI) and additive noise to the transmitted signal. Usually the channel response is not known a priori and adaptive equalization methods therefore need to be used.

In our simulations, both MLP network and adaptive clustering method outperformed the linear equalizer, when bit error rates (BERs) were considered. Our simulation cases included simulations in both fixed and altering channels. The adaptive clustering method achieved the smallest BER. However, the BER of the MLP network was only slightly worse. The computational load of the adaptive clustering method and linear equalizer was very small compared to the MLP network. The disadvantage of the MLP network was that the training did not always converge to an adequate solution. The main reason for this was the random initialization of the network weights. To avoid this problem, an efficient weight initialization method, such as the maximum covariance (MC) initialization scheme, could he used. By applying MC initialization, the MLP network converges much faster than a randomly initialized network. Due to this the MLP network with MC initialization achieves better BERs than the randomly initialized network, if they are trained with the same amount of training epochs. Conversely, we can say that the BERs achieved by the randomly initialized MLP network can be reached much faster and/or with a smaller network by applying MC initialization. Since the MC initialization scheme is also computationally relatively light, we can actually significantly decrease the amount of computation needed in the network by applying MC initialization.

Chemical agent detection
Humans can recognise 2000 to 4000 different odours. The sense of smell can be fooled by creating gases which are beyond the recognition capability of the human sense of smell. Chemical warfare agents are a good example of this. Most of them are invisible, odourless, and poisonous. To patch up the weakness of the human sense of smell, the M90 chemical agent detector was developed by Environics Oy. The operation of M90 is closely related to the operation of the human sense of smell. The gas information is gathered and amplified by a sensor, based on ion mobility spectrometry, inputted into a neural network, and finally classified. The rough modelling of an associative memory can be performed with the Self-Organizing Map neural network.

The current chemical warfare agent recognition capability of the M90 is very good, but problems occur when multiple agents are mixed together. The neural network is taught to recognise pure gases and thus it cannot deduce the components of some gas mixtures, especially when the mixture components react with each other and create a totally new gas. Because of this, new methods were created to enable recognition of gas mixtures.

Anaesthetic machine fault diagnosis
Patient safety has always been a very important factor in hospital equipment design. The basic principle in the design of modern life-supporting devices in that no single fault can cause a risk of injury to the patient. The AS/3 Anaesthetic Delivery Unit (AS/3 ADU) is a new generation anaesthetic machine developed by Datex-Engström Division Instrumentarium Corporation. In this work, an fuzzy logic based expert system was designed and implemented for developing and testing sophisticated fault detection methods on the AS/3 ADU. An external fault detection system was implemented on a PC microcomputer. A serial adapter was built to connect a personal computer and the AS/3 ADU. The system is capable of real-time fault diagnosis. It can also be used to acquire and store data for off-line analysis. In addition, tools for data visualisation were created to study of ADU signals under incorrect operation.

Fuzzy classifier in defect diagnosis of solder joints
Due to the increasing use of surface-mount technology in the manufacturing of circuit boards, the importance of the inspection of solder joints has been especially emphasized when the method of wave soldering is employed. The large variation in the quality of solder joints caused by the wave soldering process cannot be taken into account by an inspection system in use. This causes a lot of unnecessary defect reports. In order to reduce the amount of these reports a classifier was developed by utilizing fuzzy logic methods. The fuzzy classifier of defect reports accomplishes the classification by using the information received from the solder joint inspection system. Defect reports are classified as being either justified or unnecessary.

The inspection results of nearly 900 circuit boards have been analysed to distinguish the essential features of defective and non-defective solder joints. The characters detected were then expressed by linguistic rules. In addition, the effect of the prevailing average amount of solder in the joint, which is one of the major factors affecting quality, has been considered. The simulation results of the behaviour of the classifier indicate the applicability of fuzzy logic in the decision-making process, which takes place in an environment of inaccurate information and where the reliability of the results obtained by traditional fault diagnosis methods is inadequate.

Optimization of electric load clipping
Many utilities use load clipping optimization methods whereby they directly control residential appliances and other devices in their service area. These include, for example, heaters, street lights and generators. Utilities have different objectives for load control, including production cost, system security and so on. The most common objectives are minimization of the peak load and minimization of the production cost. In this work optimization methods were developed for small power-delivering plants which have only limited or no electricity producing capabilities of their own.

Typical optimization methods include enumeration methods and methods based on dynamic programming. Many heuristical methods have also been reported. The enumeration and dynamic programming methods are optimal or nearly optimal but their major drawbacks are execution times. Heuristical greedy algorithms described in this work are fast when compared to those methods but not optimal. A dynamic successive programming method was developed to work with "time of use rates" (i.e. to work with hourly income and outcome rates for normal and peak loads, respectively). This method optimizes one load at a time. Time savings gained are 50-90% depending on how difficult the clipping situation is.

Neural networks in character recognition from x-ray images
Digital radiography makes it possible to transfer x-ray images quickly through communication channels. Digital images are also easier to handle and store compared with traditional x-ray films. A film storage system usually contains huge numbers of images and their digitization includes the feeding of patient records into a database. The purpose of this work was to find methods for the automatic recognition of an x-ray image subject’s patient records. Automatic recognition would considerably ease the digitization process.

Recognition consists of five phases, which are divided into the pre-processing stage and recognition stage. The pre-processing stage includes locating id-label, binarization, character segmentation and feature extraction. The recognition stage recognises characters from feature vectors. The binarization phase transforms the gray-level image into a black and white image. The segmentation phase locates individual characters from the id-label. The feature extraction phase calculates suitable features from character images. The recognition of feature vectors is performed with a multilayer perceptron neural network. Taking account the numerous variations and disturbances in x-ray images, the results are promising. The developed system was tested with 37 real images using resolutions of 120 dpi and 180 dpi. From 886 characters, 831 (94%) were correctly segmented and 79% of the segmented characters were correctly recognized.

Filtering ground reflection with neural networks
In this work a neural network method was used in filtering ground reflection while a signal is received under multipath conditions. Ground reflection disturbs the definition of the direction of an arriving signal by causing errors in the measured relative phases of signals. The error caused by the reflection changes nonlinearly as a function of the elevation angle, which makes the filtering of the reflection error very difficult. In addition, the behaviour of the error depends on the lengths of the antenna bases, the height of the antenna with respect to the ground level and the orientation of the antenna.

The effects of ground reflection were simulated by a theoretical model. Using the input-output data description generated by the theoretical model, multilayer perceptron neural networks were trained to respond to erroneous signals with correct elevation angles. The networks were then tested with an independent trajectory of a signal source. The theoretical results show that the disturbances can be filtered down to 0.4% from the original, when the number of the hidden layer neurons is around twenty. A disadvantage in the best simulations is the preprocessing of the signals that may be complex in practical applications. The results of the filtering simulations that did not demand the preprocessing of the signals were not as good.

3. Processing of video and audio signals (including hardware implementations)

Speech recognition
During 1995-1996 a pure neural network based recognition system was researched, which consisted of a self-organizing map and multilayer perceptron network. The recognizer was only used for speaker-independent recognition of eleven isolated English digits, which were spoken by male speakers in a moving car and had 15 dB signal-to-noise ratio. The idea of the recognizer was that SOM mapped the spoken digit to a constant dimension binary map which was classified to a digit by an MLP. This recognizer achieved 99.5% accuracy with a noisy test set. The initialization of MLP network was also enhanced such that it speeded the training of MLP by 65-74% depending on the training algorithm.

During 1996-1997 the goal of the research was to use neural networks and hidden Markov models for speaker-independent isolated and connected English digit recognition in noisy environments. The data set was spoken by 28 male speakers inside a car in three noise environments (motorway, city and a car park). In particular, the neural networks were researched as an alternative to the conventional conditional probability estimators such as mixtures of Gaussians. Hence, the purpose of the Multilayer perceptron was to estimate the emission probabilities for the conventional HMMs. Test set performance of the recognition system was 98.0% and 74.5%, respectively, in isolated and connected digit recognition at its best. The recognizer was tested with two training algorithms, embedded Viterbi (EV) and minimum classification error algorithm (MCE). The performances were slightly better for isolated digits when using MCE, while the digit strings were recognized a bit more accurately with EV training.

Speech coding: quantizations of LP coefficients
The main subject of the work was vector quantization in the context of speech coding. Topics included were estimation of linear predictive vector quantizers with different multimode structures and vector splitting schemes, evaluation of quantizers over noisy channels and the enhancement of speech encoding and decoding algorithms

The tasks of 1997 placed emphasis on the implementation and testing of a generic VQ design platform. A resulting software product has been delivered to Nokia Research Center. The environment has been used for the estimation of a wide range of different LP (Linear Predictive) quantizers for an ACELP speech coder. Functionality of the quantizers has been tested in a floating point environment and observations of the performance and error resilience characteristics of different models have been collected.

lmplementation and development of WI-speech coder
The object of this project was to develop a speech coding algorithm which meets today’s requirements for high speech quality at a low bit rate. To be able to apply novel ideas to speech coding, the basic speech coder structure has to be constructed first. A waveform interpolation type speech coder was selected for this project, because it is a relatively new coder and it has not yet been researched in detail. Furthermore, waveform interpolation coder was found to be a good test environment for our novel ideas which will be investigated during this project.

The waveform interpolation speech coder utilizes the fact that speech is almost periodic during voiced sections. Using these periodic waveforms it is possible to form a surface. This surface can be modified and downsampled in an encoder and correspondingly interpolated in the decoder. Using this basic idea it is possible to achieve a relatively low total bit rate, for example 2.4 or 4.0 kbit/s.

At the time of writing the coder is almost complete; only some parameters require fine-tuning. The speech quality achieved so far has been fairly good. This project continued in the IMPRESS project, and final results were reported in late 1999.

Video compression based on visual reception
An objective image quality metric was developed during this research project. The algorithm was implemented in an ANSI-C compliant source code which results a blockwise map of the perceptibility of the errors according to the masking by human visual system. This map can be used as an image quality metric in optimizing the adaptive coding of video sequences at very low bit rates (< 64 kbit/s): the most annoying blocks assessed by the system may be left unpredicted, i.e. they are coded separately. The level at which the error is assessed as annoying can be varied manually with a single parameter.

The maps of the perceptibility of the errors were quite well correlated with the error frames. However, it should be noted that the results were dependent on the parameter value mentioned in the abstract and the value can be varied according to the observers preferences. It has also been noticed that if the movement in the video sequence is very intense, the blocking effect can be seen around the new intra blocks (edges between original and predicted blocks are annoying). On the other hand, it must be noted that the algorithm showed that the erroneous blocks were very often situated next to the old intra blocks. This means that the algorithm was clearly working well, because normally the remaining errors are next to or very near to the old intra blocks.

Grid line removal from digitized x-ray images
When x-ray images which have been taken with a grid are digitized, some line noise appears in the digitized image. This noise creates interfence patterns when the image is magnified or reduced. This type of noise and interfence pattern makes it harder for doctors to make medical diagnoses from the images. The goal of this project has been to research and develop a method which can be used to remove line noise from the digitized images. The possibility of using nonlinear digital filters to resolve the problem has been investigated. Following some experiments, the research has focused on weighted median filters.

The filter which has been developed has proved to give good results. It causes some blurring of the images, but not too serious. Blurring can probably be removed by using a restoration technique, for example by Wiener filtering. This project continued in the IMPRESS project, and final results were reported in the late 1998.

Image quality analysis
The video coding method defined by the present H.263 and MPEG standards utilises motion compensated predictive coding and discrete cosine transform to reduce the redundant spatial and temporal information in digital video sequences. At very low bit-rates, video coding artefacts become visible, and image quality decreases considerably. A typical impairment is blocking artefact, which is produced by independent coding of adjacent image blocks.

In this work, a system for subjective assessment of blocking artefacts was developed. The assessment was obtained by using fuzzy reasoning and it was based on quantitative features that are computed from the image with simple algorithms. The aim was to develop a system which is computationally feasible for real-time implementation.

Fuzzy control of video encoding
The H.263 standard for low bit rate applications allows transmission of a reasonable quality motion pictures through the existing telephone network, wireless networks and other band-limited channels. The aim of this work was to improve the subjective image quality, while keeping the bit rate within set limits by means of fuzzy logic. This was performed in such a way that the most relevant areas in the video frame for the observer are identified by a fuzzy inference system, after which the encoding process is controlled by fuzzy control to increase the bit rate in the more relevant areas and decrease the bit rate in the less relevant areas. The relevant areas are identified by utilising the properties of the human visual system and the extraction of foreground and background regions.

The test demonstrated that the fuzzy control strategy did not overcome the conventional rate control strategy. Image quality was not greatly improved. There are two main reasons for this: first, the number of bits to be shared is really small in the low bit rate applications using the hybrid DPCM/DCT coding, and secondly, the coding standard H.263 sets limits for the control of the coding parameters. However, the bit rate can be reduced in the less relevant areas while maintaining the subjective image quality. The bit rate can be reduced by 10-15 % with a simple image analysis at low bit rates.

Vector quantization index assignment
In vector quantization, a source vector is encoded by assigning to it one codevector from a set of codevectors stored in the codebook. An index that identifies the selected codevector is transmitted across a communication channel in a binary form. At the receiving end, the index is decoded with the corresponding codevector from the codebook. However, bit errors occurring in the channel may result in an incorrectly received index. Thus, the resulting codevector may no longer be the best representation of the source vector and the quantization distortion may increase due the channel noise.

A genetic annealing algorithm was applied to the index assignment problem in vector quantization. The proposed combination of an annealing, selection and recombination mechanism is new and possesses an advantage with regard to implementation on a single processor machine because it gives satisfactory performance when using a reasonably small population. The assessment of the proposed method using a vector quantization of the line spectrum frequencies of a speech coder, the experiments with the vector quantizers trained on the Gaussian data as well as the test on the quadratic assignment problem benchmarks show the superiority of the proposed model.

Project information

Participants

The participating research laboratories of the project were:

  • Signal Processing, Tampere University of Technology, 1995-98
  • Measurement and Information Technology, Tampere University of Technology, 1995-98
  • Electronics, Tampere University of Technology, 1995-96
  • Machine Automation, VTT Automation, 1995-98
  • Computational Engineering, Helsinki University of Technology, 1997-98
The following industrial partners participated the project:

  • ABB Corporate Research, 1995-96
  • Bestsense Oy, 1995-96
  • Datex-Engström, 1995-97
  • Finnelpro Oy, 1996-98
  • Enermet Oy, 1995-96
  • Environics Oy, 1995-96
  • Kajaani Elektroniikka (Valmet Automation Kajaani), 1995-97
  • KCL Development Oy, 1996-98
  • Nokia Mobile Phones, 1996-98
  • Nokia Research Center, 1995-98
  • Rautaruukki Oy, 1995-97
  • Sanoma Oy, 1996
  • Vaisala Oy, 1995-97
  • Valmet Paperikoneet, 1995-98
  • Project dates

    The project was started on 1.3.95 and it ended on 28.2.98, thus, lasting a total of three years. The project had an prestudy period between 1.9.94-28.2.95. Some of the research topics of the project have continued as part of the IMPRESS project.

    Project volume

    Expenses        1995                1996                1997                Total

    Salaries            1 306 927        1 096 048        1 222 817        3 625 792

    Side costs         1 303 901        1 059 811        1 170 869        3 534 581

    Travel costs      128 903           82 253             93 397             304 553

    Consumption    89 757             31 753             22 637             144 147

    Equipment        60 763             13 341             0                      74 104

    Services           0                      11 870             21 042             32 912

    Other costs       44 641             47 924             10 929             103 494

    Total               2.934.892        2.343.000        2.541.691        7.819.583        (FIM)

    Project manager

    Prof. Petri Vuorimaa
    Telecommunications Software and Multimedia Laboratory
    Helsinki University of Technology
    P.0. Box 1100, FIN-02015 HUT, Finland
    Tel: +358-9-451 4794, Gsm: +358-9-3399 712
    Fax: +358-9-451 5014
    E-mail: Petri.Vuorimaa@hut.fi

    Publications

    Papers

    1. Heikkinen, J., Klapuri, H., Saarinen, J., Oksanen, H., Kastepohja, A. and Urpelainen, M., "Fuzzy Classifier in Defect Diagnosis of Solder Joints," Proc. Fifth IEEE International Conference on Fuzzy Systems (FUZZ-IEEE'96), New Orleans, USA, Sept. 1996. 1. IEEE. pp. 113-117.

    2. A. Kantsila, M. Lehtokangas and J. Saarinen, "Adaptive Equalization of Binary Data Bursts Using Neural Networks", Proc. of 5th European Congress on Intelligent Techniques and Soft Computing, EUFIT'97, Aachen, Germany, Sep 8-11, 1997, Vol. 1, pp. 593-597.

    3. A. Kantsila, M. Lehtokangas and J. Saarinen,"Adaptive Equalization of Binary Data Bursts", Proc. of IASTED International Conference on Signal and Image Processing, SIP'97, New Orleans, Louisiana, USA, Dec 4-6, 1997, pp. 117-122.

    4. Kumpulainen Pekka, Imeläinen Heikki, and Joronen Tero, "Modelling Process with Data Modified Using Self Organizing Map," Nordic MATLAB Conference Final Program & Proceedings. Norra Latin, Conference Center Stockholm. Edited by Magnus Olsson. Computer Solutions Europe Ab, Stockholm. October 27-28, 1997, Stockholm, Sverige. pp. 11-109-112.

    5. P. Kolinummi, T. Hämäläinen, H. Klapuri and K. Kaski, "Mapping of Multilayer Perceptron Networks to Partial Tree Shape Parallel Neurocomputer," Proceedings of the International Conference on Artificial Neural Networks (ICANN96), Bochum, Germany, July 1996.

    6. Kolinummi, P., Hämäläinen, T. D. and Kaski, K., "Designing a Digital Neurocomputer; Improving ANN Processing with a Dedicated Hardware System," IEEE Circuits & Devices, The Optoelectronics Magazine 13, IEEE. 2, pp. 19-27.

    7. Kolinummi, P., Hämäläinen, T. and Saarinen, J., "Mapping of Radial Basis Function Networks to Partial Tree Shape Parallel Neurocomputer," In: Gerstner, W., Germond, A., Hasler, M. & Nicoud, J.-D. (eds). Artificial Neural Networks - ICANN '97, 7th International Conference, Lausanne, Switzerland, October 8-10, 1997, Proceedings. pp. 1259-1264.

    8. Kolinummi, P., Hämäläinen, T. and Kaski, K., "Mappings of SOM and LVQ on the Partial Tree Shape Neurocomputer," Proceedings of ICNN'97, 1997 International Conference on Neural Networks, 8-12 June, 1997, Houston, Texas, USA. 2. s. 904-909.

    9. J. Lahnajärvi, M. Lehtokangas and J. Saarinen, "Filtering Ground Reflection with Neural Networks," Proceedings of 5th European Congress on Intelligent Techniques and Soft Computing, EUFIT'97, Aachen, Germany, Sep. 8-12, 1997.

    10. J. Lahnajärvi, M. Lehtokangas, and J. Saarinen, "Neural Networks for Filtering Ground Reflection," The Yearbook of the Finnish Statistical Society 1996, 1997.

    11. M. Lehtokangas, P. Salmela, J. Saarinen, and K. Kaski, "Weight Initialization Techniques in Neural Network Systems and Their Application," In C. Leondes (ed.), Algorithms and Architectures, vol. 1 in the Neural Network Systems Techniques and Applications Series, Academic Press, 1997.

    12. V. Ruoppila, M. Vaalgamaa and M. Vuolahti, "On Subjective Quality in Quantization of LPC Parameters." submitted to IEEE Nordic Signal Processing Symposium (Norsig'98) 8-11 June 1998, Visgo Holiday Resort, Denmark.

    13. P. Salmela, S. Kuusisto, J. Saarinen, K. Laurila and P. Haavisto, "Isolated Spoken Number Recognition with Hybrid of Self-Organizing Map and Multilayer Perceptron," Proceedings of International Conference on Neural Networks, ICNN'96, Washington, USA, Vol. 4, 1996, pp. 1912-1917.

    14. P. Salmela, S. Kuusisto, J. Saarinen, K. Laurila and P. Haavisto, "The Hybrid of Self-Organizing Map and Multilayer Perceptron in Isolated Spoken Number Recognition," Proceedings of World Congress on Neural Networks, WCNN'96, San Diego, USA, 1996, pp. 63-68.

    15. P. Salmela, M. Lehtokangas and J. Saarinen, "Improving Convergence of MLP with MC Initialization in Speech Recognition Application," Proceedings of the European Congress on Intelligent Techniques and Soft Computing, EUFIT'97, Vol. 1, 1997, pp. 490-494.

    16. P. Salmela, M. Lehtokangas, Kari Laurila and J. Saarinen, "Discriminative Isolated Digit Recognition in Noisy Environments using Multilayer Perceptron Networks", Invited Special Session on Robust Speech Recognition, International Conference on Neural Information Processing, ICONIP'97, Vol. 2, 1997, pp. 1116-1119.

    17. J. Seppälä, P. Korpisaari, V. Varjonen, M. Lehtokangas and J. Saarinen, "Recognition of X-ray Image's ID-label with Neural Network," The Yearbook of the Finnish Statistical Society 1996, 1997.

    18. J. Seppälä, P. Korpisaari, V. Varjonen, M. Lehtokangas and J. Saarinen, "Character Segmentation from X-ray Image's ID-label," Proceedings of the 5th European Congress on Intelligent Techniques and Soft Computing, EUFIT'97, Aachen, Germany, Sep. 8-1 1, 1997.

    19. Suojärvi Mika, Ritala Risto, Ihalainen Heimo, Jokinen Heikki, "Finding Significant Changes from Process Data", The yearbook of the Finnish statistical society 1996. Part I . Neural Networks and Statistics. Annual Conference of Statisticians, 7.-8.11.1996. Hakapaino Oy, Helsinki 1997. pp. 75-83. ISSN 0355-5941.

    20. Suojärvi Mika: Neural network for estimation of quality. XIV IMEKO WORLD CONGRESS. New Measurements - Challenges and Visions. Vol. XA. Proceedings. Editor: Jouko Halttunen. June 1-6, 1997, Tampere, Finland. pp. 48-52. ISBN 952-5183-04-1.

    21. Söderholm Kaj and Ihalainen Heimo, "Signal Feature Extraction Using Fuzzy Gates," Nordic MATLAB Conference '97. Final Program & Proceedings. Norra Latin, Conference Center Stockholm. Edited by Magnus Olsson. Computer Solutions Europe Ab, Stockholm. October 27-28, 1997, Stockholm, Sverige. pp. II-1-6.

    22. Vehviläinen Pekko and Jokinen Heikki, "Determining the position of an edge of a dryer fabric by using neural and fuzzy computing," The yearbook of the Finnish statistical society 1996. Part I. Neural Networks and Statistics. Annual Conference of Statisticians, 7.-8.11.1996. Hakapaino Oy, Helsinki 1997. pp. 83-90. ISSN 0355-5941.

    23. Vehviläinen Pekko: Älykäs paikan määrittäminen. Teema: Automaatiopäivät 97. Automaatioväylä, 5/1997. ISSN 0784 6428. Automaatioväylä Oy. pp. 20-22. (In Finnish)

    24. Vehviläinen Pekko and Jokinen Heikki, "Processing the position indicator signal by neural and fuzzy computing," VTT SYMPOSIUM 172. Comadem'97. 10th International Congress and Exhibition on Condition Monitoring and Diagnostic Engineering Management. Vol. 2. Edited by Erkki Jantunen. June 9-11, 1997, Espoo, Finland. pp. 132-141. ISBN 951-38-4563-X, ISSN 0357-9387, UDC 658.58:061.3.

    25. Vehviläinen Pekko, Jokinen Heikki, Saloniemi Jyrki, Vähätalo Harri: Determining the Position of a Dryer Fabric by Neuro-Fuzzy Computing. CD ROM, networking, (6 pp.), ISBN 952-5183-07-6, CD-ROM/esitelmä. (11) 3/4: Viiran paikan määrittäminen neuro-sumealla järjestelmällä. Automaatio 97/esitelmien lyhennelmät, (s. 49), ISBN SAS-21, Automaatio 97, Helsingin Messukeskus. Suomen Automaatioseura ry, Helsinki, Finland 23.-25.9.1997.

    26. A. Vesterinen, A. Särelä, P. Vuorimaa, and M. Lehtokangas, "Anestesiakoneen sumea vikadiagnostiikka," in Suomen Tilastoseuran vuosikirja 1996, Helsinki, 1997, pp. 91-102.

    27. M.Vuolahti, V. Ruoppila, M. Vaalgamaa, J. Saarinen, "Numerical study on predictive quantization of line spectrum frequencies." Proceedings of IASTED Conference SIP -97, Louisiana, USA.

    28. M. Vuolahti, V. Ruoppila, "Performance of Predictive Vector Quantizers in Noisy Channels." Submitted to IEEE Nordic Signal Processing Symposium (Norsig'98) 8-11 June 1998, Visgo Holiday Resort, Denmark.

    29. E. Yli-Rantala, T. Ojala, and P. Vuorimaa, "Vector quantization of residual images using self-organizing map," in Proc Int. Conf. Neural Networks, ICNN'96, Washington D.C., USA.

    30. E. Yli-Rantala, T. Ojala, and P. Vuorimaa, "Vector quantization of residual images using self-organizing map with sample weighted training," in Proc. 4th European Congress on Intelligent Techniques and Soft Computing, EUFIT'96, Aachen, Germany, Sept. 2-5, 1996, pp. 325-328.

    Laboratory reports

    1. P. Kolinummi, Neurotietokoneiden arkkitehtuureja ja toteutusvaihtoehtoja, Raportti 3-96, Tampereen teknillinen korkeakoulu, 1996. (In Finnish.)

    2. P. Kolinummi, T. Hämäläinen, J. Saarinen, K. Kaski, Kaupalliset Neurotietokoneet, Raportti 4-96, Tampereen teknillinen korkeakoulu, 1996. (In Finnish.)

    3. M. Lehtokangas, P. Salmela, J. Saarinen and K. Kaski, "Weight Initialization Techniques in Neural Network Systems and Their Application," Research Reports B 12, Laboratory of Computational Engineering, Helsinki University of Technology, 1997.

    4. P. Salmela, Neural Networks in Spoken Number Recognition, Report 1- 1996, Tampere University of Technology, Electronics Laboratory, 1996, 99 pp.

    5. P. Salmela, Spoken Digit Recognition with Neural Networks and Hidden Markov Models, Report 3-1997, Tampere University of Technology, Signal Processing Laboratory, 1997, 140 pp.

    6. Suojärvi Mika: Laatusuureen mallinnus neuroverkkojen avulla. Raportti TTKK/MIT, Tampere 1996, 54 s. ISBN 951-722-552-0. ISSN 1236-3227. (In Finnish.)

    7. Suojärvi Mika: Signal features behind web break frequency. Report 39 (laboratory series), TTKK/MIT, Tampere 1996, 18 pp.

    8. Vehviläinen Pekko: Determining the position of a dryer fabric by soft computing. Report 40 (laboratory series), TTKK/MIT, Tampere 1997, 56 pp.

    Dr. Tech. Theses

    1 . T. Hämäläinen, Implementation and Algorithms of a Tree Shape Parallel Computer, Dr.Tech. Thesis, Tampere Univeristy of Technology, 1997, 203 pp.

    2. M. Lehtokangas, Modelling with Layered Feedforward Neural Networks, Dr.Tech. Thesis, Tampere University of Technology, 1995, 228 pp.

    3. P. Vuorimaa, Fuzzy Self-Organizing Map, and It's Applications, Dr.Tech. Thesis, Tampere Univeristy of Technology, 1995, 159 pp.

    Lic.Tech. Theses

    1. P. Salmela, Spoken Digit Recognition with Neural Networks and Hidden Markov Models, Licentiate Thesis, Tampere University of Technology, 1997, 140 pp.

    2. M. Vuolahti, Thesis for the Degree of Licentiate of Technology is in preparation.

    M.Sc. Theses

    1. I. Aho, Sähkönkulutuksen tehohuippujen optimaalinen leikkaus, pro-gradu tutkielma, Tampereen yliopisto, 1996. (In Finnish.)

    2. Haukijärvi Mikko, "Fuzzy Control for Video Encoding in Real Time Applications," Tampere University of Technology, 1997.

    3. J. Heikkinen, Sumea luokitus juotosliitosten vikadiagnostiikassa, diplomityö, Tampereen teknillinen korkeakoulu, 1996. (In Finnish.)

    4. A. Kantsila, Adaptive Equalization of Binary Data Bursts, Master of Science thesis, Tampere University of Technology, 1997.

    5. P. Kolinummi, Neurotietokoneiden arkkitehtuureja ja toteutusvaihtoehtoja, diplomityö, Tampereen teknillinen korkeakoulu, 1996. (In Finnish.)

    6. H. Laine, "Spectroscopic Concentration Determination using Neural Networks," Master of Science thesis, Tampere University of Technology, 1995.

    7. J. Lahnajärvi, Maaheijastuksen suodatus neuroverkolla, diplomityö, Tampereen teknillinen korkeakoulu, 1996. (In Finnish.)

    8. M. Onnia, M.Sc. Thesis, in preparation.

    9. J. Peltoniemi, Intelligent Methods for Chemical Agent Mixture Detection, Tampere University of Technology, 1996.

    10. P. Salmela, Neural Networks in Spoken Number Recognition, Master of Science Thesis, Tampere University of Technology, 1996, 99 pp.

    11. J. Seppälä, Tekstin tunnistus röntgenkuvan ID-kentästä neuroverkolla, diplomityö, Tampereen teknillinen korkeakoulu, 1997. (In Finnish.)

    12. Suojärvi Mika, Laatuanturin signaalin käsittely neuroverkkojen avulla (Signal processing of a quality sensor with neural networks). TTKK, Sähkötekniikan osasto, 1996. (In Finnish.)

    13. Söderholm Kaj: Tool for feature extraction using fuzzy gates. TTKK, Sähkötekniikan osasto, 1997.

    14. M. Tammi, M.Sc. thesis, in preparation.

    15. Vahteri Joni, "Fuzzy Assessment of Blocking Artifacts in MC/DCT Video Coding," Tampere University of Technology, 1997.

    16. Aki Vehtari, "Neural network modeling of pumping process," M.Sc. Thesis, Laboratory of Computational Engineering, Helsinki University of Technology, 1997. (In Finnish.)

    17. Vehviläinen Pekko, Determining the position of a dryer fabric by soft computing, TTKK, Automaatiotekniikan osasto, 1997.

    18. Vesterinen Arto, Fuzzy Logic in Anaesthesia Machine Fault Diagnosis, Tampere University of Technology, 1997.

    19. M. Vuolahti, WWW-palvelimen lukija-analyysi, diplomityö, Tampereen teknillinen korkeakoulu, 1996. (In Finnish.)

    20. E. Yli-Rantala, Self-Organizing Maps in Codebook Design for Vector Quantization, Tampere University of Technology, 1996.



    jukka.iivarinen@hut.fi
    http://www.cis.hut.fi/neuronet/Tekes/3.shtml
    Thursday, 30-Nov-2000 10:34:26 EET