References of "Sainlez, Matthieu"
     in
Bookmark and Share    
Full Text
See detailApprentissages automatiques supervisés pour le monitoring environnemental et énergétique d'une chaudière de régénération
Sainlez, Matthieu ULg

Doctoral thesis (2012)

The broad objective of this thesis is to apply and compare supervised learning techniques for prediction of nitrogen oxide pollutant emission from the recovery boiler of a Kraft pulp mill. In this task ... [more ▼]

The broad objective of this thesis is to apply and compare supervised learning techniques for prediction of nitrogen oxide pollutant emission from the recovery boiler of a Kraft pulp mill. In this task, we want to highlight a technique that is most suited and self-adapted to boiler transient operating conditions. The Kraft process is an alkaline process to produce chemical pulp; cellulose fibers are dissociated from lignin by cooking the chips in a solution of sodium hydroxide (NaOH) and sodium sulfide (Na2S), called white liquor. The residual black liquor is concentrated and burned in a recovery furnace to yield an inorganic smelt of sodium carbonate (Na2CO3) and Na2S. The recovery boiler both regenerates the cooking chemicals and produces high pressure steam to the pulp mill, but the boiler is a major source of atmospheric pollutants in the mill. In particular nitrogen oxide formation is very complex because of several chemicals and dynamic mechanisms: thermal NOx, prompt NOx and fuel NOx. Nowadays, there is an increasing demand in such industries for efficient data analysis tools, especially for pollutant monitoring and/or energy management. Literature reviews refer mainly on numerical solutions where a complete description of the process is needed and where stationary condition is often a working hypothesis. This is the case with the advanced data validation and reconciliation techniques that we evaluate. This technique is based on thermodynamic models, chemical and physical relationships within process parameters and equipment. This is helpful to highlight some lack of information about the process, but this approach failed to model accurately steam and fumes utilities operating points. Indeed, in a Kraft recovery boiler, the total nitrogen oxide emission is dependent on several operating factors and heterogeneous conditions, e.g. operating fuels (black liquor or heavy fuel), furnace load, droplet size, air system operation, retention time, biomass characteristics,... For such a complex problem, machine learning techniques may be used as alternative methods in engineering analysis and predictions. They involve algorithms that improve automatically through experience collected in historical databases. Among supervised learning techniques, we focus mainly on neural networks methods (static and dynamic architectures) and additionally on tree-based (regression tree and random forests) and linear ones. For each method, we evaluate its ability to predict NOx pollutant emission in varying conditions. A random forest is a collection of uncorrelated regression trees, induced from bootstrap samples of the training data. Its internal estimates are also used to measure variable importance and allow us to classify relevant variables for a model inputs selection task. Note that we need some additional a priori knowledge to select the final inputs set. Among static neural network structures, the multilayer perceptron is the most widely used, particularly the two-layer structure in which the input units and the output layer are interconnected with an intermediate hidden layer. The model of each neuron in the network includes a nonlinear activation function that is differentiable; this network can perform static mapping between an input space and an output space. Within dynamic architectures, we distinguish those that have only feed-forward connections and those that have feedback (recurrent) connections. In this work, we focus mainly on NARX network (Nonlinear AutoRegressive model with eXogenous inputs) and additionally on Elman recurrent neural network. This last one incorporates an additional layer, called context layer, the nodes of which are the one-step delay elements embedded into the local feedback paths. Nevertheless, Elman's approach has some drawbacks associated with learning parameters scheme and temporal gradient approximation. Particularly, the NARX network is used for input-output modeling of nonlinear dynamical systems. It is a recurrent model: model inputs are applied to a tapped-delay-line memory of n units and outputs are fed back to the input layer through another line of m units. The total model order s=n+m is therefore a key parameter and the method of Lipschitz numbers is a tool for estimating it. An advantage of NARX is that we can use standard backpropagation algorithm for neural network learning scheme. Furthermore, to increase model robustness, we average neural predictions over a set of individual neural predictors, this is helpful for reducing variance prediction across trials. Despite the fact that generalization is done on the worst case configuration possible, we see that ensemble of NARX networks perform well on predicting NOx emissions during transient operations and Lipschitz numbers are very helpful for system orders estimations. We illustrate the potential of a dynamic neural approach compared to the others in the nitrogen oxide prediction task. It is more suited to practical modeling needs and offers a modeling of time and memory. It allows us to monitor NOx pollution and possibly adjusting control variables and performing diagnostics. The thesis is divided into seven chapters covering several publications. Chapter 1 is about the Kraft process and its recovery boiler. We start with a short description of the Kraft pulp mill. Then we describe the Kraft recovery boiler, some chemical reactions in the furnace, the steam production equipments and the atmospheric pollutants. Finally we discuss about nitrogen oxide formation in the furnace, the effects of several operating conditions on its production. Chapter 2 is about data mining, on what it is, on what it is used for and which are the main modeling cultures. This chapter deals with system identification, modeling approaches (white box, grey box, black box), some definitions about learning and modeling, and finally some links between modeling and optimization techniques. Chapter 3 starts with a state-of-the-art about numerical simulation of a Kraft recovery boiler, then we apply and evaluate a data validation scheme for steam and fumes utilities modeling. Finally we discuss the application of artificial intelligence techniques within the framework of a recovery boiler. Chapter 4 aims at selecting model inputs, starting with a supervised selection approach based on random forests. We introduce some methodological insights about tree-based methods, from a simple regression tree to random forests. Random forests internal estimates are used to measure the relative importance of each input variable in predicting a response, i.e. nitrogen oxide emission or high pressure steam production. Finally we discuss about some useful extra knowledge to take in account for the selection of final inputs. Chapter 5 is about neural networks modeling, we introduce the perceptron, the multilayer perceptron, and the associated backpropagation algorithm. We discuss about static and dynamic architectures, especially the Elman recurrent neural network. Finally, we apply a multilayer perceptron and an Elman recurrent neural network for predicting the high pressure steam flow rate from the Kraft recovery boiler. Chapter 6 presents some insights about input-output modeling of nonlinear dynamical systems, especially with NARX network. At the end, we explain the Lipschitz method that is applied for system orders estimation. Chapter 7 summarizes some comparison results about supervised learning techniques applied to predict nitrogen oxide pollutant emission from the recovery boiler. This comparison involves neural network techniques, tree-based methods and multiple linear regression. Finally, some research perspectives are presented and some conclusions are drawn. [less ▲]

Detailed reference viewed: 42 (7 ULg)
Full Text
See detailMachine learning techniques for atmospheric pollutant monitoring
Sainlez, Matthieu ULg; Heyen, Georges ULg

Poster (2012, January 27)

Machine learning techniques are compared to predict nitrogen oxide (NOx) pollutant emission from the recovery boiler of a Kraft pulp mill. Starting from a large database of raw process data related to a ... [more ▼]

Machine learning techniques are compared to predict nitrogen oxide (NOx) pollutant emission from the recovery boiler of a Kraft pulp mill. Starting from a large database of raw process data related to a Kraft recovery boiler, we consider a regression problem in which we are trying to predict the value of a continuous variable. Generalization is done on the worst case configuration possible to make sure the model is adequate: the training period concerns stationary operations while test periods mainly focus on NOx emissions during transient operations. [less ▲]

Detailed reference viewed: 28 (7 ULg)
Full Text
Peer Reviewed
See detailComparison of supervised learning techniques for atmospheric pollutant monitoring in a Kraft pulp mill
Sainlez, Matthieu ULg; Heyen, Georges ULg

in Journal of Computational & Applied Mathematics (2012)

In this paper, supervised learning techniques are compared to predict nitro- gen oxide (NOx) pollutant emission from the recovery boiler of a Kraft pulp mill. Starting from a large database of raw process ... [more ▼]

In this paper, supervised learning techniques are compared to predict nitro- gen oxide (NOx) pollutant emission from the recovery boiler of a Kraft pulp mill. Starting from a large database of raw process data related to a Kraft recovery boiler, we consider a regression problem in which we are trying to predict the value of a continuous variable. Generalization is done on the worst case configuration possible to make sure the model is adequate: the training period concerns stationary operations while test periods mainly fo- cus on NOx emissions during transient operations. This comparison involves neural network techniques (i.e., multilayer perceptron and NARX network), tree-based methods and multiple linear regression. We illustrate the potential of a dynamic neural approach compared to the others in this task. [less ▲]

Detailed reference viewed: 27 (8 ULg)
Full Text
See detailLes réseaux de neurones artificiels dans la finance
Sainlez, Matthieu ULg

Scientific conference (2011, November 10)

Detailed reference viewed: 23 (5 ULg)
Full Text
Peer Reviewed
See detailComparison of Machine Learning techniques for atmospheric pollutant monitoring in a Kraft pulp mill
Sainlez, Matthieu ULg; Heyen, Georges ULg

Conference (2011, November)

In this paper, machine learning techniques are compared to predict nitrogen oxide (NOx) pollutant emission from the recovery boiler of a Kraft pulp mill. Starting from a large database of raw process data ... [more ▼]

In this paper, machine learning techniques are compared to predict nitrogen oxide (NOx) pollutant emission from the recovery boiler of a Kraft pulp mill. Starting from a large database of raw process data related to a Kraft recovery boiler, we consider a regression problem in which we are trying to predict the value of a continuous variable. Generalization is done on the worst case configuration possible to make sure the model is adequate: the training period concerns stationary operations while test periods mainly focus on NOx emissions during transient operations. This comparison involves neural network techniques (i.e., static multilayer perceptron and dynamic NARX network), tree-based methods and multiple linear regression. We illustrate the potential of a dynamic neural approach compared to the others in this prediction task. [less ▲]

Detailed reference viewed: 22 (9 ULg)
Full Text
Peer Reviewed
See detailKraft RB : recurrent neural network prediction of steam production
Sainlez, Matthieu ULg; Heyen, Georges ULg

Poster (2011, May 30)

In this study, neural networks approaches are compared for predicting the high pressure (HP) steam flow rate from a Kraft recovery boiler. We apply two types of neural networks: a static multilayer ... [more ▼]

In this study, neural networks approaches are compared for predicting the high pressure (HP) steam flow rate from a Kraft recovery boiler. We apply two types of neural networks: a static multilayer perceptron and a dynamic Elman’s recurrent neural network. Starting from a one-day database of raw process data related to the boiler, the goal is to model and predict the next 12-hours of HP steam flow production from the boiler to the steam turbine. The results illustrate the potential of the dynamic approach in this task. [less ▲]

Detailed reference viewed: 25 (11 ULg)
Full Text
See detailL'intelligence des données au service des industries
Sainlez, Matthieu ULg

Scientific conference (2011, March 30)

Detailed reference viewed: 17 (8 ULg)
Full Text
Peer Reviewed
See detailSupervised learning for a Kraft recovery boiler: a data mining approach with Random Forests.
Sainlez, Matthieu ULg; Heyen, Georges ULg; Lafourcade, Sébastien

in Favrat, Daniel; Maréchal, François (Eds.) ECOS 2010 Volume IV (Power plants and Industrial processes) (2011, January 01)

A data mining methodology, the random forests, is applied to predict high pressure steam production from the recovery boiler of a Kraft pulping process. Starting from a large database of raw process data ... [more ▼]

A data mining methodology, the random forests, is applied to predict high pressure steam production from the recovery boiler of a Kraft pulping process. Starting from a large database of raw process data, the goal is to identify the input variables that explain the most significant output variations and to predict the high pressure steam flow. [less ▲]

Detailed reference viewed: 37 (8 ULg)
Full Text
Peer Reviewed
See detailRecurrent neural network prediction of steam production in a Kraft recovery boiler
Sainlez, Matthieu ULg; Heyen, Georges ULg

in Pistikopoulos, E. N.; Georgiadis, M. C.; Kokossis, A. C. (Eds.) 21st European Symposium on Computer Aided Process Engineering (Part B) (2011)

In this paper, neural networks approaches are compared for predicting the high pressure (HP) steam flow rate from a Kraft recovery boiler. We apply two types of neural networks: a static multilayer ... [more ▼]

In this paper, neural networks approaches are compared for predicting the high pressure (HP) steam flow rate from a Kraft recovery boiler. We apply two types of neural networks: a static multilayer perceptron and a dynamic Elman’s recurrent neural network. Starting from a one-day database of raw process data related to the boiler, the goal is to model and predict the next 12-hours of HP steam flow production from the boiler to the steam turbine. The results illustrate the potential of the dynamic approach in this task. [less ▲]

Detailed reference viewed: 36 (7 ULg)
Full Text
Peer Reviewed
See detailKraft recovery boiler analysis : a data mining approach
Sainlez, Matthieu ULg

Poster (2010, July)

Detailed reference viewed: 14 (9 ULg)
Full Text
Peer Reviewed
See detailKraft RB : classification of relevant variables with Random Forests
Sainlez, Matthieu ULg

Poster (2010, June)

Detailed reference viewed: 14 (9 ULg)
Full Text
Peer Reviewed
See detailSupervised learning for a Kraft recovery boiler: a data mining approach with Random Forests.
Sainlez, Matthieu ULg; Heyen, Georges ULg; Lafourcade, Sébastien

Conference (2010, June)

A data mining methodology, the random forests, is applied to predict high pressure steam production from the recovery boiler of a Kraft pulping process. Starting from a large database of raw process data ... [more ▼]

A data mining methodology, the random forests, is applied to predict high pressure steam production from the recovery boiler of a Kraft pulping process. Starting from a large database of raw process data, the goal is to identify the input variables that explain the most significant output variations and to predict the high pressure steam flow. [less ▲]

Detailed reference viewed: 28 (8 ULg)
Full Text
Peer Reviewed
See detailPerformance monitoring of an industrial boiler: classification of relevant variables with Random Forests
Sainlez, Matthieu ULg; Heyen, Georges ULg

in Pierucci, Sauro; Ferraris, Guido Buzzi (Eds.) 20th European Symposium on Computer Aided Process Engineering – ESCAPE20 (2010)

A data mining methodology, the random forests, is applied to analyze pollutant emission from the recovery boiler of a Kraft pulping process. Starting from a large database of raw process data, the goal is ... [more ▼]

A data mining methodology, the random forests, is applied to analyze pollutant emission from the recovery boiler of a Kraft pulping process. Starting from a large database of raw process data, the goal is to identify the input variables that explain the most output variations. [less ▲]

Detailed reference viewed: 36 (11 ULg)
Full Text
Peer Reviewed
See detailGene expression data analysis using spatiotemporal blind source separation
Sainlez, Matthieu ULg; Absil, Pierre-Antoine; Teschendorff, Andrew E.

in Verleysen, Michel (Ed.) ESANN'2009 proceedings, European Symposium on Artificial Neural Networks - Advances in Computational Intelligence and Learning. (2009, April)

We propose a “time-biased” and a “space-biased” method for spatiotemporal independent component analysis (ICA). The methods rely on computing an orthogonal approximate joint diagonalizer of a collection ... [more ▼]

We propose a “time-biased” and a “space-biased” method for spatiotemporal independent component analysis (ICA). The methods rely on computing an orthogonal approximate joint diagonalizer of a collection of covariance-like matrices. In the time-biased version, the time signatures of the ICA modes are imposed to be white, whereas the space-biased version imposes the same condition on the space signatures. We apply the two methods to the analysis of gene expression data, where the genes play the role of the space and the cell samples stand for the time. This study is a step towards addressing a question first raised by Liebermeister, on whether ICA methods for gene expression analysis should impose independence across genes or across cell samples. Our preliminary experiment indicates that both approaches have value, and that exploring the continuum between these two extremes can provide useful information about the interactions between genes and their impact on the phenotype. [less ▲]

Detailed reference viewed: 22 (4 ULg)