Recent Advances in Chemical Process Optimization - Chemie Ingenieur Technik - 20
Recent Advances in Chemical Process Optimization - Chemie Ingenieur Technik - 20
Systematic optimization strategies are essential in process design, operations and control. This study provides a concise
overview of recent advances in process optimization, with a particular focus on nonlinear programming. Here, derivative-
free, SQP, reduced gradient, and interior point methods are described. Moreover, state of the art problem formulations
and modeling environments are described, and contrasted with popular process simulation tools. Finally research direc-
tions are identified for future trends in multi-scale optimization within large-scale equation-oriented optimization environ-
ments.
Keywords: Complementarity problems, Multi-scale modeling, Nonlinear programming, Optimization, Process simulation
Received: February 13, 2014; revised: March 01, 2014; accepted: March 21, 2014
Chem. Ing. Tech. 2014, 86, No. 7, 943–952 © 2014 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim www.cit-journal.com
15222640, 2014, 7, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/cite.201400033 by CAPES, Wiley Online Library on [02/12/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Chemie
944 Review Ingenieur
Technik
www.cit-journal.com © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Chem. Ing. Tech. 2014, 86, No. 7, 943–952
15222640, 2014, 7, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/cite.201400033 by CAPES, Wiley Online Library on [02/12/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Chemie
Ingenieur Review 945
Technik
Table 1. Optimization Problem Classes and Process Applications. Global Optimization Methods
These methods for NLPs have
MILP MINLP Global LP, QP NLP Derivative Free
been developed to converge to the
HENS x x x x x x global optimum. More specifi-
MENS x x x x x x cally, one can show under mild
conditions that they converge to
Separations x x x an e-distance to the global opti-
Reactors x x x x mum with a finite number of
steps. Because there are no crite-
Equipment x x x
Design ria analogous to Karush-Kuhn-
Tucker conditions (KKT) for
Flowsheeting x x
NLPs, global optimization meth-
Planning x x x ods are based on spatial branch
Scheduling x x x x
and bound searches over the fea-
sible region. These methods par-
Real-time x x tition the feasible region into sub-
Optimization
regions, where upper bounds on
Linear MPC x the objective function are com-
Nonlinear MPC x puted, and lower bounds are de-
rived from convex relaxations of
Hybrid MPC x the objective function and con-
straints for each subregion. The
however, plays a crucial role at the level of real time optimi- algorithm then proceeds to eliminate all subregions that
zation. Finally, process control has traditionally relied on LP have infeasible constraint relaxations or lower bounds that
and NLP models with special structures, although MILPs are greater than the least upper bound. After this, the
are being increasingly used for hybrid systems. remaining regions are further partitioned to create new sub-
In this survey the focus lies on process optimization with regions and the cycle continues until the upper and lower
NLP models (Eq. (1) without integer variables) where the ob- bounds converge. Comprehensive development of these
jective and constraint functions are differentiable and deri- methods can be found in [7 – 9].
vatives are available from the model. Applications of these
form the NLP column in Tab. 1. Optimization methods that Mixed-Integer Programming
are not included in this survey include the following. Both MILP and MINLP deal with discrete and continuous
decision variables in Eq. (1). As with global optimization
Optimization Methods Without Derivatives methods, there are no optimality conditions, like the KKT
Derivative-free optimization methods (DFO) include the conditions, that can be applied directly. Instead, a systematic
classical direct search methods based on simplex and pat- search of the solution space, coupled with upper and lower
tern searches [2, 3] and phenomenological methods, such as bounding information, is applied. For MILPs, very efficient
simulated annealing (SA) [4] and genetic algorithms (GA) algorithms have been developed. Lower bounds are provided
[5]. More recently, classical direct search approaches have by LP solutions at intermediate nodes in the tree; these are
been extended so that rigorous convergence to local solu- relatively easy to calculate with the simplex method. Once
tions can be proved. These include the multivariable DFO the solution is known at an intermediate node, efficient pi-
algorithm that uses surrogate models, such as quadratic in- voting operations from this solution update solutions for
terpolation models, for the objective function within a trust the nodes at the next level. Moreover, a number of branch-
region method. DFO methods are formulated, analyzed and ing strategies navigate the tree very efficiently. A detailed de-
reviewed in [6]. These methods are easy to apply to a wide scription and availability of these and other MILP solvers
variety of problem types and optimization models and may can be found in [10]. Solution methods for MINLPs operate
also favor the search for global rather than locally optimal on the same concepts, but are more challenging, as NLPs re-
solutions. However, they are usually restricted to uncon- sult when binary variables are fixed, and these need to be
strained or bound constrained problems. Otherwise, they of- solved while navigating the search tree of binary decisions.
ten have difficulties with constraints, as equality constraint Strategies for the solution of MINLPs include NLP branch
elimination and addition of penalty functions for inequality and bound (NLP/BB), outer approximation (OA) and gener-
constraints are the only options. Finally, derivative-free alized disjunctive programming (GDP) methods. Compre-
methods scale poorly with the number of decision variables hensive reviews of MINLP methods can be found in [11, 12].
and are rarely applied to problems with more than a few In the next section, differentiable NLPs are considered
dozen decision variables. where efficient large-scale methods (to local solutions) are
briefly described. Sect. 3 continues with a discussion of opti-
Chem. Ing. Tech. 2014, 86, No. 7, 943–952 © 2014 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim www.cit-journal.com
15222640, 2014, 7, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/cite.201400033 by CAPES, Wiley Online Library on [02/12/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Chemie
946 Review Ingenieur
Technik
h x 0; g x ≤ 0 (5)
www.cit-journal.com © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Chem. Ing. Tech. 2014, 86, No. 7, 943–952
15222640, 2014, 7, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/cite.201400033 by CAPES, Wiley Online Library on [02/12/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Chemie
Ingenieur Review 947
Technik
Chem. Ing. Tech. 2014, 86, No. 7, 943–952 © 2014 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim www.cit-journal.com
15222640, 2014, 7, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/cite.201400033 by CAPES, Wiley Online Library on [02/12/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Chemie
948 Review Ingenieur
Technik
The variables are then partitioned as nonbasic variables Complementarities also arise naturally in the solution of
fixed at their bounds, superbasic variables, between bounds, multi-level optimization problems. Such problems are
that drive the optimization, and basic variables that can members of a more general problem class called mathema-
be solved from the equality constraints; this leads to tical programs with equilbrium constraints (MPECs) [16].
zT = [zNT, zST, zBT]. This partition is derived from local infor- For bi-level optimization problems of the form
mation and may change over the course of the optimization
iterations. The corresponding KKT conditions can be written min f(x)
as: s.t. (x,y) ∈ Z, y = arg{miny h (x,y), y ∈ C (x)} (24)
∇N f z ∇N c zc b a bb (19) Z and C(x) are defined as feasible regions for the upper
and lower level problems, respectively, with f(x,y) and h(x,y)
∇S f z ∇S c zc 0 (20) as objective functions for the upper and lower level prob-
lems, respectively. In particular, bi-level optimization prob-
∇B f z ∇B c zc 0 (21) lems can be reformulated as mathematical programs with
complementarity constraints (MPCCs) by writing the opti-
c z 0 (22) mality conditions of the inner optimization problem (espe-
cially if this problem is convex) as constraints on the outer
zN;j aj or bj ; b a;j ≥ 0; bb;j 0 or b b;j ≥ 0; ba;j 0 (23) problem. An MPCC takes the following general form:
min f(x)
where c and b are the KKT multipliers for the equality and s.t. h(x,y,z) = 0
bound constraints, respectively, and Eq. (23) replaces the g(x,y,z) ≤ 0 (25)
complementarity conditions from Eq. (6). Reduced gradient 0≤x⊥y≥0
methods work by nesting Eqs. (21) and (22) within Eqs. (19)
and (20). At iteration k, for fixed values of zNk and zSk, one where complementarity constraint in Eq. (25) implies xi = 0
can solve for zB using Eq. (22) and for c using Eq. (21). Fol- or yi = 0 (or both) for each vector element i. The complemen-
lowing this, bound multipliers b are calculated from tarity constraint may be written in several equivalent ways
Eq. (19). Over the course of the iterations, if the variables zB including:
or zS exceed their bounds or if some bound multipliers b
become negative, then the variable partition needs to be xT y = 0, x ≥ 0, y ≥ 0
changed and the Eqs. (19) – (23) are reconstructed. These re- xi yi = 0, x ≥ 0, y ≥ 0 (26)
duced gradient methods are embodied in the popular GRG2 xi yi ≤ 0, x ≥ 0, y ≥ 0
and CONOPT codes. In addition, reduced gradient concepts xi – max(0, xi – yi) = 0.
have been extended to develop large-scale reduced-space
SQP algorithms in SNOPT and rSQP. See [15] for further These alternate forms are particularly useful when apply-
details and references. ing existing NLP solution strategies to solve MPCCs.
On the other hand, MPCCs of the form of Eq. (25) cannot
be solved directly with most NLP solvers because the asso-
2.3 Handling Complementarity Constraints ciated optimality conditions of Eq. (25) do not satisfy con-
straint qualifications CQs. This leads to singular KKT con-
The MINLP (Eq. (1)) leads to the most general description of ditions, with unbounded constraint multipliers and
process optimization problems. To solve these problems, dependent constraints at every feasible point. Instead, the
NLP formulations are usually solved at a lower level for following MPCC reformulations allow standard NLP solvers
fixed values of the discrete variables. On the other hand, the to be applied.
ability to deal with some discrete decisions through continu-
ous variables, within an all-at-once NLP formulation may be min f(x)
more efficient than nested MINLP strategies. This is parti- s.t. h(x,y,z) = 0
cularly true if many discrete decisions must be handled. g(x,y,z) ≤ 0 (27)
One approach to modeling discrete decisions with contin- xiyi ≤ e
uous variables is through complementarity constraints. x,y ≥ 0
Complementarity problems arise in a large number of appli-
cations in engineering and economic systems. They include min f(x)
contact and friction problems in computational mechanics, s.t. h(x,y,z) = 0
equilibrium relations in economics, and a wide variety of g(x,y,z) ≤ 0 (28)
discrete events in process systems. However, these mathe- xTy ≤ e
matical programming formulations and related solution x,y ≥ 0
strategies are not yet fully developed in process engineering.
www.cit-journal.com © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Chem. Ing. Tech. 2014, 86, No. 7, 943–952
15222640, 2014, 7, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/cite.201400033 by CAPES, Wiley Online Library on [02/12/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Chemie
Ingenieur Review 949
Technik
Chem. Ing. Tech. 2014, 86, No. 7, 943–952 © 2014 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim www.cit-journal.com
15222640, 2014, 7, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/cite.201400033 by CAPES, Wiley Online Library on [02/12/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Chemie
950 Review Ingenieur
Technik
Finally, for process models that comprise differential-alge- mization algorithms and remove the need for the user to in-
braic equations (DAEs) or partial differential-algebraic equa- terface with the solver directly. These platforms allow the
tions (PDAEs), as in dynamic and distributed systems, a si- general formulation for all problem classes discussed above
milar choice exists. These models can either be embedded with direct interfaces to state of the art optimization codes.
within a simulation module and interfaced to the NLP sol- Three representative platforms are GAMS (General Alge-
ver, or formulated with an EO strategy and incorporated braic Modeling Systems), AMPL (A Mathematical Program-
directly within the optimization model. For the EO model, ming Language) and AIMMS (Advanced Integrated Multidi-
the NLP is generally very large, with many equality con- mensional Modeling Software). All three require problem
straints. Moreover, discretization of control profiles often model input via a declarative modeling language and pro-
leads to many degrees of freedom for optimization. vide exact gradient and Hessian information through auto-
Additionally, evolution of these model formulations is clo- matic differentiation strategies. Although possible, these
sely tied to the choice of appropriate optimization algo- platforms were not designed to handle externally added pro-
rithms. Efficient NLP algorithms that assume open EO cedural models. As a result, these platforms are best applied
models are available with exact first and second derivatives on optimization models that can be developed entirely with-
for all of the constraint and objective functions. These algo- in their modeling framework. These platforms are widely
rithms work very efficiently with RTO, DAE and PDAE used for large-scale research and industrial applications.
models. On the other hand, for problems with embedded Simulation platforms with optimization are found in the
modules, where function evaluations are expensive, and gra- bottom three levels in Fig. 2. These are often dedicated,
dients and Hessians are difficult to obtain, it is clear that application-specific modeling tools to which optimization
large-scale NLP solvers should not be applied. Fig. 2 shows a solvers have been interfaced. These lead to very useful opti-
hierarchy of models paired with suitable NLP strategies. For mization studies, but because they were not originally de-
instance, large equation-based models are solved most effi- signed for optimization models, they need to be used with
ciently with structured barrier NLP solvers. On the other some caution. In particular, most of these platforms do not
hand, black-box process models with inexact, or approxi- provide exact derivatives to the optimization solver, often
mate, derivatives and few decision variables are poorly they are approximated through finite difference. In addi-
served by large-scale NLP solvers, and derivative-free optimi- tion, the models themselves are constructed and calculated
zation algorithms must be considered instead. The problem through numerical procedures, instead of through an open
formulations also include intermediate levels in Fig. 2, declarative language. Examples of these include widely used
where SQP and reduced gradient methods are expected to process simulators such as Aspen/Plus, PRO/II and Hysys.
perform well. Nevertheless, accurate derivatives can be obtained from
To exploit synergies due to model-solver interfaces, appro- simulation models through the application of efficient auto-
priate modeling platforms are essential for the optimization matic differentiation tools. Examples of these include ADIC,
task. These are classified into two broad areas: optimization ADOL-C, Open AD and Tapenade. When linked to modern
modeling platforms and simulation platforms with optimi- NLP and MINLP algorithms, these can be applied to existing
zation. procedural models to provide powerful optimization capabil-
Optimization modeling platforms represent the top level ities. It should also be noted that more recent platforms such
in Fig. 2. They provide general purpose interfaces for opti- as Aspen Custom Modeler and gPROMS include declarative
models and exact derivatives. The MATLAB platform allows
the flexible formulation of optimization models as well,
although it currently has only limited capabilities for auto-
matic differentiation and limited optimization solvers. More
information on these issues can be found in [15].
www.cit-journal.com © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Chem. Ing. Tech. 2014, 86, No. 7, 943–952
15222640, 2014, 7, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/cite.201400033 by CAPES, Wiley Online Library on [02/12/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Chemie
Ingenieur Review 951
Technik
of magnitude [24]. These algorithmic advances have been optimized product properties [27]. Because of the substan-
realized through software modeling frameworks that link tial complexity of these models, the computational costs for
optimization models to efficient nonlinear (NLP) and process optimization are prohibitive. While lumped para-
mixed-integer nonlinear programming (MINLP) solvers. meter flowsheet models take only a few CPU seconds to
Here it is important to note that these advances are enabled solve, CFD and multi-phase population balance models may
because these frameworks permit (indeed require) optimi- require many CPU hours or even days.
zation models to be formulated carefully as well-posed prob- Moreover, as shown in [28], multi-scale modeling chal-
lems with exact first and second derivatives. lenges need to be addressed within an optimization frame-
Unfortunately, many commercial simulation tools still work. Here, original detailed models comprising complex
struggle with full flowsheet optimization, thus restricting the differential-algebraic equations (DAEs) and partial differen-
size and scope of synthesis problems that can be practically tial-algebraic equations (PDAEs) are replaced by reduced
considered. Even with state of the art simulation packages models consisting of algebraic equations that can be
(e.g., AspenPlus, gPROMS, ProSIM, Pro/II) that offer gradi- handled within an EO optimization environment, and re-
ent-based optimization, the structure of the flowsheeting quire only a few CPU seconds to evaluate. Because such an
model typically limits the formulation of nonlinear optimiza- optimization strategy is allowed to evaluate and compare
tion problems to no more than 100 degrees of freedom. information from detailed models, this reduced model fra-
While integration of nonlinear optimization presents de- mework will capture complex multi-scale phenomena with-
manding challenges for simulation tools, many of the in the process optimization. However, these strategies need
capabilities for large-scale optimization are already available to consider the following basic questions:
in optimization modeling environments such as GAMS, – What properties are needed for optimization with re-
AMPL and AIMMS. Moreover, additional families of NLP duced models to converge to the optimum of the original
algorithms, such as interior point methods, have been devel- system models?
oped that are better suited for large optimization problems. – What properties govern the construction of reduced mod-
These have been tailored to exploit these characteristics and els in order to balance model accuracy with computa-
lead to very fast performance for NLPs with many thou- tional cost during the optimization?
sands of variables and degrees of freedom. In addition, the – Can an optimization strategy based on reduced models be
incorporation of complementarity conditions to treat non- performed efficiently without frequent recourse to the ori-
smooth elements, process switches and bi-level problem for- ginal models?
mulations leads to very flexible EO optimization models Based on extensions of trust region methods in [6, 29],
[19]. Unfortunately, these algorithms are not yet available in prototype algorithms have been developed, analyzed and de-
most commercial flowsheet simulators, as the optimization monstrated in [28]. These show tremendous potential in
models require fully open, equation-oriented (EO) models dealing with intractable multi-scale optimization models.
with exact first and second derivatives. Coupled with algorithmic and software breakthroughs in
In addition, truly multi-scale process optimization still the EO mode, it is clear that these superior process optimi-
needs effective problem formulation and modeling environ- zation strategies will continue to have tremendous impacts
ments. Most process simulation models consist of lumped in generating more efficient, more productive and more
parameter descriptions with a number of ideal assumptions, sustainable process engineering systems.
e.g., perfect mixing, plug flow, equilibrium behavior and
shortcut models. These models do not account for complex
transport effects, nor do they consider phenomena that take Lorenz T. Biegler is cur-
place at broader length scales. Consequently, they cannot in- rently the Bayer University
clude detailed interactions with material design, complex Professor and Head of
fluid flow and transport effects with multiphase interac- Chemical Engineering at
tions. These effects may also require integration between Carnegie Mellon Univer-
molecular, nanoscale and distributed, continuum levels. As sity, which he joined after
a result, there is a growing need for more detailed, distribu- receiving his Ph.D. from
ted parameter, multi-scale process models. the University of Wiscon-
For instance, optimization models for advanced power sin in 1981. His research
generation processes [25, 26] comprise a heterogeneous interests lie in computer
modeling environment with lumped parameter (algebraic) aided process engineering
models, such as heat exchangers, compressors and expan- (CAPE) and he was an
ders, dynamic models (e.g., for gas separation units) and institute fellow at the National Energy Technology Lab
multi-phase (partial differential-algebraic) CFD models, e.g., as well as a scientist-in-residence at Argonne and San-
for gasification and combustor-turbine units. Similarly, poly- dia National Labs. He is also a member of the National
mer processes require detailed population balance models Academy of Engineering and has authored over 350
to determine molecular weight distributions that provide publications including two books.
Chem. Ing. Tech. 2014, 86, No. 7, 943–952 © 2014 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim www.cit-journal.com
15222640, 2014, 7, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/cite.201400033 by CAPES, Wiley Online Library on [02/12/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Chemie
952 Review Ingenieur
Technik
www.cit-journal.com © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Chem. Ing. Tech. 2014, 86, No. 7, 943–952