Matrix-free algorithm for the optimization of multidisciplinary systems

504 Accesses
2 Citations
3 Altmetric
Explore all metrics

Abstract

Multidisciplinary engineering systems are usually modeled by coupling software components that were developed for each discipline independently. The use of disparate solvers complicates the optimization of multidisciplinary systems and has been a long-standing motivation for optimization architectures that support modularity. The individual discipline feasible (IDF) formulation is particularly attractive in this respect. IDF achieves modularity by introducing optimization variables and constraints that effectively decouple the disciplinary solvers during each optimization iteration. Unfortunately, the number of variables and constraints can be significant, and the IDF constraint Jacobian required by most conventional optimization algorithms is prohibitively expensive to compute. Furthermore, limited-memory quasi-Newton approximations, commonly used for large-scale problems, exhibit linear convergence rates that can struggle with the large number of design variables introduced by the IDF formulation. In this work, we show that these challenges can be overcome using a reduced-space inexact-Newton-Krylov algorithm. The proposed algorithm avoids the need for the explicit constraint Jacobian and Hessian by using a Krylov iterative method to solve the Newton steps. The Krylov method requires matrix-vector products, which can be evaluated in a matrix-free manner using second-order adjoints. The Krylov method also needs to be preconditioned, and a key contribution of this work is a novel and effective preconditioner that is based on approximating a monolithic solution of the (linearized) multidisciplinary system. We demonstrate the efficacy of the algorithm by comparing it with the popular multidisciplinary feasible formulation on two test problems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Numerical investigation of non-hierarchical coordination for distributed multidisciplinary design optimization with fixed computational budget

Article 03 June 2016

A new Lagrangian solution scheme for non-decomposable multidisciplinary design optimization problems

Article 29 June 2023

An automated selection algorithm for nonlinear solvers in MDO

Article 21 June 2018

References

Arreckx S, Lambe AB, Martins JRRA, Orban D (2016) A matrix-free augmented lagrangian algorithm with application to large-scale structural design optimization. Optim Eng 17(2):359–384
Article MATH MathSciNet Google Scholar
Benzi M, Golub GH, Liesen J (2005) Numerical solution of saddle point problems. Acta Numer 14:1–137. doi:10.1017/s0962492904000212
Article MATH MathSciNet Google Scholar
Biros G, Ghattas O (2005a) Parallel lagrange–newton–krylov–schur methods for pde-constrained optimization. part i: the krylov–schur solver. SIAM J Sci Comput 27(2):687–713
Article MATH MathSciNet Google Scholar
Biros G, Ghattas O (2005b) Parallel lagrange–newton–krylov–schur methods for pde-constrained optimization. part ii: the lagrange–newton solver and its application to optimal control of steady viscous flows. SIAM J Sci Comput 27(2):714–739
Article MATH MathSciNet Google Scholar
Bloebaum C (1995) Coupling strength-based system reduction for complex engineering design. Struct Optim 10(2):113–121
Article Google Scholar
Byrd RH, Curtis FE, Nocedal J (2008) An inexact sqp method for equality constrained optimization. SIAM J Optim 19(1):351– 369
Article MATH MathSciNet Google Scholar
Byrd RH, Curtis FE, Nocedal J (2010) An inexact newton method for nonconvex equality constrained optimization. Math Program 122(2):273–299
Article MATH MathSciNet Google Scholar
Carpenter MH, Gottlieb D, Abarbanel S (1994) Time-stable boundary conditions for finite-difference schemes solving hyperbolic systems: methodology and application to high-order compact schemes. J Comput Phys 111(2):220–236. doi:10.1006/jcph.1994.1057
Article MATH MathSciNet Google Scholar
Conn AR, Gould NI, Toint PL (2000) Trust region methods. SIAM
Cramer EJ, Dennis JE Jr, Frank PD, Lewis RM, Shubin GR (1994) Problem formulation for multidisciplinary optimization. SIAM J Optim 4(4):754–776
Article MATH MathSciNet Google Scholar
Eisenstat SC, Walker HF (1996) Choosing the forcing terms in an inexact newton method. SIAM J Sci Comput 17(1):16–32
Article MATH MathSciNet Google Scholar
Felippa CA (2001) A historical outline of matrix structural analysis: a play in three acts. Comput Struct 79 (14):1313–1324
Article Google Scholar
Fletcher R, Leyffer S (2002) Nonlinear programming without a penalty function. Math Program 91 (2):239–269
Article MATH MathSciNet Google Scholar
Funaro D, Gottlieb D (1988) A new method of imposing boundary conditions in pseudospectral approximations of bolic equations. Math Comput 51(184):599–613
Article MATH Google Scholar
Gill PE, Murray W, Saunders MA (2005) Snopt: an sqp algorithm for large-scale constrained optimization. SIAM Rev 47(1):99– 131
Article MATH MathSciNet Google Scholar
Haftka RT (1985) Simultaneous analysis and design. AIAA J 23(7):1099–1103
Article MATH MathSciNet Google Scholar
Haftka RT, Sobieszczanski-Sobieski J, Padula SL (1992) On options for interdisciplinary analysis and design optimization. Struct Optim 4(2):65–74
Article Google Scholar
Heinkenschloss M, Ridzal D (2008) An inexact trust-region sqp method with applications to pde-constrained optimization. In: Numerical mathematics and advanced applications. Springer, pp 613–620
Heinkenschloss M, Ridzal D (2014) A matrix-free trust-region sqp method for equality constrained optimization. SIAM J Optim 24(3):1507–1541
Article MATH MathSciNet Google Scholar
Heroux MA, Bartlett RA, Howle VE, Hoekstra RJ, Hu JJ, Kolda TG, Lehoucq RB, Long KR, Pawlowski RP, Phipps ET, Salinger AG, Thornquist HK, Tuminaro RS, Willenbring JM, Williams A, Stanley KS (2005) An overview of the trilinos project. ACM Trans Math Softw 31(3):397–423. doi:10.1145/1089014.1089021
Article MATH MathSciNet Google Scholar
Herskovits J, Mappa P, Goulart E, Soares CM (2005) Mathematical programming models and algorithms for engineering design optimization. Comput Methods Appl Mech Eng 194(30):3244–3268
Article MATH MathSciNet Google Scholar
Hicken JE, Dener A (2015) A flexible iterative solver for nonconvex, equality-constrained quadratic subproblems. SIAM J Sci Comput 37(4):A1801–A1824
Article MATH MathSciNet Google Scholar
Hicken JE, Zingg DW (2013) Summation-by-parts operators and high-order quadrature. J Comput Appl Math 237(1):111–125. doi:10.1016/j.cam.2012.07.015
Article MATH MathSciNet Google Scholar
Jameson A (1989) Aerodynamic design via control theory. In: Recent advances in computational fluid dynamics. Springer, pp 377– 401
Kennedy GJ, Martins JRRA (2010) Parallel solution methods for aerostructural analysis and design optimization. In: 13th AIAA/ISSMO multidisciplinary analysis optimization conference, p 9308
Kenway GKW, Kennedy GJ, Martins JRRA (2014) Scalable parallel approach for high-fidelity steady-state aeroelastic analysis and adjoint derivative computations. AIAA J 52(5):935–951
Article Google Scholar
Knoll DA, Keyes DE (2004) Jacobian-free newton–krylov methods: a survey of approaches and applications. J Comput Phys 193(2):357–397
Article MATH MathSciNet Google Scholar
Kodiyalam S, Sobieszczanski-Sobieski J (2001) Multidisciplinary design optimisation-some formal methods, framework requirements, and application to vehicle design. Int J Veh Des 25(1–2):3–22
Article Google Scholar
Kreiss HO, Scherer G (1974) Finite element and finite difference methods for hyperbolic partial differential equations. In: de Boor C (ed) Mathematical aspects of finite elements in partial differential equations. Mathematics Research Center, the University of Wisconsin. Academic Press, New York
Martins JRRA, Lambe AB (2013) Multidisciplinary design optimization: a survey of architectures. AIAA J 51(9):2049–2075
Martins JRRA, Alonso JJ, Reuther JJ (2004) High-fidelity aerostructural design optimization of a supersonic business jet. J Aircr 41(3):523–530
Article Google Scholar
Mattsson K, Svärd M, Nordström J (2004) Stable and accurate artificial dissipation. J Sci Comput 21(1):57–79
Article MATH MathSciNet Google Scholar
Maute K, Nikbay M, Farhat C (2001) Coupled analytical sensitivity analysis and optimization of three-dimensional nonlinear aeroelastic systems. AIAA J 39(11):2051–2061
Article MATH Google Scholar
Maute K, Nikbay M, Farhat C (2003) Sensitivity analysis and design optimization of three-dimensional non-linear aeroelastic systems by the adjoint method. Int J Numer Methods Eng 56(6):911–933
Article MATH Google Scholar
Nash SG, Nocedal J (1991) A numerical study of the limited memory bfgs method and the truncated-newton method for large scale optimization. SIAM J Optim 1(3):358–372
Article MATH MathSciNet Google Scholar
Özkaya E, Gauger NR (2009) Single-step one-shot aerodynamic shape optimization. In: Optimal control of coupled systems of partial differential equations. Springer, pp 191–204
Schittkowski K (1986) Nlpql: a fortran subroutine solving constrained nonlinear programming problems. Ann Oper Res 5(2):485–500
Article MathSciNet Google Scholar
Steihaug T (1983) The conjugate gradient method and trust regions in large scale optimization. SIAM J Numer Anal 20(3):626–637
Article MATH MathSciNet Google Scholar
Strand B (1994) Summation by parts for finite difference approximations for d/dx. J Comput Phys 110 (1):47–67. doi:10.1006/jcph.1994.1005
Article MATH MathSciNet Google Scholar
Ta’asan S, Kuruvila G, Salas M (1992) Aerodynamic design and optimization in one shot. In: 30th aerospace sciences meeting and exhibit, p 25
Turner M (1959) The direct stiffness method of structural analysis. Boeing Airplane Company
Wang Z, Navon IM, Le Dimet F, Zou X (1992) The second order adjoint analysis: theory and applications. Meteorog Atmos Phys 50(1–3):3–20
Article Google Scholar
Wright S, Nocedal J (2006) Numerical optimization, 2nd edn. Springer Science

Download references

Acknowledgements

This material is based upon work supported by the National Science Foundation under Grant No. 1332819. The authors gratefully acknowledge this support.

Author information

Authors and Affiliations

Mechanical, Aerospace and Nuclear Engineering, Rensselaer Polytechnic Institute, 110 8th Street, Troy, NY, 12180, USA
Alp Dener & Jason E. Hicken

Authors

Alp Dener
View author publications
You can also search for this author in PubMed Google Scholar
Jason E. Hicken
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alp Dener.

Appendices

Appendix

Proof of Theorem 1

Consider an MDA with d disciplines. Let the equation for the i th discipline be given by

$${\boldsymbol{\mathcal{R}}}_{i}\left( \boldsymbol{w}_{i}, \{ {\boldsymbol{\mathcal{E}}}_{ij}(\boldsymbol{w}_{j}) | j\,=\,1,\ldots,d,\! j\!\neq\! i \} \right), \qquad \forall i \,=\, 1,\ldots,d. $$

where the set notation in the second argument indicates that the residual depends on the states of the other d − 1 disciplines via the nonlinear mappings ${\boldsymbol {\mathcal {E}}}_{ij}$. Next, introduce the coupling variables, ${{\bar {\boldsymbol {w}}}}_{ij}$, which are defined by

$$\begin{array}{@{}rcl@{}} {\boldsymbol{\mathcal{C}}}_{ij}(\boldsymbol{w}_{j},{{\bar{\boldsymbol{w}}}}_{ij}) &\equiv& {\boldsymbol{\mathcal{E}}}_{ij}(\boldsymbol{w}_{j}) - {{\bar{\boldsymbol{w}}}}_{ij} = 0,\\ \forall i,j &=& 1,\dots,d, \; j\neq i. \end{array} $$

Using the above conditions, the residual of the i th discipline can be expressed as a function of the coupling variables:

$${\boldsymbol{\mathcal{R}}}_{i}\left( \boldsymbol{w}_{i}, \{{{\bar{\boldsymbol{w}}}}_{ij} | j=1,\ldots,d, j\neq i \}\right), \qquad \forall i = 1,\ldots,d. $$

Let $\boldsymbol {w}^{T} = [\boldsymbol {w}_{1}^{T},\ldots ,\boldsymbol {w}_{d}^{T}]$ and ${{\bar {\boldsymbol {w}}}}^{T} = [{{\bar {\boldsymbol {w}}}}_{12}^{T},\ldots ,{{\bar {\boldsymbol {w}}}}_{d,d-1}^{T}]$ denote the compound vectors containing all the states and coupling variables, respectively, and consider the compound residual given by

$$\left( \begin{array}{c} {\boldsymbol{\mathcal{R}}}(\boldsymbol{w},{{\bar{\boldsymbol{w}}}}) \\ {\boldsymbol{\mathcal{C}}}(\boldsymbol{w},{{\bar{\boldsymbol{w}}}}) \end{array}\right) =\left( \begin{array}{c} {\boldsymbol{\mathcal{R}}}_{1}(\boldsymbol{w}_{1}, {{\bar{\boldsymbol{w}}}}_{12}, \ldots, {{\bar{\boldsymbol{w}}}}_{1d}) \\ {\vdots} \\ {\boldsymbol{\mathcal{R}}}_{d}(\boldsymbol{w}_{d}, {{\bar{\boldsymbol{w}}}}_{d1}, \ldots, {{\bar{\boldsymbol{w}}}}_{d,d-1}) \\ {\boldsymbol{\mathcal{C}}}_{12}(\boldsymbol{w}_{2},{{\bar{\boldsymbol{w}}}}_{12}) \\ {\vdots} \\ {\boldsymbol{\mathcal{C}}}_{d,d-1}(\boldsymbol{w}_{d-1},{{\bar{\boldsymbol{w}}}}_{d,d-1}) \end{array}\right). $$

The Jacobian of this compound residual is the matrix

$${\textsf{D}} \equiv \left[\begin{array}{cc} \frac{\partial {\boldsymbol{\mathcal{R}}}}{\partial \boldsymbol{w}} & \frac{\partial {\boldsymbol{\mathcal{R}}}}{\partial {{\bar{\boldsymbol{w}}}}} \\ \frac{\partial {\boldsymbol{\mathcal{C}}}}{\partial \boldsymbol{w}} & -{\textsf{I}} \end{array}\right]. $$

The (2,2) block of D is the negative identity, which is clearly invertible, and the (1,1) block of D is the block diagonal matrix d i a g(A ₁,…, A _d), which is also invertible by assumption that the A _i are invertible. Therefore, we can form the Schur complement of D with respect to either block.

The Schur complement with respect to the (2,2) block of D is the matrix

$${\textsf{S}}_{(2,2)} \equiv \frac{\partial {\boldsymbol{\mathcal{R}}}}{\partial \boldsymbol{w}} + \frac{\partial {\boldsymbol{\mathcal{R}}}}{\partial {{\bar{\boldsymbol{w}}}}} \frac{\partial {\boldsymbol{\mathcal{C}}}}{\partial \boldsymbol{w}}. $$

Now, the (i, j) block of second term above is, for i ≠ j,

$$\left( \frac{\partial {\boldsymbol{\mathcal{R}}}}{\partial {{\bar{\boldsymbol{w}}}}} \frac{\partial {\boldsymbol{\mathcal{C}}}}{\partial \boldsymbol{w}} \right)_{i,j} = \frac{\partial {\boldsymbol{\mathcal{R}}}_{i}}{\partial {{\bar{\boldsymbol{w}}}}_{ij}} \frac{\partial {\boldsymbol{\mathcal{E}}}_{ij}}{\partial \boldsymbol{w}_{j}} = \frac{\partial {\boldsymbol{\mathcal{R}}}_{i}}{\partial \boldsymbol{w}_{j}}, $$

where the last equality follows from the chain rule. There is no contribution to the (i, i) block from the second term, since there is no coupling variable between a discipline and itself. Conversely, recall that the first term in the Schur complement is ${\mathsf {A}}_{i} = \partial {\boldsymbol {\mathcal {R}}}_{i}/\partial \boldsymbol {w}_{i}$, which has no off-diagonal blocks. Summarizing, the Schur complement with respect to the (2,2) block of D is the monolithic MDA Jacobian

$${\textsf{S}}_{(2,2)} = \left( \frac{\partial {\boldsymbol{\mathcal{R}}}}{\partial \boldsymbol{w}} + \frac{\partial {\boldsymbol{\mathcal{R}}}}{\partial {{\bar{\boldsymbol{w}}}}} \frac{\partial {\boldsymbol{\mathcal{C}}}}{\partial \boldsymbol{w}} \right)_{i,j} = \frac{\partial {\boldsymbol{\mathcal{R}}}_{i}}{\partial \boldsymbol{w}_{j}}. $$

Next, we consider the Schur complement of D with respect to the (1,1) block:

$${\textsf{S}}_{(1,1)} \equiv -{\textsf{I}} - \frac{\partial {\boldsymbol{\mathcal{C}}}}{\partial \boldsymbol{w}} \left[\frac{\partial {\boldsymbol{\mathcal{R}}}}{\partial \boldsymbol{w}}\right]^{-1} \frac{\partial {\boldsymbol{\mathcal{R}}}}{\partial {{\bar{\boldsymbol{w}}}}}. $$

Now, the direct sensitivities of w _i with respect to ${{\bar {\boldsymbol {w}}}}_{ij}$ can be found by taking the total derivative of ${\boldsymbol {\mathcal {R}}}_{i} = \mathbf {0}$:

$$\begin{array}{@{}rcl@{}} &&\frac{d {\boldsymbol{\mathcal{R}}}_{i}}{d {{\bar{\boldsymbol{w}}}}_{ij}} = \frac{\partial {\boldsymbol{\mathcal{R}}}_{i}}{\partial {{\bar{\boldsymbol{w}}}}_{ij}} + \frac{\partial {\boldsymbol{\mathcal{R}}}_{i}}{\partial \boldsymbol{w}_{i}} \frac{d \boldsymbol{w}_{i}}{d {{\bar{\boldsymbol{w}}}}_{ij}} = 0 \\ &&\Rightarrow\qquad \frac{d \boldsymbol{w}_{i}}{d {{\bar{\boldsymbol{w}}}}_{ij}} = - \left[\frac{\partial {\boldsymbol{\mathcal{R}}}_{i}}{\partial \boldsymbol{w}_{i}}\right]^{-1} \frac{\partial {\boldsymbol{\mathcal{R}}}_{i}}{\partial {{\bar{\boldsymbol{w}}}}_{ij}}. \end{array} $$

Consequently, substituting this expression, the Schur complement with respect to the (1,1) block of D simplifies to

$${\textsf{S}}_{(1,1)} = -{\textsf{I}} + \frac{\partial {\boldsymbol{\mathcal{C}}}}{\partial \boldsymbol{w}} \frac{d \boldsymbol{w}}{d {{\bar{\boldsymbol{w}}}}} = \frac{d {\boldsymbol{\mathcal{C}}}}{d {{\bar{\boldsymbol{w}}}}}, $$

which is the total derivative of the coupling constraints with respect to the coupling variables, i.e. the IDF Jacobian.

The desired result now follows. If the MDA Jacobian is invertible, i.e. the Schur complement S _(2,2) is invertible, then D is invertible. If D is invertible, then the IDF Jacobian must be invertible, since it is the Schur complement S _(1,1). The reverse implication is analogous. This completes the proof.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dener, A., Hicken, J.E. Matrix-free algorithm for the optimization of multidisciplinary systems. Struct Multidisc Optim 56, 1429–1446 (2017). https://doi.org/10.1007/s00158-017-1734-0

Download citation

Received: 06 April 2017
Revised: 18 May 2017
Accepted: 01 June 2017
Published: 23 June 2017
Issue Date: December 2017
DOI: https://doi.org/10.1007/s00158-017-1734-0

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Numerical investigation of non-hierarchical coordination for distributed multidisciplinary design optimization with fixed computational budget

A new Lagrangian solution scheme for non-decomposable multidisciplinary design optimization problems

An automated selection algorithm for nonlinear solvers in MDO

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix

Proof of Theorem 1

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Matrix-free algorithm for the optimization of multidisciplinary systems

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Numerical investigation of non-hierarchical coordination for distributed multidisciplinary design optimization with fixed computational budget

A new Lagrangian solution scheme for non-decomposable multidisciplinary design optimization problems

An automated selection algorithm for nonlinear solvers in MDO

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix

Proof of Theorem 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation