key: cord-0045678-riq7xe6p authors: Pietrenko-Dabrowska, Anna; Koziel, Slawomir; Leifsson, Leifur title: Reduced-Cost Constrained Modeling of Microwave and Antenna Components: Recent Advances date: 2020-05-25 journal: Computational Science - ICCS 2020 DOI: 10.1007/978-3-030-50426-7_4 sha: e175d2739c6f2d5682c201392ca0c6f6d1dbceb5 doc_id: 45678 cord_uid: riq7xe6p Electromagnetic (EM) simulation models are ubiquitous in the design of microwave and antenna components. EM analysis is reliable but CPU intensive. In particular, multiple simulations entailed by parametric optimization or uncertainty quantification may considerably slow down the design processes. In order to address this problem, it is possible to employ fast metamodels. Here, the popular solution approaches are approximation surrogates, which are versatile and easily accessible. Notwithstanding, the major issue for conventional modeling methods is the curse of dimensionality. In the case of high-frequency components, an added difficulty are highly nonlinear outputs that need to be handled. A recently reported constrained modeling attempts to broaden the applicability of approximation surrogates by confining the surrogate model setup to a small subset of the parameter space. The said region contains the parameter vectors corresponding to high-quality designs w.r.t. the considered figures of interest, which allows for a dramatic reduction of the number of training samples needed to render reliable surrogates without formally restricting the parameter ranges. This paper reviews the recent techniques employing these concepts and provides real-world illustration examples of antenna and microwave structures. Design of contemporary microwave and antenna components has been increasingly dependent on full-wave electromagnetic (EM) simulation tools. EM analysis permits reliable evaluation of arbitrary geometries and taking into account cross-couplings between system components, dielectric anisotropy, or the effects of installation fixtures and radomes. The growing involvement of EM simulation packages is especially pertinent to parameter tuning also referred to as design closure [1] . This is where the performance parameters of the structure at hand are enhanced subject to the assumed design constraints. It is most often achieved through numerical optimization, which entails significant computational expenses. Handling massive EM analyses is one of the major challenges pertaining to EM-based design processes even if local optimization is of concern. Solving tasks such as global optimization [2] , uncertainty quantification or tolerance-aware design [3] , may become computationally unmanageable when attempted directly at full-wave EM simulation level. Reducing the CPU cost of EM-driven design has been the subject of extensive research. One possible option is the development of more efficient numerical algorithms, e.g., incorporation of adjoint sensitivities to speed up gradient-based procedures [4] , or the employment of (local) surrogate models [5] . A notable example of the latter is space mapping [6] . Other approaches include response correction methods [7] , feature-based optimization [8] , or machine learning frameworks, often surrogateassisted [9] . Another option is an overall replacement of the EM model by its faster surrogate, which permits a rapid execution of all types of simulation-based design procedures. Data-driven surrogates belong to the most popular ones due to their versatility [10] . They are constructed by approximating the data sampled from the original (here, EM) model with no physical insight required. Commonly used modeling techniques include polynomial regression [11] , radial basis functions [12] , kriging [13] , support vector regression (SVR) [14] , and polynomial chaos expansion (PCE) [15] . Available alternatives include, among others, hybridization of one of the aforementioned methods. One of these is PC kriging [16] , in which polynomial chaos expansion surrogate becomes a trend function, whereas kriging interpolation is employed to account for the residuals. Despite their merits and popularity, applicability of approximation models is limited by the curse of dimensionality. In the case of high-frequency structures, which often feature nonlinear responses, reliable data-driven surrogates can be constructed for systems described by up to four or five variables. Design utility of such models is of course questionable given the complexity of contemporary devices (both antennas and microwave). A range of methods have been developed to mitigate these problems, including high-dimensional model representation (HDMR) [17] , feature-based modeling [18] , orthogonal matching pursuit (OMP) [19] , as well as variable-fidelity methods (Bayesian model fusion [20] , co-kriging [21] , or two-stage GPR [22] ). In [23] , an alternative approach has been suggested, where the issue of high cost of training data acquisition is addressed by confining the surrogate model domain to a region that contains the designs that are of high quality with respect to the figures of interest relevant for the system at hand. The volume of such a region is dramatically smaller than the conventional box-constrained domain so that restricting the model validity (and, consequently, training data allocation) leads to significant computational savings. Determination of the constrained domain requires additional knowledge, normally in the form of the reference designs pre-optimized with respect to the chosen figures of interest of choice. Several variations of the constrained modeling have been reported, including rudimentary frameworks rendering surrogates that can handle a single operating condition [23] , versions that allow accounting for supplementary figures of interest (e.g., substrate permittivity [24] ), to techniques that permit arbitrary allocation of the reference designs [25] . The nested kriging of [26] also allows for straightforward uniform domain sampling and surrogate model optimization, which was not possible in the earlier versions. This paper reviews the recent developments of the constrained modeling, focusing on three approaches: (i) modeling with structured reference design set [23, 24] , (ii) triangulation-based modeling [25] , as well as (iii) the nested kriging framework [26] . The presented methods are illustrated using real-world microwave and antenna structures, and benchmarked against conventional surrogate modeling methods. A generic formulation of the constrained modeling concept is also provided. We start by formulating the constrained modeling concept. One of the most important components is the confined surrogate model domain. The fundamental criterion deciding upon the domain geometry is design optimality for the assumed design objectives. [23] . Another example is the substrate permittivity [24] . The objective space F is defined by the ranges f k:min f An illustration example follows: if f k are operating frequencies of a multi-band antenna, U(Á) may be defined as −min{B 1 , …, B N }, where B j is the fractional bandwidth corresponding to f j . Thus, minimization of U f (f) leads to achieving the largest possible bandwidths. Fig. 1 ), which determines the region of interest from the point of view of the figures f k . It contains the designs that are of high quality w.r.t. f k as specified by U. Hence, the surrogate constructed in the vicinity of U f (F) is all one needs to carry out the design tasks where f k are of concern. Focusing on U f (F) yields considerable savings in terms of training data acquisition as compared to building the model in X. Yet, some problems arise: (i) how to identify U f (F), (ii) how to carry out design of experiments, and, (iii) how to employ the surrogate (e.g., for parametric optimization) given geometrical complexity of the domain. Sections 3 through 5 address these issues when discussing particular realizations of the constrained modeling concept. The modeling flow is shown in Fig. 2 . The fundamental step of the process is a definition of the model domain. The latter is based on information acquired from a certain number of reference designs optimized for the selected values of performance figures. A particular way of utilizing this data is what distinguishes different versions of the constrained modeling frameworks. The initial versions of constrained modeling framework utilized (structurally) fixed set of reference points. The advantage of this approach was simpler implementation. The downside was limited flexibility. This section describes a specific technique designed to model narrowband antennas with two figures of interest: operating frequency, and relative permittivity of the dielectric substrate [24] . Here, we assume two figures of interest, specifically, the antenna operating frequency f and the relative substrate permittivity e r . The goal is to construct the surrogate for f min f f max , and e min e r e max . The design that is optimum for particular values of f and e r will be denoted as U f (f, e r ). The domain of the model is a neighborhood of the surface defined using 9 reference points allocated within the discussed ranges of the operating frequency and substrate permittivity. We have U f ðf à ; e à r Þ, where f * 2 {f min , f 0 , f max } and e à r 2 {e min , e r0 , e max }, cf. Fig. 3 . We For consistency, v 9 = v 1 . Let z be a point in the parameter space, and P k (z) be its projection onto M k (cf. Fig 4(b) ). We have arg min where The least-square solution to (4) (equivalent to the solution of (3)) is given as where In practice, the expansion coefficients with respect to v k and v k+1 are of interest. These are given as a ¼ a À p k 1. x min is the range of variation of geometry parameters within M. Using these, we can define the domain X S by imposing the following conditions: a y 2 X S if 1. y is close to M in the sense that its orthogonal projection belongs to at least one M k , i.e., we have KðyÞ ¼ fk 2 f1; . . .; 8g : P k ðyÞ 2 M k g 6 ¼ £; 2. minfjjðy À P k ðyÞÞ==dxjj : k 2 KðyÞg d max (here, //stands for component-wise division); d max is a domain thickness parameter (typically, 0.1 d max 0.2). Note that d max in the second condition determines the "perpendicular" size of X S . The size of X S is dramatically smaller (volume-wise) than the size of the hypercube containing the reference designs. The surrogate itself is constructed using kriging interpolation of the EM model response R based on the training data allocated in X S [24] . The design of experiments is based on random sampling within the interval [x min , x max ] assuming uniform probability distribution. The samples allocated outside [x min , x max ] are rejected. The method is illustrated using a ring slot antenna of Fig. 5 (a) [27] , implemented on the 0.76-mm-thick substrate. The design parameters are x = [l f l d w d r s s d o g e r ] T ; e r represents relative permittivity of the substrate. The feed line width w f is computed for each e r to ensure 50 X input impedance. The EM is implemented in CST (*300,000 cells, simulation time 90 s). The goal is to construct the surrogate for the following ranged of the operating frequency and substrate permittivity: f min = 2.5 GHz to f max = 6.5 GHz, and e min = 2.0 to e max = 5.0. There are nine reference points generated by optimizing the antenna [28] for the pairs {f 0 , e r } with f 2 {2.5, 4.5, 6.5} GHz and e r 2 {2.0, 3.5, 5.0}. The optimization objective is matching improvement at f 0 . . For the sake of validation, the surrogate model was constructed using the training set of various sizes, from 100 to 1000 samples. In all cases, we used d max = 0.2. Model validation was performed using the split-sample method with 100 test designs. The benchmark was kriging interpolation surrogate established in the original domain X = [x min , x max ] using 1000 samples. The numerical results are gathered in Table 1 , see also Fig. 6 . Furthermore, Fig. 5(b) illustrates selected projections of the training data set for conventional (uniform) and proposed design of experiments. It can be observed that the accuracy improvement due to constrained sampling is considerable (by a factor of about 3.5). The constrained surrogate that exhibits the same predictive power as the corresponding conventional model can be obtained using around ten times less data samples. In [25] , a generalization of the technique presented in Sect. 3 has been proposed, which is based on triangulation of the reference designs. This technique does not only allow for arbitrary distribution of the reference points but it also has no limitations in the number of figures of interest that can be handled. The parameter and objective spaces are defined as in Sect. 2. The reference designs x ðjÞ ¼ ½x Fig. 7(a) ). In all cases, the surrogate model constructed using kriging interpolation [28] . The model domain X S is determined as a vicinity of the manifold M defined as j¼1 a j x ðk:jÞ : 0 a j 1; The region is defined using the distance from the surface M in the orthogonal complements of the subspaces containing S (k) . Given a point z, it is necessary to find the distance from it to M, which can be done by considering a projection P k (z) onto the affine subspace H k ' S (k) . We also define the simplex anchor x (0) = x (k.1) , and its spanning vectors v (j) = x (k.j+1) − x (0) , j = 1, …, N (cf. Fig. 7(b) ). The projection corresponds to the expansion coefficients w.r.t. v (j) [25] arg min where the vectors v ðjÞ are obtained from v (j) by orthogonalization (i.e., v Note that A is a triangular matrix that contains coefficients obtained from the above orthogonalization procedure. The problem (7) f (1) f (2) f (3) f (4) f (5) f (6) f (7) f (8) S (1) = [ x (1) x (2) x (4) ] S (2) = [ x (1) x (3) x (4) ] (2) x (5) x (6) ] S (4) = [ x (2) x (5) x (7) ] S (5) = [ x (2) x (4) x (6) ] S (6) = [ x (4) x (6) x (7) ] S (7) = [ x (5) x (6) x (8) ] S (8) = [ x (6) x (7) x (8) ] S (9) = [ x (5) x (8) x (9) ] S (4) S (5) S (6) S (7) S (8) f (9) S (9) x (2) x (1) x1 x2 x3 x (3) x (4) x (5) x (6) x (7) x (8) x (9) v (1) z P k (z) The expansion coefficients can be found analytically as The critical factor is whether P k (z) 2 hull(S (k) ) (hull() stands for the convex hull). For that, it is necessary to identify the expansion coefficients a (j) of z w.r.t. {v (j) }. The latter can be obtained as ½a ð1Þ . . . a ðNÞ T ¼ A½ a ð1Þ . . . a ðNÞ T . It should be noted that P k (z) 2 S (k) if 1. a ðjÞ ! 0 for j ¼ 1; . . .; N; and 2. a ð1Þ þ . . . þ a ðNÞ 1. that is, if P k (z) is a convex combination of the vectors v (j) . The next step of surrogate modeling domain definition is to define x max ¼ maxfx ðkÞ ; k ¼ 1; . . .; pg and x min ¼ minfx ðkÞ ; k ¼ 1; . . .; pg. The vector dx ¼ x max À x min is an indication of the geometry parameter variability within the surface M. Using these, the surrogate model domain X S can be defined as in [24] . More specifically, a vector y 2 X S if 1. KðyÞ ¼ fk 2 f1; . . .; N S g : P k ðyÞ 2 S ðkÞ g 6 ¼ £; 2. minfjjðy À P k ðyÞÞ==dxjj : k 2 KðyÞg d max (//stands for component-wise division); d max is a user-defined parameter determining the domain thickness. The major benefit of this definition is that X S is considerable smaller than the original domain X ¼ ½x min ; x max volume-wise, thus the surrogate can be constructed using a reduced training set. Notwithstanding, the domain still contains the optimum designs (with respect to the selected performance figures) so that the surrogate retains its design utility. This is demonstrated in the next section by modeling a dual-band antenna over broad ranges of both geometry and material parameters. For the sake of verification, let us consider a dipole antenna of Fig. 8(a) [30] . The structure is realized on Taconic RF-35 substrate of relative permittivity e r = 3.5 and thickness h = 0.762 mm. There are six adjustable variables x = [l 1 l 2 l 3 w 1 w 2 w 3 ] T . Other parameters are fixed: l 0 = 30, w 0 = 3, s 0 = 0.15 and o = 5 are fixed (dimensions in mm). The computational model R is simulated in CST Microwave Studio (*100,000 cells; simulation time 1 min). We aim at constructing the surrogate within the following objective space: 2.0 GHz f 1 4.0 GHz (lower band), and 4.5 GHz f 2 6.5 GHz (upper band). Figure 8(b) shows the allocation of the reference designs. The latter have been generated using variable-fidelity feature-based optimization [28] . Validation has been carried out by rendering the surrogates using various numbers of training samples, from 100 to 1600. In all cases, d max = 0.05 was employed. The numerical results are gathered in Table 2 . Conventional kriging metamodel is used as a benchmark. The antenna reflection characteristics according to the surrogate and EM simulation have been shown in Fig. 9 . The nested kriging framework proposed in [26] employs two kriging metamodels. One of these models is used to establish the surrogate model domain by mapping the figureof-interest space into the parameter space and to provide the first approximation of the region of interest. The major advantage of [26] is that design of experiments but also model optimization can be implemented in a convenient way. We use the same definitions as in Sect. 2.1 for the objective and parameter spaces. The reference designs optimized w.r.t. f ðjÞ ¼ ½f Similarly as for the method discussed before, d max denotes the domain thickness. The coefficients a k are used to delimit X S (see also Fig. 10 (b)) [26] by defining Using (12), we get The second-level surrogate is a kriging model rendered in X S based on fx ðkÞ B ; Rðx ðkÞ B Þg k¼1;...;NB , where R is the EM-simulation model of the structure of interest. Note that the definition of X S facilitates design of experiments which was a problem for both [23] and [25] . It is implemented using (13) and the mappings from the unit interval [0, 1] n onto X S . Let {z (k) }, k = 1, …, N B , where z (k) = [z 1 (k) … z n (k) ] T , denote the set of uniformly distributed data points in [0, 1] n (here, using LHS [31] ). The mapping is realized in two stages. First, the function h 1 . transforms the unit hypercube onto a F  [−1,1] n−N ( is a Cartesian product). Subsequently, a function h 2 is defined as This paper discussed the recent developments in constrained modeling of highfrequency structures. Domain confinement permits a reduction of the computational overhead related to training data acquisition. At the same time, reliable surrogates can be rendered without formally restricting the ranges of geometry and material parameters as well as the operating conditions. There were three specific realizations of this concept discussed in the work. The main differences between these were in analytical formulation of the surrogate model domain. The simplest technique, discussed in Sect. 3, requires structured allocation of the reference designs. The second method, the triangulation-based modeling of Sect. 4, is more flexible, allows for an arbitrary placement of the reference set, and it can directly handle arbitrary number of operating conditions. The downside is a non-trivial design of experiments. The nested kriging framework is the most comprehensive: the very formulation of the model domain incorporates the means to carry out uniform design of experiments as well as optimization of the surrogate. Notwithstanding, implementation of this approach is more involved. In general, surrogate modeling with domain confinement can be considered a viable workaround dimensionality and parameter range issues, both of which are the fundamental challenges of conventional methods. At the same time, the initial computational overhead required to generate the reference points is well justified when the surrogate is reused, for example, for the purpose of dimension scaling for various operating conditions. Expedited design closure of antennas by means of trustregion-based adaptive response scaling High-dimensional global optimization method for highfrequency electronic design Stochastic response surfaces based on non-intrusive polynomial chaos for uncertainty quantification Antenna design exploiting adjoint sensitivity-based geometry evolution Reliable multistage optimization of antennas for multiple performance figures in highly dimensional parameter spaces A space-mapping interpolating surrogate algorithm for highly optimized EM-based design of microwave devices Simulation-Driven Design by Knowledge-Based Response Correction Techniques Fast simulation-driven feature-based design optimization of compact dual-band microstrip branch-line coupler Gaussian Processes for Machine Learning Evolutionary model type selection for global surrogate modeling Polynomial-based surrogate modeling of RF and microwave circuits in frequency domain exploiting the multinomial theorem Sequential approximate optimization using radial basis function network for engineering optimization Kriging metamodeling in simulation: a review Microwave devices and antennas modelling by support vector regression machines Statistical modeling of disturbed antennas based on the polynomial chaos expansion Polynomial-chaos-based kriging An adaptive RBF-HDMR modeling approach under limited computational budget Computationally feasible narrow-band antenna modeling using response features Signal recovery from random measurements via orthogonal matching pursuit Bayesian model fusion: large-scale performance modeling of analog and mixed-signal circuits by reusing early-stage data Variable-fidelity electromagnetic simulations and co-kriging for accurate modeling of antennas Two-stage framework for efficient Gaussian process modeling of antenna input characteristics Low-cost data-driven surrogate modeling of antenna structures by constrained sampling On reduced-cost design-oriented constrained surrogate modeling of antenna structures Triangulation-based constrained surrogate modeling of antennas Performance-based nested surrogate modeling of antenna input characteristics Microstrip-fed ring slot antenna design with wideband harmonic suppression Fast simulation-driven antenna design using response-feature surrogates Delaunay Mesh Generation Dual-band slot dipole antenna fed by a coplanar waveguide A general theory for orthogonal array based Latin hypercube sampling Rapid simulation-driven multi-objective design optimization of decomposable compact microwave passives The modeling technique described in Sect. 5 .1 has been validated using a miniaturized impedance matching transformer [32] . The structure is shown in Fig. 11 (b). It is realized on Taconic RF-35 substrate of relative permittivity e r = 3.5 and thickness h = 0.762 mm. The circuit employees compact microstrip resonant cells (CMRCs) shown in Fig. 11 (a).The adjustable variables areThe figure-of-interest space contains the operating bands [f 1 f 2 ] with the following ranges: 1.5 GHz f 1 3.5 GHz, and 4.5 GHz f 2 6.5 GHz. Here, the optimum design is understood by minimization of the maximum reflection | S 11 | within [f 1 f 2 ] (which is a minimax problem). The allocation of the reference designs has been shown in Fig. 11(c) .Surrogate model validation has been carried out for various numbers of training data samples from 50 to 800. All models were generated using d max = 0.05. The modeling error was calculated using the split-sample method with 100 random testing designs. The numerical results are shown in Table 3 (see also Fig. 12 ). The benchmark includes conventional kriging and RBF metamodels. It can be observed that the nested kriging framework enables two-or even three-fold reduction of the modeling error assuming the same training data set size. It should be emphasized that for this case the predictive power of all conventional surrogates is poor. The primary reason is a large number of geometry parameters and, consequently, a large volume of the conventional domain.