Inverse freeform design in non-imaging optics: Hamilton’s theory of geometrical optics, optimal transport, and least-squares solvers

ten Thije Boonkkamp, J. H. M.; Mitra, K.; Anthonissen, M. J. H.; Kusch, L.; Braam, P. A.; IJzerman, W. L.

doi:10.3389/fphy.2025.1518660

ORIGINAL RESEARCH article

Front. Phys., 09 May 2025

Sec. Optics and Photonics

Volume 13 - 2025 | https://doi.org/10.3389/fphy.2025.1518660

This article is part of the Research TopicFreeform Optics: From Theory to ApplicationsView all 5 articles

Inverse freeform design in non-imaging optics: Hamilton’s theory of geometrical optics, optimal transport, and least-squares solvers

J. H. M. ten Thije Boonkkamp ¹*

K. Mitra¹

M. J. H. Anthonissen ¹

L. Kusch¹

P. A. Braam¹

W. L. IJzerman^1,2

¹ Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, Netherlands
² Signify Research, Eindhoven, Netherlands

We present a systematic derivation of three mathematical models of increasing complexity for optical design, based on Hamilton’s characteristic functions and conservation of luminous flux, and briefly explain the connection with the mathematical theory of optimal transport. We outline several iterative least-squares solvers for our models and demonstrate their performance for a few challenging problems.

1 Introduction

Nowadays, traditional incandescent light bulbs have largely been replaced by LED lamps, mainly because LEDs are highly energy efficient and last for long duration. However, to create a desired light output with LEDs, optical systems are required. Therefore, in the illumination optics industry, a lot of research is carried out on the design of optical systems for a myriad of applications.

The industry standard in optical design is (quasi-)Monte Carlo ray tracing. These are forward methods, which compute the target light distribution for a given source, typically LED, and a given optical system, tracing a large collection of rays from the source to target. Convergence of ray tracing is slow and requires millions or, maybe, even billions of rays to accurately compute the target distribution; see ([1], Chapter3). Moreover, to realize the desired target distribution, these methods have to be embedded in an iterative procedure, adjusting system parameters. More specifically to update these parameters, sophisticated design methods combine an iterative gradient-based optimization method, to minimize a suitably chosen merit function, with differentiable ray tracing to compute the required gradient. These methods are efficient and broadly applicable; see e.g., [31, 32]

A promising alternative are inverse methods. These methods compute the shape/location of the optical surface(s) in one go, given a source distribution and a desired target distribution. The surfaces are referred to as freeform since they have no imposed symmetries. Mathematical models for inverse methods are based on the principles of geometrical optics and conservation of luminous flux. Our models contain the following: a geometrical equation describing the shape/location of the optical surface(s), an (implicit) relation between the location of an optical surface and the optical map, which expresses target coordinates as a function of source coordinates, a matrix equation for the optical map, a conservation law of luminous flux, and a constraint on the luminous flux. The geometrical equation can be derived evaluating one of Hamilton’s characteristic functions and has multiple solutions. The optical map satisfies the luminous flux balance, which is a Jacobian equation, and a matrix equation, which has to be supplemented with the unconventional transport boundary condition. Our models are restricted to zero-étendue sources; more specifically, we consider only planar sources emitting a parallel beam of light or point sources.

In mathematics, the abstract theory of optimal transport is the correct framework to formulate optical design problems. The subject of optimal transport theory is to compute a transport plan (mapping) that transforms a given density function into another one, minimizing the transportation cost and subject to a(n) (integral) conservation law; see [2]. One of the governing equations is precisely the geometrical equation, containing a so-called cost function in the right hand side. The transportation cost is then a cost functional (an integral of the cost function). The analogy with optical design is obvious, the two density functions correspond to the source and target distributions, the transport plan to the optical map, the cost functional can be associated with the optical path length, and the conservation law applies to the luminous flux.

Based on the analogy with optimal transport theory, we can distinguish three mathematical models of increasing complexity. First, in the simplest model, the geometrical equation contains a quadratic cost function, and the Jacobian equation reduces to the standard Monge–Ampère (MA) equation, which is a second-order, non-linear, elliptic partial differential equation (PDE) [3], defining the location of an optical surface. In the next model, the cost function is no longer quadratic and the Jacobian equation corresponds to a generalized Monge–Ampère (GMA) equation. Finally, the most complicated model does not allow a description in terms of a cost function anymore; instead, a so-called generating function is required. The Jacobian equation now corresponds to a generated Jacobian (GJ) equation.

The procedure to derive our mathematical models consists of the following steps:

1. By applying principles of geometrical optics, derive the geometrical equation.

2. Formulate the luminous flux balance, which is a Jacobian equation for the optical map.

3. Based on optimal transport theory, select a uniquely defined solution from the geometrical equation.

4. Derive the necessary (zero-gradient) condition for this solution to define a unique relation between the location of an optical surface and the optical map.

5. Derive from the zero-gradient condition the matrix equation for the Jacobi matrix of the optical map.

6. Combine the luminous flux balance with the matrix equation to derive a constraint on the luminous flux.

There are many numerical methods available for inverse freeform optical design; for a detailed survey, see [4], Chapter 5. Without trying to be complete, a possible categorization is as follows: first, numerical methods for (G)MA equations governing the shape/location of an optical surface; second, optimization methods; third, geometrical construction methods; and fourth, ray mapping methods. In the first category are methods that directly solve the second-order PDE of MA type by discretization and subsequent iteration for the resulting algebraic system. Examples of these are reported in papers by Froese [5], using a wide-stencil finite difference scheme, Benamou et al. [6], using finite differences, Brix et al. [35], using B-spline collocation, and Kawecki et al. [7], using finite elements, to just name a few examples. Methods in the second category are based on optimal transport theory and determine the optical surface(s) by minimizing the cost functional subject to conservation of luminous flux, see, e.g., [33, 34] Next, based on the work of Caffarelli and Oliker [8], geometrical construction methods for optical surfaces with paraboloids have been investigated by Kitagawa et al. [9] and Mérigot and Thibert [10] for the GMA equation, and by Galouët et al. [11] for the GJ equation. Finally, in ray mapping methods, the computation of the optical surface(s) is split into two parts, i.e., first, an estimate of the optical map is computed from an optimal transport problem, and subsequently, the optical surface is computed employing the law of reflection/refraction; see e.g., [36–38]

Our methods are of the first category. The publication that inspired us to develop our numerical solvers is by Caboussat et al. [12]. In this paper, the authors present a least-squares method to solve the Dirichlet problem for the standard MA equation. Prins adjusted their method to include the transport boundary condition; see [13]. In a series of papers, Yadav and Romijn extended the method for the GMA equation, corresponding to a non-quadratic cost function; see [14], [15, 16]. Romijn further extended our least-squares methods to deal with the GJ equation; see [17, 18]. Specifically, for the cost function models, we have developed a two-stage algorithm, i.e., we first compute the optical map and subsequently the shape/location of the optical surface(s). For both stages, we apply a least-squares method. For the generating function model, the optical map and the optical surface(s) have to be computed simultaneously, also in a least-squares fashion.

In this paper, we present a systematic derivation of all models and outline numerical simulation methods. The content is then as follows. First, in Section 2, we introduce Hamilton’s characteristic functions needed to derive the geometrical equations. In Section 3, we give an outline of optimal transport theory, needed to select a uniquely defined solution of the geometrical equation. A generic conservation law for luminous flux is presented in Section 4. Next, in Section 5, we apply the results of the previous sections to present a hierarchy of models based on three example systems. A concise description of our numerical methods is presented in Section 6. To demonstrate the performance of our methods, we present a few challenging examples in Section 7. Concluding remarks are given in Section 8, and the paper is concluded with a nomenclature list.

2 Hamilton’s characteristic functions

In this section, we introduce Hamilton’s characteristic functions which are needed for the derivation of the geometrical equations in Section 5; see, e.g., [19], pp. 94–107; [20], Section 4.1 for a more detailed account. Our starting point is the eikonal equation given by

| \nabla φ | = n, (1)

where $φ = φ (\underset{̲}{r})$ is the phase of an electromagnetic wave and $n = n (\underset{̲}{r})$ is the refractive index field as functions of the (three-dimensional) position vector (or coordinates) $\underset{̲}{r} \in R^{3}$ of a typical point. A surface $φ (\underset{̲}{r}) = C o n s t$ is a wavefront. We denote three-dimensional vectors in bold font with an underbar $(\underset{̲}{})$ , to distinguish them from two-dimensional vectors, which are only written in bold font. Equation 1 describes free propagation of light waves and no longer holds when a light wave is reflected or refracted at an optical surface. The eikonal Equation 1 is a first-order non-linear PDE, which has the characteristic strip equations

\frac{d \underset{̲}{r}}{d s} = \frac{1}{n} \nabla φ, \frac{d φ}{d s} = n, \frac{d}{d s} (\nabla φ) = \nabla n . (2)

See, e.g., Kevorkian [21]. The curves $C : \underset{̲}{r} = \underset{̲}{r} (s)$ , parameterized by the arc length $s$ , are the characteristics of (1). The system of ordinary differential equations in (2) determines the location of the characteristics and the solution along the characteristics, from which the solution of (1) can be reconstructed. From the first equation in (2), we conclude that the characteristics are orthogonal trajectories to the wavefronts, i.e., characteristics are light rays; see, e.g., [20], p. 114.

Hamilton’s characteristic functions are related to the optical path length of a ray, which we, therefore, introduce next. We assume that $n (\underset{̲}{r})$ is piecewise continuous, with jumps at lens surfaces; consequently, rays are continuous and consist of piecewise differentiable curve segments; cf. Equation 2. In this setting, we define the optical path length (OPL) between the points $P_{1}$ (position vector $\underset{̲}{r} = {\underset{̲}{r}}_{1}$ ) and $P_{2}$ (position vector $\underset{̲}{r} = {\underset{̲}{r}}_{2}$ ), connected by a light ray along the curve $C$ , as

[P_{1}, P_{2}] = d_{opt} ({\underset{̲}{r}}_{1}, {\underset{̲}{r}}_{2}) ≔ \int_{C} n (\underset{̲}{r} (s)) d s . (3)

A consequence of Fermat’s principle is that the optical path length $[P_{1}, P_{2}]$ is stationary with respect to variations in the curve $C$ ; see, e.g., [20], p. 740.

Consider an optical system consisting of a source, located in a plane $z = z_{s}$ (source plane), emitting light which via a sequence of reflectors and/or lenses arrives at a plane $z = z_{t}$ (target plane). The $z$ -axis is referred to as the optical axis. In this paper, we use the subscripts $s$ and $t$ for variables/vectors to denote their value at the source and target, respectively. Let $\hat{\underset{̲}{v}} = {(v_{1} v_{2} v_{3})}^{T} = d \underset{̲}{r} / d s$ denote the unit tangent vector of a ray; thus, $\nabla φ = n \hat{\underset{̲}{v}}$ ; cf. Equation 2. Unit vectors are denoted with a hat $(\hat{})$ . Let $Q_{s}$ (position vector $\underset{̲}{r} = {\underset{̲}{r}}_{s}$ ) and $Q_{t}$ (position vector $\underset{̲}{r} = {\underset{̲}{r}}_{t}$ ) be two points on the source and target plane, respectively; then, from Equation 3, we obtain

[Q_{s}, Q_{t}] = d_{opt} ({\underset{̲}{r}}_{s}, {\underset{̲}{r}}_{t}) = \int_{C} n \hat{\underset{̲}{v}} \cdot \hat{\underset{̲}{v}} d s = \int_{C} \nabla φ \cdot d \underset{̲}{r} = φ ({\underset{̲}{r}}_{t}) - φ ({\underset{̲}{r}}_{s}) . (4)

Note that reversing the direction of integration yields the symmetry property of $d_{opt}$ , i.e.,

\begin{aligned} d_{opt} ({\underset{̲}{r}}_{t}, {\underset{̲}{r}}_{s}) & = \int_{- C} (- n \hat{\underset{̲}{v}}) \cdot (- \hat{\underset{̲}{v}}) d s \\ = \int_{- C} - \nabla φ \cdot d \underset{̲}{r} \\ = - φ ({\underset{̲}{r}}_{s}) + φ ({\underset{̲}{r}}_{t}) \\ = d_{opt} ({\underset{̲}{r}}_{s}, {\underset{̲}{r}}_{t}) \end{aligned} (5)

as one would expect from the definition in Equation 3. Observe that $d_{opt}$ is a metric since $d_{opt} ({\underset{̲}{r}}_{s}, {\underset{̲}{r}}_{t}) \geq 0$ , with equality only if ${\underset{̲}{r}}_{s} = {\underset{̲}{r}}_{t}$ , and it satisfies the triangle inequality due to Fermat’s principle. The symmetry of $d_{opt}$ is stated in Equation 5.

For ease of presentation, we use the shorthand notation ${\hat{\underset{̲}{v}}}_{s} = \hat{\underset{̲}{s}}$ and ${\hat{\underset{̲}{v}}}_{t} = \hat{\underset{̲}{t}}$ . Straightforward differentiation of the expression for $d_{opt} ({\underset{̲}{r}}_{s}, {\underset{̲}{r}}_{t})$ in Equation 4 gives

\nabla_{s} d_{opt} ({\underset{̲}{r}}_{s}, {\underset{̲}{r}}_{t}) = - \nabla φ ({\underset{̲}{r}}_{s}) = - n_{s} \hat{\underset{̲}{s}}, \nabla_{t} d_{opt} ({\underset{̲}{r}}_{s}, {\underset{̲}{r}}_{t}) = \nabla φ ({\underset{̲}{r}}_{t}) = n_{t} \hat{\underset{̲}{t}}, (6)

where $\nabla_{s}$ denotes the gradient with respect to the source coordinates ${\underset{̲}{r}}_{s}$ and likewise for $\nabla_{t}$ . From these relations, we readily conclude that

| \nabla_{s} d_{opt} | = n_{s}, | \nabla_{t} d_{opt} | = n_{t},

i.e., $d_{opt}$ satisfies the eikonal equation in both source and target coordinates.

A ray is completely determined by its position and direction coordinates. Position coordinates $q \in R^{2}$ are defined as the projection on a plane $z = C o n s t$ of the position vector $\underset{̲}{r}$ of a typical point on the ray. Position coordinates in the source and target planes are denoted by $q_{s}$ and $q_{t}$ , respectively. Thus, $Q_{s}$ , with position vector ${\underset{̲}{r}}_{s} = (q_{s}, z_{s})$ , will denote the point at which a light ray is emitted from the source plane, and $Q_{t}$ , with the position vector ${\underset{̲}{r}}_{t} = (q_{t}, z_{t})$ , as the point at which it arrives at the target plane. Next, selecting the first two components from the equation $n \frac{d \underset{̲}{r}}{d s} = n \hat{\underset{̲}{v}} = \nabla φ$ , we introduce the momentum vector $p \in R^{2}$ as

p ≔ n \frac{d q}{d s} = n (\begin{matrix} v_{1} \\ v_{2} \end{matrix}) = (\begin{matrix} \frac{\partial φ}{\partial q_{1}} \\ \frac{\partial φ}{\partial q_{2}} \end{matrix}) ≕ \frac{\partial φ}{\partial q} . (7)

Thus, the momentum vector is the projection of $\nabla φ (x)$ on a plane $z = C o n s t$ and determines the direction of a light ray. Momentum vectors at the source and target are denoted with $p_{s}$ and $p_{t}$ , respectively. The variables $q = q (z)$ and $p = p (z)$ are referred to as phase-space coordinates and together constitute phase space.

In the remainder of this paper, we assume that $n (\underset{̲}{r})$ is piecewise constant, with possible discontinuities at optical interfaces, and consequently, a light ray consists of piecewise straight line segments. Below, we define four characteristic functions which relate the optical path length to phase-space coordinates at the source and target.

Point characteristic

As a function of the position coordinates $q_{s}$ and $q_{t}$ , the point characteristic $V$ is the optical path length of the ray connecting the points $Q_{s}$ and $Q_{t}$ , i.e.,

V (q_{s}, q_{t}) ≔ d_{opt} ((q_{s}, z_{s}), (q_{t}, z_{t})) = [Q_{s}, Q_{t}] . (8)

From Equation 4, we conclude that $V (q_{s}, q_{t}) = φ (q_{t}, z_{t}) - φ (q_{s}, z_{s})$ . Combining this relation with the equations in Equation 6, selecting the first two components in both equations, we find for the directional derivatives $\frac{\partial V}{\partial q_{s}}$ and $\frac{\partial V}{\partial q_{t}}$ the following expressions:

\frac{\partial V}{\partial q_{s}} = - n_{s} (\begin{matrix} s_{1} \\ s_{2} \end{matrix}) = - p_{s}, \frac{\partial V}{\partial q_{t}} = n_{t} (\begin{matrix} t_{1} \\ t_{2} \end{matrix}) = p_{t}; (9)

cf. Equation 7. Typically, Equations 8, 9 are used to derive a geometrical equation for optical systems described in terms of the position coordinates $q_{s}$ and $q_{t}$ . An example is the parallel-to-near-field reflector, discussed in Section 5.3.

Mixed characteristic of the first kind

Let a light ray connect the points $Q_{s} (q_{s}, z_{s})$ and $Q_{t} (q_{t}, z_{t})$ with momentum $p_{s}$ and $p_{t}$ , respectively; then, the mixed characteristic of the first kind $W$ is defined as

W (q_{s}, p_{t}) ≔ V (q_{s}, q_{t}) - q_{t} \cdot p_{t} . (10)

Straightforward differentiation yields $\partial W / \partial p_{s} = 0$ and $\partial W / \partial q_{t} = \partial V / \partial q_{t} - p_{t} = 0$ ; cf. Equation 9, second relation. Consequently, $W = W (q_{s}, p_{t})$ indeed. Moreover, we readily verify that

\frac{\partial W}{\partial q_{s}} = - p_{s}, \frac{\partial W}{\partial p_{t}} = - q_{t} .

$W$ can be interpreted as a modification of $[Q_{s}, Q_{t}]$ , which we show as follows. Assume that $z_{t} > z_{s}$ and define the point $P_{t}$ as the intersection of the ray segment arriving at $Q_{t}$ , parameterized by $\underset{̲}{r} (λ) = {\underset{̲}{r}}_{t} + λ \hat{\underset{̲}{t}}$ , and the plane passing through the origin $O_{t} (0, z_{t})$ of the target plane (position vector ${\underset{̲}{o}}_{t}$ ) perpendicular to this ray, given by the normal equation $(\underset{̲}{r} - {\underset{̲}{o}}_{t}) \cdot \hat{\underset{̲}{t}} = 0$ ; see Figure 1. The intersection point is given by $λ = λ (P_{t}) = - ({\underset{̲}{r}}_{t} - {\underset{̲}{o}}_{t}) \cdot \hat{\underset{̲}{t}}$ . Since the third component of ${\underset{̲}{r}}_{t} - {\underset{̲}{o}}_{t}$ vanishes, this can be rewritten as $λ (P_{t}) = - q_{t} \cdot (\begin{matrix} t_{1} \\ t_{2} \end{matrix}) = - q_{t} \cdot p_{t} / n_{t}$ ; cf. Equation 7 or Equation 9. For the $z$ -component of $P_{t}$ , we have $δ z_{t} ≔ z (P_{t}) - z_{t} = λ (P_{t}) t_{3}$ with $t_{3} > 0$ since the ray is directed toward the target plane $(z_{t} > z_{s})$ . From this, we conclude that $λ (P_{t}) > 0$ if $δ z_{t} > 0$ , i.e., $P_{t}$ is located behind the target plane as seen from the source, and $λ (P_{t}) < 0$ otherwise. Obviously, the distance $d (Q_{t}, P_{t}) = \pm λ (P_{t})$ since $\hat{\underset{̲}{t}}$ is a unit vector. If $δ z_{t} > 0$ , then $λ (P_{t}) > 0$ , implying that $d (Q_{t}, P_{t}) = λ (P_{t})$ and $q_{t} \cdot p_{t} = - n_{t} d (Q_{t}, P_{t}) = - [Q_{t}, P_{t}]$ ; cf. Equation 3. Analogously, if $δ z_{t} < 0$ , then $q_{t} \cdot p_{t} = [Q_{t}, P_{t}]$ . Finally, we can easily verify that $σ_{t} ≔ s g n (q_{t} \cdot p_{t}) = - s g n (δ z_{t})$ ; thus, $W$ can be interpreted as

W (q_{s}, p_{t}) = [Q_{s}, Q_{t}] - σ_{t} [Q_{t}, P_{t}] .

The characteristic function $W$ is convenient to describe optical systems in terms of $q_{s}$ and $p_{t}$ . An example is the parallel-to-far-field reflector, as described in Section 5.1.

Figure 1

Figure 1. Interpretation of the mixed characteristics of the first and second kind. (a) Mixed characteristic W ( q _s, p _t ) of the first kind; σ_t > 0. (b) Mixed characteristic W* ( p _s, q _t ) of the second kind; σ_s < 0.

Mixed characteristic of the second kind

Under the same setting, the mixed characteristic of the second kind $W^{*}$ is defined as

W^{*} (p_{s}, q_{t}) ≔ V (q_{s}, q_{t}) + q_{s} \cdot p_{s} .

In this case, using the first relation in Equation 9, we readily verify that $\partial W^{*} / \partial q_{s} = \partial V / \partial q_{s} + p_{s} = 0$ and $\partial W^{*} / \partial p_{t} = 0$ , so $W^{*}$ is a function of $p_{s}$ and $q_{t}$ indeed. Furthermore, straightforward differentiation yields

\frac{\partial W^{*}}{\partial p_{s}} = q_{s}, \frac{\partial W^{*}}{\partial q_{t}} = p_{t} .

In addition, $W^{*}$ can be interpreted as a modified OPL. We assume again that $z_{t} > z_{s}$ and introduce the point $P_{s}$ as the intersection of the ray segment emitted from $Q_{s}$ , parameterized by $\underset{̲}{r} (λ) = {\underset{̲}{r}}_{s} + λ \hat{\underset{̲}{s}}$ , and the plane passing through the origin $O_{s} (0, z_{s})$ of the source plane (position vector ${\underset{̲}{o}}_{s}$ ) perpendicular to this ray, given by $(\underset{̲}{r} - {\underset{̲}{o}}_{s}) \cdot \hat{\underset{̲}{s}} = 0$ ; see Figure 1. Analogous to the previous derivation, we obtain $λ = λ (P_{s}) = - ({\underset{̲}{r}}_{s} - {\underset{̲}{o}}_{s}) \cdot \hat{\underset{̲}{s}} = - q_{s} \cdot (\begin{matrix} s_{1} \\ s_{2} \end{matrix}) = - q_{s} \cdot p_{s} / n_{s}$ . The $z$ -component of $P_{s}$ is given by $z (P_{s}) = z_{s} + λ (P_{s}) s_{3}$ with $s_{3} > 0$ . Thus, if $δ z_{s} ≔ z (P_{s}) - z_{s} > 0$ , then $λ (P_{s}) > 0$ ; otherwise, $λ (P_{s}) < 0$ . Following the same line of reasoning as before, we find that $q_{s} \cdot p_{s} = σ_{s} [Q_{s}, P_{s}]$ with $σ_{s} = s g n (q_{s} \cdot p_{s})$ . Therefore, the second mixed characteristic can be interpreted as

W^{*} (p_{s}, q_{t}) = [Q_{s}, Q_{t}] + σ_{s} [Q_{s}, P_{s}] .

Note that $W^{*} (- p_{t}, q_{s}) = W (q_{s}, p_{t})$ , i.e., the mixed characteristic of the second kind of the ray in reversed direction equals the mixed characteristic of the first kind of the original ray. The function $W^{*}$ can be used to characterize optical systems in terms of $p_{s}$ and $q_{t}$ . An example is the point-to-near-field reflector.

Angular characteristic

The angular characteristic $T$ is another modification of $[Q_{s}, Q_{t}]$ and is a function of the momenta $p_{s}$ and $p_{t}$ . Its definition and interpretation (for $z_{t} > z_{s}$ ) are given by

\begin{aligned} T (p_{s}, p_{t}) & ≔ V (q_{s}, q_{t}) + q_{s} \cdot p_{s} - q_{t} \cdot p_{t} \\ = [Q_{s}, Q_{t}] + σ_{s} [Q_{s}, P_{s}] - σ_{t} [Q_{t}, P_{t}] . \end{aligned} (11)

From Equations 9, 11, we readily verify that $\partial T / \partial q_{s} = \partial V / \partial q_{s} + p_{s} = 0$ and $\partial T / \partial q_{t} = \partial V / \partial q_{t} - p_{t} = 0$ , implying that $T = T (p_{s}, p_{t})$ . Moreover, it is evident that

\frac{\partial T}{\partial p_{s}} = q_{s}, \frac{\partial T}{\partial p_{t}} = - q_{t} .

Note that $T (- p_{t}, - p_{s}) = T (p_{s}, p_{t})$ , i.e., the angular characteristic is invariant if the direction of the ray is reversed. This function is used to describe optical systems in terms of $p_{s}$ and $p_{t}$ , such as the point-to-far-field lens; see Section 5.2.

3 Geometrical description of optical systems: an optimal transport point of view

To derive a geometrical description of an optical system, i.e., an algebraic relation defining the location and shape of the optical surface(s), we have to evaluate one of the four Hamilton’s characteristic functions. However, this geometrical equation does not have a unique solution. In this section, we discuss how to select a uniquely defined solution from this equation. Subsequently, we derive an equation for the optical map.

First, we introduce some notation. We denote with $S$ and $T$ the (physical) source and target domain, respectively. The source and target domain are parameterized by $x \in X \subset R^{2}$ and $y \in Y \subset R^{2}$ , respectively, i.e., $X$ and $Y$ are the parameter domains for the source and target. We assume that $X$ and $Y$ are compact, i.e., bounded and closed, and consequently, any continuous function defined on one of these domains assumes a minimum and a maximum. The key in the following discussion is the optical map $m : X \to Y$ , defining how a point on the source domain (specified by $x$ ) is connected via a light ray with a point on the target domain (specified by $y = m (x)$ ). Given source and target domains, the optical map has to satisfy the condition $Y = m (X)$ , i.e., $Y$ is the image of $X$ under the mapping $m$ .

We consider optical systems having one or two freeform optical surfaces, i.e., surfaces without any symmetry. Consider an arbitrary ray connecting the source and target, for which $y = m (x)$ . Evaluation of the appropriate Hamilton’s characteristic function for such a ray leads to either one of the two algebraic equations:

u_{1} (x) + u_{2} (y) = c (x, y), (12a)

u_{2} (y) = H (x, y, u_{1} (x)), (12b)

which we refer to as geometrical equations, where $u_{1} (x)$ defines the location of (one of) the optical surface(s) and $u_{2} (y)$ is either an auxiliary variable or defines the location of the other surface. We like to emphasize that in Equation 12, source and target coordinates are related via $y = m (x)$ . In Section 5, we will derive geometrical equations for several optical systems. Note that in Equation 12a, source and target coordinates are separated in the left-hand side. This equation occurs in the theory of optimal transport, where $u_{1} (x)$ and $u_{2} (y)$ are referred to as Kantorovich potentials and $c (x, y)$ as the cost function; see, e.g., [2] for a rigorous mathematical account. In Equation 12b, source and target coordinates are no longer separated. It will turn out that for fixed $x$ and $y$ , the function $H (x, y, \cdot)$ has a unique inverse $H^{- 1} (x, y, \cdot) = G (x, y, \cdot)$ , referred to as the generating function; see [22]. Thus, the following holds:

\forall x \in X \forall y \in Y (u_{2} (y) = H (x, y, u_{1} (x)) ⟺ u_{1} (x) = G (x, y, u_{2} (y))) .

Obviously, Equation 12a is a special case of Equation 12b if we choose $H (x, y, w) = c (x, y) - w$ . Nevertheless, for the sake of completeness, we discuss both cases separately.

$c$ -convex analysis

Equation 12a has multiple solution pairs $(u_{1} (x), u_{2} (y))$ . From the theory of optimal transport, we know that a possible solution of Equation 12a is given by

\forall x \in X u_{1} (x) = \max_{y \in Y} (c (x, y) - u_{2} (y)), (13a)

\forall y \in Y u_{2} (y) = \max_{x \in X} (c (x, y) - u_{1} (x)), (13b)

which implicitly defines the optical map as an alternative to a geometrical optics derivation. To be more precise, for any $x \in X$ , its image $y * = m (x) \in Y$ is the maximizer in Equation 13a. In addition, for any $y \in Y$ , we have that $x * \in X$ such that $y = m (x *)$ is the maximizer in Equation 13b, which is possible since $Y = m (X)$ . Under certain conditions, to be specified shortly, the maxima in (13) are unique, which we henceforth assume. The solution in (13) is referred to as a $c$ -convex pair; see [23]. The proof of the equivalence of (Equation 13a) and (Equation 13b) proceeds as follows. Suppose, Equation 12a and the expression for $u_{1} (x)$ in Equation 13a hold, then we have to derive the expression for $u_{2} (y)$ in Equation 13b. From Equation 13a we conclude

\forall x \in X \forall y \in Y u_{1} (x) \geq c (x, y) - u_{2} (y),

or equivalently, swapping $u_{1} (x)$ and $u_{2} (y)$ ,

\forall x \in X \forall y \in Y u_{2} (y) \geq c (x, y) - u_{1} (x) .

Since the latter inequality holds for all $x \in X$ , we can take the maximum over all $x \in X$ to obtain

\forall y \in Y u_{2} (y) \geq \max_{x \in X} (c (x, y) - u_{1} (x)) .

Recall that $Y = m (X)$ . Let $y \in Y$ , then $y = m (\tilde{x})$ for some $\tilde{x} \in X$ . From Equation 12a, we conclude that

u_{2} (y) = c (\tilde{x}, y) - u_{1} (\tilde{x}) \leq \max_{x \in X} (c (x, y) - u_{1} (x)) .

Combining both inequalities, we obtain the expression for $u_{2} (y)$ in Equation 13b and conclude that $\tilde{x} = x *$ is the (unique) maximizer of $c (x, y) - u_{1} (x)$ . Thus, the expression for $u_{1} (x)$ in Equation 13a combined with Equation 12a implies the expression for $u_{2} (y)$ in Equation 13b. Conversely, given the expression for $u_{2} (y)$ in Equation 13b and Equation 12a, we can derive in a similar manner the expression for $u_{1} (x)$ defined in Equation 13a.

Alternatively, Equation 12a has a $c$ -concave solution pair, for which the maxima in Equation 13 are replaced by minima, defining another mapping $m$ . In either case, a necessary condition is that $(x, m (x))$ is a stationary point of $c (x, y) - u_{1} (x)$ for all $x \in X$ , i.e.,

\nabla_{x} c (x, m (x)) - \nabla u_{1} (x) = 0, (14)

where $\nabla_{x} c$ is the gradient of $c$ with respect to the source coordinates $x$ . A sufficient condition for the $c$ -convex pair (max/max solution) in (Equation 13) is that $D_{x x} c (x, m (x)) - D^{2} u_{1} (x)$ be symmetric negative definite (SND) for all $x \in X$ , where $D_{x x} c$ and $D^{2} u_{1}$ are the Hessian matrices of $c$ (with respect to $x$ ) and $u_{1}$ , respectively. The function $c (\cdot, y) - u_{1}$ is then concave, guaranteeing a unique maximum in Equation 13b. Alternatively, for a $c$ -concave pair (min/min solution), the sufficient condition is that $D_{x x} c (x, m (x)) - D^{2} u_{1} (x)$ be symmetric positive definite (SPD) for all $x \in X$ . Then, $c (\cdot, y) - u_{1}$ is convex and has a unique minimum.

According to the implicit function theorem, the optical map $y = m (x)$ is well-defined by Equation 14, provided the mixed derivative matrix $D_{x y} c = (\frac{\partial^{2} c}{\partial x_{i} \partial y_{j}})$ is regular in $(x, m (x))$ for all $x \in X$ , the so-called twist condition. However, for many optical systems, the actual computation of $y = m (x)$ from Equation 14 is quite complicated, if not impossible. Therefore, to derive an equation for the optical map, we differentiate the zero-gradient condition in Equation 14 to obtain the equation

\begin{align} C (x, m (x)) D m = P (x, m (x)), \\ C (x, y) = D_{x y} c (x, y), \\ P (x, y) = D^{2} u_{1} (x) - D_{x x} c (x, y) \end{align}, (15)

where $D m = (\frac{\partial m_{i}}{\partial x_{j}})$ is the Jacobi matrix of the optical map. We refer to Equation 15 as the matrix-Jacobi equation. Note that the matrix $P$ is either SPD, for a $c$ -convex, or SND for a $c$ -concave solution pair.

Equation 15 has to be supplemented with the so-called transport boundary condition

m (\partial X) = \partial Y, (16)

stating that the boundary of the source domain $\partial X$ is mapped to the boundary of the target domain $\partial Y$ . This condition is a consequence of the edge-ray principle of geometrical optics [24] and guarantees that all light emitted from the source arrives at the target.

For some basic optical systems $c (x, y) = x \cdot y$ , implying that $C = I$ and $D_{x x} c = O$ ; see Section 5.1. The $c$ -convex solution pair in (13a) reduces to a conjugate pair of Legendre–Fenchel transforms, see e.g., [4], for which the necessary condition (14) reduces to $m (x) - \nabla u_{1} (x) = 0$ . The sufficient condition for the $c$ -convex solution pair is that $D^{2} u_{1} (x)$ is SPD for all $x \in X$ , i.e., $u_{1} (x)$ is a convex function. Likewise, for the $c$ -concave solution pair, $D^{2} u_{1} (x)$ has to be SND for all $x \in X$ , and $u_{1} (x)$ is then concave. In both cases, $c (\cdot, y) - u_{1}$ has a unique extremum.

In our numerical algorithm, the matrix-Jacobi (Equation 15), subject to the transport boundary condition (16), is employed to compute the optical map, and subsequently, $u_{1} (x)$ is reconstructed from Equation 14. If needed, $u_{2} (y)$ is computed from Equation 12a. From $u_{1} (x)$ and possibly $u_{2} (y)$ , the shape of the optical surface(s) is computed. A concise account of the numerical algorithm is given in Section 6.

$H$ -convex analysis

Next, we consider the geometrical Equation 12b, which can always be formulated such that $H_{w} (x, y, \cdot) > 0$ for all $x \in X$ and $y \in Y$ , implying that $H (x, y, \cdot)$ has an inverse. Also, Equation 12b has multiple solution pairs $(u_{1} (x), u_{2} (y))$ . In analogy with (Equation 13), a possible solution reads

\forall x \in X u_{1} (x) = \max_{y \in Y} G (x, y, u_{2} (y)), (17a)

\forall y \in Y u_{2} (y) = \min_{x \in X} H (x, y, u_{1} (x)), (17b)

implicitly defining the optical map as follows: $y * = m (x) \in Y$ is the maximizer in Equation 17a and $x * \in X$ such that $y = m (x *)$ is the minimizer in Equation 17b. Provided a condition on $H$ holds, which we specify shortly, the extrema in Equation 17 are unique, which we henceforth assume. The functions $u_{1} (x)$ and $u_{2} (y)$ are referred to as a $G$ -convex $H$ -concave solution pair; see [4]. We prove that (Equation 17a) together with (Equation 12b) implies (Equation 17b); the implication in the reverse order proceeds in a similar way. Assume that the expression for $u_{1} (x)$ in (Equation 17a) holds, then

\forall x \in X \forall y \in Y u_{1} (x) \geq G (x, y, u_{2} (y)) .

Applying the inverse $H (x, y, \cdot) = G^{- 1} (x, y, \cdot)$ and using that $H_{w} (x, y, \cdot) > 0$ , we obtain

\forall x \in X \forall y \in Y H (x, y, u_{1} (x)) \geq u_{2} (y) .

Since this inequality holds for all $x \in X$ , we can take the minimum and find

\forall y \in Y u_{2} (y) \leq \min_{x \in X} H (x, y, u_{1} (x)) .

Let $y = m (\tilde{x}) \in Y$ for some $\tilde{x} \in X$ . Then, by virtue of Equation 12b.

u_{2} (y) = H (\tilde{x}, y, u_{1} (\tilde{x})) \geq \min_{x \in X} H (x, y, u_{1} (x)) .

Combining both inequalities, we arrive at the expression for $u_{2} (y)$ in Equation 17b with $\tilde{x} = x *$ the unique minimizer. Conversely, given the expression for $u_{2} (y)$ in Equation 17b, we can derive in a similar manner the expression for $u_{1} (x)$ in Equation 17a.

Equation 12b also has a $G$ -concave $H$ -convex (min/max) solution pair. The necessary condition for both solutions is that $(x, m (x))$ is a stationary point of $\tilde{H} (x, y) = H (x, y, u_{1} (x))$ , i.e.,

\nabla_{x} \tilde{H} (x, m (x)) = \nabla_{x} H (x, m (x), u_{1} (x)) + H_{w} (x, m (x), u_{1} (x)) \nabla u_{1} (x) = 0 . (18)

A sufficient condition for the max/min pair is that $D_{x x} \tilde{H} (x, m (x))$ be SPD for all $x \in X$ . Alternatively, for the min/max pair, the Hessian matrix $D_{x x} \tilde{H} (x, m (x))$ should be SND for all $x \in X$ . In both cases, the solution pair is unique.

Equation 18 implicitly defines the optical map $y = m (x)$ , provided the mixed derivative matrix $D_{x y} \tilde{H} (x, y)$ is regular in $(x, m (x))$ for all $x \in X$ , but the actual computation of the optical map from Equation 18 is virtually impossible for most optical systems. So, we differentiate the zero-gradient condition in Equation 18 and recover the matrix-Jacobi equation in Equation 15; however, with matrices $C$ and $P$ given by

C (x, y, u_{1} (x)) = D_{x y} \tilde{H} (x, y), P (x, y, u_{1} (x)) = - D_{x x} \tilde{H} (x, y) . (19)

Notice, there is one difficulty, i.e., the function $\tilde{H} (x, y)$ depends on $u_{1} (x)$ , and consequently, also the matrices $C$ and $P$ do. Therefore, the computation of the optical map and the function $u_{1} (x)$ can no longer be decoupled; see Section 6 for more details. Finally, also, in this case, the transport boundary condition (16) applies.

4 Conservation of luminous flux

In the previous section, we presented equations describing the geometry of an optical system, more specifically (Equation 12) for the shape/location of the optical surface(s), the zero-gradient conditions (Equations 14, 18), and the matrix-Jacobi equation for the optical map, defined in Equations 15, 19. To close the model, we have to formulate the conservation law of luminous flux.

We first introduce stereographic projections of a unit vector $\hat{\underset{̲}{v}} \in S^{2}$ , needed in some of the flux balances. A unit vector $\hat{\underset{̲}{v}}$ can be represented by a point $P$ or $Q$ on the unit sphere; see Figure 2. There are two stereographic projections of $\hat{\underset{̲}{v}}$ , i.e., the projection from the north pole and from the south pole. To compute the stereographic projection from the north pole $N$ , we have to compute the intersection of the line through $N$ and $P$ , given by

\underset{̲}{r} (λ) = (\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}) + λ (\begin{matrix} v_{1} \\ v_{2} \\ v_{3} - 1 \end{matrix}),

with the equator plane $z = 0$ . In this way, we obtain the stereographic projection $z = S_{N} (\hat{\underset{̲}{v}})$ given by

S_{N} (\hat{\underset{̲}{v}}) = \frac{1}{1 - v_{3}} (\begin{matrix} v_{1} \\ v_{2} \end{matrix}), (20a)

which is defined for $v_{3} \neq 1$ . The latter condition means that $P$ and $N$ should not coincide. Using the relations $| \hat{\underset{̲}{v}} |^{2} = 1$ and $v_{3} \neq 1$ , we can compute the inverse $\hat{\underset{̲}{v}} = S_{N}^{- 1} (z)$ and find

S_{N}^{- 1} (z) = \frac{1}{1 + | z |^{2}} (\begin{matrix} 2 z_{1} \\ 2 z_{2} \\ - 1 + | z |^{2} \end{matrix}) . (20b)

In a similar manner, we can determine the stereographic projection from the south pole and its inverse. These are given by

z = S_{S} (\hat{\underset{̲}{v}}) = \frac{1}{1 + v_{3}} (\begin{matrix} v_{1} \\ v_{2} \end{matrix}), \hat{\underset{̲}{v}} = S_{S}^{- 1} (z) = \frac{1}{1 + | z |^{2}} (\begin{matrix} 2 z_{1} \\ 2 z_{2} \\ 1 - | z |^{2} \end{matrix}) (21)

and are defined for $v_{3} \neq - 1$ . Both projections are parameterizations of $S^{2}$ , and in either case, an area element $d S (z)$ on $S^{2}$ generated by $\hat{\underset{̲}{v}}$ is given by

d S (z) = |\frac{\partial \hat{\underset{̲}{v}}}{\partial z_{1}} \times \frac{\partial \hat{\underset{̲}{v}}}{\partial z_{2}}| d z = \frac{4}{{(1 + | z |^{2})}^{2}} d z ≕ J (\hat{\underset{̲}{v}} (z)) d z . (22)

The choice for the stereographic projection depends on the direction of $\hat{\underset{̲}{v}}$ . If $v_{3} > 0$ , a suitable choice is the projection from the south pole; since then, the domain for $z$ is included in the interior of the disc $| z | \leq 1$ . The projection from the north pole gives a domain outside this disc; see Figure 2. Likewise, if $v_{3} < 0$ , the projection from the north pole is the proper choice.

Figure 2

Figure 2. Stereographic projections of the unit sphere $S^{2}$ . (A) From the north pole N. (B) From the south pole S.

Assuming there is no energy lost, the most generic form of the luminous flux balance reads

\int_{A} f (x) d S (x) = \int_{m (A)} g (y) d S (y), (23)

for every subset $A \subset X$ and image set $m (A) \subset Y$ , where $f (x)$ and $g (y)$ are the flux densities, to be specified shortly, at the source and target, respectively. Here, $d S (x)$ denotes an area element on the (curved) surface $S$ describing the source and parameterized by $x$ . Analogously, $d S (y)$ is an area element on the target surface $T$ , parameterized by $y$ . Equation 23 should hold for any subsets $A \subset X$ and $m (A) \subset Y$ , so also for $A = X$ and $m (A) = Y$ , giving the global luminous flux balance. This implies that an inverse design problem can only have a solution if $f$ and $g$ satisfy this global flux balance.

The precise form of the flux balance depends on the source and target. For the source, we consider two options: first, a planar source located in $z = 0$ and emitting a parallel beam of light with $\hat{\underset{̲}{s}} = {\hat{\underset{̲}{e}}}_{z}$ , and second, a point source located in the origin and emitting a conical beam of light with $\hat{\underset{̲}{s}} = {\hat{\underset{̲}{e}}}_{r}$ . Both are zero-étendue (English: zero-extent) sources, meaning that the emitted beam has either a spatial or angular extent, but in the embedding four-dimensional phase space, it has zero volume; see Chaves [25]. For the planar source $S = {\underset{̲}{r} (x) = (x, 0) | x \in X \subset R^{2}}$ , $x = q_{s}$ are spatial coordinates (Cartesian/polar), $d S (x) = d x$ , the area element in the source plane, and $f (x) = M (\underset{̲}{r} (x))$ , the emittance, i.e., the luminous flux per unit area emitted by the source. The point source has no spatial extent but emits light rays in an angular domain, specified by $\hat{\underset{̲}{s}} \in S \subset S^{2}$ . As parameterization, we choose a stereographic projection, either from the north or south pole, the area element $d S (x) = J (\hat{\underset{̲}{s}} (x)) d x$ , cf. Equation 22, and $f (x) = I_{s} (\hat{\underset{̲}{s}} (x))$ , the intensity of the source, i.e., the emitted luminous flux per unit solid angle. We can express both flux densities in the form $f = \tilde{f} ◦ P_{s}^{- 1}$ , where $\tilde{f} = M$ and $P_{s} : (x, 0) \mapsto x$ , the trivial projection from $S$ to $X$ , for the planar source and $\tilde{f} = I_{s}$ and $P_{s} = S_{N}$ or $P_{s} = S_{S}$ for the point source.

For the target, we consider the following cases; first, a near-field target located on a curved surface $T = {\underset{̲}{r} (y) = (y, v (y)) | y \in Y \subset R^{2}}$ , and second, a far-field target. For the near-field target, $y = q_{t}$ are spatial coordinates in a reference plane $z = z_{t}$ , the area element

d S (y) = |\frac{\partial \underset{̲}{r}}{\partial y_{1}} \times \frac{\partial \underset{̲}{r}}{\partial y_{2}}| d y = \sqrt{| \nabla v (y) |^{2} + 1} d y,

and $g (y) = E (\underset{̲}{r} (y))$ , the illuminance, i.e., the luminous flux per unit area incident on the target surface. The domain of a far-field target is an angular domain, determined by the transmitted rays with $\hat{\underset{̲}{t}} \in T \subset S^{2}$ . Therefore, we choose for $y$ one of the two stereographic projections, for which $d S (y) = J (\hat{\underset{̲}{t}} (y)) d y$ ; cf. Equation 22. However, $g (y) = I_{t} (\hat{\underset{̲}{r}} (y))$ , the intensity in the target domain as a function of $\hat{\underset{̲}{r}}$ , the unit direction vector directed straight from source to target domain, skipping all optical surfaces, rather than a function of $\hat{\underset{̲}{t}}$ . In the far-field approximation, the distance from the source to target is large compared to the size of the optical system, specifically $| \underset{̲}{r} - d \hat{\underset{̲}{t}} | / | \underset{̲}{r} | ≪ 1$ , where $d$ is the distance from the optical surface to target, measured along a reflected ray; see, e.g., [4], pp. 59–60. The optical system is considered a point contracted at the origin, so we simply consider $g (y) = I_{t} (\hat{\underset{̲}{t}} (y))$ . In addition, for the target, both flux densities can be unified in the expression $g = \tilde{g} ◦ P_{t}^{- 1}$ , where for the near field $\tilde{g} = E$ and $P_{t} : (y, v (y)) \mapsto y$ is the orthogonal projection from $T$ to $Y$ , and where for the far-field target, $\tilde{g} = I_{t}$ and either $P_{t} = S_{N}$ or $P_{t} = S_{S}$ .

5 Reflector and lens equations: a hierarchy of mathematical models

We present a hierarchy of mathematical models for optical systems with a zero-étendue source, based on three example systems, i.e., a parallel-to-far-field reflector, point-to-far-field lens, and parallel-to-near-field reflector. These example systems are selected based on the mathematical model (quadratic cost function, non-quadratic cost function, and generating function) but are otherwise arbitrary. Other choices like a parallel-to-far-field-lens (quadratic cost function) or point-to-near-field reflector (generating function) would be equally appropriate. In this section, we combine the geometrical equations with the conservation law for luminous flux.

5.1 Parallel-to-far-field reflector

We consider a light source located in the plane $z = 0$ emitting a parallel beam of light, for which $\hat{\underset{̲}{s}} = {\hat{\underset{̲}{e}}}_{z}$ . Light rays strike the reflector $R$ defined by $z = u (x)$ , $x \in X \subset R^{2}$ , and are reflected off in the direction $\hat{\underset{̲}{t}} \in T \subset S^{2}$ . The reflected rays intersect an auxiliary plane $z = L$ , which we introduce to be able to determine an optical distance; see Figure 3. More precisely, to derive a geometrical equation for the reflector surface, we determine from Equation 10 the mixed characteristic of the first kind $W (q_{s}, p_{t})$ for an arbitrary ray with $q_{s} = x$ and $p_{t} = {(t_{1} t_{2})}^{T}$ , connecting the source with the plane $z = L$ . The point characteristic $V (q_{s}, q_{t})$ involved is given by

V (q_{s}, q_{t}) = [Q_{s}, Q_{t}] = u (x) + d, d = \sqrt{| q_{t} - x |^{2} + {(L - u (x))}^{2}}, (24)

where $d = d (P, Q_{t})$ is the distance between the points $P (x, u (x))$ and $Q_{t} (q_{t}, L)$ , which are the points where the reflected ray hits the reflector and target plane, respectively, loosely referred to as intersection points. For the direction vector $\hat{\underset{̲}{t}}$ of the reflected ray, we have

p_{t} = (\begin{matrix} t_{1} \\ t_{2} \end{matrix}) = \frac{1}{d} (q_{t} - x), t_{3} = \frac{1}{d} (L - u (x)) . (25)

Moreover, $\partial W / \partial q_{s} = - p_{s} = 0$ since $\hat{\underset{̲}{s}}$ is perpendicular to the source plane, implying that $W = W (p_{t})$ . Elaborating the expression for $W$ , using the relations in Equations 24, 25, we find

\begin{aligned} W (p_{t}) & = u (x) + d - q_{t} \cdot p_{t} \\ = u (x) + \frac{1}{d} (| q_{t} - x |^{2} + {(L - u (x))}^{2} - q_{t} \cdot (q_{t} - x)) \\ = u (x) + \frac{1}{d} (- x \cdot (q_{t} - x) + d t_{3} (L - u (x))) \\ = u (x) (1 - t_{3}) - x \cdot p_{t} + L t_{3} . \end{aligned}

Figure 3

Figure 3. Sketch of a parallel-to-far-field reflector.

To separate the variables $x$ (source) and $p_{t}$ (target), we bring all terms that solely depend on $\hat{\underset{̲}{t}}$ to the left-hand side, and we obtain

\frac{W (p_{t}) - L t_{3}}{1 - t_{3}} = u (x) - \frac{x \cdot p_{t}}{1 - t_{3}} . (26)

Note that $t_{3} \neq 1$ since rays cannot pass straight through the reflector. Using the expressions for $d$ and $t_{3}$ , we can show that

\frac{\partial W}{\partial L} = \frac{\partial d}{\partial L} - \frac{\partial q_{t}}{\partial L} \cdot p_{t}, \frac{\partial d}{\partial L} = p_{t} \cdot \frac{\partial q_{t}}{\partial L} + t_{3},

from which we readily conclude that the left-hand side in Equation 26 is independent of $L$ , which makes sense since a far-field target domain should not depend on the choice of a particular plane $z = L$ . As stated in Section 4, the proper choice to parameterize the target $T$ is the stereographic projection from the north pole; thus, $y = p_{t} / (1 - t_{3})$ and $Y = S_{N} (T)$ ; cf. Equation 20. Substituting the expression for $y$ in Equation 26, we arrive at the geometrical equation

u_{1} (x) + u_{2} (y) = c (x, y), c (x, y) = x \cdot y, (27)

where $u_{1} (x) = u (x)$ and $u_{2} (y) = (L t_{3} - W (p_{t})) / (1 - t_{3})$ .

Referring to Section 3, a possible solution of Equation 27 is the max/max solution in Equation 13, which is a conjugate pair of Legendre–Fenchel transforms since $c (x, y) = x \cdot y$ . The necessary condition in Equation 14 and the matrix equation for the optical map in Equation 15 reduce to

y - \nabla u_{1} (x) = 0, (28a)

D m = P (x), P (x) = D^{2} u_{1} (x) . (28b)

Recall that Equation 28b has to be supplemented with the transport boundary condition (Equation 16).

To close the model, we elaborate the flux balance in Equation 23. Substituting $f (x) = M (\underset{̲}{r} (x))$ , $d S (x) = d x$ , $g (y) = I_{t} (\hat{\underset{̲}{t}} (y))$ , and $d S (y) = J (\hat{\underset{̲}{t}} (y)) d y$ , we obtain

\int_{A} M (\underset{̲}{r} (x)) d x = \int_{m (A)} I_{t} (\hat{\underset{̲}{t}} (y)) J (\hat{\underset{̲}{t}} (y)) d y,

for arbitrary $A \subset X$ and $m (A) \subset Y$ . Substituting $y = m (x)$ and the expression for $J (\hat{\underset{̲}{t}} (y))$ , replacing $\hat{\underset{̲}{v}} (z)$ by $\hat{\underset{̲}{t}} (y)$ in Equation 22, we can rewrite the integral in the right-hand side and find

\int_{A} M (\underset{̲}{r} (x)) d x = \int_{A} I_{t} (\hat{\underset{̲}{t}} (m (x))) \frac{4}{{(1 + | m (x) |^{2})}^{2}} \det (D m) d x, (29)

where we used that $\det (D m) > 0$ , which is correct because $D m$ is either SPD or SND. Since Equation 29 should hold for arbitrary $A \subset X$ , we obtain the differential form

\det (D m) = \frac{1}{4} {(1 + | m (x) |^{2})}^{2} \frac{M (\underset{̲}{r} (x))}{I_{t} (\hat{\underset{̲}{t}} (m (x)))} ≕ F (x, m (x)),

referred to as the Jacobian equation. Substituting $m = \nabla u_{1}$ , this equation reduces to the standard MA equation $\det (D^{2} u_{1}) = F (x, \nabla u_{1} (x))$ ; see, e.g., Gutiérrez [3].

5.2 Point-to-far-field lens

We consider a point source located in the origin $O_{s}$ (position vector given by $q_{s} = 0$ and $z_{s} = 0$ ), emitting upward a conical beam of light with the direction vector $\hat{\underset{̲}{s}} = {\hat{\underset{̲}{e}}}_{r}$ . Light rays hit a lens of refractive index $n$ ; see Figure 4. The first (entrance) surface is spherical with center $O_{s}$ and radius $R$ ; hence, rays are not refracted there; however, the second (exit) surface is freeform and given by the parameterization $\underset{̲}{r} (\hat{\underset{̲}{s}}) = u (\hat{\underset{̲}{s}}) \hat{\underset{̲}{s}}$ . Rays are refracted at the exit surface and arrive at an auxiliary plane $z = L > 0$ . To derive a geometrical equation for the freeform surface, we evaluate the angular characteristic $T (p_{s}, p_{t})$ defined in Equation 11 for an arbitrary ray connecting the source with the plane $z = L$ . The point characteristic $V (q_{s}, q_{t})$ needed to evaluate $T (p_{s}, p_{t})$ is given by

\begin{aligned} V (q_{s}, q_{t}) & = [O_{s}, Q_{t}] = R + n (u (\hat{\underset{̲}{s}}) - R) + d \\ d = \sqrt{| q_{t} - u (\hat{\underset{̲}{s}}) p_{s} |^{2} + {(L - u (\hat{\underset{̲}{s}}) s_{3})}^{2}} \end{aligned} (30)

where $d = d (P, Q_{t})$ is the distance between the points $P (u (\hat{\underset{̲}{s}}) p_{s}, u (\hat{\underset{̲}{s}}) s_{3})$ and $Q_{t} (q_{t}, L)$ , which are the intersection points of the refracted ray with exit surface and target screen, respectively. The direction vector $\hat{\underset{̲}{t}}$ of the refracted ray is given by

p_{t} = (\begin{matrix} t_{1} \\ t_{2} \end{matrix}) = \frac{1}{d} (q_{t} - u (\hat{\underset{̲}{s}}) p_{s}), t_{3} = \frac{1}{d} (L - u (\hat{\underset{̲}{s}}) s_{3}) . (31)

Note that $\partial T / \partial p_{s} = q_{s} = 0$ , implying that $T = T (p_{t})$ . Evaluating the expression for $T (p_{t})$ in Equation 11 using the relations in Equations 30, 31, we obtain

\begin{aligned} T (p_{t}) & = R + n (u (\hat{\underset{̲}{s}}) - R) + d - q_{t} \cdot p_{t} \\ = (1 - n) R + n u (\hat{\underset{̲}{s}}) + \frac{1}{d} (| q_{t} - u (\hat{\underset{̲}{s}}) p_{s} |^{2} + {(L - u (\hat{\underset{̲}{s}}) s_{3})}^{2} \\ - q_{t} \cdot (q_{t} - u (\hat{\underset{̲}{s}}) p_{s})) \\ = (1 - n) R + n u (\hat{\underset{̲}{s}}) + \frac{1}{d} (- u (\hat{\underset{̲}{s}}) p_{s}) \cdot (q_{t} - u (\hat{\underset{̲}{s}}) p_{s}) \\ + d, t_{3}, (L - u (\hat{\underset{̲}{s}}) s_{3})) \\ = (1 - n) R + n u (\hat{\underset{̲}{s}}) - u (\hat{\underset{̲}{s}}) p_{s} \cdot p_{t} + t_{3} (L - u (\hat{\underset{̲}{s}}) s_{3}) \\ = (1 - n) R + (n - \hat{\underset{̲}{s}} \cdot \hat{\underset{̲}{t}}) u (\hat{\underset{̲}{s}}) + L t_{3} . \end{aligned} (32)

To derive the last expression for $T$ in Equation 32, we substituted the relation $\hat{\underset{̲}{s}} \cdot \hat{\underset{̲}{t}} = p_{s} \cdot p_{t} + s_{3} t_{3}$ . Next, we move all terms that solely depend on $\hat{\underset{̲}{t}}$ and the constant $(1 - n) R$ to the left-hand side and find

T (p_{t}) - L t_{3} + (n - 1) R = (n - \hat{\underset{̲}{s}} \cdot \hat{\underset{̲}{t}}) u (\hat{\underset{̲}{s}}) . (33)

Differentiating the first expression in Equation 32 with respect to $L$ and combining the expression for $d$ with Equation 31, we obtain

\frac{\partial T}{\partial L} = \frac{\partial d}{\partial L} - \frac{\partial q_{t}}{\partial L} \cdot p_{t}, \frac{\partial d}{\partial L} = p_{t} \cdot \frac{\partial q_{t}}{\partial L} + t_{3},

from which we readily see that the left-hand side in Equation 33 is independent of $L$ , as anticipated.

Figure 4

Figure 4. Sketch of a point-to-far-field lens.

To separate source and target coordinates, analogous to Equation 27, we have to take the logarithm of the left- and right-hand sides of Equation 33. Note that $n - \hat{\underset{̲}{s}} \cdot \hat{\underset{̲}{t}} > 0$ and $u (\hat{\underset{̲}{s}}) > 0$ ; consequently, the left-hand side is positive as well, and we can take the logarithms. In addition, we substitute the stereographic projections (from the south pole) $x$ of $\hat{\underset{̲}{s}}$ and $y$ of $\hat{\underset{̲}{t}}$ since both $\hat{\underset{̲}{s}}$ and $\hat{\underset{̲}{t}}$ are directed upward; cf. Equation 21. In this way, we obtain

u_{1} (x) + u_{2} (y) = c (x, y), (34a)

where the variables $u_{1} (x)$ and $u_{2} (y)$ and the cost function $c (x, y)$ are defined by

u_{1} (x) = \log u (\hat{\underset{̲}{s}} (x)), (34b)

u_{2} (y) = - \log (T (p_{t} (y)) - L t_{3} (y) + (n - 1) R), (34c)

c (x, y) = - \log (n - \hat{\underset{̲}{s}} (x) \cdot \hat{\underset{̲}{t}} (y)) . (34d)

Formulated in terms of stereographic coordinates, the cost function reads

c (x, y) = - \log (n - 1 + \frac{2 | x - y |^{2}}{(1 + | x |^{2}) (1 + | y |^{2})}),

which is no longer quadratic. A possible solution of Equation 34a is the $c$ -convex pair in Equation 13, for which the necessary condition (Equation 14) holds. For the optical map, matrix Equation 15 accompanied with the transport boundary condition (Equation 16) is given.

Referring to Section 4, in the far-field approximation, the luminous flux balance reads

\int_{A} I_{s} (\hat{\underset{̲}{s}} (x)) J (\hat{\underset{̲}{s}} (x)) d x = \int_{m (A)} I_{t} (\hat{\underset{̲}{t}} (y)) J (\hat{\underset{̲}{t}} (y)) d y, (35)

for arbitrary $A \subset X$ and image set $m (A) \subset Y$ . To derive the differential form, we substitute the expressions for $J (\hat{\underset{̲}{s}} (x))$ and $J (\hat{\underset{̲}{t}} (y))$ , according to Equation 22, and subsequently substitute $y = m (x)$ . Assuming $\det (D m) > 0$ , we obtain

\det (D m) = {(\frac{1 + | m (x) |^{2}}{1 + | x |^{2}})}^{2} \frac{I_{s} (x)}{I_{t} (m (x))} ≕ F (x, m (x)) .

Combining this equation with the matrix equation in Equation 15, we obtain the GMA equation $\det (D^{2} u_{1} - D_{x x} c) = \det (C (x, m (x)) F (x, m (x))$ .

5.3 Parallel-to-near-field reflector

The last example concerns a planar source, located in $z = 0$ , emitting a parallel beam of light rays in the direction $\hat{\underset{̲}{s}} = {\hat{\underset{̲}{e}}}_{z}$ , consequently $p_{s} = 0$ , which hits a reflector $R$ given by $z = u (x)$ , with $x = q_{s}$ spatial coordinates in the source domain, and are reflected off in the direction $\hat{\underset{̲}{t}}$ and lands at a target surface given by $z = v (y)$ , with $y = q_{t}$ spatial coordinates in the reference plane $z = z_{t} = 0$ . Thus, source and target planes coincide; see Figure 5. We evaluate Hamilton’s point characteristic and find

\begin{aligned} V (q_{s}, q_{t}) = [Q_{s}, Q_{t}] \\ = u (x) + d, d = \sqrt{| y - x |^{2} + {(v (y) - u (x))}^{2}}, \end{aligned}

where $d = d (P, Q_{t})$ is the distance between the intersection points $P (x, u (x))$ and $Q_{t} (y, v (y))$ of the reflected ray with the reflector and the target surface, respectively. Since $\partial V / \partial q_{s} = - p_{s} = 0$ , we find that $V = V (q_{t})$ . In this case, it is no longer possible to separate the source and target coordinates $x$ and $y$ like in Equation 27 or (Equation 34a). Instead, we write

u_{2} (y) = H (x, y, u_{1} (x)), H (x, y, w) = w + \sqrt{| y - x |^{2} + {(v (y) - w)}^{2}}, (36)

where $u_{1} (x) = u (x)$ and $u_{2} (y) = V (q_{t})$ . Straightforward differentiation gives $H_{w} (x, y, \cdot) > 0$ , implying that for fixed $x, y$ , the inverse $G (x, y, \cdot) = H^{- 1} (x, y, \cdot)$ exists. Therefore, we can explicitly determine $u_{1} (x)$ from Equation 36, and we obtain

u_{1} (x) = G (x, y, u_{2} (y)), G (x, y, w) = \frac{1}{2} (w + v (y) - \frac{| y - x |^{2}}{w - v (y)}) . (37)

A possible solution of Equations 36, 37 is the $G$ -convex $H$ -concave pair in Equation 17, for which the necessary condition (18) holds. The matrix equation for $m$ is given in Equation 15 with matrices $C$ and $P$ defined in Equation 19. Recall that these matrices explicitly depend on $u_{1} (x)$ . Obviously, also the transport boundary condition (16) holds.

Figure 5

Figure 5. Sketch of a parallel-to-near-field reflector.

Making the proper choices for the flux densities and area elements, the flux balance (23) reduces to

\int_{A} M (\underset{̲}{r} (x)) d x = \int_{m (A)} E (\underset{̲}{r} (y)) \sqrt{| \nabla v (y) |^{2} + 1} d y, (38)

for an arbitrary set $A \subset X$ and image set $m (A) \subset Y$ , where $M (\underset{̲}{r} (x))$ and $E (\underset{̲}{r} (y))$ denote the emittance and illuminance of the source and target, respectively. Substituting $y = m (x)$ , assuming $\det (D m) > 0$ , we obtain

\det (D m) = \frac{1}{\sqrt{| \nabla v (m (x)) |^{2} + 1}} \frac{M (\underset{̲}{r} (x))}{E (\underset{̲}{r} (m (x)))} ≕ F (x, m (x)) .

Combining this equation with Equation 15 and the matrices $C$ and $P$ defined in Equation 19, we obtain the equation $\det (D_{x x} \tilde{H}) = \det (C (x, m (x), u_{1} (x)) F (x, m (x))$ , referred to as the GJ equation.

5.4 Summary of mathematical models

All mathematical models considered consist of a geometrical equation, a (zero-gradient) condition for a stationary point, a matrix equation for the optical map coupled to the transport boundary condition, and a luminous flux balance $\det (D m) = F (x, m (x))$ . The conditions for the stationary point read

cost function \nabla_{x} c (x, y) - \nabla u_{1} (x) = 0, (39a)

generating  function \nabla_{x} \tilde{H} (x, y) = 0 . (39b)

Substituting the optical map $y = m (x)$ and differentiating with respect to $x$ gives the matrix equation $C D m = P$ , where the matrices $C$ and $P$ are given by

quadratic  cost  function C = I, P = D^{2} u_{1}, (40a)

general  cost  function C = D_{x y} c, P = D^{2} u_{1} - D_{x x} c, (40b)

generating  function C = D_{x y} \tilde{H}, P = - D_{x x} \tilde{H} . (40c)

Combining the matrix equation for $m$ with the flux balance gives

\det (C) F (x, m (x)) = \det (P),

which we refer to as the luminous flux constraint. Note that for the cost-function model, the matrices $C$ and $P$ explicitly depend on $x$ and $m (x)$ ; on the other hand, for the generating-function model, both matrices in addition depend on $u_{1} (x)$ . In [26], we have given a similar overview of mathematical models of 16 base optical systems.

In the next section, we outline numerical methods to compute the optical map $y = m (x)$ and the auxiliary variable $u_{1} (x)$ . The shape/location of the optical surface is then reconstructed from a simple algebraic equation, i.e., $u (x) = u_{1} (x)$ for the parallel-to-far-field and parallel-to-near-field reflectors and $u (\hat{\underset{̲}{s}} (x)) = e^{u_{1} (x)}$ for the parallel-to-far-field lens.

6 Iterative least-squares methods

In this section, we outline iterative least-squares methods for the models presented in the previous section, starting with the base scheme for the cost-function models and adding modifications for the generation-function model. A detailed account of these methods is presented in a series of theses—see [27]; [23]; [4]—and papers—see [13]; [28]; [14, 29]; [15–18]; [26].

Base scheme for cost function

To compute the layout of the optical system, we apply a two-stage method, i.e., we first compute the optical map $m$ and subsequently, the auxiliary variable $u_{1}$ and possibly $u_{2}$ , from which we trivially can compute the shape/location of the optical surface(s). We employ a uniform rectangular grid covering the parameter space of the source domain.

We iteratively compute a symmetric matrix $P$ as approximation of the symmetric part of $C D m$ , satisfying $\det (P) = \det (C (\cdot, m)) F (\cdot, m)$ , and a vector field $b : \partial X \to \partial Y$ , from which we subsequently compute $m$ . Our solution strategy is then to minimize the functionals

\begin{align} J_{I} [m, P] = \frac{1}{2} \int_{X} | | C D m - P | |_{F}^{2} d x, \\ subject t o \det (P) = \det (C (\cdot, m)) F (\cdot, m), \end{align} (41a)

J_{B} [m, b] = \frac{1}{2} \oint_{\partial X} | m - b |_{2}^{2} d s, (41b)

J [m, P, b] = α J_{I} [m, P] + (1 - α) J_{B} [m, b], (0 < α < 1), (41c)

where $| | \cdot | |_{F}$ denotes the Frobenius norm. The minimization of $J_{B} [m, b]$ is to enforce the transport boundary condition (16). Given an initial guess $m^{0}$ , the iteration scheme then reads

P^{k + 1} = {argmin}_{P \in P (m^{k})} J_{I} [m^{k}, P], (42a)

b^{k + 1} = {argmin}_{b \in B} J_{B} [m^{k}, b], (42b)

m^{k + 1} = {argmin}_{m \in M} J [m, P^{k + 1}, b^{k + 1}], (42c)

where the corresponding function spaces are given by

P (m) = \{P \in C^{1} {(X)}^{2 \times 2} | P^{T} = P, \det (P) = \det (C (\cdot, m (\cdot))) F (\cdot, m (\cdot))\}, (43a)

B = \{b \in C {(\partial X)}^{2} | b (x) \in \partial Y\}, (43b)

M = C^{2} {(X)}^{2} . (43c)

In common parlance, $P (m)$ is the space of $2 \times 2$ matrix functions that are continuously differentiable on $X$ , are symmetric, and satisfy the luminous flux constraint. $B$ is the space of continuous vector functions that map the boundary $\partial X$ to the boundary $\partial Y$ .

The minimization of $J_{I} [m, P]$ to compute $P$ can be performed point-wise and requires the solution of a constrained minimization problem. $P$ has to be either SPD or SND, which can be enforced by a constraint on $t r (P)$ . The minimization of $J_{B} [m, b]$ to compute $b$ is a piecewise projection of $m$ on $\partial X$ . For the minimization of $J [m, P, b]$ , we impose the first variation with respect to $m$ to vanish, i.e.,

δ J [m, P, b] (η) = \lim_{ε \to 0} \frac{1}{ε} (J [m + ε η, P, b] - J [m, P, b]) = 0, (44)

for arbitrary $η \in C^{2} {(X)}^{2}$ . Evaluating this limit, applying Gauss’s theorem and the fundamental lemma of calculus of variations ([30], p. 185), we derive the following coupled boundary value problem (BVP) for $m$ :

\nabla \cdot (C^{T} C D m) = \nabla \cdot (C^{T} P), x \in X, (45a)

(1 - α) m + α (C^{T} C D m) \hat{n} = (1 - α) b + α C^{T} P \hat{n}, x \in \partial X, (45b)

where $\hat{n}$ is the unit outward normal on $\partial X$ . The divergence of a $2 \times 2$ matrix function $A = (a_{1} a_{2})$ is defined as $\nabla \cdot A = (\begin{matrix} \nabla \cdot a_{1} \\ \nabla \cdot a_{2} \end{matrix})$ . For discretization of the BVP in (Equation 45), we employ the standard finite volume method ([4], pp. Appendix B).

Next, upon convergence of the iteration (42), we compute $u_{1}$ from (Equation 39a) by minimizing the functional

I [u_{1}] = \frac{1}{2} \int_{X} | \nabla_{x} c (\cdot, m) - \nabla u_{1} |_{2}^{2} d x .

Analogous to the derivation of (Equation 45), we set the first variation $δ I [u_{1}] (v) = 0$ for arbitrary $v \in C^{2} (X)$ , to obtain the Neumann problem

\nabla^{2} u_{1} = \nabla \cdot \nabla_{x} c (\cdot, m), x \in X, (46a)

\nabla u_{1} \cdot \hat{n} = \nabla_{x} c (\cdot, m) \cdot \hat{n}, x \in \partial X . (46b)

We employ standard finite differences for the BVP in 46. $u_{1}$ is determined up to an additive constant, which translates in an additive constant in the location of the parallel-to-far-field reflector and a multiplicative constant for the location of the point-to-far-field lens. To compute a unique solution from 46 we either fix the distance from the source to optical surface along a specific ray, or prescribe an average for $u_{1}$ .

Extension to generating function

The optical map $m$ and the variable $u_{1}$ have to be computed simultaneously since the matrices $C$ and $P$ depend on both variables. Like in the previous case, the optical map satisfies the equation $C D m = P$ , with the matrices $C$ and $P$ defined in Equation 40c, and the luminous flux balance $\det (D m) = F$ . Consequently, the functional $J_{I} [m, P]$ remains the same, albeit with different matrices $C$ and $P$ . The variable $u_{1}$ has to be computed from Equation 39b, for which we introduce the functional

\begin{aligned} I [u_{1}, m] & = \frac{1}{2} \int_{X} {|\nabla_{x} \tilde{H} (\cdot, m)|}_{2}^{2} d x \\ = \frac{1}{2} \int_{X} {|\nabla_{x} H (\cdot, m, u_{1}) + H_{w} (\cdot, m, u_{1}) \nabla u_{1}|}_{2}^{2} d x . \end{aligned} (47)

To minimize the functional in Equation 47, we require $δ I [u_{1}, m] (v) = 0$ for arbitrary $v \in C^{2} (X)$ , with $δ I [u_{1}, m] (v)$ the first variation in $I$ with respect to $u_{1}$ , defined analogously to Equation 44. This way we can derive the BVP

\nabla \cdot (H_{w} \nabla_{x} H + H_{w}^{2} \nabla u_{1}) = \frac{1}{2} \frac{d}{d w} {|\nabla_{x} H + H_{w} \nabla u_{1}|}_{2}^{2}, x \in X, (48a)

H_{w} (\nabla_{x} H + H_{w} \nabla u_{1}) \cdot \hat{n} = 0, x \in \partial X . (48b)

We employ the standard finite volume method for discretization. We can show that also the solution of the BVP in Equation 48 is determined up to an additive constant and compute a unique solution prescribing the average value of $u_{1}$ ; see ([4], pp. 166–167) for more details. Finally, given an initial guess $m^{0}$ and $u_{1}^{0}$ , the iteration scheme then reads

P^{k + 1} = {argmin}_{P \in P (m^{k})} J_{I} [m^{k}, u_{1}^{k}, P], (49a)

b^{k + 1} = {argmin}_{b \in B} J_{B} [m^{k}, b], (49b)

m^{k + 1} = {argmin}_{m \in M} J [m, u_{1}^{k}, P^{k + 1}, b^{k + 1}], (49c)

u_{1}^{k + 1} = {argmin}_{v \in U} I [v, m^{k + 1}], (49d)

with $U = C^{2} (X)$ , where we have included $u_{1}^{k}$ in the argument list of $J_{I}$ and $J$ to denote their implicit dependence on $u_{1}$ via the matrix $C$ .

7 Numerical examples

We present two numerical examples, viz. the point-to-far-field lens discussed in Section 5.2 and the parallel-to-near-field reflector from Section 5.3.

7.1 Point-to-far-field lens

We consider a point source, located in the origin $O_{s}$ of the source plane $z = 0$ , emitting upward a beam of light with uniform intensity $I_{s} (\hat{\underset{̲}{s}} (x))$ on the circular stereographic source domain $X = {x \in R^{2} | | x | < 0.2}$ . In terms of spherical coordinates $(ϕ, θ)$ , this corresponds to the domain $0 \leq ϕ \leq \arccos (12 / 13)$ and $0 \leq θ < 2 π$ with $ϕ$ and $θ$ as the polar and azimuthal angle, respectively, i.e., the emitted conical light beam has an opening angle of $\arccos (12 / 13) = 0.39 r a d$ . For the target, we choose the stereographic domain $Y = [- 0.2, 0.2] \times [- 0.1, 0.1]$ and intensity $I_{t} (\hat{\underset{̲}{t}} (y))$ corresponding to the gray-scale image of the TU/e logo given in Figure 6a. Both intensities are scaled such that the global luminous flux balance holds, i.e., Equation 35 for $A = X$ and $m (A) = Y$ . Both stereographic coordinates are projections from the south pole; cf. Equation 21. The refractive index of the lens $n = 1.5$ , and we enforce a unique solution for the lens surface by setting $u (x_{c}) = 1$ , and therefore, $u_{1} (x_{c}) = 0$ , corresponding to the central light ray with stereographic source coordinate $x_{c} = 0$ and direction vector $\hat{\underset{̲}{s}} (x_{c}) = {\hat{\underset{̲}{e}}}_{z}$ ; cf. Equation 34b.

Figure 6

Figure 6. Point-to-far-field lens: target intensity pattern, mapping, freeform surface with a ray-traced target intensity pattern, and 3D illuminance distribution of TU/e-logo. (a) TU/e-logo as a target intensity pattern. (b) Mapping displayed in the stereographic projection plane of $\hat{\underset{̲}{t}}$ . (c) Freeform lens and ray-traced target intensity pattern. (d) 3D normalized illuminance distribution, using 50 × 50 bins.

For space discretization of the BVPs in (45) and (46), we cover the source domain with a uniform $201 \times 201$ grid and evaluate 500 iterations of scheme (42)–(43), where the associated functionals are defined in Equation 41. We choose $α = 1 0^{- 3}$ . The computational time, on a laptop with Intel Core i7 - 11800 H 2.30 GHz processor and a RAM of 16.0 GB, is $443.65$ s (7.39 min). The optical mapping, computed as an image of the source grid, is shown in Figure 6b, and clearly exhibits the TU/e-logo. The computed freeform lens with a ray traced target intensity pattern is shown in Figure 6c. This intensity pattern is computed with the commercial software code LightTools using $5 \times 1 0^{6}$ rays and clearly resembles the desired intensity. The corresponding (normalized) 3D illuminance distribution, projected on the plane $z = 100$ , is shown in Figure 6d.

7.2 Parallel-to-near-field reflector

We consider a reflector that converts a uniform source emittance $M (x, 0)$ into the illuminance $E (\underset{̲}{r} (y))$ corresponding to the gray-scale image of the painting ‘the Milkmaid’ by Johannes Vermeer as shown in Figure 7a, projected on a spherical target surface. The source domain $X = [- 0.25, 0.25] \times [- 2.25, - 2.75]$ and the target domain is located on the northern hemisphere of the unit sphere given by $z = v (y) = \sqrt{1 - | y |^{2}}$ with $Y = [- 0.42, 0.42] \times [- 0.82, 0.02]$ . The parameters $x$ and $y$ are both Cartesian coordinates. The emittance and illuminance are scaled such that the global flux balance holds, i.e., Equation 38 for $A = X$ and $m (A) = Y$ . We enforce a unique solution for the location of the reflector by setting $u (x_{c}) = u_{1} (x_{c}) = 3$ , where $x_{c} = {(0, - 2.5)}^{T}$ is the center point of the source domain $X$ .

Figure 7

Figure 7. Parallel-to-near-field reflector: target illuminance pattern, mapping, sag map, and freeform surface with ray-traced target illuminance. (a) “Milkmaid” as target illuminance pattern. (b) Mapping displayed in the orthogonal projection plane of the target surface. (c) Sag map of the freeform reflector. (d) Free-form reflector with ray-traced target illuminance pattern on a sphere.

To discretize the BVPs in Equations 45, 48, we cover the source domain $X$ with a uniform $201 \times 201$ grid and 500 times apply the iteration scheme defined in Equations 49, 43. The associated functionals are defined in Equation 41, where we choose $α = 0.05$ , and Equation 47. On the same laptop/processor, the computational time is 496.63 s (8.28 min). The optical map as image of the uniform rectangular source grid is shown in Figure 7b. Clearly, the girl shown in Figure 7a is recognizable in optical mapping as grid points come closer together in regions of high contrast, showing her contours. The sag map of the reflector is shown in Figure 7c.

To verify the reflector computed by the least-squares algorithm, we used LightTools. Implementing the reflector coordinates into this code and specifying the uniformly distributed rectangular source along with the curved target surface, we obtained the reflector system illustrated in Figure 7d, where only 50 rays are shown. The resulting target intensity was obtained tracing $5 \times 1 0^{7}$ rays. The image of the ‘Milkmaid’ is clearly visible, confirming the validity of the reflector computed by the least-squares algorithm.

8 Concluding remarks

We have presented a systematic and generic derivation of the governing equations for inverse optical design. Using Hamilton’s characteristic functions, we could derive a geometrical equation defining the shape/location of the optical surface(s). From this equation, applying concepts from optimal transport theory, we derived equations for the optical map. To close the model, we presented a generic conservation law of luminous flux. Combining all equations, we derived the luminous flux constraint. Subsequently, we elaborated the generic model in three different models of increasing complexity, on the basis of three example optical systems. We briefly outlined numerical least-squares methods for all models and demonstrated their performance for a few examples.

We intend to expand our research on inverse methods along the following lines. First, we will adjust our numerical methods to incorporate two target distributions; second, we will develop mathematical models and numerical methods for catadioptric systems, where light rays propagate via different paths from the source to target, and finally, and probably the most challenging, we intent to generalize our models and numerics to optical systems with finite sources, as opposed to zero-étendue sources.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.

Author contributions

JT: writing–original draft and writing–review and editing. KM: writing–original draft and writing–review and editing. MA: writing–review and editing. LK: writing–review and editing. PB: writing–review and editing. WI: writing–review and editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was partially supported by the Dutch Research Council (Dutch: Nederlandse Organisatie voor Wetenschappelijk Onderzoek–NWO) through the (grants P15-36 and P21-20).

Acknowledgments

The authors would like to thank R. Beltman from Signify for executing all LightTools simulations.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Filosa C. Phase space ray tracing for illumination optics. Eindhoven: Eindhoven University of Technology (2018). PhD Thesis.

Google Scholar

2. Santambrogio F. Optimal transport for applied mathematicians. Birkäuser, New York: Springer (2015).

Google Scholar

3. Gutiérrez C. The Monge-Ampère equation. 2nd ed. Switzerland: Birkhäuser: Springer International Publishing AG (2016).

Google Scholar

4. Romijn L. Generated Jacobian equations in freeform optical design. Eindhoven: Eindhoven University of Technology (2021). PhD Thesis.

Google Scholar

5. Froese B. A numerical method for the elliptic Monge–Ampère equation with transport boundary conditions. SIAM J Scientific Comput (2012) 34:A1432–59. doi:10.1137/110822372

CrossRef Full Text | Google Scholar

6. Benamou J, Froese B, Oberman A. Numerical solution of the optimal transportation problem using the Monge–Ampère equation. J Comput Phys (2014) 260:107–26. doi:10.1016/j.jcp.2013.12.015

CrossRef Full Text | Google Scholar

7. Kawecki E, Lakkis O, Pryer T. A finite element method for the Monge-Ampère equation with transport boundary conditions. arXiv preprint arXiv:1807 (2018).

Google Scholar

8. Caffarelli L, Oliker V. Weak solutions of one inverse problem in geometric optics. J Math Sci (2008) 154:39–49. doi:10.1007/s10958-008-9152-x

CrossRef Full Text | Google Scholar

9. Kitagawa J, Mérigot Q, Thibert B. Convergence of a Newton algorithm for semi-discrete optimal transport. J Eur Math Soc (2019) 21:2603–51. doi:10.4171/jems/889

CrossRef Full Text | Google Scholar

10. Mérigot Q, Thibert B. Mirrors, lenses and Monge-Ampère equations. Eur Math Soc Mag (2021) 120:16–28. doi:10.4171/mag-21

CrossRef Full Text | Google Scholar

11. Gallouet A, Mérigot Q, Thibert B. A damped Newton algorithm for generated Jacobian equations. Calculus of Variations and Partial Differential Equations (2021) 61:49–59. doi:10.1007/s00526-021-02147-7

CrossRef Full Text | Google Scholar

12. Caboussat A, Glowinski R, Sorensen D. A least-squares method for the numerical solution of the Dirichlet problem for the elliptic Monge − Ampère equation in dimension two. ESAIM: Control, Optimisation and Calculus of Variations (2013) 19:780–810. doi:10.1051/cocv/2012033

CrossRef Full Text | Google Scholar

13. Prins C, Beltman R, ten Thije Boonkkamp J, IJzerman W, Tukker T. A least-squares method for optimal transport using the Monge-Ampère equation. SIAM J Scientific Comput (2015) 37:B937–61. doi:10.1137/140986414

CrossRef Full Text | Google Scholar

14. Yadav N, ten Thije Boonkkamp J, IJzerman W. A Monge-Ampère problem with non-quadratic cost function to compute freeform lens surfaces. J Sci Comput (2019) 80:475–99. doi:10.1007/s10915-019-00948-9

CrossRef Full Text | Google Scholar

15. Romijn L, ten Thije Boonkkamp J, IJzerman W. Freeform lens design for a point source and far-field target. J Opt Soc Am A (2019) 36:1926–39. doi:10.1364/josaa.36.001926

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Romijn L, ten Thije Boonkkamp J, IJzerman W. Inverse reflector design for a point source and far-field target. J Comput Phys (2020) 408:109283. doi:10.1016/j.jcp.2020.109283

CrossRef Full Text | Google Scholar

17. Romijn L, Anthonissen M, ten Thije Boonkkamp J, IJzerman W. The generating function approach for double freeform lens design. J Opt Soc Am A (2021) 38:356–68. doi:10.1364/OE.438920

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Romijn L, ten Thije Boonkkamp J, Anthonissen M, IJzerman W. An iterative least-squares method for generated Jacobian equations in freeform optical design. SIAM J Scientific Comput (2021) 43:B298–B322. doi:10.1364/OE.438920

CrossRef Full Text | Google Scholar

19. Luneburg R. Mathematical theory of optics. Berkeley and Los Angeles: University of California Press (1966).

Google Scholar

20. Born M, Wolf E. Principles of optics: electromagnetic theory of propagation, interference and diffraction of light. 5th ed. Oxford: Pergamon Press (1975).

Google Scholar

21. Kevorkian J. Partial differential equations: analytical solution techniques. Pacific Grove, California: Brooks/Cole (1990).

Google Scholar

22. Guillen N. A primer on generated Jacobian equations: geometry, optics, economics. Notices Am Math Soc (2019) 66:1401–11. doi:10.1090/noti1956

CrossRef Full Text | Google Scholar

23. Yadav N. Monge-Ampère problems with non-quadratic cost function: application to freeform optics. Eindhoven: Eindhoven University of Technology (2018). PhD Thesis.

Google Scholar

24. Ries H, Rable A. Edge-ray principle of nonimaging optics. J Opt Soc Am A (2002) 19:590–5. doi:10.1364/OE.438920

CrossRef Full Text | Google Scholar

25. Chaves J. Introduction to nonimaging optics. 2nd ed. Boca Raton, Florida: CRC Press (2016).

CrossRef Full Text | Google Scholar

26. Anthonissen M, ten Thije Boonkkamp J, IJzerman W. Unified mathematical framework for a class of fundamental freeform optical systems. Opt Express (2021) 29:31650–64. doi:10.1364/OE.438920

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Prins C. Inverse methods for illumination optics. Eindhoven: Eindhoven University of Technology (2014). PhD Thesis.

Google Scholar

28. Beltman R, ten Thije Boonkkamp J, IJzerman W. A least-squares method for the inverse reflector problem in arbitrary orthogonal coordinates. J Comput Phys (2018) 367:347–73. doi:10.1016/j.jcp.2018.04.041

CrossRef Full Text | Google Scholar

29. Yadav N, Romijn L, ten Thije Boonkkamp J, IJzerman W. A least-squares method for the design of two-reflector optical systems. J Phys Photon (2019) 1:034001. doi:10.1088/2515-7647/ab2db3

CrossRef Full Text | Google Scholar

30. Courant R, Hilbert D. Methods of mathematical physics, 2. New York: John Wiley and Sons (1989). doi:10.1002/9783527617210

CrossRef Full Text | Google Scholar

31. Wang C, Luo Y, Li H, Zang Z, Xu Y, Han Y, et al. Extended ray-mapping method based on differentiable ray-tracing for non-paraxial and off-axis freeform illumination lens design. Optics Express (2023) 31:30066–30078.

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Heemels A, de Koning B, Möller M, Adam A. Optimizing freeform lenses for extended sources with algorithmic differentiable ray tracing and truncated hierarchical B-splines. Optics Express (2024) 32:9730–9746.

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Doskolovich L, Bykov D, Andreev E, Bezus E, Oliker V. Designing double freeform surfaces for collimated beam shaping with optimal mass transportation and linear assignment problems. Optics Express (2018) 26:24602–24613.

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Doskolovich L, Bykov D, Mingazov A, Bezus E. Optimal mass transportation and linear assignment problems in the design of freeform refractive optical elements generating far-field irradiance distributions. Optics Express (2019) 27:13083–13097.

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Brix K, Hafizogullari Y, Platen A. Designing illumination lenses and mirrors by the numerical solution of Monge-Ampère equations. J Opt Soc Am A (2015) 32:2227–2236.

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Feng Z, Froese B, Liang R. Freeform illumination optics construction following an optimal transport map. Appl. Optics (2016) 55:4301–4306.

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Bösel C, Gross H. Single freeform surface design for prescribed input wavefront and target irradiance. J Opt Soc Am (2017) 34:1490–1499.

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Bösel C, Gross H. Double freeform illumination design for prescribed wavefronts and irradiances. J Opt Soc Am (2017) 35:236–243.

PubMed Abstract | CrossRef Full Text | Google Scholar

Nomenclature

Scalar variables

$c (x, y)$ cost function

$d (P, Q)$ distance between the points $P$ and $Q$

$d_{opt} ({\underset{̲}{r}}_{s}, {\underset{̲}{r}}_{t})$ optical distance, i.e., optical path length $[P_{s}, P_{t}]$ as function of the position vectors ${\underset{̲}{r}}_{s}$ and ${\underset{̲}{r}}_{t}$

$E (\underset{̲}{r} (y))$ illuminance at a near-field target

$f (x)$ generic flux density of the source

$g (y)$ generic flux density of the target

$G (x, y, w)$ generating function

$H (x, y, w)$ inverse of generating function

$I_{s} (\hat{\underset{̲}{s}} (x))$ intensity of the point source

$I_{t} (\hat{\underset{̲}{t}} (y))$ intensity of the far-field target

$J (\hat{\underset{̲}{v}} (z))$ factor in the area element $d S (z)$ on $S^{2}$ generated by $\hat{\underset{̲}{v}}$ and parameterized by $z$

$M (\underset{̲}{r} (x))$ emittance of the planar source

$n (\underset{̲}{r})$ refractive index field

$s$ arc length along a curve $C$ representing a ray

$T (p_{s}, p_{t})$ angular characteristic (function)

$u_{1} (x)$ Kantorovich potential for source domain

$u_{2} (y)$ Kantorovich potential for target domain

$V (q_{s}, q_{t})$ point characteristic (function)

$W (q_{s}, p_{t})$ mixed characteristic (function) of the first kind

$W^{*} (p_{s}, q_{t})$ mixed characteristic (function) of the second kind

$z$ coordinate along the optical axis

$φ (\underset{̲}{r})$ phase of an electromagnetic wave

$[P_{s}, P_{t}]$ optical path length between the points $P_{s}$ and $P_{t}$

Vectors

$p (z)$ 2D momentum vector of a ray

$q (z)$ 2D position vector of a ray

$\underset{̲}{r}$ 3D position vector of an arbitrary point

$\hat{\underset{̲}{s}}$ 3D unit direction/tangent vector of a ray at the source plane

$\hat{\underset{̲}{t}}$ 3D unit direction/tangent vector of a ray at the target plane

$\hat{\underset{̲}{v}}$ 3D unit direction/tangent vector at an arbitrary point of a ray

$x$ 2D vector parameterizing the source domain

$y$ 2D vector parameterizing the target domain

Keywords: geometrical optics, Hamilton’s characteristic functions, optimal transport, optical map, conservation of luminous flux, inverse methods, least-squares methods

Citation: ten Thije Boonkkamp JHM, Mitra K, Anthonissen MJH, Kusch L, Braam PA and IJzerman WL (2025) Inverse freeform design in non-imaging optics: Hamilton’s theory of geometrical optics, optimal transport, and least-squares solvers. Front. Phys. 13:1518660. doi: 10.3389/fphy.2025.1518660

Received: 28 October 2024; Accepted: 24 January 2025;
Published: 09 May 2025.

Edited by:

Tong Yang, Beijing Institute of Technology, China

Reviewed by:

Huazhong Xiang, University of Shanghai for Science and Technology, China
Haisong Tang, Beijing Institute of Technology, China

Copyright © 2025 Ten Thije Boonkkamp, Mitra, Anthonissen, Kusch, Braam and IJzerman. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: J. H. M. ten Thije Boonkkamp, ai5oLm0udGVudGhpamVib29ua2thbXBAdHVlLm5s

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Inverse freeform design in non-imaging optics: Hamilton’s theory of geometrical optics, optimal transport, and least-squares solvers

1 Introduction

2 Hamilton’s characteristic functions

Point characteristic

Mixed characteristic of the first kind

Mixed characteristic of the second kind

Angular characteristic

3 Geometrical description of optical systems: an optimal transport point of view

c -convex analysis

H -convex analysis

4 Conservation of luminous flux

5 Reflector and lens equations: a hierarchy of mathematical models

5.1 Parallel-to-far-field reflector

5.2 Point-to-far-field lens

5.3 Parallel-to-near-field reflector

5.4 Summary of mathematical models

6 Iterative least-squares methods

Base scheme for cost function

Extension to generating function

7 Numerical examples

7.1 Point-to-far-field lens

7.2 Parallel-to-near-field reflector

8 Concluding remarks

Data availability statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Generative AI statement

Publisher’s note

References

Nomenclature

Scalar variables

Vectors

$c$ -convex analysis

$H$ -convex analysis