-
PDF
- Split View
-
Views
-
Cite
Cite
Bartosz Malman, Javad Mashreghi, Ryan O’Loughlin, Thomas Ransford, Double-Layer Potentials, Configuration Constants, and Applications to Numerical Ranges, International Mathematics Research Notices, Volume 2025, Issue 8, April 2025, rnaf084, https://doi.org/10.1093/imrn/rnaf084
- Share Icon Share
Abstract
Given a compact convex planar domain |$\Omega $| with non-empty interior, the classical Neumann’s configuration constant |$c_{\mathbb{R}}(\Omega )$| is the norm of the Neumann–Poincaré operator |$K_\Omega $| acting on the space of continuous real-valued functions on the boundary |$\partial \Omega $|, modulo constants. We investigate the related operator norm |$c_{\mathbb{C}}(\Omega )$| of |$K_\Omega $| on the corresponding space of complex-valued functions, and the norm |$a(\Omega )$| on the subspace of analytic functions. This change requires introduction of techniques much different from the ones used in the classical setting. We prove the equality |$c_{\mathbb{R}}(\Omega ) = c_{\mathbb{C}}(\Omega )$|, the analytic Neumann-type inequality |$a(\Omega ) < 1$|, and provide various estimates for these quantities expressed in terms of the geometry of |$\Omega $|. We apply our results to estimates for the holomorphic functional calculus of operators on Hilbert space of the type |$\|p(T)\| \leq K \sup _{z \in \Omega } |p(z)|$|, where |$p$| is a polynomial and |$\Omega $| is a domain containing the numerical range of the operator |$T$|. Among other results, we show that the well-known Crouzeix–Palencia bound |$K \leq 1 + \sqrt{2}$| can be improved to |$K \leq 1 + \sqrt{1 + a(\Omega )}$|. In the case that |$\Omega $| is an ellipse, this leads to an estimate of |$K$| in terms of the eccentricity of the ellipse.
1 Introduction
1.1 Double-layer potential
Throughout this article, |$\Omega $| will denote a compact convex planar domain with non-empty interior. If |$C(\partial \Omega )$| is the space of continuous functions on the boundary |$\partial \Omega $| and |$f \in C(\partial \Omega )$|, then its double-layer potential |$u$| is the harmonic function
Here |$ds = |d\sigma |$| is the arclength measure on the rectifiable curve |$\partial \Omega $|, |$\Omega ^{o}$| is the interior of |$\Omega $|, and |$N(\sigma )$| is the outer-pointing normal at the boundary point |$\sigma $|. The equality between the two expressions for |$u(z)$| above follows from an elementary computation in the case that |$\partial \Omega $| is sufficiently smooth. In the general case, we interpret |$N(\sigma )(\sigma - z)^{-1}$| as a Borel measurable function on |$\partial \Omega $|. By convexity of the domain, both the tangent |$T(\sigma )$| and the normal |$N(\sigma )$| exist and are continuous at all but a countable number of points |$\sigma $|, which we will call corners, at which the discontinuity of |$T$| and |$N$| amounts to a jump in the argument. In Appendix A we include more details regarding boundaries of planar convex domains, and other facts mentioned below.
The Neumann–Poincaré operator appears in connection with the study of boundary behaviour of the double-layer potential. It is known that |$u$| given by (1) has a continuous extension to |$\partial \Omega $|, and we have the representation
where |$K_\Omega $| denotes the Neumann–Poincaré integral operator
Here |$\mu _\zeta $| is the probability measure
where |$\theta _\zeta $| can be interpreted as the angle of the aperture at the possible corner at |$\zeta $| of |$\partial \Omega $|, |$\delta _\zeta $| is a unit mass at the point |$\zeta $|, and |$\rho _\zeta $| is the Radon–Nikodym derivative
It is natural to use the convention that |$\theta _\zeta = \pi $| if |$\zeta $| is not a corner. This occurs precisely when |$\mu _\zeta $| assigns no mass to the singleton |$\{\zeta \}$|. We will say that the collection of measures |$\{\mu _\zeta \}_{\zeta \in \partial \Omega }$| is the Neumann–Poincaré kernel of |$\Omega $|.
The density |$\rho _\zeta $| has the following useful geometric interpretation. If |$\sigma \in \partial \Omega \setminus \{\zeta \}$| is not a corner, and |$R_{\zeta , \sigma }$| is the radius of the unique circle passing through |$\zeta $| that is tangent to |$\partial \Omega $| at |$\sigma $|, then the equality
holds. The radius |$R_{\zeta , \sigma }$| may degenerate to |$\infty $| if |$\zeta $| is contained in the tangent line to |$\partial \Omega $| passing through |$\sigma $|. In that case we see easily that |$\rho _\zeta (\sigma ) = 0$|, so (5) still holds. To establish the formula, note that the center |$m$| of the circle in question is of the form |$m = \sigma - R N(\sigma )$|, where the radius |$R = R_{\zeta ,\sigma }> 0$| of the circle satisfies |$|m - \zeta |^{2} = |(\sigma - \zeta ) - R N(\sigma )|^{2} = R^{2}$|. Expanding the squares and solving for |$R$| leads to (5).

Example domain |$\Omega $| with corner of angle |$\theta _{\zeta ^{\prime}}$| at |$\zeta ^{\prime}$|, and a circle of radius |$R_{\zeta , \sigma }$| with center |$m$|, tangent to |$\partial \Omega $| at |$\sigma $| and passing through |$\zeta $|.
1.2 Neumann’s configuration constant
1.2.1 Real configuration constant
Historically, the Neumann–Poincaré operator has been used to solve the Dirichlet problem of finding a harmonic extension to |$\Omega ^{o}$| of a given continuous function |$u$| on |$\partial \Omega $|. The extension can be obtained by finding |$f \in C(\partial \Omega )$|, which solves (2). Indeed, if such an |$f$| is found, then the extension of |$u$| to |$\Omega ^{o}$| is given by the double-layer potential in (1). This naturally leads to questions of invertibility of the operator |$I + K_\Omega $| appearing on the right-hand side of (2), and consequently to the introduction of the Neumann’s configuration constant, which we shall soon define as the operator norm of |$K_\Omega $| acting on an appropriate space. Note that if |$\mathbf{1}$| is the constant function, then we have that |$K_\Omega \mathbf{1} = \mathbf{1}$|, since each |$\mu _\zeta $| is a probability measure. Thus |$K_\Omega $| can be naturally defined as a linear mapping on the quotient space |$C(\partial \Omega )/{\mathbb{C}} \mathbf{1}$|. The classical approach is to instead consider |$K_\Omega $| as acting on the space of real-valued continuous functions |$C_{\mathbb{R}}(\partial \Omega )$|, in which case the corresponding quotient space |$C_{\mathbb{R}}(\partial \Omega )/{\mathbb{R}} \mathbf{1}$| is endowed with the norm
It is not hard to see that the two above expressions for the norm of the coset |$g + {\mathbb{R}} \mathbf{1}$| are equivalent: they are both equal to half of the length of the interval |$g(\partial \Omega ):= \{ g(\partial \Omega ): \zeta \in \partial \Omega \}$|, the image of |$g$|. The right-most expression is minimized by choosing |$r$| to be the mid-point of the image interval. Neumann’s (real) configuration constant |$c_{\mathbb{R}}(\Omega )$| is defined as the operator norm of |$K_\Omega $| acting on the quotient space |$C_{\mathbb{R}}(\partial \Omega )/ {\mathbb{R}} \mathbf{1}$|:
It is not hard to see that we may let |$K_\Omega $| instead act from |$C_{\mathbb{R}}(\partial \Omega )$| into the quotient |$C_{\mathbb{R}}(\partial \Omega ) / {\mathbb{R}} \mathbf{1}$| without affecting the operator norm. Since each measure |$\mu _\zeta $| is of unit mass, we have |$0 \leq c_{\mathbb{R}}(\Omega ) \leq 1$|. If
then
where we use the total variation norm (functional norm) on the right-hand side. By varying |$f$| over the unit ball of |$C_{\mathbb{R}}(\partial \Omega )$| and |$\zeta , \zeta ^{\prime}$| over |$\partial \Omega $|, we obtain the important relation
This expression for |$c_{\mathbb{R}}(\Omega )$| will play a fundamental role in our study.
1.2.2 Neumann’s lemma
From (8) we can immediately deduce that |$c_{\mathbb{R}}(\Omega ) = 1$| in the case that |$\Omega $| is a triangle or a convex quadrilateral. Indeed, in those cases one sees from (3) and (4) that if |$\zeta _{1}$| and |$\zeta _{2}$| are corners of |$\Omega $| (opposing, in the case of the quadrilateral) then |$\mu _{\zeta _{1}}$| and |$\mu _{\zeta _{2}}$| are mutually singular, and so |$\| \mu _{\zeta _{1}} - \mu _{\zeta _{2}}\| = 2$|, implying |$c_{\mathbb{R}}(\Omega ) = 1$|. Neumann’s lemma, which appears initially in Neumann’s book [14], states that the cases of the triangle and quadrilateral are exceptional. For any other type of domain we have the strict inequality |$c_{\mathbb{R}}(\Omega ) < 1$|. See [17] for a proof of this claim by Schober, and the curious history of incomplete attempts at a valid proof in full generality. Neumann’s lemma implies the invertibility of |$I + K_\Omega $| on |$C_{\mathbb{R}}(\partial \Omega )/ {\mathbb{R}} \mathbf{1}$|, and thus the solvability of the Dirichlet problem on a convex domain |$\Omega $|, which is not one of the two exceptional cases. The remaining cases can be handled by considering instead powers of |$K_\Omega $|. See, for instance, [13, Theorem 3.8], [6, Proposition 7], or the article [16], which contains also an exposition of the double-layer potential and Neumann’s lemma.
At the other extreme, we have |$c_{\mathbb{R}}(\Omega ) = 0$| if and only if |$\Omega $| is a disk. This result will be proved in Section 5.
1.3 Complex and analytic configuration constants
1.3.1 Two new configuration constants
In the present article we will discuss certain applications of the double-layer potential to operator theory, which motivate the definition of the complex configuration constant
The difference between (7) and (9) is that the latter is the norm of |$K_\Omega $| on the larger space of complex-valued functions. As a consequence, we have |$c_{\mathbb{R}}(\Omega ) \leq c_{\mathbb{C}}(\Omega ) \leq 1$|. There is a principal difference between the geometric interpretations of the norms in the quotient spaces |$C_{\mathbb{R}}(\partial \Omega ) / {\mathbb{R}} \mathbf{1}$| and |$C(\partial \Omega ) / {\mathbb{C}} \mathbf{1}$|. In the former case, as we have already noted, the norm (6) of the coset represented by the real-valued function |$g$| is equal to half of the length of the image of |$g$|, this image being an interval on the real line |${\mathbb{R}}$|. In the case of complex-valued |$g$|, the quotient norm
can instead be interpreted as the radius of the smallest disk containing the image of |$g$|. A crucial difference is that we lose the ability to estimate the norm of the coset |$g + {\mathbb{C}} \mathbf{1}$| by considering the quantities |$|g(\zeta ) - g(\zeta ^{\prime})|$| only. This is the essence of why new tools are required to treat this case.
We will also study an analogous analytic constant, which is the norm of the operator |$K_\Omega $| restricted to the subspace of analytic functions in |$C(\partial \Omega )$|. More precisely, we let |$\mathcal{A}(\Omega )$| be the space of functions that are continuous in |$\Omega $| and analytic in |$\Omega ^{o}$|. Each function in |$\mathcal{A}(\Omega )$| has a unique restriction to |$\partial \Omega $|, and thus |$\mathcal{A}(\Omega )$| can be naturally identified with a subspace of |$C(\partial \Omega )$|. We define the analytic configuration constant as
The space |$\mathcal{A}(\Omega )$| is not invariant under |$K_\Omega $|, but we do have that |$K_\Omega f$| is the complex conjugate of a function in |$\mathcal{A}(\Omega )$| (in [5, proof of Lemma 2.1] this claim is established for |$\Omega $| with smooth boundary, but the same argument works in general). Clearly, we have the inequality |$a(\Omega ) \leq c_{\mathbb{C}}(\Omega )$|. We note also that if |$\widetilde{\Omega }$| is the image of |$\Omega $| under an affine transformation of the plane, then the configuration constants of the two domains are equal. We shall verify this claim in Section 6.
1.3.2 An application to functional calculi
Given an operator |$T$| on a Hilbert space |$\mathcal{H}$| with numerical range
we are interested in the optimal constant |$K> 0$| in the inequality
where |$p$| is an analytic polynomial, and the left-hand side is the operator norm of |$p(T)$| acting on |$\mathcal{H}$|. More generally, if |$W(T)$| in (12) is replaced by an arbitrary domain |$\Omega $|, and if the corresponding inequality holds for some |$K$|, then we say that |$\Omega $| is a |$K$|-spectral set for |$T$|. Von Neumann’s inequality says that the unit disk is a |$1$|-spectral set for any contraction |$T$|, and a result of Okubo–Ando from [15] says that any disk containing |$W(T)$| is a |$2$|-spectral set for |$T$|.
The numerical range |$W(T)$| is a bounded convex subset of the plane, its closure |$\overline{W(T)}$| contains the spectrum |$\sigma (T)$| of |$T$|, and it has non-empty interior in the case that |$T$| is not a normal operator (see, for instance, [10, Chapter 1]). For normal operators, the bound (12) with constant |$K = 1$| is a consequence of the spectral theorem, and it suffices to take the supremum on the right-hand side over the smaller set |$\sigma (T)$|. For general |$T$|, even establishing the existence of a bound as in (12) is a non-trivial task. A result of Delyon–Delyon from [6, Theorem 3] establishes the existence of the bound, and shows that |$K$| can be chosen depending only on the area and the diameter of |$W(T)$|. The remarkable work of Crouzeix in [3] establishes that (12) holds with |$K \leq 11.08$|. A subsequent work of Crouzeix and Palencia in [5] improves the estimate to |$K \leq 1 + \sqrt{2}$|. The Neumann–Poincaré operator appears as an essential tool in all of the mentioned works. The standing conjecture of Crouzeix from [2] is that the bound holds with |$K = 2$|. This bound is presently known to hold in the case |$\mathcal{H}$| being of dimension 2, and has been established by Crouzeix in [2].
Our interest in the new notions of configuration constants is inspired by a recent work of Schwenninger and de Vries in [18], where bounds for general homomorphisms between uniform algebras and the algebras of bounded linear operators are studied. In Section 6 we will combine their arguments with the methods of Crouzeix–Palencia to obtain the following estimate:
For instance, if |$W$| is a disk, then |$a(W) = 0$|, which gives the Okubo–Ando result mentioned above. In [18], Schwenninger and de Vries recovered this result also. The estimate (13) is our motivation for the following investigation of the configuration constants |$c_{\mathbb{R}}(\Omega ), c_{\mathbb{C}}(\Omega )$|, and |$a(\Omega )$|, and the relations between them.
1.4 Main results
1.4.1 Relation between the real and complex constants
Consider the situation in Figure 2, where the triangular image of the complex-valued function |$g: \partial \Omega \to{\mathbb{C}}$| is contained in a disk of radius |$1$|, and intersects the boundary circle of the disk in three distinct points. The three-point set |$\{g(\zeta _{1}), g(\zeta _{2}), g(\zeta _{3})\}$| is not contained in any open half-circle of the boundary, and it follows from a simple geometric argument (which we shall present in the proofs below) that |$\| g + {\mathbb{C}} \mathbf{1}\|_{\partial \Omega } = 1$|. However, the sides of the triangular image of |$g$| are all of lengths strictly less than |$2$|, and this implies that

A triangular image of a complex-valued function |$g$| contained in a disk of radius |$1$|, with three points on the boundary of a disk.
If such a function |$g$| lies in the image of the unit ball of |$C(\partial \Omega )$| under the Neumann–Poincaré operator |$K_\Omega $| for some domain |$\Omega $| that satisfies |$c_{\mathbb{R}}(\Omega ) < 1$|, then a strict inequality |$c_{\mathbb{R}}(\Omega ) < c_{\mathbb{C}}(\Omega )$| occurs. Our first main result excludes this possibility, and so establishes the simplest possible relation between the real and complex configuration constants.
It follows that every considered domain has a well-defined configuration constant |$c(\Omega )$|, which is equal to the operator norm of |$K_\Omega $| on |$C(\partial \Omega ) / {\mathbb{C}} \mathbf{1}$|, and which can be computed according to the right-hand side of (8). An important consequence of this result is the inequality
which, as we shall soon see, has some interesting consequences.
Theorem 1 doesn’t appear nearly as straightforward to prove as it is to state, and the proof takes up a large portion of the article. However, the only property of the Neumann–Poincaré operator used in the proof is that its integral kernel |$\{\mu _\zeta \}_{\zeta \in \partial \Omega }$| consists of real-valued measures. In fact, the theorem will be deduced as a corollary of a result, which we call the Three-measures theorem, and which is a general statement regarding the geometry of the space |$C(X)$| of continuous functions on a compact Hausdorff space |$X$|. This result, which we discuss and prove in Section 2, puts a restriction on the possible configurations of point sets in the plane, which arise as values of a collection of real-valued functionals on |$C(X)$|.
1.4.2 Analytic Neumann’s lemma
Note that the above estimate in (14), together with Neumann’s lemma, implies that |$a(\Omega ) < 1$| whenever |$\Omega $| is not a triangle or a quadrilateral. This can be improved, for we have an analytic version of Neumann’s lemma, in which no exceptional cases occur.
Our proof of Theorem 2 is much different from the one given by Schober in his proof of the real Neumann’s lemma in [17], but it works also in the real context. At the end of Section 4 we show how our technique leads to a different proof of Neumann’s lemma.
1.4.3 Functional calculus bounds
The following result has already been mentioned above.
Recall that if the numerical range of an operator has empty interior, then the operator is normal, and so (12) holds with |$K = 1$|. From this observation and Theorem 2 we obtain that for any fixed operator |$T:{\mathcal{H}} \to{\mathcal{H}}$|, the optimal constant |$K$| in (12) is always strictly smaller than |$1 + \sqrt{2}$|. In fact, we deduce from our results that we have the inequality
with a constant
which depends only on the shape of |$W = \overline{W(T)}$|, and not on the operator |$T$| itself. We show in Section 5 that no better universal bound can be obtained by means of the analytic configuration constant: for any |$\epsilon> 0$| there exists a “thin” quadrilateral |$\Omega _\epsilon $| for which we have |$a(\Omega _\epsilon )> 1 - \epsilon $|. However, fixing the dimension of the Hilbert space |${\mathcal{H}}$|, one may combine earlier results of Crouzeix to obtain a uniform improvement. The optimal constant |$K$| in (12) varies with |$T$|, and we may consider the supremum of these quantities among all operators |$T$| on a Hilbert space |${\mathcal{H}}$| of a fixed dimension |$N$|. In [4, Theorem 2.2], Crouzeix proved that there exists an operator realizing this supremum. An immediate corollary of his result, Theorem 2 and Theorem 3 is the following.
This improves the Crouzeix–Palencia bound, although by an indefinite amount.
1.4.4 Estimates for the configuration constants
In Section 5 we present also other computations and estimates for the configuration constants. Surprisingly, in the case of an elliptical domain, the configuration constant is computable exactly, and we obtain
where |$a$| and |$b$| are lengths of the semi-axes of the ellipse |$\Omega _{a,b}$|, and |$e$| is the eccentricity of the ellipse, given by |$e:= \sqrt{1 - b^{2}/a^{2}}$| in case that |$a \geq b$|. This fact, together with Theorem 1, estimate (13), and the inequality |$a(\Omega ) \leq c(\Omega )$|, has the following consequence.
Note that the function |$a \mapsto K(a,1)$| is continuous and increasing for |$a \geq 1$|, and we have
Hence the estimate in Corollary 5 gets worse as the eccentricity of the ellipse |$\Omega _{a,b}$| grows, and approaches the Crouzeix–Palencia bound in the limit |$a \to \infty $|. On the other hand, as |$a \to 1$|, the eccentricity of the ellipse |$\Omega _{a,1}$| tends to |$0$|. The estimate is then close to the conjectured optimal bound |$K = 2$| and coincides with the Okubo–Ando bound for |$a=1$|, in which case the domain is a disk. From this perspective, Corollary 5 may be interpreted as an elliptical generalization of the Okubo–Ando estimate.
For many other types of domains, the exact value of |$c(\Omega )$| is inaccessible. To help the situation, we establish an integral estimate, which gives an upper bound on |$c(\Omega )$| in terms of the curvature of |$\partial \Omega $|, roughly speaking. For a fixed |$\sigma $| that is not a corner of |$\partial \Omega $|, recall the definition of |$R_{\zeta , \sigma }$| in (5), and consider
If |$\kappa (\sigma )$| is the curvature of |$\partial \Omega $| at |$\sigma $|, then |$R_\Omega (\sigma )$| is at least as large as the radius of curvature
which is also the radius of the osculating circle at |$\sigma $|. Geometrically, |$R_\Omega (\sigma )$| is the radius of the smallest disk tangent to |$\partial \Omega $| at |$\sigma $|, which contains |$\Omega $|, if such a disk exists, and it is equal to |$\infty $| otherwise. The latter case occurs, for instance, if |$\sigma $| lies on a straight line segment contained in |$\partial \Omega $|. However, if |$\partial \Omega $| is sufficiently curved on a segment of |$\partial \Omega $|, then |$R_\Omega $| will be bounded above there. We obtain in such a situation a non-trivial upper bound on |$c(\Omega )$|.
The result implies spectral constant estimates similar to the one in Corollary 5 above. It also generalizes some similar results in the literature. See Section 5 for further details and examples.
1.4.5 An unresolved matter
We have mentioned above that |$c(\Omega ) = 0$| if and only if |$\Omega $| is a disk. With some additional effort, we will show in Section 5 that the condition |$a(\Omega ) = 0$| also characterizes disks. In this case, we have the equality |$a(\Omega ) = c(\Omega )$|. It is natural to ask whether other domains exist for which the equality occurs, or if the case of the disk is exceptional.
As a consequence of Theorem 2 and the exceptional cases of Neumann’s lemma, we see that the strict inequality holds whenever |$\Omega $| is a triangle or a quadrilateral. The authors have not been able to confirm that the inequality holds in any other examples.
1.5 Notations
Some of our notation has already been introduced above. For a continuous function |$f$| defined on a set |$X$|, we denote by |$\| f \|_{X}$| the supremum of |$|f|$| over |$X$|. For cosets of the form |$f + {\mathbb{C}}\mathbf{1}$| we use the convention
with similar convention for real-valued |$f$| and cosets |$f + {\mathbb{R}}\mathbf{1}$|. A norm |$\| \cdot \|$| without a subscript usually denotes a linear functional norm or a total variation norm of a measure. The distinction will be unimportant and should anyway be easy to deduce from context. We use boldface letters, such as |$\mathbf{x}$|, to denote vectors in |${\mathbb{R}}^{n}$|, and plain letters, such as |$x_{j}$|, to denote the coordinates.
2 The Three-Measures Theorem
2.1 Definitions of relevant spaces and operators
Theorem 1 will be proved as a corollary of our analysis of three-point configurations
where |$x$| is an element of a given normed space |$\mathcal{N}$|, and |$\ell _{1}, \ell _{2}, \ell _{3} \in \mathcal{N}^{*}$| are three bounded linear functionals on |$\mathcal{N}$|. A point configuration of this type has to satisfy certain conditions. For instance, we must have the distance bound
Our principal interest will be in estimating the radius of the smallest disk that contains such a three-point set.
In order to use the tools of functional analysis, we will formulate our problem as one of estimating the norm of an operator between normed spaces. To this end, we use the space |${\mathbb{C}}^{3}$| of triples of complex numbers, and we equip it with the following norm:
Similarly to our previous notational conventions, we shall set |$\mathbf{1}:= (1,1,1) \in{\mathbb{C}}^{3}$|. The quotient norm in the quotient space |${\mathbb{C}}^{3} / {\mathbb{C}} \mathbf{1}$| satisfies
and it has the geometric interpretation adequate to our problem: it is the radius of the smallest disk containing the three point set |$\{a,b,c\}$|. Given a normed space |$\mathcal{N}$| and three linear functionals |$\ell _{1}, \ell _{2}, \ell _{3} \in \mathcal{N}^{*}$|, we introduce the linear operator |$\mathcal{L}: \mathcal{N} \to{\mathbb{C}}^{3} / {\mathbb{C}} \mathbf{1}$| defined by
With these conventions, each three-point configuration |$\big ( \ell _{1}(x), \ell _{2}(x), \ell _{3}(x) \big )$| is contained in a disk of radius at most |$\|\mathcal{L}\|_{\mathcal{N} \to{\mathbb{C}}^{3} / {\mathbb{C}} \mathbf{1}} \cdot \|x\|_\mathcal{N}$|. We want to estimate the operator norm |$\|\mathcal{L}\|_{\mathcal{N} \to{\mathbb{C}}^{3} / {\mathbb{C}} \mathbf{1}}$|.
2.2 Statement of the theorem
Without any information regarding the space |$\mathcal{N}$| or the functionals |$\ell _{1}, \ell _{2}, \ell _{3}$|, the optimal estimate is
Indeed, we see that we cannot do better by choosing |$\mathcal{N} = {\mathbb{C}}$|, |$x = 1$|, and the functionals (scalars) to be the vertices of an equilateral triangle inscribed in the unit circle. For instance,
The sides of the triangle have the common length equal to |$| \ell _{i} - \ell _{j}| = \sqrt{3}$|, and the smallest disk containing the three points |$\ell _{i}(x) = \ell _{i}$| is the unit disk itself. Thus, in this case, (19) holds with equality. The estimate holds in general as a consequence of Jung’s theorem, which appeared first in [11], and which in the context of the plane says that any set of diameter |$d$| is contained in a disk of radius |$d/\sqrt{3}$|. In our setting |$d \leq \max _{j,k} \|\ell _{j} - \ell _{k}\|_{\mathcal{N}^{*}}$|, and so the estimate (19) follows from Jung’s theorem.
In our intended application, the role of the space |$\mathcal{N}$| is played by |$C(X)$|, the Banach space of continuous functions on a compact Hausdorff space |$X$|, and the functionals are given by integration against real-valued measures
It turns out that the three-point configurations that arise in this way are contained in disks of radius smaller than predicted by Jung’s theorem. The main result of the section is the following.
It is the “|$\leq $|” estimate in (20) that is the critical one. The lower bound “|$\geq $|” follows from the definition of the functional norm
We will spend the rest of the section on proving Theorem 7. The outline of the proof is as follows. We will first use duality to formulate the problem in terms of the adjoint operator |$\mathcal{L}^{*}$| between the dual spaces. Next, a discretization will help us reduce the dual problem to a finite-dimensional optimization problem. Finally, we will solve the finite-dimensional problem by the use of techniques of convex analysis.
Before proceeding, we remark that the natural generalization of the above theorem to an arbitrary |$n$|-tuple of real-valued measures is valid. See Theorem 17 below.
2.3 Dual problem
Let us denote by |$Y$| the space |${\mathbb{C}}^{3}/{\mathbb{C}} \mathbf{1}$| equipped with the norm in (17). Then the dual space |$Y^{*}$| is the two-dimensional space of three-tuples |$(\alpha ,\beta , \gamma )$| of complex numbers that satisfy
and the norm on |$Y^{*}$| is given by
In the case |$\mathcal{N} = C(X)$|, the dual space |$(C(X))^{*}$| is just the space of finite Borel measures on |$X$|. The adjoint operator |$\mathcal{L}^{*}: Y^{*} \to \mathcal{N}^{*}$| is then given by
and the estimate (20) is equivalent to
Since |$\alpha + \beta = -\gamma $| and |$(\alpha + \beta + \gamma )\mu _{3} = 0$|, we may rewrite the above inequality into
where
Note that |$\nu _{1}$| and |$\nu _{2}$| are real-valued if |$\mu _{1}, \mu _{2}, \mu _{3}$| are real-valued. Theorem 7 is thus a consequence of the following slightly more general statement in which the topological structure of |$X$| does not play a role.
In our next step, we shall simplify the problem further, and show that Proposition 8 can be established by considering finite sets |$X$| only.
2.4 Discretization
With notations as in Proposition 8, set |$\sigma :=|\nu _{1}|+|\nu _{2}|$|. Then |$\sigma $| is a positive finite measure on |$X$|, and by the Radon–Nikodym theorem we have |$d\nu _{1}=f\,d\sigma $| and |$d\nu _{2}=g\,d\sigma $|, where |$f,g$| are bounded real measurable functions on |$X$|. For a moment, let |$\|\cdot \|_{\sigma ,1}$| denote the norm
Then Proposition 8 is equivalent to the inequality
We will say that a function is simple if it only takes on a finite number of distinct values. By standard measure theory, there exist simple measurable real functions |$f_{m}$|, |$g_{m}$| on |$X$| such that |$f_{m}\to f$| and |$g_{m}\to g$| uniformly on |$X$|. Clearly |$\| \alpha f_{m} + \beta g_{m}\|_{\sigma ,1} \to \| \alpha f + \beta g\|_{\sigma ,1}$|. Likewise |$\|f_{m}\|_{\sigma ,1}\to \|f\|_{\sigma ,1}$| and |$\|g_{m}\|_{\sigma ,1}\to \|g\|_{\sigma ,1}$| and |$\|f_{m}-g_{m}\|_{\sigma ,1}\to \|f-g\|_{\sigma ,1}$|. Thus, if the inequality (24) holds for each pair of simple functions, then it holds for |$f,g$|. So it suffices to establish (24) when |$f,g$| are simple measurable real functions.
Hence, suppose that |$f,g$| are simple measurable real functions on |$X$|. We can write them as |$f=\sum _{j=1}^{n} a_{j}1_{X_{j}}$| and |$g=\sum _{j=1}^{n} b_{j} 1_{X_{j}}$|, where |$\{ X_{1},\dots ,X_{n} \}$| is a measurable partition of |$X$|, and |$a_{j},b_{j}\in{\mathbb{R}}$| for all |$j$|. The inequality in (24) becomes
Writing
and
we see that this becomes
where now |$\mathbf{x},\mathbf{y}$| are vectors in |${\mathbb{R}}^{n}$| and |$\|\cdot \|_{1}$| denotes the usual |$\ell ^{1}$|-norm on |${\mathbb{R}}^{n}$| given by
To summarize, to prove Proposition 8 and consequently to prove Theorem 7, it suffices to establish the following discrete result.
This reduction of the problem to the finite-dimensional setting allows us to use the tools of convex analysis.
2.5 Optimization over a convex set
Consider the set
Thus |$C_{n}$| is a compact convex polytope in |${\mathbb{R}}^{n} \times{\mathbb{R}}^{n}$|, and so it has a finite number of extreme points. That is, points of |$C_{n}$| that do not lie in the interior of any line segment in |$C_{n}$|. A well-known theorem of Carathéodory says that each point of a compact convex polytope is a convex combination of its extreme points.
In order to establish Proposition 9, it suffices to show that the inequality (26) holds for every extreme point of |$C_{n}$|.
From the above lemma and our sequence of reductions above, it follows that in order to prove Theorem 7 it suffices show that the inequality (26) holds at every extreme point of the polytope |$C_{n}$|. Proposition 11 below characterizes these extreme points by partitioning them into three equivalence classes.
Note that |$C_{n}$| is invariant under the following linear symmetries:
where |$\pi $| is any permutation of |$\{1,2,\dots ,n\}$|,
for any choice of |$\epsilon _{1},\dots ,\epsilon _{n}\in \{-1,1\}$|,
and
Denote by |$G_{n}$| the group generated by these symmetries. As these symmetries are linear automorphisms of |${\mathbb{R}}^{n} \times{\mathbb{R}}^{n}$|, it is clear that |$G_{n}$| leaves invariant the set of extreme points |$C_{n}$|. We say that two extreme points of |$C_{n}$| are |$\textit{G}_{n}$|-equivalent if there is an element of |$G_{n}$| mapping one of them to the other. Thus the action of |$G_{n}$| on |$C_{n}$| partitions the set of extreme points of |$C_{n}$| into a finite number of equivalence classes. Note that if the inequality (26) holds for some |$(\mathbf{x},\mathbf{y}) \in C_{n}$|, then it holds also for any point of |$C_{n}$| in the orbit of |$(\mathbf{x},\mathbf{y})$| under the group action of |$G_{n}$| on |$C_{n}$|.
The extreme points of |$C_{n}$| are identified in the following proposition.
One can readily check that each of the three above pairs really is an extreme point of |$C_{n}$|. We omit the proof, since we do not actually need this fact. In the case that |$n = 1$|, the same result holds, but only the first kind of pair can arise. Likewise, if |$n = 2$|, the same result holds, but only the first two types of pairs can arise.
We will prove Proposition 11 in Section 2.6. For now let us see how Theorem 7 follows. In order to verify (26) for all extreme points of |$C_{n}$|, it suffices to verify the inequality for the three pairs of vectors appearing in Proposition 11. This is an easy task. For instance, if |$(\mathbf{x},\mathbf{y})$| is the second pair in Proposition 11, then we have
The inequality for the other two pairs is verified similarly. Then from Lemma 10 we conclude that Proposition 9 holds, from which Theorem 7 follows by the earlier reduction.
It remains to prove Proposition 11.
2.6 Extreme points of the polytope
In the proof of Proposition 11, the group |$G_{n}$| generated by the symmetries (29)–(32) will be extensively used. In particular we will use the property that |$(\mathbf{x},\mathbf{y})$| is an extreme point of |$C_{n}$| if and only if some extreme point of |$C_{n}$| is |$G_{n}$|-equivalent to it. Moreover, the following two observations will be useful to single out.
More generally, if for two distinct indices |$j, k$| we have that two of the quantities |$x_{j}x_{k}$|, |$y_{j}y_{k}$| and |$(x_{j}-y_{j})(x_{k}-y_{k})$| are non-zero and have the same sign, then |$(\mathbf{x},\mathbf{y})$| is not an extreme point of |$C_{n}$|.
The more general statement follows by applications of a sequence of symmetries in (29)–(32) to transform |$(\mathbf{x},\mathbf{y})$| satisfying the more general assumption into a point |$(\mathbf{x^{\prime}}, \mathbf{y^{\prime}})$| where the first two coordinates of the vectors |$\mathbf{x^{\prime}}$| and |$\mathbf{y^{\prime}}$| are positive.
If for a pair |$(\mathbf{x},\mathbf{y}) \in C_{n}$| the vector |$\mathbf{x}$| or |$\mathbf{y}$| has at least three non-zero coordinates, then |$(\mathbf{x},\mathbf{y})$| is not an extreme point of |$C_{n}$|.
By using symmetries (29)–(31) we may suppose that coordinates |$x_{1}, x_{2}, x_{3}$| are non-zero and positive. If two of the coordinates |$y_{1}, y_{2}, y_{3}$| are positive, then by Lemma 12 we conclude that |$(\mathbf{x},\mathbf{y})$| is not an extreme point of |$C_{n}$|. In the contrary case, two of the coordinates |$y_{1}, y_{2}, y_{3}$| are non-positive. Then again by Lemma 12 and the symmetry (32) the pair |$(\mathbf{x},\mathbf{x-y}) \in C_{n}$| is not extreme, and thus neither is |$(\mathbf{x},\mathbf{y})$|, since these two pairs are |$G_{n}$|-equivalent.
We are ready to prove Proposition 11. We denote by |$\ell ^{1}_{n}$| the space |${\mathbb{R}}^{n}$| equipped with the norm |$\| \cdot \|_{1}$| given by (25). Recall that the extreme points of the unit ball |$B:= \{ \mathbf{x} \in{\mathbb{R}}^{n}: \|\mathbf{x}\|_{1} \leq 1\}$| are the vectors with precisely one non-zero coordinate, this coordinate being equal to |$\pm 1$|.
We will split up the proof into three cases, each case corresponding to one of the pairs in the statement of the proposition.
Case 1: At least one of the norms |$\|\mathbf{x}\|_{1},\|\mathbf{y}\|_{1},\|\mathbf{x-y}\|_{1}$| is strictly less than |$1$|. We will show that in this case |$(\mathbf{x},\mathbf{y})$| is |$G_{n}$|-equivalent to the first pair in the statement of the proposition.
By applying a suitable combination of symmetries (29)–(32), we may suppose that in fact |$\|\mathbf{x-y}\|_{1}<1$|. We claim that |$\mathbf{x}$| must be an extreme point of the unit ball of |$\ell ^{1}_{n}$|. For if not, then it lies at the midpoint of a line segment |$I$| such that |$\|\mathbf{x^{\prime}}\|_{1}\le 1$| for all |$\mathbf{x^{\prime}}\in I$|. Since |$\|\mathbf{x-y}\|_{1} < 1$|, by shrinking |$I$| if necessary, we also have |$\|\mathbf{x^{\prime}-y}\|_{1}<1$| for all |$\mathbf{x^{\prime}}\in I$|. Thus |$I\times \{\mathbf{y}\}$| is a line segment in |$C_{n}$| with interior point |$(\mathbf{x},\mathbf{y})$|, contradicting the fact that |$(\mathbf{x},\mathbf{y})$| is extreme.
Likewise, |$\mathbf{y}$| is extreme in the unit ball of |$\ell ^{1}_{n}$|. Applying a suitable symmetry, we may suppose that |$x_{1}=1$| and |$y_{j}=\pm 1$| for some |$j$|, all the other entries of |$\mathbf{x}$| and |$\mathbf{y}$| being |$0$|. Since we must have |$\|\mathbf{x-y}\|_{1}<1$|, this implies that actually |$j=1$| and |$y_{1}=1$|. Thus |$(\mathbf{x},\mathbf{y})$| is equivalent to the first pair of vectors listed in the statement of the proposition. This concludes Case 1.
Case 2: We have |$\|\mathbf{x}\|_{1} = \|\mathbf{y}\|_{1} = \|\mathbf{x-y}\|_{1} = 1$|, and one of the vectors |$\mathbf{x}$|, |$\mathbf{y}$|, or |$\mathbf{x-y}$| has only one non-zero coordinate. In this case, |$(\mathbf{x},\mathbf{y})$| will be now shown to be |$G_{n}$|-equivalent to the second pair in the statement of the proposition.
Case 3: We have |$\|\mathbf{x}\|_{1} = \|\mathbf{y}\|_{1} = \|\mathbf{x-y}\|_{1} = 1$|, and all of the vectors |$\mathbf{x}$|, |$\mathbf{y}$| and |$\mathbf{x-y}$| have exactly two non-zero coordinates. We will show that |$(\mathbf{x},\mathbf{y})$| is |$G_{n}$|-equivalent to the third pair in the statement of the proposition.
This case is slightly more complicated than the previous two. As before, we may suppose that |$x_{1}> 0$| and |$x_{2}> 0$|. We claim that |$y_{1}$| and |$y_{2}$| cannot both be equal to zero. If they were, then |$\mathbf{x-y}$| has four non-zero coordinates, contrary to the assumption. In fact, precisely one of |$y_{1}$| and |$y_{2}$| must be non-zero. If both were non-zero, then since |$\mathbf{x-y}$| has exactly two non-zero coordinates, we would have |$x_{1} - y_{1} \neq 0$| and |$x_{2} - y_{2} \neq 0$|. Then the three quantities |$x_{1}x_{2}$|, |$y_{1}y_{2}$| and |$(x_{1}-y_{1})(x_{2} - y_{2})$| would be non-zero, and Lemma 12 would imply that |$(\mathbf{x},\mathbf{y})$| is not an extreme point.
Recall that |$\mathbf{x-y}$| has only two non-zero coordinates. Since |$1-t \neq 0$| and |$s \neq 0$|, we conclude from the above that |$t = y_{1}$|. But then |$\|\mathbf{x-y}\|_{1} = 1-t + s = 1$|, and so |$t = s$|. Finally, |$1 = \|\mathbf{y}\|_{1} = t + s = 2s$| shows that |$s = t = 1/2$|, and so |$(\mathbf{x},\mathbf{y})$| is |$G_{n}$|-equivalent to the third pair in the statement of the proposition.
3 Proof of Theorem 1
In addition to Theorem 7 from Section 2, we will also need some facts from plane geometry in order to prove Theorem 1. In particular, we will need to discuss the minimum enclosing disk problem appearing in computational geometry.
3.1 Minimal enclosing disk
Let |$K$| be a compact subset of |$\mathbb{C}$| containing at least two points. Among all closed disks that contain |$K$| there exists a unique one of minimal radius. We will denote this disk by |${\mathbb{D}}_{K}$| and call it the minimal disk for |$K$|. The radius of |${\mathbb{D}}_{K}$| will be denoted by |$R(K)$|.
If |${\mathbb{D}}_{K}$| is minimal for |$K$|, then the intersection |$K \cap \partial{\mathbb{D}}_{K}$| must obviously be non-empty. In fact, this intersection must contain at least two points, and there is also a restriction on the locations of the points in |$K \cap \partial{\mathbb{D}}_{K}$|.
Let |$K$| be a compact subset of |$\mathbb{C}$|, which contains at least two points. Then the intersection |$\partial{\mathbb{D}}_{K} \cap K$| is not contained in any arc of |$\partial{\mathbb{D}}_{K}$|, which has length strictly smaller than half of the circumference of |$ {\mathbb{D}}_{K}$|. In particular, if |$K \cap \partial{\mathbb{D}}_{K} = \{a,b\}$| is a two-point set, then |$a$| and |$b$| are antipodal on |$\partial{\mathbb{D}}_{K}$|.

The initial disk |${\mathbb{D}}_{K}$| is the dashed circle, and we assume that |$\partial{\mathbb{D}}_{K} \cap K$| is contained in the black thick arc. Then |$K$| will be contained in the grey disk, which is obtained from |${\mathbb{D}}_{K}$| by first translating |${\mathbb{D}}_{K}$| in the direction of the positive real axis, and then slightly shrinking the translated disk. This contradicts the minimality of |${\mathbb{D}}_{K}$|.

The thick arc |$J$| between |$a$| and |$b$| is the smallest containing the compact set |$K$|. It follows that the shorter arc between the antipodal points |$\tilde{a}$| an |$\tilde{b}$| must contain points of |$K$|.
Let |$T = \{a, b, c\}$| be a three-point set. If |$D$| is a closed disk for which |$T \subset \partial D$|, and |$T$| is not contained in any arc of |$\partial D$|, which is strictly smaller than half of the circumference of |$D$|, then |$D = {\mathbb{D}}_{T}$|.
Assume, seeking a contradiction, that |$D \neq{\mathbb{D}}_{T}$|, and so that |$R(T)$| is strictly smaller than the radius of |$D$|. Since |$\partial D$| is the unique circle passing through the three points |$a,b,c$|, we must have that |$T \cap \partial{\mathbb{D}}_{T}$| contains precisely two points. Say |$a, b \in \partial{\mathbb{D}}_{T}$| but |$c \not \in \partial{\mathbb{D}}_{T}$|. Lemma 14 implies that |$a$| and |$b$| are antipodal on |${\mathbb{D}}_{T}$|. By translation, rescaling, and rotation, we may assume that |${\mathbb{D}}_{T}$| is the unit disk, |$a = i, b = -i$|, |$c$| has non-negative real part and |$|c| < 1$|. After these operations, we have that |$R(T) = 1$| and the circumference of |$D$| is larger than |$2 \pi $|. Thus by hypothesis, surely |$T$| is not contained in any arc of |$\partial D$| of length strictly smaller than |$\pi $|. But the shorter of the arcs of |$\partial D$| that contains |$T$| is then contained in |$\{ z \in \mathbb{C}: 0 \leq \textrm{Re} z, |z| \leq 1\}$|, and so this arc must have a length smaller than |$\pi $|. This is a contradiction, and the lemma follows.
3.2 Reduction to three-point sets
The following simple result on minimal disks makes it possible to apply Theorem 7 to more than three measures.
Let |$K$| be a compact subset of |${\mathbb{C}}$| containing at least two points. There exists a subset |$T \subset K$|, which contains at most three points and for which |${\mathbb{D}}_{K} = {\mathbb{D}}_{T}$|. In particular, |$R(K) = R(T)$|.
It may be convenient to refer to Figure 4 during the reading of the proof.
If there are two points in |$K$| that are antipodal on |$\partial{\mathbb{D}}_{K}$|, then we take |$T$| to consist of those two points. Clearly |${\mathbb{D}}_{K} = {\mathbb{D}}_{T}$|. In the case that no pair of antipodal points of |$\partial{\mathbb{D}}_{K}$| are contained in |$K$|, let |$J$| be the shortest closed arc of |$\partial{\mathbb{D}}_{K}$|, which contains |$K$|, and let |$a,b \in J \cap K$| be the end-points of |$J$|. By Lemma 14, the length of |$J$| is strictly larger than half of the circumference of |$\partial{\mathbb{D}}_{K}$|, and so |$J$| is the longer of the two arcs between |$a$| and |$b$|. Let |$\tilde{a}$| and |$\tilde{b}$| be points on |$\partial{\mathbb{D}}_{K}$|, which are antipodal to |$a$| and |$b$|, respectively. By assumption, |$\tilde{a} \not \in K, \tilde{b} \not \in K$|. We claim that the shorter of the two open arcs between |$\tilde{a}$| and |$\tilde{b}$| must contain points of |$K$|. If not, then the longer of the two arcs between |$\tilde{a}$| and |$\tilde{b}$| would contain |$K$| in its interior, and this arc has the same length as |$J$|. A routine compactness argument would lead to a contradiction to the minimality of |$J$|.
Let |$T = \{a, b, c\}$|, where |$c \in K$| is any point contained in the shorter open arc between |$\tilde{a}$| and |$\tilde{b}$|. Note that any arc containing |$T$| must contain either |$\tilde{a}$| or |$\tilde{b}$|. Then such an arc contains two antipodal points on |${\mathbb{D}}_{K}$|, and so it has a length that is at least half of the circumference of |${\mathbb{D}}_{K}$|. By Lemma 15 we conclude that |${\mathbb{D}}_{K} = {\mathbb{D}}_{T}$|.
3.3 Finalizing the proof
We are finally ready to give a proof of the equality |$c_{\mathbb{R}}(\Omega ) = c_{\mathbb{C}}(\Omega )$|.
The earlier mentioned extension of Theorem 7 to an n-measures theorem is obtained by employing the same argument as in the above proof. The normed space |${\mathbb{C}}^{n} / {\mathbb{C}} \mathbf{1}$| appearing below is defined analogously to the case |$n=3$| treated in Section 2.1.
We use Lemma 16 to pick a three-point subset |$T$| of |$K = \{ \mu _{j}(f) \}_{j=1}^{n}$| for which we have |$R(K) = R(T)$|, and apply Theorem 7 as in the preceding proof.
4 Proof of Theorem 2
4.1 Exploiting subsequences
We will argue by contradiction in order to prove Theorem 2. That is, we will assume that there exists a convex domain |$\Omega $| with |$a(\Omega ) = 1$|, and so that there exists a sequence of functions |$(f_{n})$| in |$\mathcal{A}(\Omega )$|, which satisfy
and
We shall see that this leads to a contradiction. The proof technique below is different from the one employed by Schober in [17] in his proof of Neumann’s lemma, and analyticity is used only at the very end of the proof. In fact, we shall remark at the end of the section how our arguments lead to a new proof of Neumann’s lemma that is different from the one in [17].
Thus, for now, we assume merely that |$f_{n} \in C(\partial \Omega )$|, and we will derive certain consequences of (33). In the course of the proof we shall replace the sequence |$(f_{n})$| by a subsequence multiple times, and for convenience we will not be changing the subscripts. We may suppose that |$\|f_{n}\|_\Omega = 1$|, and consequently that the images
are contained in a closed disk of radius |$1$| centred at the origin. For large |$n$|, this observation and (33) forces there to be points of the image of |$K_\Omega f_{n}$| outside of any disk centred at the origin of radius strictly less than |$1$|. By exchanging |$f_{n}$| for a unimodular multiple of itself, we may thus assume that there exists a sequence of points |$(\zeta _{n})$| in |$\partial \Omega $| for which we have
Using that the functions |$f_{n}$| are bounded by |$1$| in modulus, and the positive measures |$d\mu _\zeta $| are of unit mass, we obtain
Recall from (3) that |$\rho _{\zeta _{n}}$| denotes the |$ds$|-absolutely continuous part of |$\mu _{\zeta _{n}}$|. The above computation implies that
Compactness of the boundary |$\partial \Omega $| implies that we may assume convergence of the sequence |$(\zeta _{n})$| to some points |$\zeta \in \partial \Omega $|. The following lemma shows that we may replace in (35) the densities |$\rho _{\zeta _{n}}$| with the density |$\rho _{\zeta }$|.
Out next observation extracts more information from (33). Consider the strips
These strips have a fixed large “length” but shrinking “width”. One such strip is marked in Figure 5. We claim that each one of the strips |$S_\delta $| intersects the images |$K_\Omega f_{n}(\Omega )$| non-trivially for infinitely many indices |$n$|. For if not, then for some fixed |$\delta> 0$|, we would have that |$S_\delta \cap K_\Omega f_{n}(\partial \Omega )= \emptyset $| for all sufficiently large |$n$|, which means that the images |$K_\Omega f_{n}(\partial \Omega )$| are entirely contained in |$B(0,1) \setminus S_\delta $|, where |$B(0,1)$| denotes the closed disk of radius |$1$| centred at the origin. But if |$\epsilon _{1}$| and |$\epsilon _{2}$| are sufficiently small positive numbers, then |$B(0,1) \setminus S_\delta \subset B(\epsilon _{1}, 1-\epsilon _{2})$|, a disk of radius |$1-\epsilon _{2}$| centred at the point |$\epsilon _{1} \in{\mathbb{R}}$|. See Figure 5. Recalling the geometric interpretation of the norm |$\| K_\Omega f_{n} + {\mathbb{C}}\mathbf{1}\|_{\partial \Omega }$| as the radius of the smallest disk containing the image of |$K_\Omega f_{n}$|, we would arrive at a contradiction to (33). Thus every strip |$S_\delta $| contains points in the image of |$K_\Omega f_{n}$| for infinitely many |$n$|.

The unit disk in dark grey with the strip |$S_\delta $| removed. The dotted circle containing the dark grey area has a radius slightly smaller than |$1$|.
Since each strip |$S_\delta $| intersects the images of |$K_\Omega f_{n}$| for infinitely many |$n$|, passing to a subsequence and a routine compactness argument produces a sequence |$(\zeta ^{\prime}_{n})$| convergent to some |$\zeta ^{\prime} \in \partial \Omega $|, for which |$K_\Omega f_{n}(\zeta ^{\prime}_{n}) \to \alpha $|, with |$\alpha $| unimodular and lying in the closure of each of the strips |$S_\delta $|. Thus |$\alpha \neq 1$|. We therefore merely need to repeat the previous arguments to see that, after passing to a subsequence, we will have |$f_{n}(\sigma ) \to \alpha $| for almost every |$\sigma $| with respect to the measure |$\rho _{\zeta ^{\prime}} d s$|.
4.2 Proof of Theorem 2
The above arguments are valid for |$f_{n} \in C(\partial \Omega )$|. However, under the assumption of analyticity, the sequence |$(f_{n})$| cannot converge to two different constants on two different sets of positive arclength measure. To make this statement precise, we appeal to the classical theory of analytic functions in the (open) unit disk |${\mathbb{D}} = \{ z \in{\mathbb{C}}: |z| < 1 \}$|. Here [8, Chapter II] is an excellent reference for the claims made in the following proof.
Let |$H^\infty = H^\infty ({\mathbb{D}})$| be the space of bounded analytic functions in |${\mathbb{D}}$|, identified as usual through boundary function correspondence with a weak-star closed subspace of the space |$L^\infty (\partial{\mathbb{D}}) = (L^{1}(\partial{\mathbb{D}})^{*}$| of bounded measurable functions on |$\partial{\mathbb{D}}$|, the dual of the Lebesgue space |$L^{1}(\partial{\mathbb{D}})$| of functions integrable on |$\partial{\mathbb{D}}$| with respect to the Lebesgue measure (arclength measure) on |$\partial{\mathbb{D}}$|. It is well known that a function |$\tilde{f} \in H^\infty $| that vanishes on a subset of positive Lebesgue measure on |$\partial{\mathbb{D}}$| must vanish identically.
4.3 A proof of Neumann’s lemma
We indicate how one may proceed to use our above arguments to obtain a proof of Neumann’s lemma, stating that |$c(\Omega ) = 1$| if and only if |$\Omega $| is a triangle or a quadrilateral. We need only the following simple geometric observation regarding the densities |$\rho _\zeta $|.
Fix |$\zeta \in \partial \Omega $|. Any |$\sigma \in \partial \Omega \setminus \{\zeta \}$| that is not a corner of |$\partial \Omega $| and that satisfies |$\rho _\zeta (\sigma ) = 0$| is contained in the union of at most two line segments of |$\partial \Omega $| containing |$\zeta $|.
It will suffice to show that all |$\sigma $| satisfying the above conditions are contained in at most two different tangent lines to |$\Omega $|. To see this, recall formula (5). The condition |$\rho _\zeta (\sigma ) = (2 \pi R_{\zeta ,\sigma })^{-1} = 0$| gives |$R_{\zeta , \sigma } = \infty $|, and so |$\zeta $| is contained in the tangent line to |$\Omega $| at |$\sigma $|. The tangent line divides the plane |${\mathbb{C}}$| into two half-planes, one of which contains |$\Omega $|. Assume that two different tangent lines, at |$\sigma $| and |$\sigma ^{\prime}$|, intersect at |$\zeta $|. They divide the plane |${\mathbb{C}}$| into four sectors, and by convexity precisely one of those sectors contains |$\Omega $|. Now, any line that passes through |$\zeta $| and the open sector containing |$\Omega $| must separate |$\sigma , \sigma ^{\prime} \in \partial \Omega $|. Therefore, it is not a tangent to |$\Omega $|.
Neumann’s lemma is established as follows. Assume that |$c(\Omega ) = 1$|. From Lemmas 18 and 19 we see that two points |$\zeta , \zeta ^{\prime}$| exist for which the measures |$\rho _\zeta ds$| and |$\rho _{\zeta ^{\prime}} ds$| are mutually singular. From Lemma 20 we deduce that the support of |$\rho _\zeta ds$| is the union of at most two line segments containing |$\zeta ^{\prime}$|, and the complement of the support of |$\rho _\zeta ds$| is also a union of at most two line segments. Thus |$\partial \Omega $| is the union of at most four line segments.
5 Examples
In this section, we compute and estimate the configuration constants for some types of domains.
5.1 Configuration constant of an ellipse
For |$a,b>0$|, let
be the ellipse centred at the origin with semi-axes of lengths |$a$| and |$b$|, respectively. It is quite remarkable that the configuration constant can in this case be computed explicitly.
In order to prove the proposition, our first step is to derive an expression for the density of the Neumann–Poincaré kernel of |$\Omega _{a,b}$|. The boundary |$\partial \Omega _{a,b}$| is parametrized by
Here |$[0,2\pi ]$| can be replaced by any interval of length |$2\pi $|. Recalling formula (4) for |$\mu _\zeta $| and setting |$\zeta = \gamma (s)$|, |$\sigma = \gamma (t)$|, we obtain
Using (37), this formula can be greatly simplified.
The lemma is established by combining (37) and (38), and then using elementary trigonometric identities to simplify the resulting expression.
With this formula in hand, we now evaluate the configuration constant of the ellipse |$\Omega _{a,b}$|.
5.2 Integral estimates
For a general domain, the exact value of |$c(\Omega )$| is often inaccessible. In this section, we will present a simple estimate that is applicable to domains |$\Omega $| with a non-flat part of the boundary that leads to an upper bound on |$c(\Omega )$|.
Assume that we find a Borel measure |$\nu $| on |$\partial \Omega $| such that
If so, then, for every |$\phi \in C(\partial \Omega )$| with |$\|\phi \|_{\partial \Omega }\le 1$|, we have
which shows that the image of |$K_\Omega \phi $| is contained in a disk of radius |$k^\nu _\Omega $| centred at |$\int _{\partial \Omega } \phi \, \textrm{d}\nu $|. Thus,
One approach is to seek a positive measure |$\nu $| on |$\partial \Omega $| satisfying |$\mu _\zeta \ge \nu $| for all |$\zeta \in \partial \Omega $|. Then
and so |$k^\nu _\Omega = 1- \nu (\partial \Omega )$|.
We will construct the largest non-negative Borel measure |$\nu $| on |$\partial \Omega $|, which satisfies |$\mu _\zeta - \nu \geq 0$|. The construction is based on the geometric interpretation of the density |$\rho _\zeta (\sigma )$| in (5) and the quantity |$R_\Omega $| appearing in (15). In order to avoid the need to establish Borel measurability of |$R_\Omega $| defined as a supremum of an uncountable family as in (15), we proceed to define it in a slightly different but equivalent way. Namely, it is easy to see that, given |$\sigma \in \partial \Omega $|, if there exists a closed disk |$\Delta $| such that |$\Omega \subset \Delta $| and |$\sigma \in \partial \Delta $|, then there exists one of smallest radius. We denote this radius by |$R_\Omega (\sigma )$|. Note that if |$\sigma $| is not a corner of |$\partial \Omega $|, then the corresponding disk must be tangent to |$\partial \Omega $| at |$\sigma $|. If no disk passing through |$\sigma $| exists that contains |$\Omega $|, then we set |$R_\Omega (\sigma ):=\infty $|. This happens, for instance, if |$\sigma $| is contained in the interior of a line segment in |$\partial \Omega $|. In particular, |$R_\Omega (\sigma ) = \infty $| for all but a finite number of points of any polygonal domain.

A domain |$\Omega $| with two circles corresponding to values |$R_\Omega (\sigma )$| and |$R_\Omega (\sigma ^{\prime})$|
The function |$R_\Omega :\partial \Omega \to (0,\infty ]$| is lower semicontinuous. In particular, it is Borel measurable.
Let |$\sigma \in \partial \Omega $| and let |$(\sigma _{n})$| be a sequence in |$\partial \Omega $| such that |$\sigma _{n}\to \sigma $|. We need to show that |$\liminf _{n\to \infty }R_\Omega (\sigma _{n})\ge R_\Omega (\sigma )$|. We can suppose that |$L:=\liminf _{n\to \infty }R_\Omega (\sigma _{n})<\infty $|, otherwise there is nothing to prove. Let |$L^{\prime}>L$|. Then, replacing |$(\sigma _{n})$| by a subsequence, we can suppose that |$R_\Omega (\sigma _{n})<L^{\prime}$| for all |$n$|. Thus, for each |$n$|, there exists a closed disk |$\Delta _{n}$| of radius |$L^{\prime}$| such that |$\Omega \subset \Delta _{n}$| and |$\sigma _{n}\in \partial \Delta _{n}$|. The sequence of centres |$(c_{n})$| of the disks |$\Delta _{n}$| is bounded, so there exists a convergent subsequence |$c_{n_{j}}\to c$|. Let |$\Delta $| be the closed disk with centre |$c$| and radius |$L^{\prime}$|. Then we have |$\Omega \subset \Delta $| and |$\sigma \in \partial \Delta $|. It follows that |$R_\Omega (\sigma )\le L^{\prime}$|. As this last inequality holds for all |$L^{\prime}>L$|, we deduce that |$R_\Omega (\sigma )\le L$|. This completes the proof.
We set
By the above lemma, |$\nu $| is a non-negative Borel measure on |$\partial \Omega $|. For any |$\zeta \in \partial \Omega $|, we have
To see this, note that if |$\sigma $| is not a corner and |$R_{\zeta ,\sigma }$| is the radius of the unique circle tangent to |$\partial \Omega $| at |$\sigma $| and passing through |$\zeta $|, then |$R_\Omega (\sigma ) \geq R_{\zeta , \sigma }$|. Therefore, according to (5),
for almost every |$\sigma $| with respect to arclength measure on |$\partial \Omega $|. Inequality (41) follows. Although we shall skip a formal proof, we mention also that |$\nu $| is in fact the largest measure satisfying |$\mu _\zeta \geq \nu $| for all |$\zeta \in \partial \Omega $|. This maximality property of |$\nu $| is to be interpreted in the following sense: if |$\nu ^{\prime}$| is any measure satisfying |$\mu _\zeta \geq \nu ^{\prime}$| for all |$\zeta $|, then |$\nu \geq \nu ^{\prime}$|.

A quadrilateral domain |$\Omega _\epsilon $| with |$a(\Omega _\epsilon )> 1 - (4/\pi )\epsilon $|.
By our earlier discussion, we obtain the following upper estimate for the configuration constant:
Note that this is precisely the assertion of Theorem 6 stated in Section 1.
We will now mention some consequences. Recall that if |$\gamma $| is a plane curve of class |$C^{2}$|, then the radius of curvature of |$\gamma $| is the reciprocal of its curvature.
In this case, one sees from (15) and (16) that |$R_\Omega (\sigma )\le \rho $| for all |$\sigma \in \partial \Omega $|, from which the result follows.
This last result was already known. See for example [7, pp. 45–46] and [12, pp. 128–129]. However the proofs in these references are quite different from the one above.
5.3 Analytic configuration constants of quadrilaterals
Theorem 2 shows that |$a(\Omega ) < 1$| for every |$\Omega $|. Here we show by example that |$a(\Omega )$| may be arbitrarily close to |$1$|. Figure 7 shows a narrow quadrilateral domain for which this phenomenon occurs.
Let |$f$| be a conformal mapping of the interior of |$\Omega _\epsilon $| onto the unit disk |${\mathbb{D}}$|. By Carathéodory’s theorem, |$f$| extends to a homeomorphism of |$\Omega _\epsilon $| onto |$\overline{{\mathbb{D}}}$|, and so clearly |$f\in A(\Omega )$|. Post-composing with a suitable automorphism of |$\overline{{\mathbb{D}}}$|, we may further suppose that |$f(1)=1$| and |$f(-1)=-1$|.
Finally, by trigonometry, |$\theta _{1}$| is related to |$\epsilon $| by |$\tan (\theta _{1}/2)=\epsilon $|, whence |$\theta _{1}=2\arctan \epsilon \le 2\epsilon $|. The result follows.
5.4 Configuration constants equal to zero
Recall the estimate (13) from Section 1, which will be proved in the next section. The estimate is strongest if |$a(W) = 0$|, and in this case we reach the conjectured bound |$K = 2$|. Unfortunately, the only domain |$W$| for which we have |$a(W) = 0$| is a disk, and in this case (13) reduces to the well-known Okubo–Ando bound from [15]. For completeness we give a proof of the statement that |$a(\Omega ) = 0$| if and only if |$\Omega $| is a disk. More precisely, we have the following.
Let |$\Omega $| be a compact convex domain with non-empty interior. The following are equivalent:
- (i)
|$\Omega $| is a disk,
- (ii)
|$c(\Omega ) = 0$|,
- (iii)
|$a(\Omega ) = 0$|.
In the case that |$\Omega $| is a disk, then (5) implies readily that |$\rho _\zeta (\sigma )$| is a constant independent of |$\zeta $|, and so for every |$\zeta \in \partial \Omega $|, the measure |$\mu _\zeta $| is a normalized arclength measure on the circular boundary |$\partial \Omega $|. Then it follows from the definition that |$K_\Omega f$| is a constant function, and consequently |$\|K_\Omega f + {\mathbb{C}} \mathbf{1}\|_{C(\partial \Omega } = 0$|, so |$c(\Omega ) = a(\Omega ) = 0$|. This shows that the implications |$(i) \Rightarrow (ii)$| and |$(ii) \Rightarrow (iii)$| hold.
The conclusion that |$\Omega $| is a disk is now a consequence of the geometric formula for |$\rho _\zeta (\sigma )$| in (5). Fix any |$\sigma \in \partial \Omega $|, which is not a corner. Since the measures |$\mu _\zeta $| are all equal, so are their densities |$\rho _\zeta (\sigma )$|. Then the circles passing through |$\zeta \in \partial \Omega $| and tangential to |$\partial \Omega $| at |$\sigma $| all have the same radius, and so they all coincide with each other. Thus one circle passes through all points |$\zeta \in \partial \Omega $|. Consequently |$\partial \Omega $| is a circle, and so |$\Omega $| is a disk.
6 Application to Numerical Ranges
6.1 Spectral constant estimate
Our principal motivation for the introduction of the analytic configuration constant is the following result that was mentioned in the Section 1 and that we will now prove.
Of course, if |$W$| has no interior, then its convexity forces it to be a line segment. In that case |$T$| is a normal operator, and the spectral theorem gives us the better estimate |$\| f(T)\| \leq \|f\|_{\sigma (T)}$|, where |$f$| may be any Borel measurable function on the spectrum |$\sigma (T)$|. Thus Theorem 28 implies Theorem 3. In what follows, we will assume that |$W$| has non-empty interior.
Let us make some initial remarks before going into the proof of Theorem 28. In the case |$\sigma (T)$| is contained in the interior of |$W$|, then |$f(T)$| is defined, as usual, through the Dunford–Riesz holomorphic functional calculus. If |$\partial W \cap \sigma (T) \neq \emptyset $|, then this definition does not work. Nevertheless, if |$f \in \mathcal{A}(W)$|, then it is a standard result of approximation theory that a sequence of analytic polynomials |$(p_{n})$| exists, which converges to |$f$| uniformly on |$W$|. In the presence of any uniform bound of the form |$\|p(T)\| \leq K \|p(T)\|_{W}$| for polynomials |$p$|, we may then define |$f(T)$| as the limit of the sequence |$(p_{n}(T))$| in the operator norm. Such bounds are known to exists, the strongest known bound |$K \leq 1 + \sqrt{2}$| being due to Crouzeix and Palencia. Theorem 28 improves this estimate given information about the numerical range of |$T$|.
Our proof of Theorem 28 combines the argument of Crouzeix and Palencia from [5] with ideas of Schwenninger and de Vries from [18], where bounds for various functional calculi are derived as a consequence of the existence of extremal functions and extremal vectors. Let |$U$| be an open set in the plane, and |$H^\infty (U)$| be the algebra of bounded holomorphic functions on |$U$|. Given an operator |$T: {\mathcal{H}} \to{\mathcal{H}}$| with |$\sigma (T)$| contained in |$U$|, it is elementary that the quantity
is finite. A normal-families argument shows that an |$f \in H^\infty (U)$| exists with |$\|f\|_{U} = 1$| for which the supremum above is attained. Any such |$f$| will be called for an extremal function. If, moreover, a vector |$x \in{\mathcal{H}}$| with |$\|x\|_{\mathcal{H}} = 1$| exists for which
then we will say that |$x$| is an extremal vector, and |$(f,x)$| is an extremal pair. Unless |$\dim{\mathcal{H}} < \infty $|, an extremal vector might not exist, but we will be able to reduce the proof to the finite-dimensional case. The importance of the concept of extremal pairs |$(f,x)$| stems from the following result. We refer the reader to [1, Theorem 4.5] for a proof (see also [18, Proposition 3]).
The next two lemmas will reduce our task to consideration of finite-dimensional Hilbert spaces, in which extremal vectors exist, and will dispose of the problematic set |$\sigma (T) \cap \partial W$|. The first observation is essentially contained in [18, Proposition 9].
The proof of the next lemma will use affine invariance of the configuration constants. Let us fix |$\alpha , \beta \in{\mathbb{C}}$|, |$\alpha \neq 0$|, and an affine mapping |$A(z):= \alpha z + \beta $|. Then |$A$| is a conformal transformation of |${\mathbb{C}}$| with the additional property of taking a line segment of length |$L$| to a line segment of length |$|\alpha |L$|, and a circle of radius |$R$| to a circle of radius |$|\alpha |R$|. Let |$\widetilde{\Omega } = A(\Omega )$| be the affine image of |$\Omega $| under |$A$|, and recall the formula for the Neumann–Poincaré kernel in (3) and its geometric interpretation. If |$\zeta , \sigma \in \partial \Omega $|, |$E$| is a Borel subset of |$\partial \Omega $|, and |$s, \widetilde{s}$| are the arclength measures on |$\partial \Omega $| and |$\partial \widetilde{\Omega }$|, respectively, then it follows from the properties of |$A$| listed above that
- (i)
|$\theta _\zeta = \theta _{A(\zeta )}$|,
- (ii)
|$|\alpha | s(E) = \widetilde{s}(A(E))$|,
- (iii)
|$|\alpha | R_{\zeta , \sigma } = R_{A(\zeta ), A(\sigma )}$|.
A consequence is that the Neumann–Poincaré kernels |$\{\mu _\zeta \}_{\zeta \in \partial \Omega }$| and |$\{\widetilde{\mu }_{A(\zeta )}\}_{A(\zeta ) \in \partial \Omega }$| of the respective domains satisfy
Then a change of variables shows that |$K_\Omega (\widetilde{f} \circ A) = K_{\widetilde{\Omega }} \widetilde{f}$| for any |$\widetilde{f} \in C(\partial \widetilde{\Omega })$|, and it follows that
Armed with these equalities, we make our second observation.
By Lemma 31, it will be sufficient to establish the estimate (42) whenever |$\Omega $| contains |$W(T)$| in its interior |$\Omega ^{o}$|. Moreover, by Lemma 30, we may assume that |$T$| is an operator on a finite-dimensional Hilbert space |${\mathcal{H}}$|. Let |$U = \Omega ^{o}$| and |$(f,x)$| be an extremal pair corresponding to |$U$| and the operator |$T$|. If |$\|f(T)\| \leq 1$|, then (42) certainly holds, so we may assume that |$\|f(T)\|> 1$|.
Funding
This work has been done during Malman’s visit at Département de mathématiques et de statistique, Université Laval, supported by Simons-CRM Scholar-in-Residence program. Mashreghi’s research was supported by an NSERC Discovery Grant and the Canada Research Chairs program. O’Loughlin was supported by a CRM-Laval Postdoctoral Fellowship. Ransford’s research was supported by an NSERC Discovery Grant.
Acknowledgments
We are grateful to the referee for the careful reading of the paper, and for the suggestions that we used to improve the manuscript.
A Double-Layer Potential on a General Convex Domain
A.1 Convex domains
Let |$\Omega $| be a compact convex domain in the plane |$\mathbb{C}$| with non-empty interior |$\Omega ^{o}$|. We will be making no assumptions regarding smoothness of the boundary |$\partial \Omega $|. However, convexity itself implies that |$\partial \Omega $| is a rectifiable simple closed curve with some additional properties.
The orientation of |$\partial \Omega $| is to be counter-clockwise (i.e., positive), and we use |$\sigma ^{\prime} \uparrow \sigma $| and |$\sigma ^{\prime} \downarrow \sigma $| to denote, respectively, the counter-clockwise and clockwise one-sided convergence of |$\sigma ^{\prime}$| to |$\sigma $| within |$\partial \Omega $|. As a consequence of convexity of |$\Omega $|, the one-sided tangent angles exist at every point |$\sigma \in \partial \Omega $|, are locally given by
and satisfy
Strict inequality may occur at most at a countable subset of |$\partial \Omega $|. If it occurs at |$\sigma $|, then we say that |$\partial \Omega $| has a corner at |$\sigma $|. At any point that is not a corner, the tangent angle
is well-defined, and so is the tangent |$T(\sigma ):= e^{i \alpha (\sigma )}$| itself. If |$t \mapsto \gamma (t)$| is any (positively-oriented) parametrization of |$\partial \Omega $|, and we set |$\alpha (\sigma ) = \alpha _{+}(\sigma )$| at the corners, then the locally defined function |$\alpha (\gamma (t))$| is increasing in |$t$|, and consequently the tangent |$T$| is continuous at every point that is not a corner of |$\partial \Omega $|. At a corner, the discontinuity of |$T$| amounts to a jump of the argument of |$T$|. We denote by |$N(\sigma ):= -i T(\sigma )$| the outward-pointing normal at |$\sigma \in \partial \Omega $|.
A.2 Double-layer potential
Let |$\Omega ^{o}$| denote the interior of |$\Omega $|. To each |$z \in \Omega ^{o}$| we associate the measure |$\mu _{z}$| on |$\partial \Omega $|, which for any arc |$J \subset \partial \Omega $| satisfies
Here |$\arg (\sigma - z)$| is any locally defined continuous determination of the argument function on |$\partial \Omega $|. Non-negativity of |$\mu _{z}$| follows from convexity of |$\Omega $| and our choice of positive orientation of |$\partial \Omega $|. With respect to this orientation, every arc |$J =(a,b) \subset \partial \Omega $| has a start-point |$a$| and an end-point |$b$|, and it is easy to see that
In particular, |$\mu _{z}(\partial \Omega ) = 2$|.
The measure |$\mu _{z}$| is absolutely continuous with respect to arclength |$s$| on |$\partial \Omega $|. Indeed, if |$\sigma _{0} \in \partial \Omega $|, |$J_{n} = (a_{n}, b_{n})$| is a sequence of arcs of |$\partial \Omega $| that are shrinking to |$\sigma _{0}$|, and |$|J_{n}|$| are the corresponding arclengths, then
We use above an appropriate locally defined holomorphic branch of the logarithm. As |$n \to \infty $|, the first factor inside the brackets satisfies
while the second factor stays bounded as a consequence of the inequality |$|b_{n} - a_{n}| \leq |J_{n}|$|. Thus
and from elementary measure theory we obtain that |$\mu _{z}$| is absolutely continuous with respect to |$s$|. If moreover |$\sigma _{0}$| is not a corner, then it can be shown that
and so in additional to boundedness we even have the convergence
Thus the Radon–Nikodym derivative satisfies
at every |$\sigma \in \partial \Omega $|, which is not a corner.
A.3 Boundary kernel
The Neumann–Poincaré kernel is the boundary version of the family of measures |$\{\mu _{z}\}_{z \in \Omega ^{o}}$| introduced above. To each point |$\zeta \in \partial \Omega $| we associate the Borel probability measure on |$\partial \Omega $| defined by (A.1) for arcs |$J \subset \partial \Omega $| not containing the point |$\zeta $|. Because |$\zeta \in \partial \Omega $|, this definition implies that |$\mu _\zeta ( \partial \Omega \setminus \{\zeta \}) = \theta _\zeta /\pi $|, where |$\theta _\zeta = \pi - \alpha _{+}(\zeta ) + \alpha _{-}(\zeta )$| can be interpreted as the angle of the aperture at |$\zeta $|. Indeed, |$\theta _\zeta $| is equal to the increase in the argument of |$\sigma - \zeta $| as we traverse one loop around |$\partial \Omega $| starting and ending at the point |$\zeta $|, and since |$\mu _\zeta $| is a probability measure, we must have
With the exception of this possible point mass, |$\mu _\zeta $| is otherwise absolutely continuous with respect to arclength. The corresponding Radon–Nikodym derivative is given by
The formula (A.3) is established analogously to (A.2). All in all, the measure |$\mu _\zeta $| can be decomposed as
where |$\delta _\zeta $| is a unit mass at |$\zeta \in \partial \Omega $|, |$\theta _\zeta $| is the angle of the aperture at |$\zeta $| (with the convention that |$\theta _\zeta = \pi $| if |$\zeta $| is not a corner), and where the density |$\rho _\zeta $| is given by (A.3).
A.4 Weak-star convergence
We establish now that
in the sense of the weak-star topology on measures. Note that if |$B = B(\zeta , \delta )$| is a ball of radius |$\delta> 0$| centred at |$\zeta \in \partial \Omega $|, then expressions (A.2) and (A.3) for the densities of |$\mu _{z}$| and |$\mu _\zeta $| show that
for every |$f \in C(\partial \Omega )$|. In particular, choosing |$f = \mathbf{1}$|, we obtain
Since
we see that given |$\epsilon> 0$| for all sufficiently small |$\delta> 0$| we will have
Returning to general |$f \in C(\partial \Omega )$|, we have
On the right-hand side, the first term tends to zero as |$z \to \zeta $|, the second can be made arbitrarily small by continuity of |$f$|, the crude estimate |$\mu _{z}(B) \leq 2$| and choice of sufficiently small |$\delta $|, the third is dominated in modulus by |$\|f\|_{\partial \Omega } \cdot \epsilon $| for |$z$| sufficiently close to |$\zeta $|, and the fourth is dominated by |$\|f\|_{\partial \Omega } \cdot \mu _\zeta (B \setminus \{\zeta \})$|, which also can be made arbitrarily small by choice of sufficiently small |$\delta $|. The desired weak-star convergence follows.
References