Moduli Space of Hermite-Einstein Connections
This is the first of three posts where we will delve into the moduli space of Hermite-Einstein connections. We will start by examining some of the properties of these connections. Next, we will prove that the moduli space of irreducible Hermite-Einstein connections constitutes an open subset of the moduli space of stable bundles. After that, we will demonstrate that the moduli space of Hermite-Einstein connections is a smooth manifold and explore an application of this theory in the context of surfaces. Finally, we will consider the Kobayashi–Hitchin correspondence.
Throughout the trilogy, $M$ denotes a compact, connected Kähler manifold with a Kähler form $\omega$, and $\dim_\Bbb C M \ge 2$ unless stated otherwise. Recall that in complex geometry, we have the three important operators
- The Lefschetz operator $$ \begin{align*} L : \Omega^k(M) &\to \Omega^{k+2}(M) \\ \alpha &\mapsto \alpha \wedge \omega. \end{align*} $$
- The Hodge $\star$-operator $$ \begin{align*} \star : \Omega^k(M) &\to \Omega^{2n-k}(M). \end{align*} $$
- The dual Lefschetz operator $$ \begin{align*} \Lambda : \Omega^k(M) &\to \Omega^{k-2}(M). \end{align*} $$
Also $\frac{\omega^n}{n!}$ serves as the volume form of $M$. This is used implicitly as a measure in order to integrate functions on $M$. Recall from Hermite-Einstein Metrics and Stability the definition for a Hermite-Einstein connection.
Since we’ve already had discussions about the properties of these kinds of connections in the past, we’ll go straight into the properties of bundles admitting such connections.
- There exists a unique holomorphic structure on $E$ compatible with $\nabla$.
- If $\deg(E) := \int_M c_1(E) \wedge [\omega]^{n-1} < 0$, then $E$ has no holomorphic sections.
- $E$ is semi-stable and is a direct sum of stable bundles.
- $\nabla$ minimizes the Yang-Mills functional $$ \operatorname{YM}(\nabla) = \int_M |F_\nabla|^2. $$
- This is just an application of the Newlander-Nirenberg theorem.
- Note that for any holomorphic section $s$ we have that $$ \frac{1}{2}\Delta |s|^2 = h(i\Lambda_\omega F_\nabla(s), s) - |\nabla s|^2. $$ Now the Hermite-Einstein condition gives $$ \frac{1}{2}\Delta |s|^2 = \lambda|s|^2 - |\nabla s|^2. $$ Integrating this equation yields $$ \int_M \frac{1}{2}\Delta |s|^2 = \int_M \lambda|s|^2 - \int_M |\nabla s|^2 $$ and since $\int_M \frac{1}{2}\Delta |s|^2 = 0$ by the divergence theorem we obtain $$ \lambda \int_M |s|^2 - \int_M |\nabla s|^2 = 0. $$ Both of these integrands are non-negative and so both of the integrals are non-negative. Hence $$ \lambda \int_M |s|^2 = 0 \quad \text{and} \quad \int_M |\nabla s|^2 = 0. $$ As $\deg(E) < 0$ we know that $\lambda < 0$ and so $|s|^2 = 0$ everywhere on $M$ giving us that $s = 0$.
- Let $F$ be a reflexive subsheaf of $E$ with $$ 0 < s = \operatorname{rank}(F) < r = \operatorname{rank}(E). $$ The inclusion $\iota : F \hookrightarrow E$ induces a morphism $\det \iota : \det(F) \to \left(\bigwedge^s E\right)^{\ast\ast} \cong \bigwedge^s E$ which is a monomorphism of sheaves since $F$ is torsion-free. Tensoring with $(\det(F))^\ast$ one obtains a non-trivial global holomorphic section $$ f : \mathcal{O}_M \to \bigwedge^s E \otimes (\det(F))^\ast. $$ Also $\lambda = \frac{\pi n}{\operatorname{vol}(M)} \mu(E)$, and $\det(F)$ as a line bundle admits a Hermite-Einstein metric with a factor $$ \gamma = \frac{\pi n s}{\operatorname{vol}(M)}\mu(F). $$ The bundle $\bigwedge^s E \otimes (\det(F))^\ast$ is thus Hermite-Einstein with a factor $s\lambda - \gamma$, and from the existence of the section $f$ one gets $$ s\lambda - \gamma \ge 0 $$ or equivalently $$ \mu(F) \le \mu(E) $$ as desired. To see the second assertation, note that if $E$ is not stable, then there exists a reflexive subsheaf $F$ with $\mu(F) = \mu(E)$ and the sequence $$ 0 \longrightarrow F \longrightarrow E \longrightarrow E/F \longrightarrow 0 $$ splits. From $\mu(F) = \mu(E)$ one gets that $F$ and $E/F$ satisfy the Hermite-Einstein condition with a factor $\lambda$ and inducting on the rank of $E$ completes the proof.
- For any curvature tensor $F$ of $E$ consider the characteristic number $$ C=\int_{M} \operatorname{tr}\left(\frac{i}{2 \pi} F_\nabla-a \omega \operatorname{id}_{E}\right)^2 \wedge \omega^{n-2}, $$ where $a = \frac{\mu(E)}{\int_M \omega^n}$. Expanding the square yields $$ C=\int_M \left(c_{1}^{2}(E)-2 c_{2}(E)\right)\wedge [\omega]^{n-2} -ra^{2} \int_{M} \omega^{n}. $$ Decomposing $F_\nabla$ into its self-dual and anti-self-dual parts $F_\nabla = F^+_\nabla + F^-_\nabla$ we obtain $$ C=-(n-1)!\left\|\frac{i}{2 \pi} F^{-}_\nabla\right\|^{2}+\int_{M} \operatorname{tr}\left(\frac{i}{2 \pi} F^{+}_\nabla - a \omega \operatorname{id}_{E}\right)^{2} \wedge \omega^{n-2}. $$ Let $k$ be the last integral in the above expression, then $k \ge 0$ and the equality holds if and only if $F_\nabla$ is Einstein. Now $$ \begin{aligned} k & =\int_{M} \operatorname{tr}\left(\frac{i}{2 \pi} F^{+}_\nabla\right)^{2} \wedge \omega^{n-2}-2 a c_{1}(E) \wedge \omega^{n-1}+r a^{2} \omega^{n} \\ & =(n-1)!\left\|\frac{i}{2 \pi} F^{+}_{\nabla}\right\|^{2}-2(n-1) \cdot(n-2)!\left\|\frac{i}{2 \pi} F_\nabla^{2,0}\right\|^{2}-r a^{2} \int_{M} \omega^{n} \end{aligned} $$ and so $$ \left\|\frac{i}{2 \pi} F^{+}_{\nabla}\right\|^{2} = \frac{1}{(n-1)!}\left(k+2(n-2) \cdot(n-2)!\left\|\frac{\sqrt{-1}}{2 \pi} F_\nabla^{2,0}\right\|^{2}+r a^{2} \int_{M} \omega^{n}\right), $$ which combined with $\left\|\frac{i}{2 \pi} F_\nabla\right\|^{2}=-\frac{1}{(n-2)!} C+\frac{1}{(n-2)!} k$ gives $$ \begin{aligned} \int_{M} |F_\nabla |^{2} &= \left\|F^{+}_{\nabla}\right\|^{2}+\left\|F^{-}_{\nabla}\right\|^{2} \\ &= 4 \pi^{2}\left(\frac{n C}{(n-1)!}+\frac{2(n-2)}{n-1}\left\|\frac{\sqrt{-1}}{2 \pi} F_\nabla^{2,0}\right\|^{2}+\frac{n r}{(n-1)!} a^{2} \int_{M} \omega^{n}\right. \\ & \left.-\frac{1}{(n-2)!}\int_M \left(c_{1}^{2}(E)-2 c_{2}(E)\right) \wedge [\omega]^{n-2}\right). \end{aligned} $$ It follows that $\operatorname{YM}$ is bounded below by a topological constant given by $$ \begin{aligned} \operatorname{YM} &\ge \frac{4 \pi^{2}}{r(n-2)!}\left(\frac{\left(\int_M c_{1}(E) \wedge [\omega]^{n-1} \right)^{2}}{\int_M [\omega]^{n}}-\int_M c_{1}^{2}(E)\wedge [\omega]^{n-2}\right) \\ & +\frac{1}{n-1}\frac{\left(\int_M c_{1}(E) \wedge [\omega]^{n-1} \right)^{2}}{\int_M [\omega]^{n} } \\ & \left.+\int_M \left(2 r c_{2}(E)-(r-1) c_{1}^{2}(E)\right)\wedge [\omega]^{n-2}\right). \end{aligned} $$
Next, we’ll define so-called irreducible connections.
- $E$ has no $\nabla$-invariant subbundle $F$ except the trivial one and itself.
- The holonomy group acts irreducibly on each fibre.
- Every parallel skew-Hermitian endomorphism of $E$ is constant.
- Every parallel endomorphism of $E$ is constant.
- $E$ is not a direct sum of proper $\nabla$-invariant subbundles.
- $E$ is not an orthogonal sum of proper $\nabla$-invariant subbundles.
Under certain topological constraints one can show that for every rank $2$ bundle over $M$, any Hermite-Einstein connection is irreducible. Moreover this yields stability for $E$.
Ultimately, we are aiming to build the theory around the Kobayashi–Hitchin correspondence. I’ve briefly mentioned about this in Hermite-Einstein Metrics and Stability as the theorem by Donaldson, Uhlenbeck and Yau, but let’s briefly go over a bit of history around this before we march onto looking at moduli spaces.
The whole concept of stability originates from algebro-geometric notions, whereas the Hermite-Einstein condition is purely differential-geometric. On one hand we have the moduli space of (poly)stable holomorphic vector bundles and on the ohter, the moduli space of Hermite-Einstein connections up to gauge equivalence. The Kobayashi–Hitchin correspondence states that these two moduli spaces are the same.
To set up shop, let $E$ be a complex vector bundle of rank $r$ over a complex manifold $M$ and denote the group of smooth bundle automorphisms of $E$ by $\mathrm{GL}(E)$. The following propositions will serve as our guiding light towards the definition for the moduli space of holomorphic structures.
The connection in the above proof is called the Chern connection. For a proof of the proposition, see Holomorphic Connections and The Atiyah Class. The converse is the bit about which we are mostly interested today.
Denote now by $\mathscr{D}^{0,1}(E)$ the set of all $\Bbb C$-linear maps
\[\nabla^{0,1} : \Omega^0(E) \to \Omega^{0,1}(E),\]satisfying $\nabla^{0,1}(fs) = \bar{\partial}f \otimes s + f \nabla^{0,1}s$. Recall from Connections that $\mathscr{D}^{0,1}(E)$ is an affine space. Let $\mathscr{H}^{0,1}(E) \subset \mathscr{D}^{0,1}(E)$ be the subset consisting of $\nabla^{0,1}$ with
\[\left(\nabla^{0,1}\right)^2 = \nabla^{0,1} \circ \nabla^{0,1} = 0.\]Now if $E$ is a holomorphic vector bundle, then $\bar{\partial} : \Omega^0(E) \to \Omega^{0,1}(E)$ gives a well-defined element of $\mathscr{H}^{0,1}(E)$. Conversely, by the Koszul-Malgrange Theorem every $\nabla^{0,1} \in \mathscr{H}^{0,1}(E)$ defines a unique holomorphic structure in $E$ for which $\nabla^{0,1} = \bar{\partial}$. Hence one can consider $\mathscr{H}^{0,1}(E)$ as the set of holomorphic bundle structures in $E$. The action of $\mathrm{GL}(E)$ on $\mathscr{D}^{0,1}(E)$ is given by conjugation
\[(f,\nabla^{0,1}) \mapsto f^{-1} \circ \nabla^{0,1} \circ f.\]Note that $f^{-1} \circ \nabla^{0,1} \circ f = \nabla^{0,1} + f^{-1}\bar{\partial}f$. For $\nabla^{0,1} \in \mathscr{H}^{0,1}(E)$ we have that
\[\begin{align*} (f^{-1} \circ \nabla^{0,1} \circ f)^2 = 0 \end{align*}\]and so $\mathrm{GL}(E)$ preserves $\mathscr{H}^{0,1}(E)$. Therefore, one can introduce the orbit space
\[\mathscr{H}^{0,1}(E)/\mathrm{GL}(E),\]which we will call the moduli space of holomorphic structures in $E$. Unfortunately, there is a slight problem with this definition. The space we are considering is in general non-Hausdorff. The way to resolve this is to consider (semi)stable bundles.
Fix now a Hermitian structure $h$ in the complex vector bundle $E$ and consider the subgroup $\mathrm{U}(E,h)$ of $\mathrm{GL}(E)$ consisting of unitary automorphisms of $E$. Let $\mathscr{D}(E,h)$ be the set of connections $\nabla$ in $E$ that preserve $h$. That is, $\mathscr{D}(E,h)$ consits of $\Bbb C$-linear maps $\nabla : \Omega^0(E) \to \Omega^1(E)$ for which
\[\nabla(fs) = df \otimes s + f\nabla s\]and
\[dh(s,t) = h(\nabla s, t) + h(s,\nabla t).\]Recall that such connections extend to connections $\nabla : \Omega^{p,q}(E) \to \Omega^{p+1,q}(E) \oplus \Omega^{p,q+1}(E)$. Now for $\nabla = \nabla^{1,0} + \nabla^{0,1}$, we have that $\nabla^{0,1} \in \mathscr{D}^{0,1}(E)$ and so we have a natural bijection
\[\begin{align*} \mathscr{D}(E,h) &\to \mathscr{D}^{0,1}(E) \\ \nabla &\mapsto \nabla^{0,1}. \end{align*}\]Indeed for $\nabla^{0,1} \in \mathscr{D}^{0,1}(E)$, the map $\nabla^{1,0}$ is determined by
\[\bar{\partial}h(s,t) = h(\nabla^{0,1}s,t) + h(s,\nabla^{1,0}t).\]Let $\operatorname{End}(E,h)$ denote the vector bundle of skew-Hermitian endomorphisms of $(E,h)$. If one fixes a connection $\nabla_0 \in \mathscr{D}(E,h)$, then for every $\nabla \in \mathscr{D}(E,h)$ the difference $\alpha = \nabla - \nabla_0$ satisfies
\[h(\alpha s, t) + h(s,\alpha t) = 0.\]Thus $\alpha$ can be regarded as an element of $\Omega^1(\operatorname{End}(E,h))$. Conversely, given any such form $\alpha$, setting $\nabla = \nabla_0 + \alpha$ gives an element in $\mathscr{D}(E,h)$. It follows that $\mathscr{D}(E,h)$ is an affine space over $\Omega^1(\operatorname{End}(E,h))$. Let $\mathscr{H}(E,h)$ denote the subset of $\mathscr{D}(E,h)$ consisting of elements with $\nabla = \nabla^{1,0} + \nabla^{0,1}$ such that
\[\nabla^{0,1} \circ \nabla^{0,1} = 0.\]That is,
\[\mathscr{H}(E,h) = \{\nabla \in \mathscr{D}(E,h) \mid \nabla^{0,1} \in \mathscr{H}^{0,1}(E)\}.\]It’s not too hard to verify that we again obtain a bijection $\mathscr{H}(E,h) \to \mathscr{H}^{0,1}(E)$. Putting two and two together we have the following diagram:
\[\xymatrix{ \mathscr{D}(E,h) \ar@{<->}[r] & \mathscr{D}^{0,1}(E) \\ \mathscr{H}(E,h) \ar@{->}[u] \ar@{<->}[r] & \mathscr{H}^{0,1}(E). \ar@{->}[u] }\]Note that for $f \in \mathrm{U}(E,h)$ and $\nabla \in \mathscr{H}(E,h)$ we have
\[(f^{-1} \circ \nabla \circ f)^2 = 0\]and so the action of $\mathrm{U}(E,h)$ on $\mathscr{D}(E,h)$ by conjugation preserves $\mathscr{H}(E,h)$. Since there is a one-to-one correspondence between $\mathscr{D}(E,h)$ and $\mathscr{D}^{0,1}(E)$, the group $\mathrm{GL}(E)$ acting on $\mathscr{D}^{0,1}(E)$ must act also on $\mathscr{D}(E,h)$. This action is given by
\[(f,\nabla) \mapsto f^\ast \circ \nabla^{1,0} \circ (f^{\ast})^{-1} + f^{-1} \circ \nabla^{0,1} \circ f.\]Here $f^\ast$ is the adjoint of $f$. Now since the action of $\mathrm{GL}(E)$ on $\mathscr{D}^{0,1}(E)$ preserves $\mathscr{H}^{0,1}(E)$, the action on $\mathscr{D}(E,h)$ will also preserve $\mathscr{H}(E,h)$.
The action of $\mathrm{U}(E,h)$ on $\mathscr{D}(E,h)$ or $\mathscr{H}(E,h)$ is not effective. For a unitary automorphism $f$, the equality
\[f^\ast \circ \nabla^{1,0} \circ (f^{\ast})^{-1} + f^{-1} \circ \nabla^{0,1} \circ f = \nabla\]yields $\nabla \circ f = f \circ \nabla$, i.e. $f$ is parallel with respect to $\nabla$. We obtain the following proposition.
What we would like to achieve here is that the spaces $\mathscr{D}(E,h)/\mathrm{U}(E,h)$ and $\mathscr{H}(E,h)/\mathrm{U}(E,h)$ were Hausdorff. To do so there is one more thing we need to consider. Is the action of $\mathrm{U}(E,h)$ to these spaces proper?
Let’s then see what kind of data we obtain if we give $M$ more structure. Suppose that $M$ is a compact Kähler manifold with a Kähler metric $g$. Let $\mathscr{E}(E,h)$ denote the subset of $\mathscr{H}(E,h)$ consisting of Hermite-Einstein connections $\nabla$. That is,
\[\mathscr{E}(E,h) = \{ \nabla \in \mathscr{H}(E,h) \mid i\Lambda F_\nabla = \lambda \operatorname{id}_E\}.\]The first thing to look at is whether or not the action of $\mathrm{U}(E,h)$ on $\mathscr{D}(E,h)$ preserves $\mathscr{E}(E,h)$. Note that for $\nabla \in \mathscr{D}(E,h)$ and $f \in \mathrm{U}(E,h)$ we have that
\[\begin{align*} F_{f^{-1} \circ \nabla \circ f} &= (f^{-1} \circ \nabla \circ f) \circ (f^{-1} \circ \nabla \circ f) \\ &= f^{-1} \circ \nabla \circ \nabla \circ f \\ &= f^{-1} \circ F_\nabla \circ f. \end{align*}\]Using this we obtain
\[\begin{align*} i\Lambda F_{f^{-1} \circ \nabla \circ f} &= i\Lambda(f^{-1} \circ F_\nabla \circ f) \\ &= f^{-1} \circ i\Lambda F_\nabla \circ f. \end{align*}\]It follows that the action of $\mathrm{U}(E,h)$ on $\mathscr{D}(E,h)$ preserves $\mathscr{E}(E,h)$. Also the natural map
\[\mathscr{E}(E,h)/\mathrm{U}(E,h) \to \mathscr{H}(E,h)/\mathrm{U}(E,h)\]is injective for obvious reasons. The uniqueness of Hermite-Einstein connections guarantees that even the map
\[\mathscr{E}(E,h)/\mathrm{U}(E,h) \to \mathscr{H}^{0,1}(E,h)/\mathrm{GL}(E)\]is injective. The space $\mathscr{E}(E,h)/\mathrm{U}(E,h)$ is called the moduli space of Hermite-Einstein structures in $E$.
The next thing we aim to show is that the moduli space of Hermite-Einstein connections $\mathscr{E}(E,h)/\mathrm{U}(E,h)$ is open in the moduli space $\mathscr{H}^{0,1}(E)/\mathrm{GL}(E)$ of stable holomorphic structures on $E$.
The surjectivity of $\mathscr{E}(E,h)/\mathrm{U}(E,h) \to \mathscr{H}^{0,1}(E,h)/\mathrm{GL}(E)$ was open for quite a while, but eventually resolved by the Kobayashi–Hitchin correspondence. There was though, some evidence that this should hold:
Next time we’ll look at an elliptic complex that arises from the infinitesimal deformations of a Hermitian-Einstein structure and compute its index.