Open AccessArticle

Geometric Theory of Heat from Souriau Lie Groups Thermodynamics and Koszul Hessian Geometry: Applications in Information Geometry for Exponential Families

Frédéric Barbaresco

Advanced Radar Concepts Business Unit, Thales Air Systems, Limours 91470, France

Entropy 2016, 18(11), 386; https://doi.org/10.3390/e18110386

Submission received: 4 August 2016 / Revised: 17 September 2016 / Accepted: 27 September 2016 / Published: 4 November 2016

(This article belongs to the Special Issue Differential Geometrical Theory of Statistics)

Download

Browse Figures

Versions Notes

Abstract

We introduce the symplectic structure of information geometry based on Souriau’s Lie group thermodynamics model, with a covariant definition of Gibbs equilibrium via invariances through co-adjoint action of a group on its moment space, defining physical observables like energy, heat, and moment as pure geometrical objects. Using geometric Planck temperature of Souriau model and symplectic cocycle notion, the Fisher metric is identified as a Souriau geometric heat capacity. The Souriau model is based on affine representation of Lie group and Lie algebra that we compare with Koszul works on G/K homogeneous space and bijective correspondence between the set of G-invariant flat connections on G/K and the set of affine representations of the Lie algebra of G. In the framework of Lie group thermodynamics, an Euler-Poincaré equation is elaborated with respect to thermodynamic variables, and a new variational principal for thermodynamics is built through an invariant Poincaré-Cartan-Souriau integral. The Souriau-Fisher metric is linked to KKS (Kostant–Kirillov–Souriau) 2-form that associates a canonical homogeneous symplectic manifold to the co-adjoint orbits. We apply this model in the framework of information geometry for the action of an affine group for exponential families, and provide some illustrations of use cases for multivariate gaussian densities. Information geometry is presented in the context of the seminal work of Fréchet and his Clairaut-Legendre equation. The Souriau model of statistical physics is validated as compatible with the Balian gauge model of thermodynamics. We recall the precursor work of Casalis on affine group invariance for natural exponential families.

Keywords:

Lie group thermodynamics; moment map; Gibbs density; Gibbs equilibrium; maximum entropy; information geometry; symplectic geometry; Cartan-Poincaré integral invariant; geometric mechanics; Euler-Poincaré equation; Fisher metric; gauge theory; affine group

Lorsque le fait qu’on rencontre est en opposition avec une théorie régnante, il faut accepter le fait et abandonner la théorie, alors même que celle-ci, soutenue par de grands noms, est généralement adoptée
—Claude Bernard in “Introduction à l’Étude de la Médecine Expérimentale” [1]

Au départ, la théorie de la stabilité structurelle m’avait paru d’une telle ampleur et d’une telle généralité, qu’avec elle je pouvais espérer en quelque sorte remplacer la thermodynamique par la géométrie, géométriser en un certain sens la thermodynamique, éliminer des considérations thermodynamiques tous les aspects à caractère mesurable et stochastiques pour ne conserver que la caractérisation géométrique correspondante des attracteurs.
—René Thom in “Logos et théorie des Catastrophes” [2]

1. Introduction

This MDPI Entropy Special Issue on “Differential Geometrical Theory of Statistics” collects a limited number of selected invited and contributed talks presented during the GSI’15 conference on “Geometric Science of Information” in October 2015. This paper is an extended version of the paper [3] “Symplectic Structure of Information Geometry: Fisher Metric and Euler-Poincaré Equation of Souriau Lie Group Thermodynamics” published in GSI’15 Proceedings. At GSI’15 conference, a special session was organized on “lie groups and geometric mechanics/thermodynamics”, dedicated to Jean-Marie Souriau’s works in statistical physics, organized by Gery de Saxcé and Frédéric Barbaresco, and an invited talk on “Actions of Lie groups and Lie algebras on symplectic and Poisson manifolds. Application to Lagrangian and Hamiltonian systems” by Charles-Michel Marle, addressing “Souriau’s thermodynamics of Lie groups”. In honor of Jean-Marie Souriau, who died in 2012 and Claude Vallée [4,5,6], who passed away in 2015, this Special Issue will publish three papers on Souriau’s thermodynamics: Marle’s paper on “From Tools in Symplectic and Poisson Geometry to Souriau’s Theories of Statistical Mechanics and Thermodynamics” [7], de Saxcé’s paper on “Link between Lie Group Statistical Mechanics and Thermodynamics of Continua” [8] and this publication by Barbaresco. This paper also proposes new developments, compared to paper [9] where relations between Souriau and Koszul models have been initiated.

This paper, similar to the goal of the papers of Marle and de Saxcé in this Special Issue, is intended to honor the memory of the French Physicist Jean-Marie Souriau and to popularize his works, currently little known, on statistical physics and thermodynamics. Souriau is well known for his seminal and major contributions in geometric mechanics, the discipline he created in the 1960s, from previous Lagrange’s works that he conceptualized in the framework of symplectic geometry, but very few people know or have exploited Souriau’s works contained in Chapter IV of his book “Structure des systèmes dynamiques” published in 1970 [10] and only translated into English in 1995 in the book “Structure of Dynamical Systems: A Symplectic View of Physics” [11], in which he applied the formalism of geometric mechanics to statistical physics. The personal author’s contribution is to place the work of Souriau in the broader context of the emerging “Geometric Science of Information” [12] (addressed in GSI’15 conference), for which the author will show that the Souriau model of statistical physics is particularly well adapted to generalize “information geometry”, that the author illustrates for exponential densities family and multivariate gaussian densities. The author will observe that the Riemannian metric introduced by Souriau is a generalization of Fisher metric, used in “information geometry”, as being identified to the hessian of the logarithm of the generalized partition function (Massieu characteristic function), for the case of densities on homogeneous manifolds where a non-abelian group acts transively. For a group of time translation, we recover the classical thermodynamics and for the Euclidean space, we recover the classical Fisher metric used in Statistics. The author elaborates a new Euler-Poincaré equation for Souriau’s thermodynamics, action on “geometric heat” variable Q (element of dual Lie algebra), and parameterized by “geometric temperature” (element of Lie algebra). The author will integrate Souriau thermodynamics in a variational model by defining an extended Cartan-Poincaré integral invariant defined by Souriau “geometric characteristic function” (the logarithm of the generalized Souriau partition function parameterized by geometric temperature). These results are illustrated for multivariate Gaussian densities, where the associated group is identified to compute a Souriau moment map and reduce the Euler-Poincaré equation of geodesics. In addition, the symplectic cocycle and Souriau-Fisher metric are deduced from a Lie group thermodynamics model.

The main contributions of the author in this paper are the following:

The Souriau model of Lie group thermodynamics is presented with standard notations of Lie group theory, in place of Souriau equations using less classical conventions (that have limited understanding of his work by his contemporaries).
We prove that Souriau Riemannian metric introduced with symplectic cocycle is a generalization of Fisher metric (called Souriau-Fisher metric in the following) that preserves the property to be defined as a hessian of partition function logarithm $g_{β} = - \frac{\partial^{2} Φ}{\partial β^{2}} = \frac{\partial^{2} \log ψ_{Ω}}{\partial β^{2}}$ as in classical information geometry. We then establish the equality of two terms, the first one given by Souriau’s definition from Lie group cocycle $Θ$ and parameterized by “geometric heat” Q (element of dual Lie algebra) and “geometric temperature” β (element of Lie algebra) and the second one, the hessian of the characteristic function $Φ (β) = - \log ψ_{Ω} (β)$ with respect to the variable β:

$g_{β} ([β, Z_{1}], [β, Z_{2}]) = 〈 Θ (Z_{1}), [β, Z_{2}] 〉 + 〈 Q, [Z_{1}, [β, Z_{2}]] 〉 = \frac{\partial^{2} \log ψ_{Ω}}{\partial β^{2}}$

(1)

A dual Souriau-Fisher metric, the inverse of this last one, could be also elaborated with the hessian of “geometric entropy” $s (Q)$ with respect to the variable Q: $\frac{\partial^{2} s (Q)}{\partial Q^{2}}$
For the maximum entropy density (Gibbs density), the following three terms coincide: $\frac{\partial^{2} \log ψ_{Ω}}{\partial β^{2}}$ that describes the convexity of the log-likelihood function, $I (β) = - E [\frac{\partial^{2} \log p_{β} (ξ)}{\partial β^{2}}]$ the Fisher metric that describes the covariance of the log-likelihood gradient, whereas $I (β) = E [(ξ - Q) {(ξ - Q)}^{T}] = V a r (ξ)$ that describes the covariance of the observables.
This Souriau-Fisher metric is also identified to be proportional to the first derivative of the heat $g_{β} = - \frac{\partial Q}{\partial β}$ , and then comparable by analogy to geometric “specific heat” or “calorific capacity”.
We observe that the Souriau metric is invariant with respect to the action of the group $I (A d_{g} (β)) = I (β)$ , due to the fact that the characteristic function $Φ (β)$ after the action of the group is linearly dependent to $β$ . As the Fisher metric is proportional to the hessian of the characteristic function, we have the following invariance:

$I (A d_{g} (β)) = - \frac{\partial^{2} (Φ - 〈 θ (g^{- 1}), β 〉)}{\partial β^{2}} = - \frac{\partial^{2} Φ}{\partial β^{2}} = I (β)$

(2)
We have proposed, based on Souriau’s Lie group model and on analogy with mechanical variables, a variational principle of thermodynamics deduced from Poincaré-Cartan integral invariant. The variational principle holds on $g$ the Lie algebra, for variations $δ β = \dot{η} + [β, η]$ , where $η (t)$ is an arbitrary path that vanishes at the endpoints, $η (a) = η (b) = 0$ :

$δ \int_{t_{0}}^{t_{1}} Φ (β (t)) \cdot d t = 0$

(3)

where the Poincaré-Cartan integral invariant $\int_{C_{a}} Φ (β) \cdot d t = \int_{C_{b}} Φ (β) \cdot d t$ is defined with $Φ (β)$ , the Massieu characteristic function, with the 1-form $ω = Φ (β) \cdot d t = (〈 Q, β 〉 - s) \cdot d t = 〈 Q, (β \cdot d t) 〉 - s \cdot d t$
We have deduced Euler-Poincaré equations for the Souriau model:

$\begin{array}{l} \frac{d Q}{d t} = a d_{β}^{*} Q and {\begin{cases} s (Q) = 〈 β, Q 〉 - Φ (β) \\ β = \frac{\partial s (Q)}{\partial Q} \in g, Q = \frac{\partial Φ (β)}{\partial β} \in g^{*} \end{cases} and \frac{d}{d t} (A d_{g}^{*} Q) = 0 \\ with {\begin{cases} g^{*} : dual Lie algebra \\ a d_{X}^{*} Y : Coadjoint operator \end{cases} \end{array}$

(4)

where $Q$ is the Souriau geometric heat (element of dual Lie algebra) and $β$ is the Souriau geometric temperature (element of the Lie algebra). The second equation is linked to the result of Souriau based on the moment map that a symplectic manifold is always a coadjoint orbit, affine of its group of Hamiltonian transformations (a symplectic manifold homogeneous under the action of a Lie group, is isomorphic, up to a covering, to a coadjoint orbit; symplectic leaves are the orbits of the affine action that makes the moment map equivariant).
We have established that the affine representation of Lie group and Lie algebra by Jean-Marie Souriau is equivalent to Jean-Louis Koszul’s affine representation developed in the framework of hessian geometry of convex sharp cones. Both Souriau and Koszul have elaborated equations requested for Lie group and Lie algebra to ensure the existence of an affine representation. We have compared both approaches of Souriau and Koszul in a table.
We have applied the Souriau model for exponential families and especially for multivariate Gaussian densities.
We have applied the Souriau-Koszul model Gibbs density to compute the maximum entropy density for symmetric positive definite matrices, using the inner product $〈 η, ξ 〉 = T r (η^{T} ξ)$ , $\forall η, ξ \in S y m (n)$ given by Cartan-Killing form. The Gibbs density (generalization of Gaussian law for theses matrices and defined as maximum entropy density):

$p_{\hat{ξ}} (ξ) = e^{- 〈 Θ^{- 1} (\hat{ξ}), ξ 〉 + Φ (Θ^{- 1} (\hat{ξ}))} = ψ_{Ω} (I_{d}) \cdot [\det (α {\hat{ξ}}^{- 1})] \cdot e^{- T r (α {\hat{ξ}}^{- 1} ξ)} with α = \frac{n + 1}{2}$

(5)
For the case of multivariate Gaussian densities, we have considered $G A (n)$ a sub-group of affine group, that we defined by a (n + 1) × (n + 1) embedding in matrix Lie group $G_{a f f}$ , and that acts for multivariate Gaussian laws by:

$[\begin{matrix} Y \\ 1 \end{matrix}] = [\begin{matrix} R^{1 / 2} & m \\ 0 & 1 \end{matrix}] [\begin{matrix} X \\ 1 \end{matrix}] = [\begin{matrix} R^{1 / 2} X + m \\ 1 \end{matrix}], {\begin{cases} (m, R) \in R^{n} \times S y m^{+} (n) \\ M = [\begin{matrix} R^{1 / 2} & m \\ 0 & 1 \end{matrix}] \in G_{a f f} \end{cases} X \approx ℵ (0, I) \to Y \approx ℵ (m, R)$

(6)
For multivariate Gaussian densities, as we have identified the acting sub-group of affine group $M$ , we have also developed the computation of the associated Lie algebras $η_{L}$ and $η_{R}$ , adjoint and coadjoint operators, and especially the Souriau “moment map” $Π_{R}$ :

$\begin{array}{l} 〈 n_{L}, M^{- 1} n_{R} M 〉 = 〈 Π_{R}, n_{R} 〉 \\ with M = [\begin{matrix} R^{1 / 2} & m \\ 0 & 1 \end{matrix}], n_{L} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} & R^{- 1 / 2} \dot{m} \\ 0 & 0 \end{matrix}] and η_{R} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} & \dot{m} - R^{- 1 / 2} {\dot{R}}^{1 / 2} \dot{m} \\ 0 & 0 \end{matrix}] \\ \Rightarrow Π_{R} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} + R^{- 1} \dot{m} m^{T} & R^{- 1} \dot{m} \\ 0 & 0 \end{matrix}] \end{array}$

(7)

Using Souriau Theorem (geometrization of Noether theorem), we use the property that this moment map $Π_{R}$ is constant (its components are equal to Noether invariants):

$\frac{d Π_{R}}{d t} = 0 \Rightarrow {\begin{cases} R^{- 1} \dot{R} + R^{- 1} \dot{m} m^{T} = B = c s t e \\ R^{- 1} \dot{m} = b = c s t e \end{cases}$

(8)

to reduce the Euler-Lagrange equation of geodesics between two multivariate Gaussian densities:

${\begin{cases} \ddot{R} + \dot{m} {\dot{m}}^{T} - \dot{R} R^{- 1} \dot{R} = 0 \\ \ddot{m} - \dot{R} R^{- 1} \dot{m} = 0 \end{cases}$

(9)

to this reduced equation of geodesics:

${\begin{cases} \dot{m} = R b \\ \dot{R} = R (B - b m^{T}) \end{cases}$

(10)

that we solve by “geodesic shooting” technic based on Eriksen equation of exponential map.
For the families of multivariate Gaussian densities, that we have identified as homogeneous manifold with the associated sub-group of the affine group $[\begin{matrix} R^{1 / 2} & m \\ 0 & 1 \end{matrix}]$ , we have considered the elements of exponential families, that play the role of geometric heat $Q$ in Souriau Lie group thermodynamics, and $β$ the geometric (Planck) temperature:

$Q = \hat{ξ} = [\begin{matrix} E [z] \\ E [z z^{T}] \end{matrix}] = [\begin{matrix} m \\ R + m m^{T} \end{matrix}], β = [\begin{matrix} - R^{- 1} m \\ \frac{1}{2} R^{- 1} \end{matrix}]$

(11)

We have considered that these elements are homeomorph to the (n + 1) × (n + 1) matrix elements:

$Q = \hat{ξ} = [\begin{matrix} R + m m^{T} & m \\ 0 & 0 \end{matrix}] \in g^{*}, β = [\begin{matrix} \frac{1}{2} R^{- 1} & - R^{- 1} m \\ 0 & 0 \end{matrix}] \in g$

(12)

to compute the Souriau symplectic cocycle of the Lie group:

$θ (M) = \hat{ξ} (A d_{M} (β)) - A d_{M}^{*} \hat{ξ}$

(13)

where the adjoint operator is equal to:

$A d_{M} β = [\begin{matrix} \frac{1}{2} Ω^{- 1} & - Ω^{- 1} n \\ 0 & 0 \end{matrix}] with Ω = R'^{1 / 2} R R'^{- 1 / 2} and n = (\frac{1}{2} m' + R'^{1 / 2} m)$

(14)

with

$\hat{ξ} (A d_{M} (β)) = [\begin{matrix} Ω + n n^{T} & n \\ 0 & 0 \end{matrix}]$

(15)

and the co-adjoint operator:

$A d_{M}^{*} \hat{ξ} = [\begin{matrix} R + m m^{T} - m m'^{T} & R^{' 1 / 2} m \\ 0 & 0 \end{matrix}]$

(16)
Finally, we have computed the Souriau-Fisher metric $g_{β} ([β, Z_{1}], [β, Z_{2}]) = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}])$ for multivariate Gaussian densities, given by:

$\begin{matrix} g_{β} ([β, Z_{1}], [β, Z_{2}]) = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}]) = \tilde{Θ} (Z_{1}, [β, Z_{2}]) + 〈 \hat{ξ}, [Z_{1}, [β, Z_{2}]] 〉 \\ = 〈 Θ (Z_{1}), [β, Z_{2}] 〉 + 〈 \hat{ξ}, [Z_{1}, [β, Z_{2}]] 〉 \end{matrix}$

(17)

with element of Lie algebra given by $Z = [\begin{matrix} \frac{1}{2} Ω^{- 1} & - Ω^{- 1} n \\ 0 & 0 \end{matrix}]$ .

The plan of the paper is as follows. After this introduction in Section 1, we develop in Section 2 the position of Souriau symplectic model of statistical physics in the historical developments of thermodynamic concepts. In Section 3, we develop and revisit the Lie group thermodynamics model of Jean-Marie Souriau in modern notations. In Section 4, we make the link between Souriau Riemannian metric and Fisher metric defined as a geometric heat capacity of Lie group thermodynamics. In Section 5, we elaborate Euler-Lagrange equations of Lie group thermodynamics and a variational model based on Poincaré-Cartan integral invariant. In Section 6, we explore Souriau affine representation of Lie group and Lie algebra (including the notions of: affine representations and cocycles, Souriau moment map and cocycles, equivariance of Souriau moment map, action of Lie group on a symplectic manifold and dual spaces of finite-dimensional Lie algebras) and we analyze the link and parallelisms with Koszul affine representation, developed in another context (comparison is synthetized in a table). In Section 7, we illustrate Koszul and Souriau Lie group models of information geometry for multivariate Gaussian densities. In Section 8, after identifying the affine group acting for these densities, we compute the Souriau moment map to obtain the Euler-Poincaré equation, solved by geodesic shooting method. In Section 9, Souriau Riemannian metric defined by cocycle for multivariate Gaussian densities is computed. We give a conclusion in Section 10 with research prospects in the framework of affine Poisson geometry [13], Bismut stochastic mechanics [14] and second order extension of the Gibbs state [15,16]. We have three appendices: Appendix A develops the Clairaut(-Legendre) equation of Maurice Fréchet associated to “distinguished functions” as a seminal equation of information geometry; Appendix B is about a Balian Gauge model of thermodynamics and its compliance with the Souriau model; Appendix C is devoted to the link of Casalis-Letac’s works on affine group invariance for natural exponential families with Souriau’s works.

2. Position of Souriau Symplectic Model of Statistical Physics in Historical Developments of Thermodynamic Concepts

In this Section, we will explain the emergence of thermodynamic concepts that give rise to the generalization of the Souriau model of statistical physics. To understand Souriau’s theoretical model of heat, we have to consider first his work in geometric mechanics where he introduced the concept of “moment map” and “symplectic cohomology”. We will then introduce the concept of “characteristic function” developed by François Massieu, and generalized by Souriau on homogeneous symplectic manifolds. In his statistical physics model, Souriau has also generalized the notion of “heat capacity” that was initially extended by Pierre Duhem as a key structure to jointly consider mechanics and thermodynamics under the umbrella of the same theory. Pierre Duhem has also integrated, in the corpus, the Massieu’s characteristic function as a thermodynamic potential. Souriau’s idea to develop a covariant model of Gibbs density on homogeneous manifold was also influenced by the seminal work of Constantin Carathéodory that axiomatized thermodynamics in 1909 based on Carnot’s works. Souriau has adapted his geometric mechanical model for the theory of heat, where Henri Poincaré did not succeed in his paper on attempts of mechanical explanation for the principles of thermodynamics.

Lagrange’s works on “mécanique analytique (analytic mechanics)” has been interpreted by Jean-Marie Souriau in the framework of differential geometry and has initiated a new discipline called after Souriau, “mécanique géométrique (geometric mechanics)” [17,18,19]. Souriau has observed that the collection of motions of a dynamical system is a manifold with an antisymmetric flat tensor that is a symplectic form where the structure contains all the pertinent information of the state of the system (positions, velocities, forces, etc.). Souriau said: “Ce que Lagrange a vu, que n’a pas vu Laplace, c’était la structure symplectique (What Lagrange saw, that Laplace didn’t see, was the symplectic structure” [20]. Using the symmetries of a symplectic manifold, Souriau introduced a mapping which he called the “moment map” [21,22,23], which takes its values in a space attached to the group of symmetries (in the dual space of its Lie algebra). He [10] called dynamical groups every dimensional group of symplectomorphisms (an isomorphism between symplectic manifolds, a transformation of phase space that is volume-preserving), and introduced Galileo group for classical mechanics and Poincaré group for relativistic mechanics (both are sub-groups of affine group [24,25]). For instance, a Galileo group could be represented in a matrix form by (with A rotation, b the boost, c space translation and e time translation):

[\begin{matrix} x' \\ t \\ 1 \end{matrix}] = \underset{GALILEO GROUP}{[\begin{matrix} A & b & c \\ 0 & 1 & e \\ 0 & 0 & 1 \end{matrix}]} [\begin{matrix} x \\ t \\ 1 \end{matrix}] with {\begin{cases} A \in S O (3) \\ b, c \in R^{3} \\ e \in R \end{cases}, Lie Algebra [\begin{matrix} ω & η & γ \\ 0 & 0 & ε \\ 0 & 0 & 0 \end{matrix}] with {\begin{cases} ω \in s o (3) \\ η, γ \in R^{3} \\ ε \in R^{+} \end{cases}

(18)

Souriau associated to this moment map, the notion of symplectic cohomology, linked to the fact that such a moment is defined up to an additive constant that brings into play an algebraic mechanism (called cohomology). Souriau proved that the moment map is a constant of the motion, and provided geometric generalization of Emmy Noether invariant theorem (invariants of E. Noether theorem are the components of the moment map). For instance, Souriau gave an ontological definition of mass in classical mechanics as the measure of the symplectic cohomology of the action of the Galileo group (the mass is no longer an arbitrary variable but a characteristic of the space). This is no longer true for Poincaré group in relativistic mechanics, where the symplectic cohomology is null, explaining the lack of conservation of mass in relativity. All the details of classical mechanics thus appear as geometric necessities, as ontological elements. Souriau has also observed that the symplectic structure has the property to be able to be reconstructed from its symmetries alone, through a 2-form (called Kirillov–Kostant–Souriau form) defined on coadjoint orbits. Souriau said that the different versions of mechanical science can be classified by the geometry that each implies for space and time; geometry is determined by the covariance of group theory. Thus, Newtonian mechanics is covariant by the group of Galileo, the relativity by the group of Poincaré; General relativity by the “smooth” group (the group of diffeomorphisms of space-time). However, Souriau added “However, there are some statements of mechanics whose covariance belongs to a fourth group rarely considered: the affine group, a group shown in the following diagram for inclusion. How is it possible that a unitary point of view (which would be necessarily a true thermodynamics), has not yet come to crown the picture? Mystery...” [26]. See Figure 1.

As early as 1966, Souriau applied his theory to statistical mechanics, developed it in the Chapter IV of his book “Structure of Dynamical Systems” [11], and elaborated what he called a “Lie group thermodynamics” [10,11,27,28,29,30,31,32,33,34,35,36,37]. Using Lagrange’s viewpoint, in Souriau statistical mechanics, a statistical state is a probability measure on the manifold of motions (and no longer in phase space [38]). Souriau observed that Gibbs equilibrium [39] is not covariant with respect to dynamic groups of Physics. To solve this braking of symmetry, Souriau introduced a new “geometric theory of heat” where the equilibrium states are indexed by a parameter

β

with values in the Lie algebra of the group, generalizing the Gibbs equilibrium states, where

β

plays the role of a geometric (Planck) temperature. The invariance with respect to the group, and the fact that the entropy

s

is a convex function of this geometric temperature

β

, imposes very strict, universal conditions (e.g., there exists necessarily a critical temperature beyond which no equilibrium can exist). Souriau observed that the group of time translations of the classical thermodynamics [40,41] is not a normal subgroup of the Galilei group, proving that if a dynamical system is conservative in an inertial reference frame, it need not be conservative in another. Based on this fact, Souriau generalized the formulation of the Gibbs principle to become compatible with Galileo relativity in classical mechanics and with Poincaré relativity in relativistic mechanics. The maximum entropy principle [42,43,44,45,46,47,48,49,50,51] is preserved, and the Gibbs density is given by the density of maximum entropy (among the equilibrium states for which the average value of the energy takes a prescribed value, the Gibbs measures are those which have the largest entropy), but with a new principle “If a dynamical system is invariant under a Lie subgroup G’ of the Galileo group, then the natural equilibria of the system forms the Gibbs ensemble of the dynamical group G’” [10]. The classical notion of Gibbs canonical ensemble is extended for a homogneous symplectic manifold on which a Lie group (dynamic group) has a symplectic action. When the group is not abelian (non-commutative group), the symmetry is broken, and new “cohomological” relations should be verified in Lie algebra of the group [52,53,54,55]. A natural equilibrium state will thus be characterized by an element of the Lie algebra of the Lie group, determining the equilibrium temperature

β

. The entropy

s (Q)

, parametrized by

Q

the geometric heat (mean of energy

U

, element of the dual Lie algebra) is defined by the Legendre transform [56,57,58,59] of the Massieu potential

Φ (β)

parametrized by

β

(

Φ (β)

is the minus logarithm of the partition function

ψ_{Ω} (β)

s (Q) = 〈 β, Q 〉 - Φ (β) with {\begin{cases} Q = \frac{\partial Φ}{\partial β} \in g^{*} \\ β = \frac{\partial s}{\partial Q} \in g \end{cases}

(19)

\begin{array}{l} p_{G i b b s} (ξ) = e^{Φ (β) - 〈 β, U (ξ) 〉} = \frac{e^{- 〈 β, U (ξ) 〉}}{\int_{M} e^{- 〈 β, U (ξ) 〉} d ω}, Q = \frac{\partial Φ (β)}{\partial β} = \frac{\int_{M} U (ξ) e^{- 〈 β, U (ξ) 〉} d ω}{\int_{M} e^{- 〈 β, U (ξ) 〉} d ω} = \int_{M} U (ξ) p (ξ) d ω \\ with Φ (β) = - \log \int_{M} e^{- 〈 β, U (ξ) 〉} d ω \end{array}

(20)

Souriau completed his “geometric heat theory” by introducing a 2-form in the Lie algebra, that is a Riemannian metric tensor in the values of adjoint orbit of

β

[β, Z]

with

Z

an element of the Lie algebra. This metric is given for

(β, Q)

g_{β} ([β, Z_{1}], [β, Z_{2}]) = 〈 Θ (Z_{1}), [β, Z_{2}] 〉 + 〈 Q, [Z_{1}, [β, Z_{2}]] 〉

(21)

where

Θ

is a cocycle of the Lie algebra, defined by

Θ = T_{e} θ

with

θ

a cocycle of the Lie group defined by

θ (M) = Q (A d_{M} (β)) - A d_{M}^{*} Q

. We have observed that this metric

g_{β}

is also given by the hessian of the Massieu potential

g_{β} = - \frac{\partial^{2} Φ}{\partial β^{2}} = \frac{\partial \log ψ_{Ω}}{\partial β^{2}}

as Fisher metric in classical information geometry theory [60], and so this is a generalization of the Fisher metric for homogeneous manifold. We call this new metric the Souriau-Fisher metric. As

g_{β} = - \frac{\partial Q}{\partial β}

, Souriau compared it by analogy with classical thermodynamics to a “geometric specific heat” (geometric calorific capacity).

The potential theory of thermodynamics and the introduction of “characteristic function” (previous function

Φ (β) = - \log ψ_{Ω} (β)

in Souriau theory) was initiated by François Jacques Dominique Massieu [61,62,63,64]. Massieu was the son of Pierre François Marie Massieu and Thérèse Claire Castel. He married in 1862 with Mlle Morand and had 2 children. He graduated from Ecole Polytechnique in 1851 and Ecole des Mines de Paris in 1956, he has integrated “Corps des Mines”. He defended his Ph.D. in 1861 on “Sur les intégrales algébriques des problèmes de mécanique” and on “Sur le mode de propagation des ondes planes et la surface de l’onde élémentaire dans les cristaux biréfringents à deux axes” [65] with the jury composed of Lamé, Delaunay et Puiseux. In 1870, François Massieu presented his paper to the French Academy of Sciences on “characteristic functions of the various fluids and the theory of vapors” [61]. The design of the characteristic function is the finest scientific title of Mr. Massieu. A prominent judge, Joseph Bertrand, do not hesitate to declare, in a statement read to the French Academy of Sciences 25 July 1870, that “the introduction of this function in formulas that summarize all the possible consequences of the two fundamental theorems seems, for the theory, a similar service almost equivalent to that Clausius has made by linking the Carnot’s theorem to entropy” [66]. The final manuscript was published by Massieu in 1873, “Exposé des principes fondamentaux de la théorie mécanique de la chaleur (Note destinée à servir d’introduction au Mémoire de l’auteur sur les fonctions caractéristiques des divers fluides et la théorie des vapeurs)” [63].

Massieu introduced the following potential

Φ (β)

, called “characteristic function”, as illustrated in Figure 2, that is the potential used by Souriau to generalize the theory:

s (Q) = 〈 β, Q 〉 - Φ (β) \underset{β = \frac{1}{T}}{\Rightarrow} Φ = \frac{Q}{T} - S

. However, in his third paper, Massieu was influenced by M. Bertrand, as illustrated in Figure 3, to replace the variable

β = \frac{1}{T}

(that he used in his two first papers) by

T

. We have then to wait 50 years more for the paper of Planck, who introduced again the good variable

β = \frac{1}{T}

, and then generalized by Souriau, giving to Planck temperature

β

an ontological and geometric status as element of the Lie algebra of the dynamic group.

This Lie group thermodynamics of Souriau is able to explain astronomical phenomenon (rotation of celestial bodies: the Earth and the stars rotating about themselves). The geometric temperature

β

can be also interpreted as a space-time vector (generalization of the temperature vector of Planck), where the temperature vector and entropy flux are in duality unifying heat conduction and viscosity (equations of Fourier and Navier). In case of centrifuge system (e.g., used for enrichment of uranium), the Gibbs Equilibrium state [60,67] are given by Souriau equations as the variation in concentration of the components of an inhomogeneous gas. Classical statistical mechanics corresponds to the dynamical group of time translations, for which we recover from Souriau equations the concepts and principles of classical thermodynamics (temperature, energy, heat, work, entropy, thermodynamic potentials) and of the kinetic theory of gases (pressure, specific heats, Maxwell’s velocity distribution, etc.).

Souriau also studied continuous medium thermodynamics, where the “temperature vector” is no longer constrained to be in Lie algebra, but only contrained by phenomenologic equations (e.g., Navier equations, etc.). For thermodynamic equilibrium, the “temperature vector” is then a Killing vector of Space-Time. For each point X, there is a “temperature vector”

β (X)

, such it is an infinitesimal conformal transform of the metric of the universe

g_{i j}

. Conservation equations can then be deduced for components of impulsion-energy tensor

T^{i j}

and entropy flux

S^{j}

with

{\hat{\partial}}_{i} T^{i j} = 0 and \partial_{i} S^{j} = 0

. Temperature and metric are related by the following equations:

\begin{array}{l} {\begin{cases} {\hat{\partial}}_{i} β_{j} + {\hat{\partial}}_{j} β_{i} = λ g_{i j} \\ \partial_{i} β_{j} + \partial_{j} β_{i} - 2 Γ_{i j}^{k} β_{k} = λ g_{i j} \end{cases} with {\begin{cases} {\hat{\partial}}_{i} . : covariant derivative \\ β_{j} : component of Temperature vector \end{cases} \\ λ = 0 \Rightarrow Killing Equation \end{array}

(22)

Leon Brillouin made the link between Boltzmann entropy and Negentropie of information theory [68,69,70,71], but before Jean-Marie Souriau, only Constantin Carathéodory and Pierre Duhem [72,73,74,75] initiated first theoretical works to generalize thermodynamics.

After three years as lecturer at Lille university, Duhem published a paper in the official revue of the Ecole Normale Supérieure, in 1891, “On general equations of thermodynamics” [72] (Sur les équations générales de la Thermodynamique) in Annales Scientifiques de l’Ecole Normale Supérieure. Duhem generalized the concept of “virtual work” under the action of “external actions” by taking into account both mechanical and thermal actions. In 1894, the design of a generalized mechanics based on thermodynamics was further developed: ordinary mechanics had already become “a particular case of a more general science”. Duhem writes “We made dynamics a special case of thermodynamics, a science that embraces common principles in all changes of state bodies, changes of places as well as changes in physical qualities” (Nous avons fait de la dynamique un cas particulier de la thermodynamique, une Science qui embrasse dans des principes communs tous les changements d’état des corps, aussi bien les changements de lieu que les changements de qualités physiques). In the equations of his generalized mechanics-thermodynamics, some new terms had to be introduced, in order to account for the intrinsic viscosity and friction of the system. As observed by Stefano Bordoni, Duhem aimed at widening the scope of physics: the new physics could not confine itself to “local motion” but had to describe what Duhem qualified “motions of modification”. If Boltzmann had tried to proceed from “local motion” to attain the explanation of more complex transformations, Duhem was trying to proceed from general laws concerning general transformation in order to reach “local motion” as a simplified specific case. Four scientists were credited by Duhem with having carried out “the most important researches on that subject”: Massieu had managed to derive thermodynamics from a “characteristic function and its partial derivatives”; Gibbs had shown that Massieu’s functions “could play the role of potentials in the determination of the states of equilibrium” in a given system; von Helmholtz had put forward “similar ideas”; von Oettingen had given “an exposition of thermodynamics of remarkable generality” based on general duality concept in “Die thermodynamischen Beziehungen antithetisch entwickelt” published at St. Petersburg in 1885. Duhem took into account a system whose elements had the same temperature and where the state of the system could be completely specified by giving its temperature and n other independent quantities. He then introduced some “external forces”, and held the system in equilibrium. A virtual work corresponded to such forces, and a set of n + 1 equations corresponded to the condition of equilibrium of the physical system. From the thermodynamic point of view, every infinitesimal transformation involving the generalized displacements had to obey to the first law, which could be expressed in terms of the (n + 1) generalized Lagrangian parameters. The amount of heat could be written as a sum of (n + 1) terms. The new alliance between mechanics and thermodynamics led to a sort of symmetry between thermal and mechanical quantities. The n + 1 functions played the role of generalized thermal capacities, and the last term was nothing other than the ordinary thermal capacity. The knowledge of the “equilibrium equations of a system” allowed Duhem to compute the partial derivatives of the thermal capacity with regard to all the parameters which described the state of the system, apart from its derivative with regard to temperature. The thermal capacities were therefore known “except for an unspecified function of temperature”.

The axiomatic approach of thermodynamics was published in 1909 in Mathematische Annalen [76] under the title “Examination of the Foundations of Thermodynamics” (Untersuchungen überdie Grundlagen der Thermodynamik) by Constantin Carathéodory based on Carnot’s works [77]. Carathéodory introduced entropy through a mathematical approach based on the geometric behavior of a certain class of partial differential equations called Pfaffians. Carathéodory’s investigations start by revisiting the first law and reformulating the second law of thermodynamics in the form of two axioms. The first axiom applies to a multiphase system change under adiabatic conditions (axiom of classical thermodynamics due to Clausius [78,79]). The second axiom assumes that in the neighborhood of any equilibrium state of a system (of any number of thermodynamic coordinates), there exist states that are inaccessible by reversible adiabatic processes. In the book of Misha Gromov “Metric Structures for Riemannian and Non-Riemannian Spaces”, written and edited by Pierre Pansu and Jacques Lafontaine, a new metric is introduced called “Carnot-Carathéodory metric”. In one of his papers, Misha Gromov [80,81] gives historical remarks “This result (which seems obvious by the modern standards) appears (in a more general form) in the 1909-paper by Carathéorody on formalization of the classical thermodynamics where horizontal curves roughly correspond to adiabatic processes. In fact, the above proof may be performed in the language of Carnot (cycles) and for this reason the metris distH were christened ‘Carnot-Carathéodory’ in Gromov-Lafontaine-Pansu book” [82]. When I ask this question to Pierre Pansu, he gave me the answer that “The section 4 of [76], entitled Hilfsatz aus der Theorie des Pfaffschen Gleichungen (Lemma from the theory of Pfaffian equations) opens with a statement relating to the differential 1-forms. Carathéodory says, If a Pfaffian equation dx0 + X1 dx1 + X2 dx2 + … + Xn dxn = 0 is given, in which the Xi are finite, continuous, differentiable functions of the xi, and one knows that in any neighborhood of an arbitrary point P of the space of xi there is a point that one cannot reach along a curve that satisfies this equation then the expression must necessarily possess a multiplier that makes it into a complete differential”. This is confirmed in the introduction of his paper [76], where Carathéodory said “Finally, in order to be able to treat systems with arbitrarily many degrees of freedom from the outset, instead of the Carnot cycle that is almost always used, but is intuitive and easy to control only for systems with two degrees of freedom, one must employ a theorem from the theory of Pfaffian differential equations, for which a simple proof is given in the fourth section”.

We have also to make reference to Henri Poincaré [83] that published the paper “On attempts of mechanical explanation for the principles of thermodynamics (Sur les tentatives d’explication mécanique des principes de la thermodynamique)” at the Comptes rendus de l’Académie des sciences in 1889 [84], in which he tried to consolidate links between mechanics and thermomechanics principles. These elements were also developed in Poincaré’s lecture of 1892 [85] on “thermodynamique” in Chapter XVII “Reduction of thermodynamics principles to the general principles of mechanics (Réduction des principes de la Thermodynamique aux principes généraux de la mécanique)”. Poincaré writes in his book [85] “It is otherwise with the second law of thermodynamics. Clausius was the first to attempt to bring it to the principles of mechanics, but not succeed satisfactorily. Helmholtz in his memoir on the principle of least actions developed a theory much more perfect than that of Clausius. However, it cannot account for irreversible phenomena. (Il en est autrement du second principe de la thermodynamique. Clausius, a le premier, tenté de le ramener aux principes de la Mécanique, mais sans y réussir d’une manière satisfaisante. Helmoltz dans son mémoire sur le principe de la moindre action, a développé une théorie beaucoup plus parfaite que celle de Clausius; cependant elle ne peut rendre compte des phénomènes irréversibles.)”. About Helmoltz work, Poincaré observes [85] “It follows from these examples that the Helmholtz hypothesis is true in the case of body turning around an axis; So it seems applicable to vortex motions of molecules (Il résulte de ces exemples que l’hypothèse d’Helmoltz est exacte dans le cas de corps tournant autour d’un axe; elle parait donc applicable aux mouvements tourbillonnaires des molecules.)”, but he adds in the following that the Helmoltz model is also true in the case of vibrating motions as molecular motions. However, he finally observes that the Helmoltz model cannot explain the increasing of entropy and concludes [85] “All attempts of this nature must be abandoned; the only ones that have any chance of success are those based on the intervention of statistical laws, for example, the kinetic theory of gases. This view, which I cannot develop here, can be summed up in a somewhat vulgar way as follows: Suppose we want to place a grain of oats in the middle of a heap of wheat; it will be easy; then suppose we wanted to find it and remove it; we cannot achieve it. All irreversible phenomena, according to some physicists, would be built on this model (Toutes les tentatives de cette nature doivent donc être abandonnées; les seules qui aient quelque chance de succès sont celles qui sont fondées sur l’intervention des lois statistiques comme, par exemple, la théorie cinétique des gaz. Ce point de vue, que je ne puis développer ici, peut se résumer d’une façon un peu vulgaire comme il suit: Supposons que nous voulions placer un grain d’avoine au milieu d’un tas de blé; cela sera facile; supposons que nous voulions ensuite l’y retrouver et l’en retirer; nous ne pourrons y parvenir. Tous les phénomènes irréversibles, d’après certains physiciens, seraient construits sur ce modèle)”. In Poincaré’s lecture, Massieu has greatly influenced Poincaré to introduce Massieu characteristic function in probability [86]. As we have observed, Poincaré has introduced characteristic function in probability lecture after his lecture on thermodynamics where he discovered in its second edition [85], the Massieu’s characteristic function. We can read that “Since from the functions of Mr. Massieu one can deduce other functions of variables, all equations of thermodynamics can be written so as to only contain these functions and their derivatives; it will thus result in some cases, a great simplification (Puisque des fonctions de M. Massieu on peut déduire les autres fonctions des variables, toutes les équations de la Thermodynamique pourront s’écrire de manière à ne plus renfermer que ces fonctions et leurs dérivées; il en résultera donc, dans certains cas, une grande simplification).” [85]. He [85] added “MM. Gibbs von Helmholtz, Duhem have used this function H = U − TS assuming that T and V are constant. Mr. von Helmotz has called it ‘free energy’ and also proposes to give him the name of “kinetic potential”; Duhem called it ‘the thermodynamic potential at constant volume’; this is the most justified naming (MM. Gibbs, von Helmoltz, Duhem ont fait usage de cette function H = TS − U en y supposant T et V constants. M. von Helmotz l’a appellée énergie libre et a propose également de lui donner le nom de potential kinetique; M. Duhem la nomme potentiel thermodynamique à volume constant; c’est la dénomination la plus justifiée)”. In 1906, Henri Poincaré also published a note [87] “Reflection on The kinetic theory of gases” (Réflexions sur la théorie cinétique des gaz), where he said that: “The kinetic theory of gases leaves awkward points for those who are accustomed to mathematical rigor … One of the points which embarrassed me most was the following one: it is a question of demonstrating that the entropy keeps decreasing, but the reasoning of Gibbs seems to suppose that having made vary the outside conditions we wait that the regime is established before making them vary again. Is this supposition essential, or in other words, we could arrive at opposite results to the principle of Carnot by making vary the outside conditions too fast so that the permanent regime has time to become established?”.

Jean-Marie Souriau has elaborated a disruptive and innovative “théorie géométrique de la chaleur (geometric theory of heat)” [88] after the works of his predecessors as illustrated in Figure 4: “théorie analytique de la chaleur (analytic theory of heat)” by Jean Baptiste Joseph Fourier [88], “théorie mécanique de la chaleur (mechanic theory of heat)” by François Clausius [89] and François Massieu and “théorie mathématique de la chaleur (mathematic theory of heat)” by Siméon-Denis Poisson [90,91], as illustrated in this figure:

3. Revisited Souriau Symplectic Model of Statistical Physics

In this Section, we will revisit the Souriau model of thermodynamics but with modern notations, replacing personal Souriau conventions used in his book of 1970 by more classical ones.

In 1970, Souriau introduced the concept of co-adjoint action of a group on its momentum space (or “moment map”: mapping induced by symplectic manifold symmetries), based on the orbit method works, that allows to define physical observables like energy, heat and momentum or moment as pure geometrical objects (the moment map takes its values in a space determined by the group of symmetries: the dual space of its Lie algebra). The moment(um) map is a constant of the motion and is associated to symplectic cohomology (assignment of algebraic invariants to a topological space that arises from the algebraic dualization of the homology construction). Souriau introduced the moment map in 1965 in a lecture notes at Marseille University and published it in 1966. Souriau gave the formal definition and its name based on its physical interpretation in 1967. Souriau then studied its properties of equivariance, and formulated the coadjoint orbit theorem in his book in 1970. However, in his book, Souriau also observed in Chapter IV that Gibbs equilibrium states are not covariant by dynamical groups (Galileo or Poincaré groups) and then he developed a covariant model that he called “Lie group thermodynamics”, where equilibriums are indexed by a “geometric (Planck) temperature”, given by a vector

β

that lies in the Lie algebra of the dynamical group. For Souriau, all the details of classical mechanics appear as geometric necessities (e.g., mass is the measure of the symplectic cohomology of the action of a Galileo group). Based on this new covariant model of thermodynamic Gibbs equilibrium, Souriau has formulated statistical mechanics and thermodynamics in the framework of symplectic geometry by use of symplectic moments and distribution-tensor concepts, giving a geometric status for temperature, heat and entropy.

There is a controversy about the name “momentum map” or “moment map”. Smale [92] referred to this map as the “angular momentum”, while Souriau used the French word “moment”. Cushman and Duistermaat [93] have suggested that the proper English translation of Souriau’s French word was “momentum” which fit better with standard usage in mechanics. On the other hand, Guillemin and Sternberg [94] have validated the name given by Souriau and have used “moment” in English. In this paper, we will see that name “moment” given by Souriau was the most appropriate word. In his Chapter IV of his book [10], studying statistical mechanics, Souriau [10] has ingeniously observed that moments of inertia in mechanics are equivalent to moments in probability in his new geometric model of statistical physics. We will see that in Souriau Lie group thermodynamic model, these statistical moments will be given by the energy and the heat defined geometrically by Souriau, and will be associated with “moment map” in dual Lie algebra.

This work has been extended by Claude Vallée [5,6] and Gery de Saxcé [4,8,95,96]. More recently, Kapranov has also given a thermodynamical interpretation of the moment map for toric varieties [97] and Pavlov, thermodynamics from the differential geometry standpoint [98].

The conservation of the moment of a Hamiltonian action was called by Souriau the “symplectic or geometric Noether theorem”. Considering phases space as symplectic manifold, cotangent fiber of configuration space with canonical symplectic form, if Hamiltonian has Lie algebra, then the moment map is constant along the system integral curves. Noether theorem is obtained by considering independently each component of the moment map.

In a first step to establish new foundations of thermodynamics, Souriau [10] has defined a Gibbs canonical ensemble on a symplectic manifold M for a Lie group action on M. In classical statistical mechanics, a state is given by the solution of Liouville equation on the phase space, the partition function. As symplectic manifolds have a completely continuous measure, invariant by diffeomorphisms, the Liouville measure λ, all statistical states will be the product of the Liouville measure by the scalar function given by the generalized partition function

e^{Φ (β) - 〈 β, U (ξ) 〉}

defined by the energy

U

(defined in the dual of the Lie algebra of this dynamical group) and the geometric temperature

β

, where

Φ

is a normalizing constant such the mass of probability is equal to 1,

Φ (β) = - \log \int_{M} e^{- 〈 β, U (ξ) 〉} d λ

[99]. Jean-Marie Souriau then generalizes the Gibbs equilibrium state to all symplectic manifolds that have a dynamical group. To ensure that all integrals that will be defined could converge, the canonical Gibbs ensemble is the largest open proper subset (in Lie algebra) where these integrals are convergent. This canonical Gibbs ensemble is convex. The derivative of

Φ

Q = \frac{\partial Φ}{\partial β}

(thermodynamic heat) is equal to the mean value of the energy

U

. The minus derivative of this generalized heat

Q

K = - \frac{\partial Q}{\partial β}

is symmetric and positive (this is a geometric heat capacity). Entropy

s

is then defined by Legendre transform of

Φ

s = 〈 β, Q 〉 - Φ

. If this approach is applied for the group of time translation, this is the classical thermodynamics theory. However, Souriau [10] has observed that if we apply this theory for non-commutative group (Galileo or Poincaré groups), the symmetry has been broken. Classical Gibbs equilibrium states are no longer invariant by this group. This symmetry breaking provides new equations, discovered by Souriau [10].

We can read in his paper this prophetical sentence “This Lie group thermodynamics could be also of first interest for mathematics (Peut-être cette Thermodynamique des groups de Lie a-t-elle un intérêt mathématique)” [30]. He explains that for the dynamic Galileo group with only one axe of rotation, this thermodynamic theory is the theory of centrifuge where the temperature vector dimension is equal to 2 (sub-group of invariance of size 2), used to make “uranium 235” and “ribonucleic acid” [30]. The physical meaning of these two dimensions for vector-valued temperature is “thermic conduction” and “viscosity”. Souriau said that the model unifies “heat conduction” and “viscosity” (Fourier and Navier equations) in the same theory of irreversible process. Souriau has applied this theory in detail for relativistic ideal gas with the Poincaré group for the dynamical group.

Before introducing the Souriau Model of Lie group thermodynamics, we will first remind readers of the classical notation of Lie group theory in their application to Lie group thermodynamics:

The coadjoint representation of $G$ is the contragredient of the adjoint representation. It associates to each $g \in G$ the linear isomorphism $A d_{g}^{*} \in G L (g^{*})$ , which satisfies, for each $ξ \in g^{*}$ and $X \in g$ :

$〈 A d_{g^{- 1}}^{*} (ξ), X 〉 = 〈 ξ, A d_{g^{- 1}} (X) 〉$

(23)
The adjoint representation of the Lie algebra $g$ is the linear representation of $g$ into itself which associates, to each $X \in g$ , the linear map $a d_{X} \in g l (g)$ . $a d$ Tangent application of $A d$ at neutral element $e$ of $G$ :

$\begin{array}{l} a d = T_{e} A d : T_{e} G \to E n d (T_{e} G) \\ X, Y \in T_{e} G \mapsto a d_{X} (Y) = [X, Y] \end{array}$

(24)
The coadjoint representation of the Lie algebra $g$ is the contragredient of the adjoint representation. It associates, to each $X \in g$ , the linear map $a d_{X}^{*} \in g l (g^{*})$ which satisfies, for each $ξ \in g^{*}$ and $X \in g$ :

$〈 a d_{- X}^{*} (ξ), Y 〉 = 〈 ξ, A d_{- X} (Y) 〉$

(25)

We can illustrate for group of matrices for $G = G L_{n} (K)$ with $K = R or C$ .

$T_{e} G = M_{n} (K), X \in M_{n} (K), g \in G A d_{g} (X) = g X g^{- 1}$

(26)

$X, Y \in M_{n} (K) a d_{X} (Y) = {(T_{e} A d)}_{X} (Y) = X Y - Y X = [X, Y]$

(27)

Then, the curve from $e = I_{d} = c (0)$ tangent to $X = c (1)$ is given by $c (t) = \exp (t X)$ and transform by $A d$ : $γ (t) = A d \exp (t X)$

$a d_{X} (Y) = {(T_{e} A d)}_{X} (Y) = {\frac{d}{d t} γ (t) Y |}_{t = 0} = {\frac{d}{d t} \exp (t X) Y \exp {(t X)}^{- 1} |}_{t = 0} = X Y - Y X$

(28)

For each temperature $β$ , element of the Lie algebra $g$ , Souriau has introduced a tensor ${\tilde{Θ}}_{β}$ , equal to the sum of the cocycle $\tilde{Θ}$ and the heat coboundary (with [.,.] Lie bracket):

${\tilde{Θ}}_{β} (Z_{1}, Z_{2}) = \tilde{Θ} (Z_{1}, Z_{2}) + 〈 Q, a d_{Z_{1}} (Z_{2}) 〉 with a d_{Z_{1}} (Z_{2}) = [Z_{1}, Z_{2}]$

(29)

This tensor ${\tilde{Θ}}_{β}$ has the following properties:
$\tilde{Θ} (X, Y) = 〈 Θ (X), Y 〉$ where the map $Θ$ is the one-cocycle of the Lie algebra $g$ with values in $g^{*}$ , with $Θ (X) = T_{e} θ (X (e))$ where $θ$ the one-cocycle of the Lie group G. $\tilde{Θ} (X, Y)$ is constant on M and the map $\tilde{Θ} (X, Y) : g \times g \to ℜ$ is a skew-symmetric bilinear form, and is called the symplectic cocycle of Lie algebra $g$ associated to the moment map $J$ , with the following properties:

$\tilde{Θ} (X, Y) = J_{[X, Y]} - {J_{X}, J_{Y}} with {., .} Poisson Bracket and J the Moment Map$

(30)

$\tilde{Θ} ([X, Y], Z) + \tilde{Θ} ([Y, Z], X) + \tilde{Θ} ([Z, X], Y) = 0$

(31)

where $J_{X}$ linear application from $g$ to differential function on $M$ : $\begin{array}{l} g \to C^{\infty} (M, R) \\ X \to J_{X} \end{array}$ and the associated differentiable application $J$ , called moment(um) map:

$\begin{matrix} J : & M \to g^{*} such that J_{X} (x) = 〈 J (x), X 〉, X \in g \\ x \mapsto J (x) \end{matrix}$

(32)

If instead of $J$ we take the following moment map: $J' (x) = J (x) + Q, x \in M$
where $Q \in g^{*}$ is constant, the symplectic cocycle $θ$ is replaced by $θ' (g) = θ (g) + Q - A d_{g}^{*} Q$
where $θ' - θ = Q - A d_{g}^{*} Q$ is one-coboundary of $G$ with values in $g^{*}$ . We also have properties $θ (g_{1} g_{2}) = A d_{g_{1}}^{*} θ (g_{2}) + θ (g_{1})$ and $θ (e) = 0$ .
The geometric temperature, element of the algebra $g$ , is in the thekernel of the tensor ${\tilde{Θ}}_{β}$ :

$β \in K e r {\tilde{Θ}}_{β}, such that {\tilde{Θ}}_{β} (β, β) = 0, \forall β \in g$

(33)
The following symmetric tensor $g_{β}$ , defined on all values of $a d_{β} (.) = [β, .]$ is positive definite:

$g_{β} ([β, Z_{1}], [β, Z_{2}]) = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}])$

(34)

$g_{β} ([β, Z_{1}], Z_{2}) = {\tilde{Θ}}_{β} (Z_{1}, Z_{2}), \forall Z_{1} \in g, \forall Z_{2} \in Im (a d_{β} (.))$

(35)

$g_{β} (Z_{1}, Z_{2}) \geq 0, \forall Z_{1}, Z_{2} \in Im (a d_{β} (.))$

(36)

where the linear map $a d_{X} \in g l (g)$ is the adjoint representation of the Lie algebra $g$ defined by $X, Y \in g (= T_{e} G) \mapsto a d_{X} (Y) = [X, Y]$ , and the co-adjoint representation of the Lie algebra $g$ the linear map $a d_{X}^{*} \in g l (g^{*})$ which satisfies, for each $ξ \in g^{*}$ and $X, Y \in g$ : $〈 a d_{X}^{*} (ξ), Y 〉 = 〈 ξ, - a d_{X} (Y) 〉$
These equations are universal, because they are not dependent on the symplectic manifold but only on the dynamical group G, the symplectic cocycle $Θ$ , the temperature $β$ and the heat $Q$ . Souriau called this model “Lie groups thermodynamics”.

We will give the main theorem of Souriau for this “Lie group thermodynamics”:

Theorem 1 (Souriau Theorem of Lie Group Thermodynamics).

Let

Ω

be the largest open proper subset of

g

, Lie algebra of G, such that

\int_{M} e^{- 〈 β, U (ξ) 〉} d λ

and

\int_{M} ξ \cdot e^{- 〈 β, U (ξ) 〉} d λ

are convergent integrals, this set

Ω

is convex and is invariant under every transformation

A d_{g} (.)

, where

g \mapsto A d_{g} (.)

is the adjoint representation of G, such that

A d_{g} = T_{e} i_{g}

with

i_{g} : h \mapsto g h g^{- 1}

. Let

a : G \times g^{*} \to g^{*}

a unique affine action

a

such that linear part is a coadjoint representation of

G

, that is the contragradient of the adjoint representation. It associates to each

g \in G

the linear isomorphism

A d_{g}^{*} \in G L (g^{*})

, satisfying, for each:

ξ \in g^{*} a n d X \in g : 〈 A d_{g}^{*} (ξ), X 〉 = 〈 ξ, A d_{g^{- 1}} (X) 〉 .

Then, the fundamental equations of Lie group thermodynamics are given by the action of the group:

Action of Lie group on Lie algebra:

$β \to A d_{g} (β)$

(37)
Transformation of characteristic function after action of Lie group:

$Φ \to Φ - 〈 θ (g^{- 1}), β 〉$

(38)
Invariance of entropy with respect to action of Lie group:

$s \to s$

(39)
Action of Lie group on geometric heat, element of dual Lie algebra:

$Q \to a (g, Q) = A d_{g}^{*} (Q) + θ (g)$

(40)

Souriau equations of Lie group thermodynamics are summarized in the following Figure 5 and Figure 6:

For Hamiltonian, actions of a Lie group on a connected symplectic manifold, the equivariance of the moment map with respect to an affine action of the group on the dual of its Lie algebra has been studied by Marle and Libermann [100] and Lichnerowics [101,102]:

Theorem 2 (Marle Theorem on Cocycles).

Let

G

be a connected and simply connected Lie group,

R : G \to G L (E)

be a linear representation of

G

in a finite-dimensional vector space E, and

r : g \to g l (E)

be the associated linear representation of its Lie algebra

g

. For any one-cocycle

Θ : g \to E

of the Lie algebra

g

for the linear representation r, there exists a unique one-cocycle

θ : G \to E

of the Lie group

G

for the linear representation R such that

Θ (X) = T_{e} θ (X (e))

, which has

Θ

as associated Lie algebra one-cocycle. The Lie group one-cocycle

θ

is a Lie group one-coboundary if and only if the Lie algebra one-cocycle

Θ

is a Lie algebra one-coboundary.

Let

G

be a Lie group whose Lie algebra is

g

. The skew-symmetric bilinear form

\tilde{Θ}

g = T_{e} G

can be extended into a closed differential two-form on

G

, since the identity on

\tilde{Θ}

means that its exterior differential

d \tilde{Θ}

vanishes. In other words,

\tilde{Θ}

is a 2-cocycle for the restriction of the de Rham cohomology of

G

to left invariant differential forms. In the framework of Lie group action on a symplectic manifold, equivariance of moment could be studied to prove that there is a unique action a(.,.) of the Lie group

G

on the dual

g^{*}

of its Lie algebra for which the moment map

J

is equivariant, that means for each

x \in M

J (Φ_{g} (x)) = a (g, J (x)) = A d_{g}^{*} (J (x)) + θ (g)

(41)

where

Φ : G \times M \to M

is an action of Lie group G on differentiable manifold M, the fundamental field associated to an element

X

of Lie algebra

g

of group G is the vectors field

X_{M}

on M:

X_{M} (x) = {\frac{d}{d t} Φ_{\exp (- t X)} (x) |}_{t = 0}

(42)

with

Φ_{g_{1}} (Φ_{g_{2}} (x)) = Φ_{g_{1} g_{2}} (x)

and

Φ_{e} (x) = x

Φ

is Hamiltonian on a symplectic manifold

M

, if

Φ

is symplectic and if for all

X \in g

, the fundamental field

X_{M}

is globally Hamiltonian. The cohomology class of the symplectic cocycle

θ

only depends on the Hamiltonian action

Φ

, and not on

J

In Appendix B, we observe that Souriau Lie group thermodynamics is compatible with Balian gauge theory of thermodynamics [103], that is obtained by symplectization in dimension 2n + 2 of contact manifold in dimension 2n + 1. All elements of the Souriau geometric temperature vector are multiplied by the same gauge parameter.

We conclude this section by this Bourbakiste citation of Jean-Marie Souriau [34]:

It is obvious that one can only define average values on objects belonging to a vector (or affine) space; Therefore—so this assertion may seem Bourbakist—that we will observe and measure average values only as quantity belonging to a set having physically an affine structure. It is clear that this structure is necessarily unique—if not the average values would not be well defined. (Il est évident que l’on ne peut définir de valeurs moyennes que sur des objets appartenant à un espace vectoriel (ou affine); donc—si bourbakiste que puisse sembler cette affirmation—que l’on n’observera et ne mesurera de valeurs moyennes que sur des grandeurs appartenant à un ensemble possédant physiquement une structure affine. Il est clair que cette structure est nécessairement unique—sinon les valeurs moyennes ne seraient pas bien définies.).

4. The Souriau-Fisher Metric as Geometric Heat Capacity of Lie Group Thermodynamics

We observe that Souriau Riemannian metric, introduced with symplectic cocycle, is a generalization of the Fisher metric, that we call the Souriau-Fisher metric, that preserves the property to be defined as a hessian of the partition function logarithm

g_{β} = - \frac{\partial^{2} Φ}{\partial β^{2}} = \frac{\partial^{2} \log ψ_{Ω}}{\partial β^{2}}

as in classical information geometry. We will establish the equality of two terms, between Souriau definition based on Lie group cocycle

Θ

and parameterized by “geometric heat” Q (element of dual Lie algebra) and “geometric temperature” β (element of Lie algebra) and hessian of characteristic function

Φ (β) = - \log ψ_{Ω} (β)

with respect to the variable β:

g_{β} ([β, Z_{1}], [β, Z_{2}]) = 〈 Θ (Z_{1}), [β, Z_{2}] 〉 + 〈 Q, [Z_{1}, [β, Z_{2}]] 〉 = \frac{\partial^{2} \log ψ_{Ω}}{\partial β^{2}}

(43)

If we differentiate this relation of Souriau theorem

Q (A d_{g} (β)) = A d_{g}^{*} (Q) + θ (g)

, this relation occurs:

\frac{\partial Q}{\partial β} (- [Z_{1}, β], .) = \tilde{Θ} (Z_{1}, [β, .]) + 〈 Q, A d_{. Z_{1}} ([β, .]) 〉 = {\tilde{Θ}}_{β} (Z_{1}, [β, .])

(44)

- \frac{\partial Q}{\partial β} ([Z_{1}, β], Z_{2} .) = \tilde{Θ} (Z_{1}, [β, Z_{2}]) + 〈 Q, A d_{. Z_{1}} ([β, Z_{2}]) 〉 = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}])

(45)

\Rightarrow - \frac{\partial Q}{\partial β} = g_{β} ([β, Z_{1}], [β, Z_{2}])

(46)

As the entropy is defined by the Legendre transform of the characteristic function, this Souriau-Fisher metric is also equal to the inverse of the hessian of “geometric entropy”

s (Q)

with respect to the variable Q:

\frac{\partial^{2} s (Q)}{\partial Q^{2}}

For the maximum entropy density (Gibbs density), the following three terms coincide:

\frac{\partial^{2} \log ψ_{Ω}}{\partial β^{2}}

that describes the convexity of the log-likelihood function,

I (β) = - E [\frac{\partial^{2} \log p_{β} (ξ)}{\partial β^{2}}]

the Fisher metric that describes the covariance of the log-likelihood gradient, whereas

I (β) = E [(ξ - Q) {(ξ - Q)}^{T}] = V a r (ξ)

that describes the covariance of the observables.

We can also observe that the Fisher metric

I (β) = - \frac{\partial Q}{\partial β}

is exactly the Souriau metric defined through symplectic cocycle:

I (β) = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}]) = g_{β} ([β, Z_{1}], [β, Z_{2}])

(47)

The Fisher metric

I (β) = - \frac{\partial^{2} Φ (β)}{\partial β^{2}} = - \frac{\partial Q}{\partial β}

has been considered by Souriau as a generalization of “heat capacity”. Souriau called it

K

the “geometric capacity”.

For

β = \frac{1}{k T}

K = - \frac{\partial Q}{\partial β} = - \frac{\partial Q}{\partial T} {(\frac{\partial (1 / k T)}{\partial T})}^{- 1} = k T^{2} \frac{\partial Q}{\partial T}

linking the geometric capacity to calorific capacity, then Fisher metric can be introduced in Fourier heat equation (see Figure 7):

\frac{\partial T}{\partial t} = \frac{κ}{C \cdot D} Δ T with \frac{\partial Q}{\partial T} = C \cdot D \Rightarrow \frac{\partial β^{- 1}}{\partial t} = κ {[(β^{2} / k) \cdot I_{F i s h e r} (β)]}^{- 1} Δ β^{- 1}

(48)

We can also observe that Q is related to the mean, and K to the variance of U:

K = I (β) = - \frac{\partial Q}{\partial β} = var (U) = \int_{M} U {(ξ)}^{2} \cdot p_{β} (ξ) d ω - {(\int_{M} U (ξ) \cdot p_{β} (ξ) d ω)}^{2}

(49)

We observe that the entropy

s

is unchanged, and

Φ

is changed but with linear dependence to

β

, with the consequence that Fisher Souriau metric is invariant:

s [Q (A d_{g} (β))] = s (Q (β)) and I (A d_{g} (β)) = - \frac{\partial^{2} (Φ - 〈 θ (g^{- 1}), β 〉)}{\partial β^{2}} = - \frac{\partial^{2} Φ}{\partial β^{2}} = I (β)

(50)

We have observed that the concept of “heat capacity” is important in the Souriau model because it gives a geometric meaning to its definition. The notion of “heat capacity” has been generalized by Pierre Duhem in his general equations of thermodynamics.

Souriau [34] proposed to define a thermometer (θερμός) device principle that could measure this geometric temperature using “relative ideal gas thermometer” based on a theory of dynamical group thermometry and has also recovered the (geometric) Laplace barometric law

5. Euler-Poincaré Equations and Variational Principle of Souriau Lie Group Thermodynamics

When a Lie algebra acts locally transitively on the configuration space of a Lagrangian mechanical system, Henri Poincaré proved that the Euler-Lagrange equations are equivalent to a new system of differential equations defined on the product of the configuration space with the Lie algebra. Marle has written about the Euler-Poincaré equations [104], under an intrinsic form, without any reference to a particular system of local coordinates, proving that they can be conveniently expressed in terms of the Legendre and moment maps of the lift to the cotangent bundle of the Lie algebra action on the configuration space. The Lagrangian is a smooth real valued function

L

defined on the tangent bundle

T M

. To each parameterized continuous, piecewise smooth curve

γ : [t_{0}, t_{1}] \to M

, defined on a closed interval

[t_{0}, t_{1}]

, with values in

M

, one associates the value at

γ

of the action integral:

I (γ) = \int_{t_{0}}^{t_{1}} L (\frac{d γ (t)}{d t}) d t

(51)

The partial differential of the function

L : M \times g \to ℜ

with respect to its second variable

d_{2} \bar{L}

, which plays an important part in the Euler-Poincaré equation, can be expressed in terms of the moment and Legendre maps:

d_{2} \bar{L} = p_{g^{*}} \circ φ^{t} \circ L \circ φ

with

J = p_{g^{*}} \circ φ^{t} (\Rightarrow d_{2} \bar{L} = J \circ L \circ φ)

the moment map,

p_{g^{*}} : M \times g^{*} \to g^{*}

the canonical projection on the second factor,

L : T M \to T^{*} M

the Legendre transform, with:

φ : M \times g \to T M / φ (x, X) = X_{M} (x) and φ^{t} : T^{*} M \to M \times g^{*} / φ^{t} (ξ) = (π_{M} (ξ), J (ξ))

(52)

The Euler-Poincaré equation can therefore be written under the form:

(\frac{d}{d t} - a d_{V (t)}^{*}) (J \circ L \circ φ (γ (t), V (t))) = J \circ d_{1} \bar{L} (γ (t), V (t)) with \frac{d γ (t)}{d t} = φ (γ (t), V (t))

(53)

with

H (ξ) = 〈 ξ, L^{- 1} (ξ) 〉 - L (L^{- 1} (ξ)), ξ \in T^{*} M, L : T M \to T^{*} M, H : T^{*} M \to R .

(54)

Following the remark made by Poincaré at the end of his note [105], the most interesting case is when the map

\bar{L} : M \times g \to R

only depends on its second variable

X \in g

. The Euler-Poincaré equation becomes:

(\frac{d}{d t} - a d_{V (t)}^{*}) (d \bar{L} (V (t))) = 0

(55)

We can use analogy of structure when the convex Gibbs ensemble is homogeneous [106]. We can then apply Euler-Poincaré equation for Lie group thermodynamics. Considering Clairaut’s equation:

s (Q) = 〈 β, Q 〉 - Φ (β) = 〈 Θ^{- 1} (Q), Q 〉 - Φ (Θ^{- 1} (Q))

(56)

with

Q = Θ (β) = \frac{\partial Φ}{\partial β} \in g^{*}

β = Θ^{- 1} (Q) \in g

, a Souriau-Euler-Poincaré equation can be elaborated for Souriau Lie group thermodynamics:

\frac{d Q}{d t} = a d_{β}^{*} Q

(57)

\frac{d}{d t} (A d_{g}^{*} Q) = 0 .

(58)

The first equation, the Euler-Poincaré equation is a reduction of Euler-Lagrange equations using symmetries and especially the fact that a group is acting homogeneously on the symplectic manifold:

\frac{d Q}{d t} = a d_{β}^{*} Q and {\begin{cases} s (Q) = 〈 β, Q 〉 - Φ (β) \\ β = \frac{\partial s (Q)}{\partial Q} \in g, Q = \frac{\partial Φ (β)}{\partial β} \in g^{*} \end{cases}

(59)

Back to Koszul model of information geometry, we can then deduce an equivalent of the Euler-Poincaré equation for statistical models

\frac{d x^{*}}{d t} = a d_{x}^{*} x^{*} and {\begin{cases} Φ^{*} (x^{*}) = 〈 x, x^{*} 〉 - Φ (x) \\ x = \frac{\partial Φ^{*} (x^{*})}{\partial x} \in Ω, x^{*} = \frac{\partial Φ (x)}{\partial x} \in Ω^{*} \end{cases}

(60)

We can use this Euler-Poincaré equation to deduce an associated equation on entropy:

\frac{d s}{d t} = 〈 \frac{d β}{d t}, Q 〉 + 〈 β, a d_{β}^{*} Q 〉 - \frac{d Φ}{d t}

that reduces to

\frac{d s}{d t} = 〈 \frac{d β}{d t}, Q 〉 - \frac{d Φ}{d t}

(61)

due to

〈 ξ, a d_{V} X 〉 = - 〈 a d_{V}^{*} ξ, X 〉 \Rightarrow 〈 β, a d_{β}^{*} Q 〉 = 〈 Q, a d_{β} β 〉 = 0

With these new equation of thermodynamics

\frac{d Q}{d t} = a d_{β}^{*} Q

and

\frac{d}{d t} (A d_{g}^{*} Q) = 0

, we can observe that the new important notion is related to co-adjoint orbits, that are associated to a symplectic manifold by Souriau with KKS 2-form.

We will then define the Poincaré-Cartan integral invariant for Lie group thermodynamics. Classically in mechanics, the Pfaffian form

ω = p \cdot d q - H \cdot d t

is related to Poincaré-Cartan integral invariant [107]. Dedecker has observed, based on the relation [108]:

ω = \partial_{\dot{q}} L \cdot d q - (\partial_{\dot{q}} L \cdot \dot{q} - L) \cdot d t = L \cdot d t + \partial_{\dot{q}} L ϖ with ϖ = d q - \dot{q} \cdot d t

(62)

that the property that among all forms

χ \equiv L \cdot d t \mod ϖ

the form

ω = p \cdot d q - H \cdot d t

is the only one satisfying

d χ \equiv 0 \mod ϖ

, is a particular case of more general Lepage congruence.

Analogies between geometric mechanics and geometric Lie group thermodynamics, provides the following similarities of structures:

{\begin{cases} \dot{q} \leftrightarrow β \\ p \leftrightarrow Q \end{cases}, {\begin{cases} L (\dot{q}) \leftrightarrow Φ (β) \\ H (p) \leftrightarrow s (Q) \\ H = p \cdot \dot{q} - L \leftrightarrow s = 〈 Q, β 〉 - Φ \end{cases} and {\begin{cases} \dot{q} = \frac{d q}{d t} = \frac{\partial H}{\partial p} \leftrightarrow β = \frac{\partial s}{\partial Q} \\ p = \frac{\partial L}{\partial \dot{q}} \leftrightarrow Q = \frac{\partial Φ}{\partial β} \end{cases}

(63)

We can then consider a similar Poincaré-Cartan-Souriau Pfaffian form:

ω = p \cdot d q - H \cdot d t \leftrightarrow ω = 〈 Q, (β \cdot d t) 〉 - s \cdot d t = (〈 Q, β 〉 - s) \cdot d t = Φ (β) \cdot d t

(64)

This analogy provides an associated Poincaré-Cartan-Souriau integral invariant. Poincaré-Cartan integral invariant

\int_{C_{a}} p \cdot d q - H . d t = \int_{C_{b}} p \cdot d q - H \cdot d t

is given for Souriau thermodynamics by:

\int_{C_{a}} Φ (β) \cdot d t = \int_{C_{b}} Φ (β) \cdot d t

(65)

We can then deduce an Euler-Poincaré-Souriau variational principle for thermodynamics: The variational principle holds on

g

, for variations

δ β = \dot{η} + [β, η]

, where

η (t)

is an arbitrary path that vanishes at the endpoints,

η (a) = η (b) = 0

δ \int_{t_{0}}^{t_{1}} Φ (β (t)) \cdot d t = 0

(66)

6. Souriau Affine Representation of Lie Group and Lie Algebra and Comparison with the Koszul Affine Representation

This affine representation of Lie group/algebra used by Souriau has been intensively studied by Marle [7,100,109,110]. Souriau called the mechanics deduced from this model, “affine mechanics”. We will explain affine representations and associated notions as cocycles, Souriau moment map and cocycles, equivariance of Souriau moment map, action of Lie group on a symplectic manifold and dual spaces of finite-dimensional Lie algebras. We have observed that these tools have been developed in parallel by Jean-Louis Koszul. We will establish close links and synthetize the comparisons in a table of both approaches.

6.1. Affine Representations and Cocycles

Souriau model of Lie group thermodynamics is linked with affine representation of Lie group and Lie algebra. We will give in the following main elements of this affine representation.

Let G be a Lie group and E a finite-dimensional vector space. A map

A : G \to A f f (E)

can always be written as:

A (g) (x) = R (g) (x) + θ (g) with g \in G, x \in E

(67)

where the maps

R : G \to G L (E)

and

θ : G \to E

are determined by A. The map A is an affine representation of G in E.

The map

θ : G \to E

is a one-cocycle of G with values in E, for the linear representation R; it means that

θ

is a smooth map which satisfies, for all

g, h \in G

θ (g h) = R (g) (θ (h)) + θ (g)

(68)

The linear representation R is called the linear part of the affine representation A, and

θ

is called the one-cocycle of

G

associated to the affine representation A. A one-coboundary of

G

with values in E, for the linear representation R, is a map

θ : G \to E

which can be expressed as:

θ (g) = R (g) (c) - c, g \in G

(69)

where c is a fixed element in E and then there exist an element

c \in E

such that, for all

g \in G

and

x \in E

A (g) (x) = R (g) (x + c) - c

(70)

Let

g

be a Lie algebra and E a finite-dimensional vector space. A linear map

a : g \to a f f (E)

always can be written as:

a (X) (x) = r (X) (x) + Θ (X) with X \in g, x \in E

(71)

where the linear maps

r : g \to g l (E)

and

Θ : g \to E

are determined by a. The map a is an affine representation of G in E. The linear map

Θ : g \to E

is a one-cocycle of G with values in E, for the linear representation r; it means that

Θ

satisfies, for all

X, Y \in g

Θ ([X, Y]) = r (X) (Θ (Y)) - r (Y) (Θ (X))

(72)

Θ

is called the one-cocycle of

g

associated to the affine representation a. A one-coboundary of

g

with values in E, for the linear representation r, is a linear map

Θ : g \to E

which can be expressed as:

Θ (X) = r (X) (c), X \in g

where c is a fixed element in E., and then there exist an element

c \in E

such that, for all

X \in g

and

x \in E

a (X) (x) = r (X) (x + c)

Let

A : G \to A f f (E)

be an affine representation of a Lie group

g

in a finite-dimensional vector space E, and

g

be the Lie algebra of

G

. Let

R : G \to G L (E)

and

θ : G \to E

be, respectively, the linear part and the associated cocycle of the affine representation A. Let

a : g \to a f f (E)

be the affine representation of the Lie algebra

g

associated to the affine representation

A : G \to A f f (E)

of the Lie group

G

. The linear part of a is the linear representation

r : g \to g l (E)

associated to the linear representation

R : G \to G L (E)

, and the associated cocycle

Θ : g \to E

is related to the one-cocycle

θ : G \to E

by:

Θ (X) = T_{e} θ (X (e)), X \in g

(73)

This is deduced from:

{\frac{d A (\exp (t X)) (x)}{d t} |}_{t = 0} = {\frac{d (R (\exp (t X)) (x) + θ (\exp (t X))}{d t} |}_{t = 0} \Rightarrow a (X) (x) = r (X) (x) + T_{e} θ (X)

(74)

Let

G

be a connected and simply connected Lie group,

R : G \to G L (E)

be a linear representation of

G

in a finite-dimensional vector space E, and

r : g \to g l (E)

be the associated linear representation of its Lie algebra

g

. For any one-cocycle

Θ : g \to E

of the Lie algebra

g

for the linear representation r, there exists a unique one-cocycle

θ : G \to E

of the Lie group

G

for the linear representation R such that:

Θ (X) = T_{e} θ (X (e))

(75)

in other words, which has

Θ

as associated Lie algebra one-cocycle. The Lie group one-cocycle

θ

is a Lie group one-coboundary if and only if the Lie algebra one-cocycle

Θ

is a Lie algebra one-coboundary.

{\frac{d θ (g \exp (t X))}{d t} |}_{t = 0} = {\frac{d (θ (g) + R (g) (θ (\exp (t X)))}{d t} |}_{t = 0} \Rightarrow T_{g} θ (T L_{g} (X)) = R (g) (Θ (x))

(76)

which proves that if it exists, the Lie group one-cocycle

θ

such that

T_{e} θ = Θ

is unique.

6.2. Souriau Moment Map and Cocycles

Souriau first introduced the moment map in his book. We will give the link with previous cocycles of affine representation.

There exist

J_{X}

linear application from

g

to differential function on

M

\begin{array}{l} g \to C^{\infty} (M, R) \\ X \to J_{X} \end{array}

(77)

We can then associate a differentiable application

J

, called moment(um) map for the Hamiltonian Lie group action

Φ

\begin{array}{l} J : M \to g^{*} \\ x \mapsto J (x) such that J_{X} (x) = 〈 J (x), X 〉, X \in g \end{array}

(78)

Let

J

moment map, for each

(X, Y) \in g \times g

, we associate a smooth function

\tilde{Θ} (X, Y) : M \to ℜ

defined by:

\tilde{Θ} (X, Y) = J_{[X, Y]} - {J_{X}, J_{Y}} with {., .} : Poisson Bracket

(79)

It is a Casimir of the Poisson algebra

C^{\infty} (M, ℜ)

, that satisfies:

\tilde{Θ} ([X, Y], Z) + \tilde{Θ} ([Y, Z], X) + \tilde{Θ} ([Z, X], Y) = 0

(80)

When the Poisson manifold is a connected symplectic manifold, the function

\tilde{Θ} (X, Y)

is constant on M and the map:

\tilde{Θ} (X, Y) : g \times g \to ℜ

(81)

is a skew-symmetric bilinear form, and is called the symplectic Cocycle of Lie algebra

g

associated to the moment map

J

Let

Θ : g \to g^{*}

be the map such that for all:

X, Y \in g : 〈 Θ (X), Y 〉 = \tilde{Θ} (X, Y)

(82)

The map

Θ

is therefore the one-cocycle of the Lie algebra

g

with values in

g^{*}

for the coadjoint representation

X \mapsto a d_{X}^{*}

g

associated to the affine action of

g

on its dual:

a_{Θ} (X) (ξ) = a d_{- X}^{*} (ξ) + Θ (X), X \in g, ξ \in g^{*}

(83)

Let

G

be a Lie group whose Lie algebra is

g

. The skew-symmetric bilinear form

\tilde{Θ}

g = T_{e} G

can be extended into a closed differential two-form on

G

, since the identity on

\tilde{Θ}

means that its exterior differential

d \tilde{Θ}

vanishes. In other words,

\tilde{Θ}

is a 2-cocycle for the restriction of the de Rham cohomology of

G

to left (or right) invariant differential forms.

6.3. Equivariance of Souriau Moment Map

There exists a unique affine action

a

such that the linear part is a coadjoint representation:

\begin{matrix} a : & G \times g^{*} \to g^{*} \\ a (g, ξ) = A d_{g^{- 1}}^{*} ξ + θ (g) \end{matrix}

(84)

with

〈 A d_{g^{- 1}}^{*} ξ, X 〉 = 〈 ξ, A d_{g - 1} X 〉

and that induce equivariance of moment

J

6.4. Action of Lie Group on a Symplectic Manifold

Let

Φ : G \times M \to M

be an action of Lie group G on differentiable manifold M, the fundamental field associated to an element

X

of Lie algebra

g

of group G is the vectors field

X_{M}

on M:

X_{M} (x) = {\frac{d}{d t} Φ_{\exp (- t X)} (x) |}_{t = 0} With Φ_{g_{1}} (Φ_{g_{2}} (x)) = Φ_{g_{1} g_{2}} (x) and Φ_{e} (x) = x

(85)

Φ

is Hamiltonian on a symplectic manifold

M

, if

Φ

is symplectic and if for all

X \in g

, the fundamental field

X_{M}

is globally Hamiltonian.

There is a unique action a of the Lie group

G

on the dual

g^{*}

of its Lie algebra for which the moment map J is equivariant, that means satisfies for each

x \in M

J (Φ_{g} (x)) = a (g, J (x)) = A d_{g^{- 1}}^{*} (J (x)) + θ (g)

(86)

θ : G \to g^{*}

is called cocycle associated to the differential

T_{e} θ

of 1-cocyle

θ

associated to J at neutral element

e

〈 T_{e} θ (X), Y 〉 = \tilde{Θ} (X, Y) = J_{[X, Y]} - {J_{X}, J_{Y}}

(87)

If instead of J we take the moment map

J' (x) = J (x) + μ, x \in M

, where

μ \in g^{*}

is constant, the symplectic cocycle

θ

is replaced by:

θ' (g) = θ (g) + μ - A d_{g}^{*} μ

(88)

where

θ' - θ = μ - A d_{g}^{*} μ

is one-coboundary of

G

with values in

g^{*}

Therefore, the cohomology class of the symplectic cocycle

θ

only depends on the Hamiltonian action

Φ

, not on the choice of its moment map J. We have also:

\tilde{Θ}' (X, Y) = \tilde{Θ} (X, Y) + 〈 μ, [X, Y] 〉

(89)

This property is used by Jean-Marie Souriau [10] to offer a very nice cohomological interpretation of the total mass of a classical (nonrelativistic) isolated mechanical system. He [10] proves that the space of all possible motions of the system is a symplectic manifold on which the Galilean group acts by a Hamiltonian action. The dimension of the symplectic cohomology space of the Galilean group (the quotient of the space of symplectic one-cocycles by the space of symplectic one-coboundaries) is equal to 1. The cohomology class of the symplectic cocycle associated to a moment map of the action of the Galilean group on the space of motions of the system is interpreted as the total mass of the system.

For Hamiltonian actions of a Lie group on a connected symplectic manifold, the equivariance of the moment map with respect to an affine action of the group on the dual of its Lie algebra has been proved by Marle [110]. Marle [110] has also developed the notion of symplectic cocycle and has proved that given a Lie algebra symplectic cocycle, there exists on the associated connected and simply connected Lie group a unique corresponding Lie group symplectic cocycle. Marle [104] has also proved that there exists a two-parameter family of deformations of these actions (the Hamiltonian actions of a Lie group on its cotangent bundle obtained by lifting the actions of the group on itself by translations) into a pair of mutually symplectically orthogonal Hamiltonian actions whose moment maps are equivariant with respect to an affine action involving any given Lie group symplectic cocycle. Marle [104] has also explained why a reduction occurs for Euler-Poncaré equation mainly when the Hamiltonian can be expressed as the moment map composed with a smooth function defined on the dual of the Lie algebra; the Euler-Poincaré equation is then equivalent to the Hamilton equation written on the dual of the Lie algebra.

6.5. Dual Spaces of Finite-Dimensional Lie Algebras

Let

g

be a finite-dimensional Lie algebra, and

g^{*}

its dual space. The Lie algebra

g

can be considered as the dual of

g^{*}

, that means as the space of linear functions on

g^{*}

, and the bracket of the Lie algebra

g

is a composition law on this space of linear functions. This composition law can be extended to the space

C^{\infty} (g^{*}, ℜ)

by setting:

{f, g} (x) = 〈 x, [d f (x), d g (x)] 〉, f and g \in C^{\infty} (g^{*}, ℜ), x \in g^{*}

(90)

If we apply this formula for Souriau Lie group thermodynamics, and for entropy s(Q) depending on geometric heat Q:

{s_{1}, s_{2}} (Q) = 〈 Q, [d s_{1} (Q), d s_{2} (Q)] 〉, s_{1} and s_{2} \in C^{\infty} (g^{*}, ℜ), Q \in g^{*}

(91)

This bracket on

C^{\infty} (g^{*}, ℜ)

defines a Poisson structure on

g^{*}

, called its canonical Poisson structure. It implicitly appears in the works of Sophus Lie, and was rediscovered by Alexander Kirillov [111], Bertram Kostant and Jean-Marie Souriau.

The above defined canonical Poisson structure on

g^{*}

can be modified by means of a symplectic cocycle

\tilde{Θ}

by defining the new bracket:

{f, g}_{\tilde{Θ}} (x) = 〈 x, [d f (x), d g (x)] 〉 - \tilde{Θ} (d f (x), d g (x))

(92)

with

\tilde{Θ}

a symplectic cocycle of the Lie algebra

g

being a skew-symmetric bilinear map

\tilde{Θ} : g \times g \to ℜ

which satisfies:

\tilde{Θ} ([X, Y], Z) + \tilde{Θ} ([Y, Z], X) + \tilde{Θ} ([Z, X], Y) = 0

(93)

This Poisson structure is called the modified canonical Poisson structure by means of the symplectic cocycle

\tilde{Θ}

. The symplectic leaves of

g^{*}

equipped with this Poisson structure are the orbits of an affine action whose linear part is the coadjoint action, with an additional term determined by

\tilde{Θ}

6.6. Koszul Affine Representation of Lie Group and Lie Algebra

Previously, we have developed Souriau’s works on the affine representation of a Lie group used to elaborate the Lie group thermodynamics. We will study here another approach of affine representation of Lie group and Lie algebra introduced by Jean-Louis Koszul. We consolidate the link of Jean-Louis Koszul work with Souriau model. This model uses an affine representation of a Lie group and of a Lie algebra in a finite-dimensional vector space, seen as special examples of actions.

Since the work of Henri Poincare and Elie Cartan, the theory of differential forms has become an essential instrument of modern differential geometry [112,113,114,115] used by Jean-Marie Souriau for identifying the space of motions as a symplectic manifold. However, as said by Paulette Libermann [116], except Henri Poincaré who wrote shortly before his death a report on the work of Elie Cartan during his application for the Sorbonne University, the French mathematicians did not see the importance of Cartan’s breakthroughs. Souriau followed lectures of Elie Cartan in 1945. The second student of Elie Cartan was Jean-Louis Koszul. Koszul introduced the concepts of affine spaces, affine transformations and affine representations [117,118,119,120,121,122,123,124]. More especially, we are interested by Koszul’s definition for affine representations of Lie groups and Lie algebras. Koszul studied symmetric homogeneous spaces and defined relation between invariant flat affine connections to affine representations of Lie algebras, and characterized invariant Hessian metrics by affine representations of Lie algebras [117,118,119,120,121,122,123,124]. Koszul provided correspondence between symmetric homogeneous spaces with invariant Hessian structures by using affine representations of Lie algebras, and proved that a simply connected symmetric homogeneous space with invariant Hessian structure is a direct product of a Euclidean space and a homogeneous self-dual regular convex cone [117,118,119,120,121,122,123,124]. Let G be a connected Lie group and let G/K be a homogeneous space on which G acts effectively, Koszul gave a bijective correspondence between the set of G-invariant flat connections on G/K and the set of a certain class of affine representations of the Lie algebra of G [117,118,119,120,121,122,123,124]. The main theorem of Koszul is: let G/K be a homogeneous space of a connected Lie group G and let

g

and

k

be the Lie algebras of G and K, assuming that G/K is endowed with a G-invariant flat connection, then

g

admits an affine representation (f,q) on the vector space E. Conversely, suppose that G is simply connected and that

g

is endowed with an affine representation, then G/K admits a G-invariant flat connection.

Koszul has proved the following [117,118,119,120,121,122,123,124]. Let

Ω

be a convex domain in

R^{n}

containing no complete straight lines, and an associated convex cone

V (Ω) = {(λ x, x) \in R^{n} \times R / x \in Ω, λ \in R^{+}}

. Then there exists an affine embedding:

ℓ : x \in Ω \mapsto [\begin{matrix} x \\ 1 \end{matrix}] \in V (Ω)

(94)

If we consider

η

the group of homomorphism of

A (n, R)

into

G L (n + 1, R)

given by:

s \in A (n, R) \mapsto [\begin{matrix} f (s) & q (s) \\ 0 & 1 \end{matrix}] \in G L (n + 1, R)

(95)

and associated affine representation of Lie algebra:

[\begin{matrix} f & q \\ 0 & 0 \end{matrix}]

(96)

with

A (n, R)

the group of all affine transformations of

R^{n}

. We have

η (G (Ω)) \subset G (V (Ω))

and the pair

(η, ℓ)

of the homomorphism

η : G (Ω) \to G (V (Ω))

and the map

ℓ : Ω \to V (Ω)

is equivariant.

A Hessian structure (D, g) on a homogeneous space G/K is said to be an invariant Hessian structure if both D and g are G-invariant. A homogeneous space G/K with an invariant Hessian structure (D, g) is called a homogeneous Hessian manifold and is denoted by (G/K, D, g). Another result of Koszul is that a homogeneous self-dual regular convex cone is characterized as a simply connected symmetric homogeneous space admitting an invariant Hessian structure that is defined by the positive definite second Koszul form (we have identified in a previous paper that this second Koszul form is related to the Fisher metric). In parallel, Vinberg [125,126] gave a realization of a homogeneous regular convex domain as a real Siegel domain. Koszul has observed that regular convex cones admit canonical Hessian structures, improving some results of Pyateckii-Shapiro that studied realizations of homogeneous bounded domains by considering Siegel domains in connection with automorphic forms. Koszul defined a characteristic function

ψ_{Ω}

of a regular convex cone

Ω

, and showed that

ψ_{Ω} = D d \log ψ_{Ω}

is a Hessian metric on

Ω

invariant under affine automorphisms of

Ω

. If

Ω

is a homogeneous self dual cone, then the gradient mapping is a symmetry with respect to the canonical Hessian metric, and is a symmetric homogeneous Riemannian manifold. More information on Koszul Hessian geometry can be found in [127,128,129,130,131,132,133,134,135,136].

We will now focus our attention to Koszul affine representation of Lie group/algebra. Let

G

a connex Lie group and

E

a real or complex vector space of finite dimension, Koszul has introduced an affine representation of

G

E

such that [117,118,119,120,121,122,123,124]:

\begin{array}{l} E \to E \\ a \mapsto s a \forall s \in G \end{array}

(97)

is an affine transformation. We set

A (E)

the set of all affine transformations of a vector space

E

, a Lie group called affine transformation group of

E

. The set

G L (E)

of all regular linear transformations of

E

, a subgroup of

A (E)

We define a linear representation from

G

G L (E)

\begin{array}{l} f : G \to G L (E) \\ s \mapsto f (s) a = s a - s o \forall a \in E \end{array}

(98)

and an application from

G

E

\begin{array}{l} q : G \to E \\ s \mapsto q (s) = s o \forall s \in G \end{array}

(99)

Then we have

\forall s, t \in G

f (s) q (t) + q (s) = q (s t)

(100)

deduced from

f (s) q (t) + q (s) = s q (t) - s o + s o = s q (t) = s t o = q (s t)

On the contrary, if an application

q

from

G

E

and a linear representation

f

from

G

G L (E)

verify previous equation, then we can define an affine representation of

G

E

, written

(f, q)

A f f (s) : a \mapsto s a = f (s) a + q (s) \forall s \in G, \forall a \in E

(101)

The condition

f (s) q (t) + q (s) = q (s t)

is equivalent to requiring the following mapping to be an homomorphism:

A f f : s \in G \mapsto A f f (s) \in A (E)

(102)

We write

f

the linear representation of Lie algebra

g

G

, defined by

f

and

q

the restriction to

g

of the differential to

q

(

f

and

q

the differential of

f

and

q

respectively), Koszul has proved that:

\begin{array}{l} f (X) q (Y) - f (Y) q (X) = q ([X, Y]) \forall X, Y \in g \\ with f : g \to g l (E) and q : g \mapsto E \end{array}

(103)

where

g l (E)

the set of all linear endomorphisms of

E

, the Lie algebra of

G L (E)

Using the computation,

q (A d_{s} Y) = {\frac{d q (s \cdot e^{t Y} \cdot s^{- 1})}{d t} |}_{t = 0} = f (s) f (Y) q (s^{- 1}) + f (s) q (Y)

(104)

We can obtain:

q ([X, Y]) = {\frac{d q (A d_{e^{t X}} Y)}{d t} |}_{t = 0} = f (X) q (Y) q (e) + f (e) f (Y) (- q (X)) + f (X) q (Y)

(105)

where

e

is the unit element in

G

. Since

f (e)

is the identity mapping and

q (e) = 0

, we have the equality:

f (X) q (Y) - f (Y) q (X) = q ([X, Y])

A pair

(f, q)

of a linear representation

f

of a Lie algebra

g

E

and a linear mapping

q

from

g

E

is an affine representation of

g

E

, if it satisfies

f (X) q (Y) - f (Y) q (X) = q ([X, Y])

Conversely, if we assume that

g

admits an affine representation

(f, q)

E

, using an affine coordinate system

{x^{1}, ..., x^{n}}

E

, we can express an affine mapping

v \mapsto f (X) v + q (Y)

by an

(n + 1) \times (n + 1)

matrix representation:

a f f (X) = [\begin{matrix} f (X) & q (X) \\ 0 & 0 \end{matrix}]

(106)

where

f (X)

is a

n \times n

matrix and

q (X)

is a

n

row vector.

X \mapsto a f f (X)

is an injective Lie algebra homomorphism from

g

in the Lie algebra of all

(n + 1) \times (n + 1)

matrices,

g l (n + 1, R)

| \begin{array}{l} g \to g l (n + 1, R) \\ X \mapsto a f f (X) \end{array}

(107)

If we denote

g_{a f f} = a f f (g)

, we write

G_{a f f}

the linear Lie subgroup of

G L (n + 1, R)

generated by

g_{a f f}

. An element of

s \in G_{a f f}

is expressed by:

A f f (s) = [\begin{matrix} f (s) & q (s) \\ 0 & 1 \end{matrix}]

(108)

Let

M_{a f f}

be the orbit of

G_{a f f}

through the origin

o

, then

M_{a f f} = q (G_{a f f}) = G_{a f f} / K_{a f f}

where

K_{a f f} = {s \in G_{a f f} / q (s) = 0} = K e r (q)

Example.

Let

Ω

be a convex domain in

R^{n}

containing no complete straight lines, we define a convex cone

V (Ω)

R^{n + 1} = R^{n} \times R

V (Ω) = {(λ x, x) \in R^{n} \times R / x \in Ω, λ \in R^{+}}

. Then there exists an affine embedding:

ℓ : x \in Ω \mapsto [\begin{matrix} x \\ 1 \end{matrix}] \in V (Ω)

(109)

If we consider

η

the group of homomorphism of

A (n, R)

into

G L (n + 1, R)

given by:

s \in A (n, R) \mapsto [\begin{matrix} f (s) & q (s) \\ 0 & 1 \end{matrix}] \in G L (n + 1, R)

(110)

with

A (n, R)

the group of all affine transformations of

R^{n}

. We have

η (G (Ω)) \subset G (V (Ω))

and the pair

(η, ℓ)

of the homomorphism

η : G (Ω) \to G (V (Ω))

and the map

ℓ : Ω \to V (Ω)

is equivariant:

ℓ \circ s = η (s) \circ ℓ and d ℓ \circ s = η (s) \circ d ℓ

(111)

6.7. Comparison of Koszul and Souriau Affine Representation of Lie Group and Lie Algebra

We will compare, in the following Table 1, affine representation of Lie group and Lie algebra from Souriau and Koszul approaches:

6.8. Additional Elements on Koszul Affine Representation of Lie Group and Lie Algebra

Let

{x^{1}, x^{2}, ..., x^{n}}

be a local coordinate system on M, the Christoffel’s symbols

Γ_{i j}^{k}

of the connection

D

are defined by:

D_{\frac{\partial}{\partial x^{i}}} \frac{\partial}{\partial x^{j}} = \sum_{k = 1}^{n} Γ_{i j}^{k} \frac{\partial}{\partial x^{k}}

(112)

The torsion tensor T of D is given by:

T (X, Y) = D_{X} Y - D_{Y} X - [X, Y]

(113)

T (\frac{\partial}{\partial x^{i}}, \frac{\partial}{\partial x^{j}}) = \sum_{k = 1}^{n} T_{i j}^{k} \frac{\partial}{\partial x^{k}} with T_{i j}^{k} = Γ_{i j}^{k} - Γ_{j i}^{k}

(114)

The curvature tensor R of D is given by:

R (X, Y) Z = D_{X} D_{Y} Z - D_{Y} D_{X} Z - D_{[X, Y]} Z

(115)

R (\frac{\partial}{\partial x^{k}}, \frac{\partial}{\partial x^{l}}) \frac{\partial}{\partial x^{j}} = \sum_{i} R_{j k l}^{i} \frac{\partial}{\partial x^{i}} with R_{j k l}^{i} = \frac{\partial Γ_{l j}^{i}}{\partial x^{k}} - \frac{\partial Γ_{k j}^{i}}{\partial x^{l}} + \sum_{m} (Γ_{l j}^{m} Γ_{k m}^{i} - Γ_{k j}^{m} Γ_{l m}^{i})

(116)

The Ricci tensor Ric of D is given by:

R i c (Y, Z) = T r {X \to R (X, Y) Z}

(117)

R_{j k} = R i c (\frac{\partial}{\partial x^{j}}, \frac{\partial}{\partial x^{k}}) = \sum_{i} R_{k i j}^{i}

(118)

In the following, we will consider a homogeneous space G/K endowed with a G-invariant flat connection D (homogeneous flat manifold) written (G/K, D). Koszul has proved a bijective correspondence between the set of G-invariant flat connections on G/K and the set of affine representations of the Lie algebra of G. Let (G, K) be the pair of connected Lie group

G

and its closed subgroup

K

. Let

g

the Lie algebra of

G

and

k

be the Lie subalgebra of

g

corresponding to

K

X^{*}

is defined as the vector field on

M = G / K

induced by the 1-parameter group of transformation

e^{- t X}

. We denote

A_{X^{*}} = L_{X^{*}} - D_{X^{*}}

, with

L_{X^{*}}

the Lie derivative.

Let

V

be the tangent space of

G / K

o = {K}

and let consider, the following values at

o

f (X) = A_{X^{*}, o}

(119)

q (X) = X_{o}^{*}

(120)

where

A_{X^{*}} Y^{*} = - D_{Y^{*}} X^{*}

(where

D

is a locally flat linear connection: its torsion and curvature tensors vanish identically), then:

f ([X, Y]) = [f (X), f (Y)]

(121)

f (X) q (Y) - f (Y) q (X) = q ([X, Y])

(122)

where

\ker (k) = q

, and

(f, q)

an affine representation of the Lie algebra

g

\forall X \in g, X_{a} = \sum_{i} (\sum_{j} f {(X)}_{i}^{j} x^{i} + q {(X)}^{i}) \frac{\partial}{\partial x^{i}}

(123)

The 1-parameter transformation group generated by

X_{a}

is an affine transformation group of V, with linear parts given by

e^{- t . f (X)}

and translation vector parts:

\sum_{n = 1}^{\infty} \frac{{(- t)}^{n}}{n!} f {(X)}^{n - 1} q (X)

(124)

These relations are proved by using:

{\begin{cases} A_{X^{*}} Y^{*} - A_{Y^{*}} X^{*} = [X^{*}, Y^{*}] \\ [A_{X^{*}}, A_{Y^{*}}] = A_{{[X^{*}, Y]}^{*}} \end{cases} with A_{X^{*}} Y^{*} = - D_{Y^{*}} X^{*}

(125)

based on the property that the connection D is locally flat and there is local coordinate systems on M such that

D_{\frac{\partial}{\partial x^{i}}} \frac{\partial}{\partial x^{j}} = 0

with a vanishing torsion and curvature:

T (X, Y) = 0 \Rightarrow D_{X} Y - D_{Y} X = [X, Y]

(126)

R (X, Y) Z = 0 \Rightarrow D_{X} D_{Y} Z - D_{Y} D_{X} Z = D_{[X, Y]} Z

(127)

deduced from the fact the a locally flat linear connection (vanishing of torsion and curvature).

Let

ω

be an invariant volume element on

G / K

in an affine local coordinate system

{x^{1}, x^{2}, ..., x^{n}}

in a neighborhood of

o

ω = Φ \cdot d x^{1} \land ... \land d x^{n}

(128)

We can write

X^{*} = \sum_{i} χ^{i} \frac{\partial}{\partial x^{i}}

and develop the Lie derivative of the volume element

ω

L_{X^{*}} ω = (L_{X^{*}} Φ) . d x^{1} \land ... \land d x^{n} + \sum_{j} Φ . d x^{1} \land \dots \land L_{X^{*}} d x^{j} \land \dots \land d x^{n} = (X^{*} Φ + (\sum_{j} \frac{\partial χ^{j}}{\partial x^{j}}) Φ) d x^{1} \land ... \land d x^{n}

(129)

Since the volume element

ω

is invariant by G:

L_{X^{*}} ω = 0 \Rightarrow X^{*} Φ + (\sum_{j} \frac{\partial χ^{j}}{\partial x^{j}}) Φ = 0 \Rightarrow X^{*} \log Φ = - \sum_{j} \frac{\partial χ^{j}}{\partial x^{j}}

(130)

By using

A_{X^{*}} Y^{*} = - D_{Y^{*}} X^{*}

, we have:

(D_{\frac{\partial}{\partial x^{i}}} (A_{X^{*}})) (\frac{\partial}{\partial x^{j}}) = D_{\frac{\partial}{\partial x^{i}}} (A_{X^{*}} (\frac{\partial}{\partial x^{j}})) - A_{X^{*}} (D_{\frac{\partial}{\partial x^{i}}} \frac{\partial}{\partial x^{j}}) = - D_{\frac{\partial}{\partial x^{i}}} D_{\frac{\partial}{\partial x^{j}}} (\sum_{k} χ^{k} \frac{\partial}{\partial x^{k}}) = - \sum_{k} \frac{\partial^{2} χ^{k}}{\partial x^{i} \partial x^{j}} \frac{\partial}{\partial x^{k}}

(131)

But as D is locally flat and

X^{*}

is an infinitesimal affine transformation with respect to D:

D_{\frac{\partial}{\partial x^{i}}} (A_{X^{*}}) = 0 \Rightarrow \frac{\partial^{2} χ^{k}}{\partial x^{i} \partial x^{j}} = 0

(132)

The Koszul form and canonical bilinear form are given by:

α = \sum_{i} \frac{\partial \log Φ}{\partial x^{i}} d x^{i} = D \log Φ

(133)

D α = \sum_{i, j} \frac{\partial^{2} \log Φ}{\partial x^{i} \partial x^{j}} d x^{i} d x^{j} = D d \log Φ

(134)

L_{X^{*}} α = L_{X^{*}} D \log Φ = D L_{X^{*}} \log Φ = D X^{*} \log Φ = - D (\sum_{j} \frac{\partial χ^{j}}{\partial x^{j}}) = - \sum_{, j} \frac{\partial^{2} χ^{j}}{\partial x^{i} \partial x^{j}} d x^{i} = 0

(135)

Then,

L_{X^{*}} α = 0 \forall X \in g

By using

X^{*} \log Φ = - \sum_{j} \frac{\partial χ^{j}}{\partial x^{j}}

, we can obtain:

α (X^{*}) = (D \log Φ) (X^{*}) \underset{L_{X^{*}} α = 0}{\Rightarrow} D_{X^{*}} \log Φ = - \sum_{j} \frac{\partial χ^{j}}{\partial x^{j}}

(136)

By using

A_{X^{*}} Y^{*} = - D_{Y^{*}} X^{*}

, we can develop:

A_{X^{*}} (\frac{\partial}{\partial x^{j}}) = - D_{\frac{\partial}{\partial x^{j}}} X^{*} = - \sum_{i} \frac{\partial χ^{i}}{\partial x^{j}} \frac{\partial}{\partial x^{i}}

(137)

f (X) = A_{X^{*}, o}

and

q (X) = X_{o}^{*}

T r (f (X)) = T r (A_{X^{*}, o}) = - \sum_{i} \frac{\partial χ^{i}}{\partial x^{i}} (o) = α (X_{0}^{*}) = α_{0} (q (X))

(138)

If we use that

L_{X^{*}} α = 0 \forall X \in g

, then we obtain:

(D α) (X^{*}, Y^{*}) = (D_{Y^{*}} α) (X^{*}) = - (A_{Y^{*}} α) (X^{*}) = - A_{Y^{*}} (α (X^{*})) + α (A_{Y^{*}} X^{*}) = α (A_{Y^{*}} X^{*})

(139)

D α_{0} (q (X), q (Y)) = α_{0} (f (Y) q (X))

(140)

To synthetize the result proved by Jean-Louis Koszul, if

α_{o}

and

D α_{o}

are the values of

α

and

D α

o

, then:

α_{o} (q (X)) = T r (f (X)) \forall X \in g

(141)

D α_{o} (q (X), q (Y)) = {〈 q (X), q (Y) 〉}_{o} = α_{0} (f (X) q (Y)) \forall X, Y \in g

(142)

Jean-Louis Koszul has also proved that the inner product

〈 ., . 〉

on V, given by the Riemannian metric

g_{i j}

, satisfies the following conditions:

〈 f (X) q (Y), q (Z) 〉 + 〈 q (Y), f (X) q (Z) 〉 = 〈 f (Y) q (X), q (Z) 〉 + 〈 q (X), f (Y) q (Z) 〉

(143)

To make the link with Souriau model of thermodynamics, the first Koszul form

α = D \log Φ = T r (f (X))

will play the role of the geometric heat

Q

and the second koszul form

D α = D d \log Φ = {〈 q (X), q (Y) 〉}_{o}

will be the equivalent of Souriau-Fisher metric that is G-invariant.

Koszul theory is wider and integrates “information geometry” in its corpus. Koszul [117,118,119,120,121,122,123,124] has proved general results, for example: on a complex homogeneous space, an invariant volume defines with the complex structure, an invariant Hermitian form. If this space is a bounded domain, then this hermitian form is positive definite and coincides with the classical Bergman metric of this domain. During his stay at Institute for Advanced Study in Princeton, Koszul [117,118,119,120,121,122,123,124] has also demonstrated the reciprocal for a class of complex homogeneous spaces, defined by open orbits of complex affine transformation groups. Koszul and Vey [137,138] have also developed extended results with the following theorem for connected hessian manifolds:

Theorem 3 (Koszul-Vey Theorem).

Let

M

be a connected hessian manifold with hessian metric

g

. Suppose that M admits a closed 1-form

α

such that

D α = g

and there exists a group

G

of affine automorphisms of

M

preserving

α

If $M / G$ is quasi-compact, then the universal covering manifold of M is affinely isomorphic to a convex domain $Ω$ of an affine space not containing any full straight line.
If $M / G$ is compact, then $Ω$ is a sharp convex cone.

On this basis, Koszul has given a Lie group construction of a homogeneous cone that has been developed and applied in information geometry by Shima and Boyom in the framework of Hessian geometry. The results of Koszul are also fundamental in the framework of Souriau thermodynamics.

7. Souriau Lie Group Model and Koszul Hessian Geometry Applied in the Context of Information Geometry for Multivariate Gaussian Densities

We will enlighten Souriau model with Koszul hessian geometry applied in information geometry [117,118,119,120,121,122,123,124], recently studied in [3,9,139]. We have previously shown that information geometry could be founded on the notion of Koszul-Vinberg characteristic function

ψ_{Ω} (x) = \int_{Ω^{*}} e^{- 〈 x, ξ 〉} d ξ, \forall x \in Ω

where Ω is a convex cone and Ω^∗ the dual cone with respect to Cartan-Killing inner product

〈 x, y 〉 = - B (x, θ (y))

invariant by automorphisms of Ω, with

B (., .)

the Killing form and

θ (.)

the Cartan involution. We can develop the Koszul characteristic function:

ψ_{Ω} (x + λ u) = ψ_{Ω} (x) - λ 〈 x^{*}, u 〉 + \frac{λ^{2}}{2} 〈 K (x) u, u 〉 + ...

(144)

with x^{*} = \frac{d Φ (x)}{d x}, Φ (x) = - \log ψ_{Ω} (x) and K (x) = \frac{d^{2} Φ (x)}{d x^{2}}

(145)

This characteristic function is at the cornerstone of modern concept of information geometry, defining Koszul density by solution of maximum Koszul-Shannon entropy [140]:

\underset{p}{M a x} [- \int_{Ω^{*}} p_{\hat{ξ}} (ξ) \log p_{\hat{ξ}} (ξ) \cdot d ξ] such that \int_{Ω^{*}} p_{\hat{ξ}} (ξ) d ξ = 1 and \int_{Ω^{*}} ξ \cdot p_{\hat{ξ}} (ξ) d ξ = \hat{ξ}

(146)

\begin{array}{l} p_{\hat{ξ}} (ξ) = \frac{e^{- 〈 Θ^{- 1} (\hat{ξ}), ξ 〉}}{\int_{Ω^{*}} e^{- 〈 Θ^{- 1} (\hat{ξ}), ξ 〉} \cdot d ξ} \hat{ξ} = Θ (β) = \frac{\partial Φ (β)}{\partial β} where Φ (β) = - \log ψ_{Ω} (β) \\ ψ_{Ω} (β) = \int_{Ω^{*}} e^{- 〈 β, ξ 〉} d ξ, S (\hat{ξ}) = - \int_{Ω^{*}} p_{\hat{ξ}} (ξ) \log p_{\hat{ξ}} (ξ) \cdot d ξ and β = Θ^{- 1} (\hat{ξ}) \\ S (\hat{ξ}) = 〈 \hat{ξ}, β 〉 - Φ (β) \end{array}

(147)

This last relation is a Legendre transform between the logarithm of characteristic function and the entropy:

\begin{array}{l} \log p_{\hat{ξ}} (ξ) = - 〈 ξ, β 〉 + Φ (β) \\ S (\overset{⁀}{ξ}) = - \int_{Ω^{*}} p_{\hat{ξ}} (ξ) \cdot \log p_{\hat{ξ}} (ξ) \cdot d ξ = - E [\log p_{\hat{ξ}} (ξ)] \\ S (\overset{⁀}{ξ}) = 〈 E [ξ], β 〉 - Φ (β) = 〈 \hat{ξ}, β 〉 - Φ (β) \end{array}

(148)

The inversion

Θ^{- 1} (\hat{ξ})

is given by the Legendre transform based on the property that the Koszul-Shannon entropy is given by the Legendre transform of minus the logarithm of the characteristic function:

S (\hat{ξ}) = 〈 β, \hat{ξ} 〉 - Φ (β) with Φ (β) = - \log \int_{Ω^{*}} e^{- 〈 ξ, β 〉} d ξ \forall β \in Ω and \forall ξ, \hat{ξ} \in Ω^{*}

(149)

We can observe the fundamental property that

E [S (ξ)] = S (E [ξ]), ξ \in Ω^{*}

, and also as observed by Maurice Fréchet that “distinguished functions” (densities with estimator reaching the Fréchet-Darmois bound) are solutions of the Alexis Clairaut equation introduced by Clairaut in 1734 [141], as illustrated in Figure 8:

S (\hat{ξ}) = 〈 Θ^{- 1} (\hat{ξ}), \hat{ξ} 〉 - Φ [Θ^{- 1} (\hat{ξ})] \forall \hat{ξ} \in {Θ (β) / β \in Ω}

(150)

Details of Fréchet elaboration for this Clairaut(-Legendre) equation for “distinguished function” is given in Appendix A, and other elements are available on Fréchet’s papers [141,142,143,144].

In this structure, the Fisher metric

I (x)

makes appear naturally a Koszul hessian geometry [145,146], if we observe that

\begin{array}{l} \log p_{\hat{ξ}} (ξ) = - 〈 ξ, β 〉 + Φ (β) \\ S (\overset{⁀}{ξ}) = - \int_{Ω^{*}} p_{\hat{ξ}} (ξ) \cdot \log p_{\hat{ξ}} (ξ) \cdot d ξ = - E [\log p_{\hat{ξ}} (ξ)] \\ S (\overset{⁀}{ξ}) = 〈 E [ξ], β 〉 - Φ (β) = 〈 \hat{ξ}, β 〉 - Φ (β) \end{array}

(151)

Then we can recover the relation with Fisher metric:

\begin{array}{l} I (β) = - E [\frac{\partial^{2} \log p_{β} (ξ)}{\partial β^{2}}] = - E [\frac{\partial^{2} (- 〈 ξ, β 〉 + Φ (β))}{\partial β^{2}}] = - \frac{\partial^{2} Φ (β)}{\partial β^{2}} \\ \hat{ξ} = \frac{\partial Φ (β)}{\partial β} \\ I (β) = E [\frac{\partial \log p_{β} (ξ)}{\partial β} {\frac{\partial \log p_{β} (ξ)}{\partial β}}^{T}] = E [(ξ - \hat{ξ}) {(ξ - \hat{ξ})}^{T}] = E [ξ^{2}] - E {[ξ]}^{2} = V a r (ξ) \end{array}

(152)

with Crouzeix relation established in 1977 [147,148],

\frac{\partial^{2} Φ}{\partial β^{2}} = {[\frac{\partial^{2} S}{\partial {\hat{ξ}}^{2}}]}^{- 1}

giving the dual metric, in dual space, where entropy

S

and (minus) logarithm of characteristic function,

Φ

, are dual potential functions.

The first metric of information geometry [149,150], the Fisher metric is given by the hessian of the characteristic function logarithm:

I (β) = - E [\frac{\partial^{2} \log p_{β} (ξ)}{\partial β^{2}}] = - \frac{\partial^{2} Φ (β)}{\partial β^{2}} = \frac{\partial^{2} \log ψ_{Ω} (β)}{\partial β^{2}}

(153)

d s_{g}^{2} = d β^{T} I (β) d β = \sum_{i j} g_{i j} d β_{i} d β_{j} with g_{i j} = {[I (β)]}_{i j}

(154)

The second metric of information geometry is given by hessian of the Shannon entropy:

\frac{\partial^{2} S (\hat{ξ})}{\partial {\hat{ξ}}^{2}} = {[\frac{\partial^{2} Φ (β)}{\partial β^{2}}]}^{- 1} with S (\hat{ξ}) = 〈 \hat{ξ}, β 〉 - Φ (β)

(155)

d s_{h}^{2} = d {\hat{ξ}}^{T} [\frac{\partial^{2} S (\hat{ξ})}{\partial {\hat{ξ}}^{2}}] d \hat{ξ} = \sum_{i j} h_{i j} d {\hat{ξ}}_{i} d {\hat{ξ}}_{j} with h_{i j} = {[\frac{\partial^{2} S (\hat{ξ})}{\partial {\hat{ξ}}^{2}}]}_{i j}

(156)

Both metrics will provide the same distance:

d s_{g}^{2} = d s_{h}^{2}

(157)

From the Cartan inner product, we can generate logarithm of the Koszul characteristic function, and its Legendre transform to define Koszul entropy, Koszul density and Koszul metric, as explained in the following Figure 9:

This information geometry has been intensively studied for structured matrices [151,152,153,154,155,156,157,158,159,160,161,162,163,164,165,166] and in statistics [167] and is linked to the seminal work of Siegel [168] on symmetric bounded domains.

We can apply this Koszul geometry framework for cones of symmetric positive definite matrices. Let the inner product

〈 η, ξ 〉 = T r (η^{T} ξ), \forall η, ξ \in S y m (n)

given by Cartan-Killing form,

Ω

be the set of symmetric positive definite matrices is an open convex cone and is self-dual

Ω^{*} = Ω

\begin{array}{l} 〈 η, ξ 〉 = T r (η^{T} ξ), \forall η, ξ \in S y m (n) \\ ψ_{Ω} (β) = \int_{Ω^{*}} e^{- 〈 β, ξ 〉} d ξ = \det {(β)}^{- \frac{n + 1}{2}} ψ_{Ω} (I_{d}) \\ \hat{ξ} = \frac{\partial Φ (β)}{\partial β} = \frac{\partial (- \log ψ_{Ω} (β))}{\partial β} = \frac{n + 1}{2} β^{- 1} \end{array}

(158)

p_{\hat{ξ}} (ξ) = e^{- 〈 Θ^{- 1} (\hat{ξ}), ξ 〉 + Φ (Θ^{- 1} (\hat{ξ}))} = ψ_{Ω} (I_{d}) \cdot [\det (α {\hat{ξ}}^{- 1})] \cdot e^{- T r (α {\hat{ξ}}^{- 1} ξ)} with α = \frac{n + 1}{2}

(159)

We will in the following illustrate information geometry for multivariate Gaussian density [169]:

p_{\hat{ξ}} (ξ) = \frac{1}{{(2 π)}^{n / 2} \det {(R)}^{1 / 2}} e^{- \frac{1}{2} {(z - m)}^{T} R^{- 1} (z - m)}

(160)

If we develop:

\begin{matrix} \frac{1}{2} {(z - m)}^{T} R^{- 1} (z - m) & = \frac{1}{2} [z^{T} R^{- 1} z - m^{T} R^{- 1} z - z^{T} R^{- 1} m + m^{T} R^{- 1} m] \\ = \frac{1}{2} z^{T} R^{- 1} z - m^{T} R^{- 1} z + \frac{1}{2} m^{T} R^{- 1} m \end{matrix}

(161)

We can write the density as a Gibbs density:

\begin{array}{l} p_{\hat{ξ}} (ξ) = \frac{1}{{(2 π)}^{n / 2} \det {(R)}^{1 / 2} e^{\frac{1}{2} m^{T} R^{- 1} m}} e^{- [- m^{T} R^{- 1} z + \frac{1}{2} z^{T} R^{- 1} z]} = \frac{1}{Z} e^{- 〈 ξ, β 〉} \\ ξ = [\begin{matrix} z \\ z z^{T} \end{matrix}] and β = [\begin{matrix} - R^{- 1} m \\ \frac{1}{2} R^{- 1} \end{matrix}] = [\begin{matrix} a \\ H \end{matrix}] with 〈 ξ, β 〉 = a^{T} z + z^{T} H z = T r [z a^{T} + H^{T} z z^{T}] \end{array}

(162)

We can then rewrite density with canonical variables:

\begin{array}{l} p_{\hat{ξ}} (ξ) = \frac{1}{\int_{Ω^{*}} e^{- 〈 ξ, β 〉} . d ξ} e^{- 〈 ξ, β 〉} = \frac{1}{Z} e^{- 〈 ξ, β 〉} with \log (Z) = n \log (2 π) + \frac{1}{2} \log \det (R) + \frac{1}{2} m^{T} R^{- 1} m \\ ξ = [\begin{matrix} z \\ z z^{T} \end{matrix}], \hat{ξ} = [\begin{matrix} E [z] \\ E [z z^{T}] \end{matrix}] = [\begin{matrix} m \\ R + m m^{T} \end{matrix}], β = [\begin{matrix} a \\ H \end{matrix}] = [\begin{matrix} - R^{- 1} m \\ \frac{1}{2} R^{- 1} \end{matrix}] with 〈 ξ, β 〉 = T r [z a^{T} + H^{T} z z^{T}] \\ R = E [(z - m) {(z - m)}^{T}] = E [z z^{T} - m z^{T} - z m^{T} + m m^{T}] = E [z z^{T}] - m m^{T} \end{array}

(163)

The first potential function (free energy/logarithm of characteristic function) is given by:

ψ_{Ω} (β) = \int_{Ω^{*}} e^{- 〈 ξ, β 〉} \cdot d ξ and Φ (β) = - \log ψ_{Ω} (β) = \frac{1}{2} [- T r [H^{- 1} a a^{T}] + \log [{(2)}^{n} \det H] - n \log (2 π)]

(164)

We verify the relation between the first potential function and moment:

\begin{array}{l} \frac{\partial Φ (β)}{\partial β} = \frac{\partial [- \log ψ_{Ω} (β)]}{\partial β} = \int_{Ω^{*}} ξ \frac{e^{- 〈 ξ, β 〉}}{\int_{Ω^{*}} e^{- 〈 ξ, β 〉} \cdot d ξ} \cdot d ξ = \int_{Ω^{*}} ξ \cdot p_{\hat{ξ}} (ξ) \cdot d ξ = \hat{ξ} \\ \frac{\partial Φ (β)}{\partial β} = [\begin{matrix} \frac{\partial Φ (β)}{\partial a} \\ \frac{\partial Φ (β)}{\partial H} \end{matrix}] = [\begin{matrix} m \\ R + m m^{T} \end{matrix}] = \hat{ξ} \end{array}

(165)

The second potential function (Shannon entropy) is given as a Legendre transform of the first one:

\begin{array}{l} S (\hat{ξ}) = 〈 \hat{ξ}, β 〉 - Φ (β) with \frac{\partial Φ (β)}{\partial β} = \hat{ξ} and \frac{\partial S (\hat{ξ})}{\partial \hat{ξ}} = β \\ S (\hat{ξ}) = - \int_{Ω^{*}} \frac{e^{- 〈 ξ, β 〉}}{\int_{Ω^{*}} e^{- 〈 ξ, β 〉} \cdot d ξ} \log \frac{e^{- 〈 ξ, β 〉}}{\int_{Ω^{*}} e^{- 〈 ξ, β 〉} \cdot d ξ} \cdot d ξ = - \int_{Ω^{*}} p_{\hat{ξ}} (ξ) \log p_{\hat{ξ}} (ξ) \cdot d ξ \end{array}

(166)

S (\hat{ξ}) = - \int_{Ω^{*}} p_{\hat{ξ}} (ξ) \log p_{\hat{ξ}} (ξ) \cdot d ξ = \frac{1}{2} [\log {(2)}^{n} \det [H^{- 1}] + n \log (2 π \cdot e)] = \frac{1}{2} [\log \det [R] + n \log (2 π \cdot e)]

(167)

This remark was made by Jean-Souriau in his book [10] as soon as 1969. He has observed, as illustrated in Figure 10 that if we take vector with tensor components

ξ = (\begin{matrix} z \\ z \otimes z \end{matrix})

, components of

\hat{ξ}

will provide moments of the first and second order of the density of probability

p_{\hat{ξ}} (ξ)

. He used this change of variable

z' = H^{1 / 2} z + H^{- 1 / 2} a

, to compute the logarithm of the characteristic function

Φ (β)

We can finally compute the metric from the matrix

g_{i j}

d s^{2} = \sum_{i j} g_{i j} d θ_{i} d θ_{j} = d m^{T} R^{- 1} d m + \frac{1}{2} T r [{(R^{- 1} d R)}^{2}]

(168)

and from classical expression of the Euler-Lagrange equation:

\sum_{i = 1}^{n} g_{i k} {\ddot{θ}}_{i} + \sum_{i, j = 1}^{n} Γ_{i j k} {\dot{θ}}_{i} {\dot{θ}}_{j} = 0, k = 1, ..., n with Γ_{i j k} = \frac{1}{2} [\frac{\partial g_{j k}}{\partial θ_{i}} + \frac{\partial g_{j k}}{\partial θ_{j}} + \frac{\partial g_{i j}}{\partial θ_{k}}]

(169)

That is explicitely given by [170]:

{\begin{cases} \ddot{R} + \dot{m} {\dot{m}}^{T} - \dot{R} R^{- 1} \dot{R} = 0 \\ \ddot{m} - \dot{R} R^{- 1} \dot{m} = 0 \end{cases}

(170)

We cannot integrate this Euler-Lagrange equation. We will see that Lie group theory will provide new reduced equation, Euler-Poincaré equation, using Souriau theorem.

We make reference to the book of Deza that gives a survey about distance and metric space [171].

The case of Natural Exponential families that are invariant by an affine group has been studied by Casalis (in 1999 paper and in her Ph.D. thesis) [172,173,174,175,176,177,178] and by Letac [179,180,181]. We give the details of Casalis’ development in Appendix C. Barndorff-Nielsen has also studied transformation models for exponential families [182,183,184,185,186]. In this section, we will only consider the case of multivariate Gaussian densities.

8. Affine Group Action for Multivariate Gaussian Densities and Souriau’s Moment Map: Computation of Geodesics by Geodesic Shooting

To more deeply understand Koszul and Souriau Lie group models of information geometry, we will illustrate their tools for multivariate Gaussian densities.

Consider the general linear group

G L (n)

consisting of the invertible n × n matrices, that is a topological group acting linearly on

R^{n}

by:

\begin{array}{l} G L (n) \times R^{n} \to R^{n} \\ (A, x) \mapsto A x \end{array}

(171)

The group

G L (n)

is a Lie group, is a subgroup of the general affine group

G A (n)

, composed of all pairs

(A, υ)

where

A \in G L (n)

and

υ \in R^{n}

, the group operation given by:

(A_{1}, υ_{1}) (A_{2}, υ_{2}) = (A_{1} A_{2}, A_{1} υ_{2} + υ_{1})

(172)

G L (n)

is an open subset of

R^{n^{2}}

, and may be considered as n²-dimensional differential manifold with the same differentiable structure than

R^{n^{2}}

. Multiplication and inversion are infinitely often differentiable mappings. Consider the vector space

g l (n)

of real n × n matrices and the commutator product:

\begin{array}{l} g l (n) \times g l (n) \to g l (n) \\ (A, B) \mapsto A B - B A = [A, B] \end{array}

(173)

This is a Lie product making

g l (n)

into a Lie algebra. The exponential map is then the mapping defined by:

\begin{matrix} \exp : & g l (n) \to G L (n) \\ A \mapsto \exp (A) = \sum_{n = 0}^{\infty} \frac{A^{n}}{n!} \end{matrix}

(174)

Restricting

A

to have positive determinant, one obtains the positive general affine group

G A_{+} (n)

that acts transitively on

R^{n}

by:

((A, υ), x) \mapsto A x + υ

(175)

In case of symmetric positive definite matrices

S y m^{+} (n)

, we can use the Cholesky decomposition:

R = L L^{T}

(176)

where

L

is a lower triangular matrix with real and positive diagonal entries, and

L^{T}

denotes the transpose of

L

, to define the square root of

R

Given a positive semidefinite matrix

R

, according to the spectral theorem, the continuous functional calculus can be applied to obtain a matrix

R^{1 / 2}

such that

R^{1 / 2}

is itself positive and

R^{1 / 2} R^{1 / 2} = R

. The operator

R^{1 / 2}

is the unique non-negative square root of

R

N_{n} = {ℵ (μ, Σ) / μ \in R^{n}, Σ \in S y m^{+}_{n}}

the class of regular multivariate normal distributions, where

μ

is the mean vector and

Σ

is the (symmetric positive definite) covariance matrix, is invariant under the transitive action of

G A (n)

. The induced action of

G A (n)

R^{n} \times S y m^{+}_{n}

is then given by:

\begin{array}{l} G A (n) \times (R^{n} \times S y m^{+} n) \to R^{n} \times S y m^{+} n \\ ((A, υ), (μ, Σ)) \mapsto (A μ + υ, A Σ A^{T}) \end{array}

(177)

and

\begin{array}{l} G A (n) \times R^{n} \to R^{n} \\ ((A, υ), x) \mapsto A x + υ \end{array}

(178)

As the isotropy group of

(0, I_{n})

is equal to

O (n)

, we can observe that:

N_{n} = G A (n) / O (n)

(179)

N_{n}

is an open subset of the vector space

T_{n} = {(η, Ω) / η \in R^{n}, Ω \in S y m_{n}}

and is a differentiable manifold, where the tangent space at any point may be identified with

T_{n}

The Fisher information defines a metric given to

N_{n}

a Riemannian manifold structure. The inner product of two tangent vectors

(η_{1}, Ω_{1}) \in T_{n}

(η_{2}, Ω_{2}) \in T_{n}

at the point

(μ, Σ) \in N_{n}

is given by:

g_{(μ, Σ))} ((η_{1}, Ω_{1}), (η_{1}, Ω_{1})) = η_{1}^{T} Σ^{- 1} η_{2} + \frac{1}{2} T r (Σ^{- 1} Ω_{1} Σ^{- 1} Ω_{2})

(180)

Niels Christian Bang Jesperson has proved that the transformation model on

R^{n}

with parameter set

R^{n} \times S y m^{+}_{n}

are exactly those of the form

p_{μ, Σ} = f_{μ, Σ} λ

where

λ

is the Lebesque measure, where

f_{μ, Σ} (x) = h ({(x - μ)}^{T} Σ^{- 1} (x - μ)) / \det {(Σ)}^{1 / 2}

and

h : [0, + \infty [\to R^{+}

is a continuous function with

\int_{0}^{+ \infty} h (s) s^{\frac{n}{2} - 1} d s < + \infty

. Distributions with densities of this form are called elliptic distributions.

To improve understanding of tools, we will consider

G A (n)

as a sub-group of affine group, that could be defined by a matrix Lie group

G_{a f f}

, that acts for multivariate Gaussian laws, as illustrated in Figure 11:

\begin{array}{l} [\begin{matrix} Y \\ 1 \end{matrix}] = [\begin{matrix} R^{1 / 2} & m \\ 0 & 1 \end{matrix}] [\begin{matrix} X \\ 1 \end{matrix}] = [\begin{matrix} R^{1 / 2} X + m \\ 1 \end{matrix}], {\begin{cases} (m, R) \in R^{n} \times S y m^{+} (n) \\ M = [\begin{matrix} R^{1 / 2} & m \\ 0 & 1 \end{matrix}] \in G_{a f f} \end{cases} \\ X \approx ℵ (0, I) \to Y \approx ℵ (m, R) \end{array}

(181)

We can verify that M is a Lie group with classical properties, that product of M preserves the structure, the associativity, the non-commutativity, and the existence of neutral element:

\begin{array}{l} M_{1} \cdot M_{2} = [\begin{matrix} R_{1}^{1 / 2} & m_{1} \\ 0 & 1 \end{matrix}] [\begin{matrix} R_{2}^{1 / 2} & m_{2} \\ 0 & 1 \end{matrix}] = [\begin{matrix} R_{1}^{1 / 2} R_{2}^{1 / 2} & R_{1}^{1 / 2} m_{2} + m_{1} \\ 0 & 1 \end{matrix}] \\ M_{2} \cdot M_{1} = [\begin{matrix} R_{2}^{1 / 2} & m_{2} \\ 0 & 1 \end{matrix}] [\begin{matrix} R_{1}^{1 / 2} & m_{1} \\ 0 & 1 \end{matrix}] = [\begin{matrix} R_{2}^{1 / 2} R_{1}^{1 / 2} & R_{2}^{1 / 2} m_{1} + m_{2} \\ 0 & 1 \end{matrix}] \end{array}} \Rightarrow {\begin{cases} M_{1} \cdot M_{2} \in G_{a f f} \\ M_{2} \cdot M_{1} \in G_{a f f} \\ M_{1} \cdot M_{2} \neq M_{2} \cdot M_{1} \\ M_{1} \cdot (M_{2} \cdot M_{3}) = (M_{1} \cdot M_{2}) \cdot M_{3} \\ M_{1} \cdot I = M_{1} \end{cases}

(182)

We can also observe that the inverse preserves the structure:

M = [\begin{matrix} R^{1 / 2} & m \\ 0 & 1 \end{matrix}] \Rightarrow M_{R}^{- 1} = M_{L}^{- 1} = M^{- 1} = [\begin{matrix} R^{- 1 / 2} & - R^{- 1 / 2} m \\ 0 & 1 \end{matrix}] \in G_{a f f}

(183)

To this Lie group we can associate a Lie algebra whose underlying vector space is the tangent space of the Lie group at the identity element and which completely captures the local structure of the group. This Lie group acts smoothly on the manifold, and acts on the vector fields. Any tangent vector at the identity of a Lie group can be extended to a left (respectively right) invariant vector field by left (respectively right) translating the tangent vector to other points of the manifold. This identifies the tangent space at the identity

g = T_{I} (G)

with the space of left invariant vector fields, and therefore makes the tangent space at the identity into a Lie algebra, called the Lie algebra of G.

L_{G} : {\begin{cases} G_{a f f} \to G_{a f f} \\ M \mapsto L_{M} N = M \cdot N \end{cases} and R_{G} : {\begin{cases} G_{a f f} \to G_{a f f} \\ M \mapsto R_{M} N = N \cdot M \end{cases}

(184)

Considering the curve

γ (t)

and its derivative

\dot{γ} (t)

γ (t) = [\begin{matrix} R^{1 / 2} (t) & m (t) \\ 0 & 1 \end{matrix}] and \dot{γ} (t) = [\begin{matrix} {\dot{R}}^{1 / 2} (t) & \dot{m} (t) \\ 0 & 0 \end{matrix}]

(185)

We can consider the curve with the point

γ (0)

moved at the identity element on the left or on the right. Then, the tangent plan at identity element provides the Lie algebra:

Γ_{L} (t) = L_{M^{- 1}} (γ (t)) = [\begin{matrix} R^{- 1 / 2} R^{1 / 2} (t) & R^{- 1 / 2} (m (t) - m) \\ 0 & 1 \end{matrix}]

(186)

{{\dot{Γ}}_{L} (t) |}_{t = 0} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} (0) & R^{- 1 / 2} \dot{m} (0) \\ 0 & 1 \end{matrix}] = {\frac{d}{d t} (L_{M^{- 1}} (γ (t))) |}_{t = 0} = d L_{M^{- 1}} \dot{γ} (0) = d L_{M^{- 1}} \dot{M}

(187)

Lie algebra on the right and on the left is the defined by:

\begin{matrix} d L_{M^{- 1}} : & T_{M} (G) \to g_{L} \\ \dot{M} \mapsto Ω_{L} = d L_{M^{- 1}} \dot{M} = M^{- 1} \dot{M} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} & R^{- 1 / 2} \dot{m} \\ 0 & 0 \end{matrix}] \end{matrix}

(188)

\begin{matrix} d R_{M^{- 1}} : & T_{M} (G) \to g_{R} \\ \dot{M} \mapsto Ω_{R} = d R_{M^{- 1}} \dot{M} = \dot{M} M^{- 1} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} & \dot{m} - R^{- 1 / 2} {\dot{R}}^{1 / 2} \dot{m} \\ 0 & 0 \end{matrix}] \end{matrix}

(189)

We can then observe the velocities in two different ways, either by placing in a fixed outside frame, either by putting in place of the element in the process of moving by placing in the reference frame of the element.

[\begin{matrix} X (t) \\ 1 \end{matrix}] = M [\begin{matrix} x \\ 1 \end{matrix}] \Rightarrow [\begin{matrix} \dot{X} (t) \\ 0 \end{matrix}] = Ω_{R} [\begin{matrix} X (t) \\ 1 \end{matrix}] with x fixed

(190)

[\begin{matrix} x (t) \\ 1 \end{matrix}] = M^{- 1} [\begin{matrix} X \\ 1 \end{matrix}] \Rightarrow [\begin{matrix} \dot{x} (t) \\ 0 \end{matrix}] = - Ω_{L} [\begin{matrix} X \\ 1 \end{matrix}] with X fixed

(191)

In the following, we will complete the global view by the operators which will allow to link algebra (from the left or the right) between them and also connect to their dual. We will first consider the automorphisms, the action by conjugation of the Lie group on itself that allows this operator to carry a member of the group.

\begin{matrix} A D : & G \times G \to G \\ M, N \mapsto A D_{M} N = M . N . M^{- 1} \end{matrix}

(192)

{\begin{cases} M_{1} = [\begin{matrix} R_{1}^{1 / 2} & m_{1} \\ 0 & 1 \end{matrix}], M_{2} = [\begin{matrix} R_{2}^{1 / 2} & m_{2} \\ 0 & 1 \end{matrix}] \\ A D_{M_{1}} M_{2} = [\begin{matrix} R_{2}^{1 / 2} & - R_{2}^{1 / 2} m_{1} + R_{1}^{1 / 2} m_{2} + m_{1} \\ 0 & 1 \end{matrix}] \end{cases}

(193)

If now we consider a curve N(t) curve on the manifold via the identity at t = 0. Its image by the previous operator will be then curve

γ = M \cdot N (t) \cdot M^{- 1}

passing through identity element at t = 0. As

\dot{N} (0)

is an element of the Lie algebra and its image by previous conjugation operator is called the Adjoint operator:

\begin{matrix} A d : & G \times g \to g \\ M, n \mapsto A d_{M} n = M . n . M^{- 1} = {\frac{d}{d t} |}_{t = 0} (A D_{M} N (t)) with {\begin{cases} N (0) = I \\ \dot{N} (0) = n \in g \end{cases} \end{matrix}

(194)

We can then compute the Adjoint operator for the previous Lie group:

{\begin{cases} n_{2 L} = [\begin{matrix} R_{2}^{- 1 / 2} {\dot{R}}_{2}^{1 / 2} & R_{2}^{- 1 / 2} {\dot{m}}_{2} \\ 0 & 0 \end{matrix}], n_{2 R} = [\begin{matrix} R_{2}^{- 1 / 2} {\dot{R}}_{2}^{1 / 2} & - R_{2}^{- 1 / 2} {\dot{R}}_{2}^{1 / 2} m_{2} + {\dot{m}}_{2} \\ 0 & 0 \end{matrix}] \\ A d_{M_{1}} n_{2 L} = n_{2 R} and A d_{M 2} n_{2 R} = [\begin{matrix} R_{2}^{- 1 / 2} {\dot{R}}_{2}^{1 / 2} & - R_{2}^{- 1 / 2} {\dot{R}}_{2}^{1 / 2} m_{2} + {\dot{R}}_{2}^{1 / 2} m_{2} + R_{2}^{1 / 2} {\dot{m}}_{2} \\ 0 & 0 \end{matrix}], A d_{M_{1}^{- 1}} n_{2 R} = n_{2 L} \end{cases}

(195)

We recall that the Lie algebra has been defined as the tangent space at the identity of a Lie group. We will then introduce a Lie bracket

[., .]

, the expression of the operator associated with the combined action of the Lie algebra on itself, called an adjoint operator. The adjoint operator represents the action by conjugation of the Lie algebra on itself and is defined by:

\begin{matrix} a d : & g \times g \to g \\ n, m \mapsto a d_{m} n = m \cdot n - n \cdot m = {\frac{d}{d t} |}_{t = 0} (A d_{M} n (t)) = [m, n] with {\begin{cases} \dot{N} (0) = n \in g \\ \dot{M} (0) = m \in g \end{cases} \end{matrix}

(196)

We can then compute this operator for our use case:

n_{1 L} = [\begin{matrix} R_{1}^{- 1 / 2} {\dot{R}}_{1}^{1 / 2} & R_{1}^{- 1 / 2} {\dot{m}}_{1} \\ 0 & 0 \end{matrix}], n_{2 L} = [\begin{matrix} R_{2}^{- 1 / 2} {\dot{R}}_{2}^{1 / 2} & R_{2}^{- 1 / 2} {\dot{m}}_{2} \\ 0 & 0 \end{matrix}]

(197)

a d_{n_{1 L}} n_{2 L} = [n_{1 L}, n_{2 L}] = [\begin{matrix} 0 & R_{1}^{- 1 / 2} ({\dot{R}}_{1}^{1 / 2} {\dot{m}}_{2} - {\dot{R}}_{2}^{1 / 2} {\dot{m}}_{1}) R_{2}^{- 1 / 2} \\ 0 & 0 \end{matrix}]

(198)

a d_{n_{1 R}} n_{2 R} = [n_{1 R}, n_{2 R}] = [\begin{matrix} 0 & R_{1}^{- 1 / 2} {\dot{R}}_{1}^{1 / 2} (- R_{2}^{- 1 / 2} {\dot{R}}_{2}^{1 / 2} m_{2} + {\dot{m}}_{2}) - R_{2}^{- 1 / 2} {\dot{R}}_{2}^{1 / 2} (- R_{1}^{- 1 / 2} {\dot{R}}_{1}^{1 / 2} m_{1} + {\dot{m}}_{1}) \\ 0 & 0 \end{matrix}]

(199)

To study the geodesic trajectories of the group, we consider the Lagrangian from the total kinetic energy (a quadratic form on speeds). It may therefore in particular be written in the left algebra “left”, with the scalar product associated with the metric.

E_{L} = \frac{1}{2} 〈 n_{L}, n_{L} 〉 = \frac{1}{2} T r [n_{L}^{T} n_{L}]

(200)

If we consider as scalar product:

\begin{matrix} 〈 ., . 〉 : & g^{*} \times g \to R \\ k, n \mapsto 〈 k, n 〉 = T r (k^{T} n) \end{matrix}

(201)

and left algebra:

n_{L} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} & R^{- 1 / 2} \dot{m} \\ 0 & 0 \end{matrix}]

(202)

we obtain for the total kinetic energy

E_{L} = \frac{1}{2} (T r (R^{- 1} \dot{R}) + {\dot{m}}^{T} R^{- 1} \dot{m})

(203)

We will then introduce the coadjoint operator that will enable us to work on the elements of the dual algebra of the Lie algebra defined above. Like algebra, which is physically the space of instantaneous speeds, the dual algebra is the space of moments. For the dual of left algebra, the moment is given by:

Π_{L} = \frac{\partial E_{L}}{\partial n_{L}} = n_{L}

(204)

Where

E_{L}

is the kinetic energy of the system and is currently associated with

Π_{L}

is an element of the left algebra. The moment space is the dual algebra, denoted

g^{*}

, associated with the Lie algebra

g

. This value is deduced from the computation:

\begin{array}{l} 〈 \frac{\partial E_{L}}{\partial n_{L}}, δ U 〉 = \underset{ε \to 0}{L i m} \frac{E_{L} (n_{L} + ε \cdot δ U) - E_{L} (n_{L})}{ε} \\ with E_{L} (n_{L} + ε \cdot δ U) = \frac{1}{2} 〈 n_{L} + ε . δ U, n_{L} + ε \cdot δ U 〉 = \frac{1}{2} {(n_{L} + ε \cdot δ U)}^{T} (n_{L} + ε \cdot δ U) \\ 〈 \frac{\partial E_{L}}{\partial n_{L}}, δ U 〉 = 2. \frac{1}{2} t r (η_{L}^{T} δ U) = 〈 n_{L}, δ U 〉 \Rightarrow \frac{\partial E_{L}}{\partial n_{L}} = n_{L} \end{array}

(205)

Then the moment map is given by:

\begin{matrix} α_{M} : & g \to g^{*} \\ n_{L} \mapsto Π_{L} = η_{L} \end{matrix}

(206)

We can observe that the application that turns left algebra into dual algebra is the identity application but, physically, the first are moments and the seconds are instantaneous speeds.

We can also define the moment

Π_{R}

associated to the right algebra

η_{R}

by:

〈 Π_{L}, n_{L} 〉 = 〈 Π_{L}, M^{- 1} n_{R} M 〉 = 〈 Π_{R}, n_{R} 〉

(207)

But as

Π_{L} = n_{L}

, we can deduce that:

\begin{array}{l} 〈 n_{L}, M^{- 1} n_{R} M 〉 = 〈 Π_{R}, n_{R} 〉 \\ with M = [\begin{matrix} R^{1 / 2} & m \\ 0 & 1 \end{matrix}], n_{L} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} & R^{- 1 / 2} \dot{m} \\ 0 & 0 \end{matrix}] and η_{R} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} & \dot{m} - R^{- 1 / 2} {\dot{R}}^{1 / 2} \dot{m} \\ 0 & 0 \end{matrix}] \\ \Rightarrow Π_{R} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} + R^{- 1} \dot{m} m^{T} & R^{- 1} \dot{m} \\ 0 & 0 \end{matrix}] \end{array}

(208)

Then, the operator that transform the right algebra to its dual algebra is given by:

\begin{matrix} β_{M} : & g \to g^{*} \\ n_{R} = [\begin{matrix} η_{R 1} & η_{R 2} \\ 0 & 0 \end{matrix}] \mapsto Π_{R} = [\begin{matrix} η_{R 1} (1 + m^{T} R^{- 1} m) + η_{R 2} m^{T} R^{- 1} & η_{R 1} R^{- 1} m + R^{- 1} η_{R 2} \\ 0 & 0 \end{matrix}] \end{matrix}

(209)

There is an operator to change the view of algebra. Therefore, there is an operator that did the same to the dual algebra. This is called the co-adjoint operator and it is the conjugate action of the Lie group on its dual algebra:

{\begin{matrix} A d^{*} : & G \times g^{*} \to g \\ M, η \mapsto A d_{M}^{*} η \end{matrix} with 〈 A d_{M}^{*} η, n 〉 = 〈 η, A d_{M} n 〉 where n \in g

(210)

We can then develop this expression for our use in the case of an affine sup-group. We find:

{\begin{cases} M = [\begin{matrix} A & b \\ 0 & 1 \end{matrix}] \in G \\ η = [\begin{matrix} η_{1} & η_{2} \\ 0 & 0 \end{matrix}] \in g^{*} \\ n = [\begin{matrix} n_{1} & n_{2} \\ 0 & 0 \end{matrix}] \in g \end{cases} \Rightarrow {\begin{cases} 〈 A d_{M}^{*} η, n 〉 = 〈 η, A d_{M} n 〉 = 〈 η, M n M^{- 1} 〉 \\ 〈 A d_{M}^{*} η, n 〉 = 〈 [\begin{matrix} η_{1} - η_{2} b^{T} & A η_{2} \\ 0 & 0 \end{matrix}], n 〉 \end{cases} \Rightarrow A d_{M}^{*} η = [\begin{matrix} η_{1} - η_{2} b^{T} & A η_{2} \\ 0 & 0 \end{matrix}]

(211)

and we can also observe that:

A d_{M^{- 1}}^{*} η = [\begin{matrix} η_{1} + A η_{2} b^{T} & A η_{2} \\ 0 & 0 \end{matrix}]

(212)

Similarly there exists the following relation between the left and the right algebras:

A d_{M}^{*} Π_{R} = Π_{L} and A d_{M^{- 1}}^{*} Π_{L} = Π_{R}

(213)

As we have defined a commutator on the Lie algebra, it is possible to define one on its dual algebra. This commutator on the dual algebra can also be defined using the operator expressing the combined action of the algebra of its dual algebra. This operator is called the co-adjoint operator:

{\begin{matrix} a d^{*} : & g \times g^{*} \to g^{*} \\ n, η \mapsto a d_{n}^{*} η \end{matrix} with 〈 a d_{n}^{*} η, κ 〉 = 〈 η, a d_{n} κ 〉 where κ \in g

(214)

We can develop this co-adjoint operator on its dual algebra for our use-case:

{\begin{cases} κ = [\begin{matrix} κ_{1} & κ_{2} \\ 0 & 0 \end{matrix}] \in G \\ η = [\begin{matrix} η_{1} & η_{2} \\ 0 & 0 \end{matrix}] \in g^{*} \\ n = [\begin{matrix} n_{1} & n_{2} \\ 0 & 0 \end{matrix}] \in g \end{cases} \Rightarrow {\begin{cases} 〈 a d_{n}^{*} η, κ 〉 = 〈 η, a d_{n} κ 〉 = 〈 η, n κ - κ n 〉 \\ 〈 a d_{n}^{*} η, κ 〉 = 〈 [\begin{matrix} - η_{2} n_{2}^{T} & n_{1} η_{2} \\ 0 & 0 \end{matrix}], κ 〉 \end{cases} \Rightarrow {\begin{cases} a d_{n}^{*} η = [\begin{matrix} - η_{2} n_{2}^{T} & n_{1} η_{2} \\ 0 & 0 \end{matrix}] \\ a d_{n}^{*} η = {n, η} \end{cases}

(215)

This co-adjoint operator will give the Euler-Poincaré equation. While the Euler-Lagrange equations is defined on the tangent bundle (union of the tangent spaces at each point) of the manifold and give the geodesics, the Euler-Poincaré equation gives a differential system on the dual Lie algebra of the group associated with the manifold.

We can also complete these maps by using additional ones. First,

p \in T_{M}^{*} G

the moment associated with

\dot{M} \in T_{M} G

in tangent space of

G

M

and also two other moments map the element of the dual algebra in dual tangent space, respectively on the left and on the right:

{\begin{cases} 〈 Π_{L}, n_{L} 〉 = 〈 d L_{M^{- 1}}^{*} Π_{L}, \dot{M} 〉 \\ 〈 Π_{L}, d L_{M^{- 1}} \dot{M} 〉 = 〈 Π_{L}, M^{- 1} \dot{M} 〉 \end{cases} \Rightarrow p = {(M^{- 1})}^{T} Π_{L}

(216)

where

\begin{matrix} d L_{M^{- 1}}^{*} : & g_{L}^{*} \to T_{M}^{*} G \\ Π_{L} \mapsto p = {(M^{- 1})}^{T} Π_{L} \end{matrix} and \begin{matrix} d R_{M^{- 1}}^{*} : & g_{R}^{*} \to T_{M}^{*} G \\ Π_{R} \mapsto p = Π_{R} {(M^{- 1})}^{T} \end{matrix}

(217)

From these relations, we can also observe that:

\begin{array}{l} Π_{L} = n_{L} = M^{- 1} \dot{M} \\ \Rightarrow {\begin{cases} p = {(M^{- 1})}^{T} M^{- 1} \dot{M} \\ p = Ξ_{M} \cdot \dot{M} with Ξ_{M} = {(M^{- 1})}^{T} M^{- 1} \end{cases} \end{array}

(218)

All these maps could be summarized in the following Figure 12:

Heni Poincaré proved that when a Lie algebra acts locally and transitively on the configuration space of a Lagrangian mechanical system, the Euler-Lagrange equations are equivalent to a new system of differential equations defined on the product of the configuration space with the Lie algebra.

If we consider that the following function is stationary for a Lagragian l(.) invariant with respect to the action of a group on the left:

S (η_{L}) = \int_{a}^{b} l (η_{L}) d t with δ S (η_{L}) = 0 and l : g \to R

(219)

The solution is given by the Euler-Poincaré equation:

\begin{array}{l} \frac{d}{d t} \frac{δ l}{δ η_{L}} = a d_{η_{L}}^{*} \frac{δ l}{δ η_{L}} \\ δ η_{L} = \dot{Γ} + a d_{η_{L}} Γ where Γ (t) \in g \end{array}

(220)

If we take for the function l(.), the total kinetic energy

E_{L}

, using

Π_{L} = M^{- 1} \dot{M} = \frac{\partial E_{L}}{\partial n_{L}} \in g_{L}

, then the Euler-Poincaré equation is given by:

\frac{d Π_{L}}{d t} = a d_{n_{L}}^{*} Π_{L} with \frac{δ l}{δ η_{L}} = \frac{\partial E_{L}}{\partial n_{L}} = Π_{L} \in g_{L}

(221)

The following quantities are conserved:

\frac{d Π_{R}}{d t} = 0

(222)

With this second theorem, it is possible to write the geodesic not from its coordinate system but from the quantity of motion, and in addition to determine explicitly what the conserved quantities along the geodesic are (conservations are related to the symmetries of the variety and hence the invariance of the Lagrangian under the action of the group).

For our use-case, the Euler-Poincaré equation is given by:

{\begin{cases} {\dot{η}}_{L 1} = - η_{L 2} η_{L 2}^{T} \\ {\dot{η}}_{L 2} = η_{L 2} η_{L 1} \end{cases} with {\begin{cases} η_{L 1} = R^{- 1 / 2} {\dot{R}}^{1 / 2} \\ η_{L 2} = R^{- 1 / 2} \dot{m} \end{cases} \Rightarrow {\begin{cases} {(R^{- 1 / 2} {\dot{R}}^{1 / 2})}^{•} = - R^{- 1 / 2} \dot{m} {\dot{m}}^{T} R^{- 1 / 2} \\ \dot{{(R^{- 1 / 2} \dot{m})}^{•} = R^{- 1 / 2} {\dot{R}}^{1 / 2} R^{- 1 / 2} \dot{m}} \end{cases}

(223)

If we remark that we have

R^{- 1 / 2} {\dot{R}}^{1 / 2} = R^{- 1 / 2} (R^{- 1 / 2} \dot{R}) = R^{- 1} \dot{R}

, then the conserved Souriau moment could be given by:

Π_{R} = [\begin{matrix} R^{- 1 / 2} {\dot{R}}^{1 / 2} + R^{- 1} \dot{m} m^{T} & R^{- 1} \dot{m} \\ 0 & 0 \end{matrix}] = [\begin{matrix} R^{- 1} \dot{R} + R^{- 1} \dot{m} m^{T} & R^{- 1} \dot{m} \\ 0 & 0 \end{matrix}]

(224)

Components of the Souriau moment give the conserved quantities that are the classical elements given by Emmy Noether Theorem (Souriau moment is a geometrization of Emmy Noether Theorem):

\frac{d Π_{R}}{d t} = [\begin{matrix} \frac{d (R^{- 1} \dot{R} + R^{- 1} \dot{m} m^{T})}{d t} & \frac{d (R^{- 1} \dot{m})}{d t} \\ 0 & 0 \end{matrix}] = 0 \Rightarrow {\begin{cases} R^{- 1} \dot{R} + R^{- 1} \dot{m} m^{T} = B = c s t e \\ R^{- 1} \dot{m} = b = c s t e \end{cases}

(225)

From this constant, we can obtain a reduced equation of geodesic:

{\begin{cases} \dot{m} = R b \\ \dot{R} = R (B - b m^{T}) \end{cases}

(226)

This is the Euler-Poincaré equation of geodesic. We can observe that we have obtained a reduction of the following Euler-Lagrange equation [27,156,187]:

{\begin{cases} \ddot{R} + \dot{m} {\dot{m}}^{T} - \dot{R} R^{- 1} \dot{R} = 0 \\ \ddot{m} - \dot{R} R^{- 1} \dot{m} = 0 \end{cases}

associated to the information geometry metric

d s^{2} = d m^{T} R^{- 1} d m + \frac{1}{2} T r ({(R^{- 1} d R)}^{2})

The Fisher information defines a metric turning

N_{n} = {(m, R) \in R^{n} \times S y m^{+} (n)}

into a Riemannian manifold. The inner product of two tangent vectors

(m_{1}, R_{1}) \in T_{n}

and

(m_{2}, R_{2}) \in T_{n}

at the point

(μ, Σ) \in N_{n}

is given by:

g_{(μ, Σ)} ((m_{1}, R_{1}), (m_{2}, R_{2})) = m_{1}^{T} Σ^{- 1} m_{2} + \frac{1}{2} t r (Σ^{- 1} R_{1} Σ^{- 1} R_{2})

(227)

and the geodesic is given by:

l (χ) = \int_{t_{0}}^{t_{1}} \sqrt{g_{χ (t)} (\dot{χ} (t), \dot{χ} (t))} d t

(228)

We can also observe that the manifold of multivariate Gaussian is homogeneous with respect to positive affine group

G A^{+} (n)

d s_{Y}^{2} = d s_{X}^{2} for Y = Σ^{1 / 2} X + μ {with GA}^{+} (n) = {(μ, Σ) \in R \times G L (R) / \det (Σ) > 0}

(229)

characterized by the action of the group

(m, R) \mapsto ρ . (m, R) = (Σ^{1 / 2} m + μ, Σ^{1 / 2} R Σ^{1 / 2 T}), ρ \in G A^{+} (n)

with [\begin{matrix} Y \\ 1 \end{matrix}] = [\begin{matrix} Σ^{1 / 2} & μ \\ 0 & 1 \end{matrix}] [\begin{matrix} X \\ 1 \end{matrix}]

(230)

\begin{array}{l} d s_{Y}^{2} = d {(Σ^{1 / 2} m + μ)}^{T} {(Σ^{1 / 2} R Σ^{1 / 2 T})}^{- 1} d (Σ^{1 / 2} m + μ) + \frac{1}{2} T r ({({(Σ^{1 / 2} R Σ^{1 / 2 T})}^{- 1} d (Σ^{1 / 2} R Σ^{1 / 2 T}))}^{2}) \\ d s_{Y}^{2} = d m^{T} R^{- 1} d m + \frac{1}{2} T r ({(R^{- 1} d R)}^{2}) = d s_{X}^{2} \end{array}

(231)

Since the special orthogonal group

S O (n) = {δ \in G L (R) / \det (δ) = 1}

is the stabilizer subgroup of

(0, I_{n})

, we have the following isomorphism:

\begin{array}{l} G A^{+} (n) / S O (n) \to N_{n} = {(m, R) \in R^{n} \times S y m^{+} (n)} \\ ρ = (μ, Σ) \mapsto ρ . (0, I_{n}) = (μ, Σ^{1 / 2} Σ^{1 / 2 T}) = (μ, Σ) \end{array}

(232)

We can then restrict the computation of the geodesic from

(0, I_{n})

and then we can partially integrate the system of equations:

{\begin{cases} \dot{m} = R b \\ \dot{R} = R (B - b m^{T}) \end{cases}

(233)

where

(R^{- 1} (0) \dot{m} (0), R^{- 1} (0) (\dot{R} (0) + \dot{m} (0) m {(0)}^{T})) = (b, B) \in R^{n} \times S y m_{n} (R)

are the integration constants.

From this Euler-Poincaré equation, we can compute geodesics by geodesic shooting [188,189,190,191] using classical Eriksen equations [192,193,194,195], by the following change of parameters:

{\begin{cases} Δ (t) = R^{- 1} (t) \\ δ (t) = R^{- 1} (t) m (t) \end{cases} \Rightarrow {\begin{cases} \dot{Δ} = - B Δ + b m^{T} \\ \dot{δ} = - B δ + (1 + δ^{T} Δ^{- 1} δ) b \\ Δ (0) = I_{p}, δ (0) = 0 \end{cases} with {\begin{cases} \dot{Δ} (0) = - B \\ \dot{δ} (0) = b \end{cases}

(234)

The initial speed of the geodesic is given by

(\dot{δ} (0), \dot{Δ} (0))

. The geodesic shooting is given by the exponential map:

Λ (t) = \exp (t A) = \sum_{n = 0}^{\infty} \frac{{(t A)}^{n}}{n!} = (\begin{matrix} Δ & δ & Φ \\ δ^{T} & ε & γ^{T} \\ Φ^{T} & γ & Γ \end{matrix}) with A = (\begin{matrix} - B & b & 0 \\ b^{T} & 0 & - b^{T} \\ 0 & - b & B \end{matrix})

(235)

This equation can be interpreted by group theory.

A

could be considered as an element of Lie algebra

s o (n + 1, n)

of the special Lorentz group

S O_{O} (n + 1, n)

and more specifically as the element

p

of Cartan Decomposition

l + p

where

l

is the Lie algebra of a maximal compact sub-group

K = S (O (n + 1) \times O (n))

of the group

G = S O_{O} (n + 1, n)

. We know that its exponential map defines a geodesic on Riemannian Symetric space

G / K

This equation can be established by the following developments:

\dot{Λ} (t) = A . Λ (t) \Rightarrow (\begin{matrix} \dot{Δ} & \dot{δ} & \dot{Φ} \\ {\dot{δ}}^{T} & \dot{ε} & {\dot{γ}}^{T} \\ {\dot{Φ}}^{T} & \dot{γ} & \dot{Γ} \end{matrix}) = (\begin{matrix} - B & b & 0 \\ b^{T} & 0 & - b^{T} \\ 0 & - b & B \end{matrix}) . (\begin{matrix} Δ & δ & Φ \\ δ^{T} & ε & γ^{T} \\ Φ^{T} & γ & Γ \end{matrix})

(236)

We can then deduce that:

{\begin{cases} \dot{Δ} = - B Δ + b δ^{T} \\ \dot{δ} = - B δ + ε b \end{cases}

(237)

ε = 1 + δ^{T} Δ^{- 1} δ

, then

(Δ, δ)

is solution to the geodesic equation previously defined. Since

ε (0) = 1

, it suffices to demonstrate that

\dot{ε} = \dot{τ}

where

τ = δ^{T} Δ^{- 1} δ

From

\dot{Λ} (t) = Λ (t) . A

, using that

{\dot{δ}}^{T} = b^{T} Δ - b^{T} Φ^{T}

, we can deduce:

{\begin{cases} \dot{ε} = b^{T} δ - b^{T} γ \\ \dot{τ} = b^{T} δ - b^{T} ((τ - ε) Δ^{- 1} δ + Φ^{T} Δ^{- 1} δ) \end{cases}

(238)

Then

\dot{ε} = \dot{τ}

, if

γ = (τ - ε) Δ^{- 1} δ + Φ Δ^{- 1} δ

, that could be verified using relation

Λ . Λ^{- 1} = I

, by observing that:

Λ^{- 1} = \exp (- t A) = Λ (- t) = [\begin{matrix} Γ & γ & Φ^{T} \\ γ^{T} & ε & δ^{T} \\ Φ & δ & Δ \end{matrix}]

(239)

Λ . Λ^{- 1} = I \Rightarrow {\begin{cases} Δ γ + ε δ + Φ δ = 0 \\ Δ Φ^{T} + δ δ^{T} + Φ Δ = 0 \end{cases} \Rightarrow {\begin{cases} γ = - ε Δ^{- 1} δ - Δ^{- 1} Φ δ \\ Φ^{T} Δ^{- 1} + Δ^{- 1} δ δ^{T} Δ^{- 1} + Δ^{- 1} Φ = 0 \end{cases} \Rightarrow {\begin{cases} γ = - ε Δ^{- 1} δ - Δ^{- 1} Φ δ \\ Φ^{T} Δ^{- 1} δ + τ Δ^{- 1} δ + Δ^{- 1} Φ δ = 0 \end{cases}

(240)

We can then compute

γ

from two last equations:

γ = (τ - ε) Δ^{- 1} δ + Φ^{T} Δ^{- 1} δ

(241)

\dot{τ} = b^{T} δ - b^{T} ((τ - ε) Δ^{- 1} δ + Φ^{T} Δ^{- 1} δ)

then we can deduce that

\dot{τ} = b^{T} δ - b^{T} γ

and then

\dot{τ} = \dot{ε}

To interpret elements of

Λ

(Γ (t), γ (t)) = (Δ (- t), δ (- t))

, opposite points to

(Δ (t), δ (t))

, and

ε = 1 + δ^{T} Δ^{- 1} δ = 1 + γ^{T} Γ^{- 1} γ

Then the geodesic that goes through the origin

(0, I_{n})

with initial tangent vector

(b, - B)

is the curve given by

(δ (t), Δ (t))

. Then the distance computation is reduced to estimate the initial tangent vector space related by

(R^{- 1} (0) \dot{m} (0), R^{- 1} (0) (\dot{R} (0) + \dot{m} (0) m {(0)}^{T})) = (b, B) \in R^{n} \times S y m_{n} (R)

The distance will be then given by the initial tangent vector:

d = \sqrt{\dot{m} {(0)}^{T} R^{- 1} (0) \dot{m} (0) + \frac{1}{2} T r [{(R^{- 1} (0) \dot{R} (0))}^{2}]}

(242)

This initial tangent vector will be identified by “Geodesic Shooting”. Let

V = \log_{A} B

{\begin{cases} \frac{d V_{m}}{d t} = \frac{1}{2} (\frac{d R}{d t}) R^{- 1} V_{m} + \frac{1}{2} V_{R} R^{- 1} (\frac{d m}{d t}) \\ \frac{d V_{R}}{d t} = \frac{1}{2} ((\frac{d R}{d t}) R^{- 1} V_{m} + V_{R} R^{- 1} (\frac{d R}{d t})) - \frac{1}{2} ((\frac{d m}{d t}) V_{m}^{T} + V_{m}^{T} (\frac{d m}{d t})) \end{cases}

(243)

Geodesic Shooting is corrected by using Jacobi Field J and parallel transport:

J (t) = {\frac{\partial χ_{α} (t)}{\partial α} |}_{t = 0}

solution to

\frac{d^{2} J (t)}{d t^{2}} + R (J (t), \dot{χ} (t)) \dot{χ} (t) = 0

with R the Riemann Curvarture tensor.

We consider a geodesic

χ

between

θ_{0}

and

θ_{1}

with an initial tangent vector

V

, and we suppose that

V

is perturbated by

W

, to

V + W

. The variation of the final point

θ_{1}

can be determined thanks to the Jacobi field with

J (0) = 0

and

\dot{J} (0) = W

. In term of the exponential map, this could be written:

J (t) = {\frac{d}{d α} \exp_{θ_{0}} (t (V + α W)) |}_{α = 0}

(244)

This could be illustrated in the Figure 13:

We give some illustration, in Figure 14, of geodesic shooting to compute the distance between multivariate Gaussian density for the case n = 2:

9. Souriau Riemannian Metric for Multivariate Gaussian Densities

To illustrate the Souriau-Fisher metric, we will consider the family of multivariate Gaussian densities and will develop some elements that we have previously developed purely theoretically.

For the families of multivariate Gaussian densities, that we have identified as homogeneous manifold with the associated sub-group of the affine group

[\begin{matrix} R^{1 / 2} & m \\ 0 & 1 \end{matrix}]

, we have seen that if we consider them as elements of exponential families, we can write

\hat{ξ}

(element of the dual Lie algebra) that play the role of geometric heat

Q

in Souriau Lie group thermodynamics, and

β

the geometric (Planck) temperature.

\hat{ξ} = [\begin{matrix} E [z] \\ E [z z^{T}] \end{matrix}] = [\begin{matrix} m \\ R + m m^{T} \end{matrix}], β = [\begin{matrix} - R^{- 1} m \\ \frac{1}{2} R^{- 1} \end{matrix}]

(245)

These elements are homeomorphic to the matrix elements in matrix Lie algebra and dual Lie algebra:

\hat{ξ} = [\begin{matrix} R + m m^{T} & m \\ 0 & 0 \end{matrix}] \in g^{*}, β = [\begin{matrix} \frac{1}{2} R^{- 1} & - R^{- 1} m \\ 0 & 0 \end{matrix}] \in g

(246)

If we consider

M = [\begin{matrix} R'^{1 / 2} & m' \\ 0 & 1 \end{matrix}]

, then we can compute the co-adjoint operator:

A d_{M}^{*} \hat{ξ} = [\begin{matrix} R + m m^{T} - m m'^{T} & R^{' 1 / 2} m \\ 0 & 0 \end{matrix}]

(247)

We can also compute the adjoint operator:

\begin{array}{l} A d_{M} β = M \cdot β \cdot M^{- 1} = [\begin{matrix} R'^{1 / 2} & m' \\ 0 & 1 \end{matrix}] [\begin{matrix} \frac{1}{2} R^{- 1} & - R^{- 1} m \\ 0 & 0 \end{matrix}] [\begin{matrix} R'^{- 1 / 2} & - R'^{- 1 / 2} m' \\ 0 & 1 \end{matrix}] \\ A d_{M} β = [\begin{matrix} \frac{1}{2} R'^{1 / 2} R^{- 1} R'^{- 1 / 2} & - \frac{1}{2} R'^{1 / 2} R^{- 1} R'^{- 1 / 2} m' - R'^{1 / 2} R^{- 1} m \\ 0 & 0 \end{matrix}] \end{array}

(248)

We can rewrite

A d_{M} β

with the following identification:

\begin{array}{l} A d_{M} β = [\begin{matrix} \frac{1}{2} Ω^{- 1} & - Ω^{- 1} n \\ 0 & 0 \end{matrix}] \\ with Ω = R'^{1 / 2} R R'^{- 1 / 2} and n = (\frac{1}{2} m' + R'^{1 / 2} m) \end{array}

(249)

We have then to develop

\hat{ξ} (A d_{M} (β))

, that is to say

\hat{ξ} (β)

after action of the group on the Lie algebra for

β

, given by

A d_{M} (β)

. By analogy of structure between

\hat{ξ} (β)

and

β

, we can write:

\begin{array}{l} β = [\begin{matrix} \frac{1}{2} R^{- 1} & - R^{- 1} m \\ 0 & 0 \end{matrix}] \\ \hat{ξ} (β) = [\begin{matrix} R + m m^{T} & m \\ 0 & 0 \end{matrix}] \end{array}} \Rightarrow {\begin{cases} A d_{M} β = [\begin{matrix} \frac{1}{2} Ω^{- 1} & - Ω^{- 1} n \\ 0 & 0 \end{matrix}] \\ \hat{ξ} (A d_{M} (β)) = [\begin{matrix} Ω + n n^{T} & n \\ 0 & 0 \end{matrix}] \end{cases}

(250)

We have then to identify the cocycle

θ (M)

from

\hat{ξ} (A d_{M} (β)) = A d_{M}^{*} (\hat{ξ}) + θ (M)

\Rightarrow θ (M) = \hat{ξ} (A d_{M} (β)) - A d_{M}^{*} \hat{ξ}

where:

A d_{M}^{*} \hat{ξ} = [\begin{matrix} R + m m^{T} - m m'^{T} & R^{' 1 / 2} m \\ 0 & 0 \end{matrix}]

(251)

\hat{ξ} (A d_{M} (β)) = [\begin{matrix} R'^{1 / 2} R R'^{- 1 / 2} + (\frac{1}{2} m' + R'^{1 / 2} m) {(\frac{1}{2} m' + R'^{1 / 2} m)}^{T} & (\frac{1}{2} m' + R'^{1 / 2} m) \\ 0 & 0 \end{matrix}]

(252)

The cocycle is then given by:

\begin{array}{l} θ (M) = [\begin{matrix} R'^{1 / 2} R R'^{- 1 / 2} + (\frac{1}{2} m' + R'^{1 / 2} m) {(\frac{1}{2} m' + R'^{1 / 2} m)}^{T} & (\frac{1}{2} m' + R'^{1 / 2} m) \\ 0 & 0 \end{matrix}] - [\begin{matrix} R + m m^{T} - m m'^{T} & R^{' 1 / 2} m \\ 0 & 0 \end{matrix}] \\ θ (M) = [\begin{matrix} (R'^{1 / 2} R R'^{- 1 / 2} - R) + (R^{' 1 / 2} m m^{T} R^{' 1 / 2 T} - m m^{T}) + (\frac{1}{2} m' m^{T} R^{' 1 / 2 T} + \frac{1}{2} R^{' 1 / 2} m m'^{T} - m m'^{T}) & \frac{1}{2} m' \\ 0 & 0 \end{matrix}] \end{array}

(253)

From

θ (M) = \hat{ξ} (A d_{M} (β)) - A d_{M}^{*} \hat{ξ}

, we can compute cocycle in Lie algebra

Θ = T_{e} θ

(254)

used to define the tensor:

\begin{matrix} \tilde{Θ} (X, Y) : & g \times g \to ℜ \\ X, Y \mapsto 〈 Θ (X), Y 〉 \end{matrix}

(255)

In this second part, we will compute the Souriau-Fisher metric given by:

g_{β} ([β, Z_{1}], [β, Z_{2}]) = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}])

(256)

with

{\tilde{Θ}}_{β} (Z_{1}, Z_{2}) = \tilde{Θ} (Z_{1}, Z_{2}) + 〈 \hat{ξ}, a d_{Z_{1}} Z_{2} 〉 = 〈 Θ (Z_{1}), Z_{2} 〉 + 〈 \hat{ξ}, [Z_{1}, Z_{2}] 〉

(257)

\begin{matrix} g_{β} ([β, Z_{1}], [β, Z_{2}]) = {\tilde{Θ}}_{β} (Z_{1}, [β, Z_{2}]) = \tilde{Θ} (Z_{1}, [β, Z_{2}]) + 〈 \hat{ξ}, [Z_{1}, [β, Z_{2}]] 〉 \\ = 〈 Θ (Z_{1}), [β, Z_{2}] 〉 + 〈 \hat{ξ}, [Z_{1}, [β, Z_{2}]] 〉 \end{matrix}

(258)

where

β = [\begin{matrix} \frac{1}{2} R^{- 1} & - R^{- 1} m \\ 0 & 0 \end{matrix}] and \hat{ξ} = [\begin{matrix} R + m m^{T} & m \\ 0 & 0 \end{matrix}]

(259)

If we set Z_{1} = [\begin{matrix} \frac{1}{2} Ω_{1}^{- 1} & - Ω_{1}^{- 1} n_{1} \\ 0 & 0 \end{matrix}] and Z_{2} = [\begin{matrix} \frac{1}{2} Ω_{2}^{- 1} & - Ω_{2}^{- 1} n_{2} \\ 0 & 0 \end{matrix}]

(260)

With

〈 ..., ... 〉

the inner product given by

〈 ξ, β 〉 = T r [b a^{T} + H^{T} L] with ξ = [\begin{matrix} L & b \\ 0 & 0 \end{matrix}], β = [\begin{matrix} H & a \\ 0 & 0 \end{matrix}]

(261)

\begin{array}{l} [β, Z_{2}] = β Z_{2} - Z_{2} β = [\begin{matrix} \frac{1}{2} R^{- 1} & - R^{- 1} m \\ 0 & 0 \end{matrix}] [\begin{matrix} \frac{1}{2} Ω_{2}^{- 1} & - Ω_{2}^{- 1} n_{2} \\ 0 & 0 \end{matrix}] - [\begin{matrix} \frac{1}{2} Ω_{2}^{- 1} & - Ω_{2}^{- 1} n_{2} \\ 0 & 0 \end{matrix}] [\begin{matrix} \frac{1}{2} R^{- 1} & - R^{- 1} m \\ 0 & 0 \end{matrix}] \\ [β, Z_{2}] = [\begin{matrix} \frac{1}{4} (R^{- 1} Ω_{2}^{- 1} - Ω_{2}^{- 1} R^{- 1}) & - \frac{1}{2} (R^{- 1} Ω_{2}^{- 1} n_{2} - Ω_{2}^{- 1} R^{- 1} m) \\ 0 & 0 \end{matrix}] \end{array}

(262)

\begin{matrix} [Z_{1}, [β, Z_{2}]] = [\begin{matrix} \frac{1}{2} Ω_{1}^{- 1} & - Ω_{1}^{- 1} n_{1} \\ 0 & 0 \end{matrix}] [\begin{matrix} \frac{1}{4} (R^{- 1} Ω_{2}^{- 1} - Ω_{2}^{- 1} R^{- 1}) & - \frac{1}{2} (R^{- 1} Ω_{2}^{- 1} n_{2} - Ω_{2}^{- 1} R^{- 1} m) \\ 0 & 0 \end{matrix}] \\ - [\begin{matrix} \frac{1}{4} (R^{- 1} Ω_{2}^{- 1} - Ω_{2}^{- 1} R^{- 1}) & - \frac{1}{2} (R^{- 1} Ω_{2}^{- 1} n_{2} - Ω_{2}^{- 1} R^{- 1} m) \\ 0 & 0 \end{matrix}] [\begin{matrix} \frac{1}{2} Ω_{1}^{- 1} & - Ω_{1}^{- 1} n_{1} \\ 0 & 0 \end{matrix}] \\ = [\begin{matrix} \frac{1}{8} (Ω_{1}^{- 1} (R^{- 1} Ω_{2}^{- 1} - Ω_{2}^{- 1} R^{- 1}) - (R^{- 1} Ω_{2}^{- 1} - Ω_{2}^{- 1} R^{- 1}) Ω_{1}^{- 1}) & - \frac{1}{4} (Ω_{1}^{- 1} (R^{- 1} Ω_{2}^{- 1} n_{2} - Ω_{2}^{- 1} R^{- 1} m) - (R^{- 1} Ω_{2}^{- 1} - Ω_{2}^{- 1} R^{- 1}) Ω_{1}^{- 1} n_{1}) \\ 0 & 0 \end{matrix}] \end{matrix}

(263)

We can then compute:

\begin{array}{l} 〈 \hat{ξ}, [Z_{1}, [β, Z_{2}]] 〉 = T r [\frac{1}{4} m {((R^{- 1} Ω_{2}^{- 1} - Ω_{2}^{- 1} R^{- 1}) Ω_{1}^{- 1} n_{1} - Ω_{1}^{- 1} (R^{- 1} Ω_{2}^{- 1} n_{2} - Ω_{2}^{- 1} R^{- 1} m))}^{T}] \\ + T r [(\frac{1}{8} (Ω_{1}^{- 1} (R^{- 1} Ω_{2}^{- 1} - Ω_{2}^{- 1} R^{- 1}) - (R^{- 1} Ω_{2}^{- 1} - Ω_{2}^{- 1} R^{- 1}) Ω_{1}^{- 1})) (R + m m^{T})] \end{array}

(264)

The Souriau-Fisher metric is defined in Lie algebra

g_{β} ([β, Z_{1}], [β, Z_{2}])

where:

\begin{array}{l} [β, Z_{1}] = [\begin{matrix} \frac{1}{4} (R^{- 1} Ω_{1}^{- 1} - Ω_{1}^{- 1} R^{- 1}) & - \frac{1}{2} (R^{- 1} Ω_{1}^{- 1} n_{1} - Ω_{1}^{- 1} R^{- 1} m) \\ 0 & 0 \end{matrix}] = [\begin{matrix} \frac{1}{2} G_{1}^{- 1} & - G_{1}^{- 1} g_{1} \\ 0 & 0 \end{matrix}] \\ with G_{1} = 2 (Ω_{1} R - R Ω_{1}) and g_{1} = (I - R Ω_{1} R^{- 1} Ω_{1}^{- 1}) n_{1} + (Ω_{1} R Ω_{1}^{- 1} R^{- 1} - I) m \\ [β, Z_{2}] = [\begin{matrix} \frac{1}{4} (R^{- 1} Ω_{2}^{- 1} - Ω_{2}^{- 1} R^{- 1}) & - \frac{1}{2} (R^{- 1} Ω_{2}^{- 1} n_{2} - Ω_{2}^{- 1} R^{- 1} m) \\ 0 & 0 \end{matrix}] = [\begin{matrix} \frac{1}{2} G_{2}^{- 1} & - G_{2}^{- 1} g_{2} \\ 0 & 0 \end{matrix}] \\ with G_{2} = 2 (Ω_{2} R - R Ω_{2}) and g_{2} = (I - R Ω_{2} R^{- 1} Ω_{2}^{- 1}) n_{2} + (Ω_{2} R Ω_{2}^{- 1} R^{- 1} - I) m \end{array}

(265)

and

β = [\begin{matrix} \frac{1}{2} R^{- 1} & - R^{- 1} m \\ 0 & 0 \end{matrix}]

(266)

Another approach to develop the Souriau-Fisher metric

g_{β} ([β, Z_{1}], [β, Z_{2}])

is to compute the tensor

\tilde{Θ} (X, Y)

from the moment map

J

\tilde{Θ} (X, Y) = J_{[X, Y]} - {J_{X}, J_{Y}} with {., .} Poisson Bracket and J the Moment Map

(267)

\tilde{Θ} (X, Y) : g \times g \to ℜ

(268)

We can then write the Souriau-Fisher metric as:

{\tilde{Θ}}_{β} (Z_{1}, Z_{2}) = J_{[Z_{1}, Z_{2}]} - {J_{Z_{1}}, J_{Z_{2}}} + 〈 \hat{ξ}, [Z_{1}, Z_{2}] 〉

(269)

Where the associated differentiable application

J

, called moment map is:

\begin{matrix} J : & M \to g^{*} such that J_{X} (x) = 〈 J (x), X 〉, X \in g \\ x \mapsto J (x) \end{matrix}

(270)

This moment map could be identified with the operator that transforms the right algebra to an element of its dual algebra given by:

\begin{matrix} β_{M} : & g \to g^{*} \\ Z = [\begin{matrix} N & η \\ 0 & 0 \end{matrix}] \mapsto J = [\begin{matrix} N (1 + m^{T} R^{- 1} m) + η m^{T} R^{- 1} & N R^{- 1} m + R^{- 1} η \\ 0 & 0 \end{matrix}] \end{matrix}

(271)

10. Conclusions

In this paper, we have developed a Souriau model of Lie group thermodynamics that recovers the symmetry broken by lack of covariance of Gibbs density in classical statistical mechanics with respect to dynamic groups action in physics (Galileo and Poincaré groups, sub-group of affine group). The ontological model of Souriau gives geometric status to (Planck) temperature (element of Lie alebra), heat (element of dual Lie algebra) and entropy. Souriau said in one of his papers [30] on this new “Lie group thermodynamics” that “these formulas are universal, in that they do not involve the symplectic manifold, but only group G, the symplectic cocycle. Perhaps this Lie group thermodynamics could be of interest for mathematics”.

For this new covariant thermodynamics, the fundamental notion is the coadjoint orbit that is linked to positive definite KKS (Kostant–Kirillov–Souriau) 2-form [196]:

ω_{w} (X, Y) = 〈 w, [U, V] 〉 with X = a d_{w} U \in T_{w} M and Y = a d_{w} V \in T_{w} M

(272)

that is the Kähler-form of a G-invariant kähler structure compatible with the canonical complex structure of M, and determines a canonical symplectic structure on M. When the cocycle is equal to zero, the KKS and Souriau-Fisher metric are equal. This 2-form introduced by Jean-Marie Souriau is linked to the coadjoint action and the coadjoint orbits of the group on its moment space. Souriau provided a classification of the homogeneous symplectic manifolds with this moment map. The coadjoint representation of a Lie group G is the dual of the adjoint representation. If

g

denotes the Lie algebra of G, the corresponding action of G on

g^{*}

, the dual space to

g

, is called the coadjoint action. Souriau proved based on the moment map that a symplectic manifold is always a coadjoint orbit, affine of its group of Hamiltonian transformations, deducing that coadjoint orbits are the universal models of symplectic manifolds: a symplectic manifold homogeneous under the action of a Lie group, is isomorphic, up to a covering, to a coadjoint orbit. So the link between Souriau-Fisher metric and KKS 2-form will provide a symplectic structure and foundation to information manifolds. For Souriau thermodynamics, the Souriau-Fisher metric is the canonical structure linked to KKS 2-form, modified by the cocycle (its symplectic leaves are the orbits of the affine action that makes equivariant the moment map). This last property allows us to determine all homogeneous spaces of a Lie group admitting an invariant symplectic structure by the action of this group: for example, there are the orbits of the coadjoint representation of this group or of a central extension of this group (the central extension allowing suppressing the cocycle). For affine coadjoint orbits, we make reference to Alice Tumpach Ph.D. [197,198,199] who has developed previous works of Neeb [200], Biquard and Gauduchon [201,202,203,204].

Other promising domains of research are theory of generating maps [205,206,207,208] and the link with Poisson geometry through affine Poisson group. As observed by Pierre Dazord [209] in his paper “Groupe de Poisson Affines”, the extension of a Poisson group to an affine Poisson group due to Drinfel’d [210] includes the affine structures of Souriau on dual Lie algebra. For an affine Poisson group, its universal covering could be identified to a vector space with an associated affine structure. If this vector space is an abelian affine Poisson group, we can find the affine structure of Souriau. For the abelian group (R³,+), affine Poisson groups are the affine structures of Souriau.

Souriau model of Lie group thermodynamics could be a promising way to achieve René Thom’s dream to replace thermodynamics by geometry [211,212], and could be extended to the second order extension of the Gibbs state [213,214].

We could explore the links between “stochastic mechanics” (mécanique alétoire) developed by Jean-Michel Bismut based on Malliavin Calculus (stochastic calculus of variations) and Souriau “Lie group thermodynamics”, especially to extend covariant Souriau Gibbs density on the stochastic symplectic manifold (e.g., to model centrifuge with random vibrating axe and the Gibbs density).

We have seen that Souriau has replaced classical Maximum Entropy approach by replacing Lagrange parameters by only one geometric “temperature vector” as element of Lie algebra. In parallel, as refered in [15], Ingarden has introduced [213,214] second and higher order temperature of the Gibbs state that could be extended to Souriau theory of thermodynamics. Ingarden higher order temperatures could be defined in the case when no variational is considered, but when a probability distribution depending on more than one parameter. It has been observed that Ingarden can fail if the following assumptions are not fulfilled: the number of components of the sum goes to infinity and the components of the sum are stochastically independent. Gibbs hypothesis can also fail if stochastic interactions with the environment are not sufficiently weak. In all these cases, we never observe absolute thermal equilibrium of Gibbs type but only flows or turbulence. Nonequilibrium thermodynamics could be indirectly addressed by means of the concept of high order temperatures. Momentum

Q = \frac{\partial Φ (β)}{\partial β}

should be replaced by higher order moments given by the relation

Q_{k} = \frac{\partial Φ (β_{1}, ..., β_{n})}{\partial β_{k}} = \frac{\int_{M} U^{k} (ξ) \cdot e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{k} (ξ) 〉} d ω}{\int_{M} e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{k} (ξ) 〉} d ω}

defined by extended Massieu characteristic function

Φ (β_{1}, ..., β_{n}) = - \log \int_{M} e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{k} (ξ) 〉} d ω

. Entropy is defined by Legendre transform of this Massieu characteristic function

S (Q_{1}, ..., Q_{n}) = \sum_{k = 1}^{n} 〈 β_{k}, Q_{k} 〉 - Φ (β_{1}, ..., β_{n})

where

β_{k} = \frac{\partial S (Q_{1}, ..., Q_{n})}{\partial Q_{k}}

. We are able also to define high order thermal capacities given by

K_{k} = - \frac{\partial Q_{k}}{\partial β_{k}}

. The Gibbs density could be then extended with respect to high order temperatures by

p_{G i b b s} (ξ) = e^{\sum_{k = 1}^{n} 〈 β_{k}, U^{k} (ξ) 〉 - Φ (β_{1}, ..., β_{n})} = \frac{e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{k} (ξ) 〉}}{\int_{M} e^{- \sum_{k = 1}^{n} 〈 β_{k}, U^{k} (ξ) 〉} d ω}

We also have to make reference to the works of Streater [16], Nencka [215] and Burdet [216]. Nencka and Streater [215], for certain unitary representations of a Lie algebra

g

, define the statistical manifold

ℳ

of states as the convex cone of

X \in g

for which the partition function

Z = T r [\exp (- X)]

is finite. The Hessian of

\log Z

defines a Riemannian metric g on dual Lie algebra

g^{*}

. They observe that

g^{*}

foliates into the union of coadjoint orbits, each of which can be given a complex Kostant structure (that of Kostant).

To conclude, we will make reference to Alain Berthoz [217] at College de France who has studied brain coding of movement. The most recent studies on this topic, by Alexandre Afgoustidis Ph.D. [218] “Invariant Harmonic Analysis and Geometry in the Workings of the Brain” supervised by Daniel Bennequin, Afgoustidis [218] consolidate the idea that brain vestibular channels and otolithes code Lie algebra of the homogeneous Galileo group as illustrated in the following Figure 15.

Souriau gave the same ideas in this direction regarding how the brain could code invariants [219]:

Lorsque il y un tremblement de terre, nous assistons à la mort de l’Espace. … Nous vivons avec nos habitudes que nous pensons universelles. … La neuroscience s’occupe rarement de la géométrie … Pour les singes qui vivent dans les arbres, certaines propriétés du groupe d’Euclide sont mieux câblées dans leurs cerveaux (When there is an earthquake, we are witnessing the death of Space … We live with our habits that we think are universal.... Neuroscience rarely is interested in geometry … For the monkeys that live in trees, some properties of the Euclid group are better coded in their brains).

Souriau added anecdotes from a discussion with a student of Bohr that [220]:

L’élève demanda à Bohr qu’il ne comprenait pas le principe de correspondance. Bohr lui demanda de s’assoir et il tourna autour de lui. Bohr lui dit tu dois commencer à avoir mal au cœur, c’est que tu commences à comprendre ce qu’est le principe de correspondance (The student said to Bohr that he did not understand the principle of correspondence. Bohr asked him to sit and he turned around. Bohr said, you should start to be seasick, it is then that you begin to understand what the correspondence principle is.).

Acknowledgments

I would like to thank Charles-Michel Marle and Gery de Saxcé for the fruitful discussions on Souriau model of statistical physics that help me to understand the fundamental notion of affine representation of Lie group and algebra, moment map and coadjoint orbits. I would also like to thank Michel Boyom that introduce me to Jean-Louis Koszul works on affine representation of Lie group and Lie algebra.

Si on ajoute que la critique qui accoutume l’esprit, surtout en matière de faits, à recevoir de simples probabilités pour des preuves, est, par cet endroit, moins propre à le former, que ne le doit être la géométrie qui lui fait contracter l’habitude de n’acquiescer qu’à l’évidence; nous répliquerons qu’à la rigueur on pourrait conclure de cette différence même, que la critique donne, au contraire, plus d’exercice à l’esprit que la géométrie: parce que l’évidence, qui est une et absolue, le fixe au premier aspect sans lui laisser ni la liberté de douter, ni le mérite de choisir; au lieu que les probabilités étant susceptibles du plus et du moins, il faut, pour se mettre en état de prendre un parti, les comparer ensemble, les discuter et les peser. Un genre d’étude qui rompt, pour ainsi dire, l’esprit à cette opération, est certainement d’un usage plus étendu que celui où tout est soumis à l’évidence; parce que les occasions de se déterminer sur des vraisemblances ou probabilités, sont plus fréquentes que celles qui exigent qu’on procède par démonstrations: pourquoi ne dirions –nous pas que souvent elles tiennent aussi à des objets beaucoup plus importants?
—Joseph de Maistre in L’Espit de Finesse [221]

Le cadavre qui s’acoutre se méconnait et imaginant l’éternité s’en approrie l’illusion … C’est pourquoi j’abandonnerai ces frusques et jetant le masque de mes jours, je fuirai le temps où, de concert avec les autres, je m’éreinte à me trahir.
—Emile Cioran in Précis de decomposition [222]

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Clairaut(-Legendre) Equation of Maurice Fréchet Associated to “Distinguished Functions” as Fundamental Equation of Information Geometry

Before Rao [223,224], in 1943, Maurice Fréchet [141] wrote a seminal paper introducing what was then called the Cramer-Rao bound. This paper contains in fact much more that this important discovery. In particular, Maurice Fréchet introduces more general notions relative to “distinguished functions”, densities with estimator reaching the bound, defined with a function, solution of Clairaut’s equation. The solutions “envelope of the Clairaut’s equation” are equivalent to standard Legendre transform without convexity constraints but only smoothness assumption. This Fréchet’s analysis can be revisited on the basis of Jean-Louis Koszul’s works as a seminal foundation of “information geometry”.

We will use Maurice Fréchet notations, to consider the estimator:

T = H (X_{1}, ..., X_{n})

(A1)

and the random variable

A (X) = \frac{\partial \log p_{θ} (X)}{\partial θ}

(A2)

that are associated to:

U = \sum_{i} A (X_{i})

(A3)

The normalizing constraint

\int_{- \infty}^{+ \infty} p_{θ} (x) d x = 1

implies that:

\int_{- \infty}^{+ \infty} ... \int_{- \infty}^{+ \infty} \prod_{i} p_{θ} (x_{i}) d x_{i} = 1

If we consider the derivative if this last expression with respect to

θ

, then

\int_{- \infty}^{+ \infty} ... \int_{- \infty}^{+ \infty} [\sum_{i} A (x_{i})] \prod_{i} p_{θ} (x_{i}) d x_{i} = 0 gives : E_{θ} [U] = 0

(A4)

Similarly, if we assume that

E_{θ} [T] = θ

, then

\int_{- \infty}^{+ \infty} ... \int_{- \infty}^{+ \infty} H (x_{1}, ..., x_{n}) \prod_{i} p_{θ} (x_{i}) d x_{i} = θ

, and we obtain by derivation with respect to

θ

E [(T - θ) U] = 1

(A5)

But as

E [T] = θ

and

E [U] = 0

, we immediately deduce that:

E [(T - E [T]) (U - E [U])] = 1

(A6)

From Schwarz inequality, we can develop the following relations:

{[E (Z T)]}^{2} \leq E [Z^{2}] E [T^{2}] 1 \leq E [{(T - E [T])}^{2}] E [{(U - E [U])}^{2}] = {(σ_{T} σ_{U})}^{2}

(A7)

U being the summation of independent variables, Bienaymé equality could be applied:

{(σ_{U})}^{2} = \sum_{i} {[σ_{A (X_{i})}]}^{2} = n {(σ_{A})}^{2}

(A8)

From which, Fréchet deduced the bound, rediscovered by Cramer and Rao 2 years later:

{(σ_{T})}^{2} \geq \frac{1}{n {(σ_{A})}^{2}}

(A9)

Fréchet [141] observed that it is a remarkable inequality where the second member is independent of the choice of the function H defining the “empirical value” T, where the first member can be taken to any empirical value

T = H (X_{1}, ..., X_{n})

subject to the unique condition

E_{θ} [T] = θ

regardless is

θ

The classic condition that the Schwarz inequality becomes an equality helps us to determine when

σ_{T}

reaches its lower bound

\frac{1}{\sqrt{n} σ_{n}}

The previous inequality becomes an equality if there are two numbers

α

and

β

(not random and not both zero ) such that

α (H' - θ) + β U = 0

, with

H'

being a particular function among eligible

H

such that we have an equality. This equality is rewritten

H' = θ + λ' U

with

λ'

being a non-random number.

If we use the previous equation, then:

E [(T - E [T]) (U - E [U])] = 1 \Rightarrow E [(H' - θ) U] = λ' E_{θ} [U^{2}] = 1

(A10)

We obtain:

U = \sum_{i} A (X_{i}) \Rightarrow λ' n E_{θ} [A^{2}] = 1

(A11)

From which we obtain

λ'

and the form of the associated estimator

H'

λ' = \frac{1}{n E [A^{2}]} \Rightarrow H' = θ + \frac{1}{n E [A^{2}]} \sum_{i} \frac{\partial \log p_{θ} (X_{i})}{\partial θ}

(A12)

It is therefore deduced that the estimator that reaches the terminal is of the form:

H' = θ + \frac{\sum_{i} \frac{\partial \log p_{θ} (X_{i})}{\partial θ}}{n \int_{- \infty}^{+ \infty} {[\frac{\partial p_{θ} (x)}{\partial θ}]}^{2} \frac{d x}{p_{θ} (x)}}

(A13)

with

E [H'] = θ + λ' E [U] = θ

H'

would be one of the eligible functions, if

H'

would be independent of

θ

. Indeed, if we consider

E_{θ_{0}} [H'] = θ_{0}

E [{(H' - θ_{0})}^{2}] \leq E_{θ_{0}} [{(H - θ_{0})}^{2}] \forall H such that E_{θ_{0}} [H] = θ_{0}

H = θ_{0}

satisfies the equation and inequality shows that it is almost certainly equal to

θ_{0}

So to look for

θ_{0}

, we should know beforehand

θ_{0}

At this stage, Fréchet [141] looked for “distinguished functions” (“densités distinguées” in French), as any probability density

p_{θ} (x)

such that the function:

h (x) = θ + \frac{\frac{\partial \log p_{θ} (x)}{\partial θ}}{\int_{- \infty}^{+ \infty} {[\frac{\partial p_{θ} (x)}{\partial θ}]}^{2} \frac{d x}{p_{θ} (x)}}

(A14)

is independent of

θ

. The objective of Fréchet is then to determine the minimizing function

T = H' (X_{1}, ..., X_{n})

that reaches the bound. We can deduce from previous relations that:

λ (θ) \frac{\partial \log p_{θ} (x)}{\partial θ} = h (x) - θ

(A15)

But as

λ (θ) > 0

, we can consider

\frac{1}{λ (θ)}

as the second derivative of a function

Φ (θ)

such that:

\frac{\partial \log p_{θ} (x)}{\partial θ} = \frac{\partial^{2} Φ (θ)}{\partial θ^{2}} [h (x) - θ]

(A16)

From which we deduce that:

ℓ (x) = \log p_{θ} (x) - \frac{\partial Φ (θ)}{\partial θ} [h (x) - θ] - Φ (θ)

(A17)

Is an independent quantity of

θ

. A distinguished function will be then given by:

p_{θ} (x) = e^{\frac{\partial Φ (θ)}{\partial θ} [h (x) - θ] + Φ (θ) + ℓ (x)}

(A18)

With the normalizing constraint

\int_{- \infty}^{+ \infty} p_{θ} (x) d x = 1

These two conditions are sufficient. Indeed, reciprocally, let three functions

Φ (θ)

h (x)

and

ℓ (x)

that we have, for any

θ : \int_{- \infty}^{+ \infty} e^{\frac{\partial Φ (θ)}{\partial θ} [h (x) - θ] + Φ (θ) + ℓ (x)} d x = 1

(A19)

Then the function is distinguished:

θ + \frac{\frac{\partial \log p_{θ} (x)}{\partial θ}}{\int_{- \infty}^{+ \infty} {[\frac{\partial p_{θ} (x)}{\partial θ}]}^{2} \frac{d x}{p_{θ} (x)}} = θ + λ (x) \frac{\partial^{2} Φ (θ)}{\partial θ^{2}} [h (x) - θ]

(A20)

If λ (x) \frac{\partial^{2} Φ (θ)}{\partial θ^{2}} = 1, when \frac{1}{λ (x)} = \int_{- \infty}^{+ \infty} {[\frac{\partial \log p_{θ} (x)}{\partial θ}]}^{2} p_{θ} (x) d x = {(σ_{A})}^{2}

(A21)

The function is reduced to

h (x)

and then is not dependent of

θ

We have then the following relation:

\frac{1}{λ (x)} = \int_{- \infty}^{+ \infty} {(\frac{\partial^{2} Φ (θ)}{\partial θ^{2}})}^{2} {[h (x) - θ]}^{2} e^{\frac{\partial Φ (θ)}{\partial θ} (h (x) - θ) + Φ (θ) + ℓ (x)} d x

(A22)

The relation is valid for any

θ

, we can derive prefious equation with respect with

θ

\int_{- \infty}^{+ \infty} e^{\frac{\partial Φ (θ)}{\partial θ} (h (x) - θ) + Φ (θ) + ℓ (x)} (\frac{\partial^{2} Φ (θ)}{\partial θ^{2}}) [h (x) - θ] d x = 0

(A23)

We can divide by

\frac{\partial^{2} Φ (θ)}{\partial θ^{2}}

because it does not depend on

x

If we derive again with respect to

θ

, we will have:

\int_{- \infty}^{+ \infty} e^{\frac{\partial Φ (θ)}{\partial θ} (h (x) - θ) + Φ (θ) + ℓ (x)} (\frac{\partial^{2} Φ (θ)}{\partial θ^{2}}) {[h (x) - θ]}^{2} d x = \int_{- \infty}^{+ \infty} e^{\frac{\partial Φ (θ)}{\partial θ} (h (x) - θ) + Φ (θ) + ℓ (x)} d x = 1

(A24)

Combining this relation with that of

\frac{1}{λ (x)}

, we can deduce that

λ (x) \frac{\partial^{2} Φ (θ)}{\partial θ^{2}} = 1

and as

λ (x) > 0

then

\frac{\partial^{2} Φ (θ)}{\partial θ^{2}} > 0

Fréchet emphasizes at this step [141], another way to approach the problem. We can select arbitrarily

h (x)

and

l (x)

and then

Φ (θ)

is determined by:

\int_{- \infty}^{+ \infty} e^{\frac{\partial Φ (θ)}{\partial θ} [h (x) - θ] + Φ (θ) + ℓ (x)} d x = 1

(A25)

That could be rewritten:

e^{θ . \frac{\partial Φ (θ)}{\partial θ} - Φ (θ)} = \int_{- \infty}^{+ \infty} e^{\frac{\partial Φ (θ)}{\partial θ} h (x) + ℓ (x)} d x

(A26)

If we then fixed arbitrarily

h (x)

and

l (x)

and let s an arbitrary variable, the following function will be an explicit positive function given by

e^{Ψ (s)}

\int_{- \infty}^{+ \infty} e^{s . h (x) + ℓ (x)} d x = e^{Ψ (s)}

(A27)

Fréchet obtained finally the function

Φ (θ)

as solution of the equation [141]:

Φ (θ) = θ \cdot \frac{\partial Φ (θ)}{\partial θ} - Ψ (\frac{\partial Φ (θ)}{\partial θ})

(A28)

Fréchet noted that this is the Alexis Clairaut equation [141].

The case

\frac{\partial Φ (θ)}{\partial θ} = c s t e

would reduce the density to a function that would be independent of

θ

, and so

Φ (θ)

is given by a singular solution of this Clairaut equation, which is unique and could be computed by eliminating the variable s between:

Φ = θ \cdot s - Ψ (s) and θ = \frac{\partial Ψ (s)}{\partial s}

(A29)

Or between:

e^{θ \cdot s - Φ (θ)} = \int_{- \infty}^{+ \infty} e^{s \cdot h (x) + ℓ (x)} d x and \int_{- \infty}^{+ \infty} e^{s \cdot h (x) + ℓ (x)} [h (x) - θ] d x = 0

(A30)

Φ (θ) = - \log \int_{- \infty}^{+ \infty} e^{s \cdot h (x) + ℓ (x)} d x + θ \cdot s

where s is given implicitly by

\int_{- \infty}^{+ \infty} e^{s \cdot h (x) + ℓ (x)} [h (x) - θ] d x = 0

Then we know the distinguished function,

H'

among functions

H (X_{1}, ..., X_{n})

verifying

E_{θ} [H] = θ

and such that

σ_{H}

reaches for each value of

θ

, an absolute minimum, equal to

\frac{1}{\sqrt{n} σ_{A}}

For the previous equation:

h (x) = θ + \frac{\frac{\partial \log p_{θ} (x)}{\partial θ}}{\int_{- \infty}^{+ \infty} {[\frac{\partial p_{θ} (x)}{\partial θ}]}^{2} \frac{d x}{p_{θ} (x)}}

(A31)

We can rewrite the estimator as:

H' (X_{1}, ..., X_{n}) = \frac{1}{n} [h (X_{1}) + ... + h (X_{n})]

(A32)

and compute the associated empirical value:

t = H' (x_{1}, ..., x_{n}) = \frac{1}{n} \sum_{i} h (x_{i}) = θ + λ (θ) \sum_{i} \frac{\partial \log p_{θ} (x_{i})}{\partial θ}

If we take

θ = t

, we have as

λ (θ) > 0

\sum_{i} \frac{\partial \log p_{t} (x_{i})}{\partial t} = 0

(A33)

When

p_{θ} (x)

is a distinguished function, the empirical value

t

θ

corresponding to a sample

x_{1}, ..., x_{n}

is a root of previous equation in

t

. This equation has a root and only one when X is a distinguished variable. Indeed, as we have:

p_{θ} (x) = e^{\frac{\partial Φ (θ)}{\partial θ} [h (x) - θ] + Φ (θ) + ℓ (x)}

(A34)

\sum_{i} \frac{\partial \log p_{t} (x_{i})}{\partial t} = \frac{\partial^{2} Φ (t)}{\partial t^{2}} [\frac{\sum_{i} h (x_{i})}{n} - t] with \frac{\partial^{2} Φ (t)}{\partial t^{2}} > 0

(A35)

We can then recover the unique root:

t = \frac{\sum_{i} h (x_{i})}{n}

This function

T \equiv H' (X_{1}, ..., X_{n}) = \frac{1}{n} \sum_{i} h (X_{i})

can have an arbitrary form, that is a sum of functions of each only one of the quantities and it is even the arithmetic average of N values of a same auxiliary random variable

Y = h (X)

. The dispersion is given by:

{(σ_{T_{n}})}^{2} = \frac{1}{n {(σ_{A})}^{2}} = \frac{1}{n \int_{- \infty}^{+ \infty} {[\frac{\partial p_{θ} (x)}{\partial θ}]}^{2} \frac{d x}{p_{θ} (x)}} = \frac{1}{n \frac{\partial^{2} Φ (θ)}{\partial θ^{2}}}

(A36)

and

T_{n}

follows the probability density:

p_{θ} (t) = \sqrt{n} \frac{1}{σ_{A} \sqrt{2 π}} e^{- \frac{n {(t - θ)}^{2}}{2 \cdot σ_{A}^{2}}} with {(σ_{A})}^{2} = \frac{\partial^{2} Φ (θ)}{\partial θ^{2}}

(A37)

Clairaut Equation and Legendre Transform

We have just observed that Fréchet shows that distinguished functions depend on a function

Φ (θ)

, solution of the Clairaut equation:

Φ (θ) = θ \cdot \frac{\partial Φ (θ)}{\partial θ} - Ψ (\frac{\partial Φ (θ)}{\partial θ})

(A38)

Or given by the Legendre transform:

Φ = θ \cdot s - Ψ (s) and θ = \frac{\partial Ψ (s)}{\partial s}

(A39)

Fréchet also observed that this function

Φ (θ)

could be rewritten:

Φ (θ) = - \log \int_{- \infty}^{+ \infty} e^{s \cdot h (x) + ℓ (x)} d x + θ \cdot s

where s is given implicitly by

\int_{- \infty}^{+ \infty} e^{s \cdot h (x) + ℓ (x)} [h (x) - θ] d x = 0

This equation is the fundamental equation of information geometry.

The “Legendre” transform was introduced by Adrien-Marie Legendre in 1787 [225] to solve a minimal surface problem Gaspard Monge in 1784. Using a result of Jean Baptiste Meusnier, a student of Monge, it solves the problem by a change of variable corresponding to the transform which now entitled with his name. Legendre wrote: “I have just arrived by a change of variables that can be useful in other occasions.” About this transformation, Darboux [226] in his book gives an interpretation of Chasles: “This comes after a comment by Mr. Chasles, to substitute its polar reciprocal on the surface compared to a paraboloïd.” The equation of Clairaut was introduced 40 years earlier in 1734 by Alexis Clairaut [225]. Solutions “envelope of the Clairaut equation” are equivalent to the Legendre transform with unconditional convexity, but only under differentiability constraint. Indeed, for a non-convex function, Legendre transformation is not defined where the Hessian of the function is canceled, so that the equation of Clairaut only makes the hypothesis of differentiability. The portion of the strictly convex function g in Clairaut equation y = px − g(p) to the function f giving the envelope solutions by the formula y = f(x) is precisely the Legendre transformation. The approach of Fréchet may be reconsidered in a more general context on the basis of the work of Jean-Louis Koszul.

Appendix B. Balian Gauge Model of Thermodynamics and its Compliance with Souriau Model

Supported by Industial group TOTAL (previously Elf-Aquitaine), Roger Balian has introduced a Gauge theory of thermodynamics [103] and has also developed information geometry in statistical physics and quantum physics [103,227,228,229,230,231,232,233,234,235]. Balian has observed that the entropy

S

(we use Balian notation, contrary with previous section where we use

- S

as neg-entropy) can be regarded as an extensive variable

q^{0} = S (q^{1}, ..., q^{n})

, with

q^{i} (i = 1, ..., n)

, n independent quantities, usually extensive and conservative, characterizing the system. The n intensive variables

γ_{i}

are defined as the partial derivatives:

γ_{i} = \frac{\partial S (q^{1}, ..., q^{n})}{\partial q^{i}}

(B1)

Balian has introduced a non-vanishing gauge variable

p_{0}

, without physical relevance, which multiplies all the intensive variables, defining a new set of variables:

p_{i} = - p_{0} . γ_{i}, i = 1, ..., n

(B2)

The 2n + 1-dimensional space is thereby extended into a 2n + 2-dimensional thermodynamic space

T

spanned by the variables

p_{i}, q^{i} with i = 0, 1, ..., n

, where the physical system is associated with a n + 1-dimensional manifold

M

T

, parameterized for instance by the coordinates

q^{1}, ..., q^{n}

and

p_{0}

. A gauge transformation which changes the extra variable

p_{0}

while keeping the ratios

p_{i} / p_{0} = - γ_{i}

invariant is not observable, so that a state of the system is represented by any point of a one-dimensional ray lying in

M

, along which the physical variables

q^{0}, ..., q^{n}, γ_{1}, ..., γ_{n}

are fixed. Then, the relation between contact and canonical transformations is a direct outcome of this gauge invariance: the contact structure

\tilde{ω} = d q^{0} - \sum_{i = 1}^{n} γ_{i} \cdot d q^{i}

in n + 1 dimension can be embedded into a symplectic structure in 2n + 2 dimension, with 1-form:

ω = \sum_{i = 0}^{n} p_{i} \cdot d q^{i}

(B3)

as symplectization, with geometric interpretation in the theory of fiber bundles.

The n + 1-dimensional thermodynamic manifolds

M

are characterized by the vanishing of this form

ω = 0

. The 1-form induces then a symplectic structure on

T

d ω = \sum_{i = 0}^{n} d p_{i} \land d q^{i}

(B4)

Any thermodynamic manifold

M

belongs to the set of the so-called Lagrangian manifolds in

T

, which are the integral submanifolds of

d ω

with maximum dimension (n + 1). Moreover,

M

is gauge invariant, which is implied by

ω = 0

. The extensivity of the entropy function

S (q^{1}, ..., q^{n})

is expressed by the Gibbs-Duhem relation

S = \sum_{i = 1}^{n} q^{i} \frac{\partial S}{\partial q^{i}}

, rewritten with previous relation

\sum_{i = 0}^{n} p_{i} q^{i} = 0

, defining a 2n + 1-dimensional extensivity sheet in

T

, where the thermodynamic manifolds

M

should lie. Considering an infinitesimal canonical transformation, generated by the Hamiltonian

h (q^{0}, q^{1}, ..., q^{n}, p_{0}, p_{1}, ..., p_{n})

{\dot{q}}_{i} = \frac{\partial h}{\partial p_{i}}

and

{\dot{p}}_{i} = \frac{\partial h}{\partial q^{i}}

, the Hamilton’s equations are given by Poisson bracket:

\dot{g} = {g, h} = \sum_{i = 0}^{n} \frac{\partial g}{\partial q^{i}} \frac{\partial h}{\partial p_{i}} - \frac{\partial h}{\partial q_{i}} \frac{\partial g}{\partial p_{i}}

(B5)

The concavity of the entropy

S (q^{1}, ..., q^{n})

, as function of the extensive variables, expresses the stability of equilibrium states. This property produces constraints on the physical manifolds

M

in the 2n + 2-dimensional space. It entails the existence of a metric structure in the n-dimensional space

q_{i}

relying on the quadratic form:

d s^{2} = - d^{2} S = - \sum_{i, j = 1}^{n} \frac{\partial^{2} S}{\partial q^{i} \partial q^{j}} d q^{i} d q^{j}

(B6)

which defines a distance between two neighboring thermodynamic states.

As d γ_{i} = \sum_{j = 1}^{n} \frac{\partial^{2} S}{\partial q^{i} \partial q^{j}} d q^{j}, then : d s^{2} = - \sum_{i = 1}^{n} d γ_{i} d q_{i} = \frac{1}{p_{0}} \sum_{i = 0}^{n} d p_{i} d q^{i}

(B7)

The factor

1 / p_{0}

ensures gauge invariance. In a continuous transformation generated by

h

, the metric evolves according to:

\frac{d}{d τ} (d s^{2}) = \frac{1}{p_{0}} \frac{\partial h}{\partial q^{0}} d s^{2} + \frac{1}{p_{0}} \sum_{i, j = 0}^{n} (\frac{\partial^{2} h}{\partial q^{i} \partial p_{j}} d p_{i} d p_{j} - \frac{\partial^{2} h}{\partial q^{i} \partial q^{j}} d q^{i} d q^{j})

(B8)

We can observe that this gauge theory of thermodynamics is compatible with Souriau Lie groupTthermodynamics, where we have to consider the Souriau vector

β = [\begin{matrix} γ_{1} \\ ⋮ \\ γ_{n} \end{matrix}]

, transformed in a new vector:

p_{i} = - p_{0} . γ_{i}, p = [\begin{matrix} - p_{0} γ_{1} \\ ⋮ \\ - p_{0} γ_{n} \end{matrix}] = - p_{0} \cdot β

(B9)

Appendix C. Casalis-Letac Affine Group Invariance for Natural Exponential Families

The characterization of the natural exponential families of R^d which are preserved by a group of affine transformations has been examined by Muriel Casalis in her Ph.D. [173] and her different papers [172,174,175,176,177,178]. Her method has consisted of translating the invariance property of the family into a property concerning the measures which generate it, and to characterize such measures.

Let

E

a vector space of finite size,

E^{*}

its dual.

〈 θ, x 〉

duality bracket with

(θ, x) \in E^{*} \times E

μ

positive Radon measure on

E

, Laplace transform is:

L_{μ} : E^{*} \to [0, \infty] with θ \mapsto L_{μ} (θ) = \int_{E} e^{〈 θ, x 〉} μ (d x)

(C1)

Let transformation

k_{μ} (θ)

defined on

Θ (u) interior of D_{μ} = {θ \in E^{*}, L_{μ} < \infty}

k_{μ} (θ) = \log L_{μ} (θ)

(C2)

natural exponential families are given by:

F (μ) = {P (θ, μ) (d x) = e^{〈 θ, x 〉 - k_{μ} (θ)} μ (d x), θ \in Θ (μ)}

(C3)

with injective function (domain of means):

k'_{μ} (θ) = \int_{E} x P (θ, μ) μ (d x)

(C4)

the inverse function:

ψ_{μ} : M_{F} \to Θ (μ) with M_{F} = Im (k'_{μ} (Θ (μ)))

(C5)

and the Covariance operator:

V_{F} (m) = k_{μ}^{''} (ψ_{μ} (m)) = {(ψ_{μ}^{'} (m))}^{- 1}, m \in M_{F}

(C6)

Measure generetad by a family

F

is then given by:

F (μ) = F (μ') \Leftrightarrow \exists (a, b) \in E^{*} \times R, such that μ' (d x) = e^{〈 a, x 〉 + b} μ (d x)

(C7)

Let

F

an exponential family of

E

generated by

μ

and

φ : x \mapsto g_{φ} x + v_{φ}

with

g_{φ} \in G L (E)

automorphisms of

E

and

v_{φ} \in E

, then the family

φ (F) = {φ (P (θ, μ)), θ \in Θ (μ)}

is an exponential familly of

E

generated by

φ (μ)

Definition C1.

An exponential family

F

is invariant by a group

G

(affine group of

E

), if

\forall φ \in G, φ (F) = F : \forall μ, F (φ (μ)) = F (μ)

(C8)

(the contrary could be false)

Then Muriel Casalis has established the following theorem:

Theorem C1 (Casalis).

Let

F = F (μ)

an exponential family of

E

and

G

affine group of

E

, then

F

is invariant by

G

if and only:

\begin{array}{l} \exists a : G \to E^{*}, \exists b : G \to R, such that : \\ \forall (φ, φ') \in G^{2}, {\begin{cases} a (φ φ') = g_{t}_{φ}^{- 1} a (φ') + a (φ) \\ b (φ φ') = b (φ) + b (φ') - 〈 a (φ'), g_{φ}^{- 1} v_{φ} 〉 \end{cases} \\ \forall φ \in G, φ (μ) (d x) = e^{〈 a (φ), x 〉 + b (φ)} μ (d x) \end{array}

(C9)

When

G

is a linear subgroup,

b

is a character of

G

and

a

could be obtained by the help of cohomology of Lie groups.

If we define action of

G

E^{*}

by:

g \cdot x = g_{t}^{- 1} x, g \in G, x \in E^{*}

(C10)

It can be verified that:

a (g_{1} g_{2}) = g_{1} \cdot a (g_{2}) + a (g_{1})

(C11)

the action a is an inhomogeneous 1-cocycle:

\forall n > 0

, let the set of all functions from

G^{n}

E^{*}

ℑ (G^{n}, E^{*})

called inhomogenesous n-cochains, then we can define the operators

d^{n} : ℑ (G^{n}, E^{*}) \to ℑ (G^{n + 1}, E^{*})

by:

\begin{matrix} d^{n} F (g_{1}, \dots, g_{n + 1}) = g_{1} . F (g_{2}, \dots, g_{n + 1}) + \sum_{i = 1}^{n} {(- 1)}^{i} F (g_{1}, g_{2}, \dots, g_{i} g_{i + 1}, \dots, g_{n}) \\ + {(- 1)}^{n + 1} F (g_{1}, g_{2}, \dots, g_{n}) \end{matrix}

(C12)

Let

Z^{n} (G, E^{*}) = K e r (d^{n}), B (G, E^{*}) = Im (d^{n - 1})

, with

Z^{n}

inhomogneous n-cocycles, the quotient:

H^{n} (G, E^{*}) = Z^{n} (G, E^{*}) / B^{n} (G, E^{*})

(C13)

is the Cohomology group of

G

with value in

E^{*}

. We have:

\begin{matrix} d^{0} : & E^{*} \to ℑ (G, E^{*}) \\ x \mapsto (g \mapsto g \cdot x - x) \end{matrix}

(C14)

Z^{0} = {x \in E^{*}; g \cdot x = x, \forall g \in G}

(C15)

\begin{matrix} d^{1} : & ℑ (G, E^{*}) \to ℑ (G^{2}, E^{*}) \\ F \mapsto d^{1} F, d^{1} F (g_{1}, g_{2}) = g_{1} \cdot F (g_{2}) - F (g_{1} g_{2}) + F (g_{1}) \end{matrix}

(C16)

Z^{1} = {F \in ℑ (G, E^{*}); F (g_{1} g_{2}) = g_{1} \cdot F (g_{2}) + F (g_{1}), \forall (g_{1}, g_{2}) \in G^{2}}

(C17)

B^{1} = {F \in ℑ (G, E^{*}); \exists x \in E^{*}, F (g) = g \cdot x - x}

(C18)

When the Cohomology group

H^{1} (G, E^{*}) = 0

then:

Z^{1} (G, E^{*}) = B^{1} (G, E^{*})

(C19)

Then if

F = F (μ)

is an exponential family invariant by

G

μ

verifies:

\forall g \in G, g (μ) (d x) = e^{〈 c, x 〉 - 〈 c, g^{- 1} x 〉 + b (g)} μ (d x)

(C20)

\forall g \in G, g (e^{〈 c, x 〉} μ (d x)) = e^{b (g)} e^{〈 c, x 〉} μ (d x) with μ_{0} (d x) = e^{〈 c, x 〉} μ (d x)

(C21)

For all compact group,

H^{1} (G, E^{*}) = 0

and we can express a:

\begin{matrix} A : & G \to G A (E) \\ g \mapsto A_{g}, A_{g} (θ) =^{t} g^{- 1} θ + a (g) \end{matrix}

(C22)

\begin{array}{l} \forall (g, g') \in G^{2}, A_{g g'} = A_{g} A_{g'} \\ A (G) compact sub - group of G A (E) \end{array}

(C23)

\exists fixed point \Rightarrow \forall g \in G, A_{g} (c) =^{t} g^{- 1} c + a (g) = c \Rightarrow a (g) = (I_{d} -^{t} g^{- 1}) c

(C24)

References and Notes

Bernard, C. Introduction à l’Étude de la Médecine Expérimentale. Available online: http://classiques.uqac.ca/classiques/bernard_claude/intro_etude_medecine_exp/intro_medecine_exper.pdf (accessed on 17 October 2016).
Thom, R. Logos et Théorie des Catastrophes; Editions Patiño: Genève, Switzerland, 1988. [Google Scholar]
Barbaresco, F. Symplectic structure of information geometry: Fisher metric and Euler-Poincaré equation of souriau lie group thermodynamics. In Geometric Science of Information, Second International Conference GSI 2015; Nielsen, F., Barbaresco, F., Eds.; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9389, pp. 529–540. [Google Scholar]
De Saxcé, G.; Vallée, C. Galilean Mechanics and Thermodynamics of Continua; Wiley-ISTE: London, UK, 2016. [Google Scholar]
Vallée, C. Relativistic thermodynamics of continua. Int. J. Eng. Sci. 1981, 19, 589–601. [Google Scholar] [CrossRef]
Vallée, C.; Lerintiu, C. Convex analysis and entropy calculation in statistical mechanics. Proc. A Razmadze Math. Inst. 2005, 137, 111–129. [Google Scholar]
Marle, C.M. From Tools in Symplectic and Poisson Geometry to J.-M. Souriau’s Theories of Statistical Mechanics and Thermodynamics. Entropy 2016, 18, 370. [Google Scholar] [CrossRef]
De Saxcé, G. Link between lie group statistical mechanics and thermodynamics of continua. In Special Issue MDPI Entropy “Differential Geometrical Theory of Statistics”; MDPI: Basel, Switzerland, 2016; Volume 18, p. 254. [Google Scholar]
Barbaresco, F. Koszul information geometry and souriau geometric temperature/capacity of lie group thermodynamics. Entropy 2014, 16, 4521–4565. [Google Scholar] [CrossRef]
Souriau, J.M. Structure des Systèmes Dynamiques; Editions Jacques Gabay: Paris, France, 1970. (In French) [Google Scholar]
Souriau, J.M. Structure of Dynamical Systems, volume 149 of Progress in Mathematics. In A Symplectic View of Physics; Birkhäuser: Basel, Switzerland, 1997. [Google Scholar]
Nielsen, F.; Barbaresco, F. Geometric Science of Information; Springer: Berlin/Heidelberg, Germany, 2015. [Google Scholar]
Kosmann-Schwarzbach, Y. La géométrie de Poisson, création du XXième siècle. In Siméon-Denis Poisson; Ecole Polytechnique: Paris, France, 2013; pp. 129–172. [Google Scholar]
Bismut, J.M. Mécanique Aléatoire; Springer: Berlin/Heidelberg, Germany, 1981; Volume 866. [Google Scholar]
Casas-Vázquez, J.; Jou, D. Temperature in non-equilibrium states: A review of open problems and current proposals. Rep. Prog. Phys. 2003, 66, 1937–2023. [Google Scholar] [CrossRef]
Streater, R.F. The information manifold for relatively bounded potentials. Tr. Mat. Inst. Steklova 2000, 228, 217–235. [Google Scholar]
Arnold, V.I. Sur la géométrie différentielle des groupes de Lie de dimension infinie et ses applications à l’hydrodynamique des fluides parfaits. Ann. Inst. Fourier 1966, 16, 319–361. [Google Scholar] [CrossRef]
Arnold, V.I.; Givental, A.B. Symplectic geometry. In Dynamical Systems IV: Symplectic Geometry and Its Applications, Encyclopaedia of Mathematical Sciences; Arnol’d, V.I., Novikov, S.P., Eds.; Springer: Berlin, Germany, 1990; Volume 4, pp. 1–136. [Google Scholar]
Marle, C.M.; de Saxcé, G.; Vallée, C. L’oeuvre de Jean-Marie Souriau, Gazette de la SMF, Hommage à Jean-Marie Souriau. 2012; Published by SMF, Paris. [Google Scholar]
Patrick Iglesias, Itinéraire d’un Mathématicien: Un entretien Avec Jean-Marie Souriau, Le Journal de Maths des Elèves. Available online: http://www.lutecium.fr/jp-petit/science/gal_port/interview_Souriau.pdf (accessed on 27 October 2016). (In French)
Iglesias, P. Symétries et Moment; Hermann: Paris, France, 2000. [Google Scholar]
Kosmann-Schwarzbach, Y. Groupes et Symmetries; Ecole Polytechnique: Paris, France, 2006. [Google Scholar]
Kosmann-Schwarzbach, Y. En homage à Jean-Marie Souriau, quelques souvenirs. Gazette des Mathématiciens 2012, 133, 105–106. [Google Scholar]
Ghys, E. Actions localement libres du groupe affine. Invent. Math. 1985, 82, 479–526. [Google Scholar] [CrossRef]
Rais, M. La representation coadjointe du groupe affine. Ann. Inst. Fourier 1978, 28, 207–237. (In French) [Google Scholar] [CrossRef]
Souriau, J.M. Mécanique des états condensés de la matière. In Proceedings of the 1st International Seminar of Mechanics Federation of Grenoble, Grenoble, France, 19–21 May 1992. (In French)
Souriau, J.M. Géométrie de l’espace de phases. Commun. Math. Phys. 1966, 374, 1–30. (In French) [Google Scholar]
Souriau, J.M. Définition covariante des équilibres thermodynamiques. Nuovo Cimento 1966, 1, 203–216. (In French) [Google Scholar]
Souriau, J.M. Mécanique Statistique, Groupes de Lie et Cosmologie; Colloques Internationaux du CNRS Numéro 237: Paris, France, 1974; pp. 59–113. (In French) [Google Scholar]
Souriau, J.M. Géométrie Symplectique et Physique Mathématique; Éditions du C.N.R.S.: Paris, France, 1975. (In French) [Google Scholar]
Souriau, J.M. Thermodynamique Relativiste des Fluides; Centre de Physique Théorique: Marseille, France, 1977. (In French) [Google Scholar]
Souriau, J.M. Interpretation Géometrique des Etatsquantiques; Springer: Berlin/Heidelberg, Germany, 1977; Volume 570. (In French) [Google Scholar]
Souriau, J.M. Thermodynamique et géométrie. In Differential Geometrical Methods in Mathematical Physics II; Bleuler, K., Reetz, A., Petry, H.R., Eds.; Springer: Berlin/Heidelberg, Germany, 1978; pp. 369–397. (In French) [Google Scholar]
Souriau, J.M. Dynamic Systems Structure, Chapters 16–19. Unpublished work. 1980.
Souriau, J.M.; Iglesias, P. Heat Cold and Geometry. Differential Geometry and Mathematical Physics, Mathematical Physics Studies Volume; Springer: Amsterdam, The Netherlands, 1983; pp. 37–68. [Google Scholar]
Souriau, J.M. Mécanique classique et géométrie symplectique. CNRS Marseille. Cent. Phys. Théor. Report ref. CPT-84/PE-1695 1984. (In French)
Souriau, J.M. On Geometric Mechanics. Discret. Cont. Dyn. Syst. J. 2007, 19, 595–607. [Google Scholar] [CrossRef]
Laplace, P.S. Mémoire sur la probabilité des causes sur les évènements. In Mémoires de Mathématique et de Physique; De l’Imprimerie Royale: Paris, France, 1774. (In French) [Google Scholar]
Gibbs, J.W. Elementary principles in statistical mechanics. In The Rational Foundation of Thermodynamics; Scribner: New York, NY, USA, 1902. [Google Scholar]
Ruelle, D.P. Thermodynamic Formalism; Addison-Wesley: New York, NY, USA, 1978. [Google Scholar]
Ruelle, D.P. Extending the definition of entropy to nonequilibrium steady states. Proc. Natl. Acad. Sci. USA 2003, 100, 3054–3058. [Google Scholar] [CrossRef] [PubMed]
Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620–630. [Google Scholar] [CrossRef]
Jaynes, E.T. Information theory and statistical mechanics II. Phys. Rev. 1957, 108, 171–190. [Google Scholar] [CrossRef]
Jaynes, E.T. Prior probabilities. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 227–241. [Google Scholar] [CrossRef]
Jaynes, E.T. The well-posed problem. Found. Phys. 1973, 3, 477–493. [Google Scholar] [CrossRef]
Jaynes, E.T. Where do we stand on maximum entropy? In The Maximum Entropy Formalism; Levine, R.D., Tribus, M., Eds.; MIT Press: Cambridge, MA, USA, 1979; pp. 15–118. [Google Scholar]
Jaynes, E.T. The minimum entropy production principle. Annu. Rev. Phys. Chem. 1980, 31, 579–601. [Google Scholar] [CrossRef]
Jaynes, E.T. On the rationale of maximum entropy methods. IEEE Proc. 1982, 70, 939–952. [Google Scholar] [CrossRef]
Jaynes, E.T. Papers on Probability, Statistics and Statistical Physics; Reidel: Dordrecht, The Netherlands, 1982. [Google Scholar]
Ollivier, Y. Aspects de l’entropie en Mathématiques et en Physique (Théorie de l’information, Systèmes Dynamiques, Grandes Déviations, Irréversibilité). Available online: http://www.yann-ollivier.org/entropie/entropie.pdf (accessed on 7 August 2015). (In French)
Villani, C. (Ir)rréversibilité et Entropie. Available online: http://www.bourbaphy.fr/villani.pdf (accessed on 5 August 2015). (In French)
Godement, R. Introduction à la Théorie des Groupes de Lie; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
Guichardet, A. Cohomologie des Groups Topologiques et des Algèbres de Lie; Cedic/Fernand Nathan: Paris, France, 1980. [Google Scholar]
Guichardet, A. La method des orbites: Historiques, principes, résultats. In Leçons de Mathématiques D’aujourd’hui; Cassini: Paris, France, 2010; Volume 4, pp. 33–59. [Google Scholar]
Guichardet, A. Le Problème de Kepler, Histoire et Théorie; Ecole Polytechnique: Paris, France, 2012. [Google Scholar]
Dubois, J.G.; Dufour, J.P. La théorie des catastrophes. V. Transformées de Legendre et thermodynamique. In Annales de l’IHP Physique Théorique; Institut Henri Poincaré: Paris, France, 1978; Volume 29, pp. 1–50. [Google Scholar]
Monge, G. Sur le Calcul Intégral des Equations aux Différences Partielles; Mémoires de l’Académie des Sciences: Paris, France, 1784; pp. 118–192. (In French) [Google Scholar]
Moreau, J.J. Fonctions convexes duales et points proximaux dans un espace hilbertien. C. R. Acad. Sci. 1962, 255, 2897–2899. (In French) [Google Scholar]
Plastino, A.; Plastino, A.R. On the Universality of thermodynamics’ Legendre transform structure. Phys. Lett. A 1997, 226, 257–263. [Google Scholar] [CrossRef]
Friedrich, T. Die fisher-information und symplectische strukturen. Math. Nachr. 1991, 153, 273–296. (In German) [Google Scholar] [CrossRef]
Massieu, F. Sur les Fonctions caractéristiques des divers fluides. C. R. Acad. Sci. 1869, 69, 858–862. (In French) [Google Scholar]
Massieu, F. Addition au précédent Mémoire sur les Fonctions caractéristiques. C. R. Acad. Sci. 1869, 69, 1057–1061. (In French) [Google Scholar]
Massieu, F. Exposé des Principes Fondamentaux de la Théorie Mécanique de la Chaleur (note Destinée à Servir D’introduction au Mémoire de L’auteur sur les Fonctions Caractéristiques des Divers Fluides et la Théorie des Vapeurs); Académie des Sciences: Paris, France, 1873; p. 31. (In French) [Google Scholar]
Massieu, F. Thermodynamique: Mémoire sur les Fonctions Caractéristiques des Divers Fluides et sur la Théorie des Vapeurs; Académie des Sciences: Paris, France, 1876; p. 92. (In French) [Google Scholar]
Massieu, F. Sur les Intégrales Algébriques des Problèmes de Mécanique. Suivie de Sur le Mode de Propagation des Ondes Planes et la Surface de L’onde Elémentaire dans les Cristaux Biréfringents à Deux Axes. Ph.D. Thesis, Faculté des Sciences de Paris, Paris, France, 1861. [Google Scholar]
Nivoit, E. Notice sur la vie et les Travaux de M. Massieu, Inspecteur Général des Mines. Available online: http://facultes19.ish-lyon.cnrs.fr/fiche.php?indice=1153 (accessed 27 October).
Gibbs, J.W. Graphical Methods in the Thermodynamics of Fluids. In The Scientific Papers of J. Willard Gibbs; Dover: New York, NY, USA, 1961. [Google Scholar]
Brillouin, L. Science and Information Theory; Academic Press: New York, NY, USA, 1956. [Google Scholar]
Brillouin, L. Maxwell’s demon cannot operate: Information and entropy. J. Appl. Phys. 1951, 22, 334–337. [Google Scholar] [CrossRef]
Brillouin, L. Physical entropy and information. J. Appl. Phys. 1951, 22, 338–343. [Google Scholar] [CrossRef]
Brillouin, L. Negentropy principle of information. J. Appl. Phys. 1953, 24, 1152–1163. [Google Scholar] [CrossRef]
Duhem, P. Sur les équations générales de la thermodynamique. In Annales scientifiques de l’École Normale Supérieure; Ecole Normale Supérieure: Paris, France, 1891; Volume 8, pp. 231–266. (In French) [Google Scholar]
Duhem, P. Commentaire aux principes de la Thermodynamique—Première partie. J. Math. Appl. 1892, 8, 269–330. (In French) [Google Scholar]
Duhem, P. Commentaire aux principes de la Thermodynamique—Troisième partie. J. Math. Appl. 1894, 10, 207–286. (In French) [Google Scholar]
Duhem, P. Les théories de la chaleur. Revue des deux Mondes 1895, 130, 851–868. [Google Scholar]
Carathéodory, C. Untersuchungen über die Grundlagen der Thermodynamik (Examination of the foundations of thermodynamics). Math. Ann. 1909, 67, 355–386. [Google Scholar] [CrossRef]
Carnot, S. Réflexions sur la Puissance Motrice du feu; Dover: New York, NY, USA, 1960. [Google Scholar]
Clausius, R. On the Mechanical Theory of Heat; Browne, W.R., Translator; Macmillan: London, UK, 1879. [Google Scholar]
Darrigol, O. The Origins of the Entropy Concept. Available online: http://www.bourbaphy.fr/darrigol.pdf (accessed on 5 August 2015). (In French)
Gromov, M. In a Search for a Structure, Part 1: On Entropy. Available online: http://www.ihes.fr/~gromov/PDF/structre-serch-entropy-july5-2012.pdf (accessed on 6 August 2015).
Gromov, M. Six Lectures on Probability, Symmetry, Linearity. Available online: http://www.ihes.fr/~gromov/PDF/probability-huge-Lecture-Nov-2014.pdf (accessed on 6 August 2015).
Gromov, M. Metric Structures for Riemannian and Non-Riemannian Spaces (Modern Birkhäuser Classics), 3rd ed.Lafontaine, J., Pansu, P., Eds.; Birkhäuser: Basel, Switzerland, 2006. [Google Scholar]
Kozlov, V.V. Heat equilibrium by Gibbs and poincaré. Dokl. RAN 2002, 382, 602–606. (In French) [Google Scholar]
Poincaré, H. Sur les tentatives d’explication mécanique des principes de la thermodynamique. C. R. Acad. Sci. 1889, 108, 550–553. [Google Scholar]
Poincaré, H. Thermodynamique, Cours de Physique Mathématique. Available online: http://gallica.bnf.fr/ark:/12148/bpt6k2048983 (accessed on 24 October 2016). (In French)
Poincaré, H. Calcul des Probabilités; Gauthier-Villars: Paris, France, 1896. (In French) [Google Scholar]
Poincaré, H. Réflexions sur la théorie cinétique des gaz. J. Phys. Theor. Appl. 1906, 5, 369–403. [Google Scholar] [CrossRef]
Fourier, J. Théorie Analytique de la Chaleur; Chez Firmin Didot: Paris, France, 1822. (In French) [Google Scholar]
Clausius, R. Théorie Mécanique de la Chaleur; Lacroix: Paris, France, 1868. (In French) [Google Scholar]
Poisson, S.D. Théorie Mathématique de la Chaleur; Bachelier: Paris, France, 1835. (In French) [Google Scholar]
Kosmann-Schwarzbach, Y. Siméon-Denis Poisson: Les Mathématiques au Service de la Science; Ecole Polytechnique: Paris, France, 2013. (In French) [Google Scholar]
Smale, S. Topology and Mechanics. Invent. Math. 1970, 10, 305–331. [Google Scholar] [CrossRef]
Cushman, R.; Duistermaat, J.J. The quantum mechanical spherical pendulum. Bull. Am. Math. Soc. 1988, 19, 475–479. [Google Scholar] [CrossRef]
Guillemin, V.; Sternberg, S. The moment map and collective motion. Ann. Phys. 1980, 1278, 220–253. [Google Scholar] [CrossRef]
De Saxcé, G.; Vallée, C. Bargmann group, momentum tensor and Galilean invariance of Clausius-Duhem Inequality. Int. J. Eng. Sci. 2012, 50, 216–232. [Google Scholar] [CrossRef]
De Saxcé, G. Entropy and structure for the thermodynamic systems. In Geometric Science of Information, Second International Conference GSI 2015 Proceedings; Nielsen, F., Barbaresco, F., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9389, pp. 519–528. [Google Scholar]
Kapranov, M. Thermodynamics and the moment map. 2011; arXiv:1108.3472v1. [Google Scholar]
Pavlov, V.P.; Sergeev, V.M. Thermodynamics from the differential geometry standpoint. Theor. Math. Phys. 2008, 157, 1484–1490. [Google Scholar] [CrossRef]
Cartier, P.; DeWitt-Morette, C. Functional Integration. Action and Symmetries; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
Libermann, P.; Marle, C.M. Symplectic Geometry and Analytical Mechanics; Springer Science & Business Media: Berlin/Heidelberg, Germany, 1987. [Google Scholar]
Lichnerowicz, A. Espaces homogènes Kähleriens. In Colloque de Géométrie Différentielle; CNRSP: Paris, France, 1953; pp. 171–184. (In French) [Google Scholar]
Lichnerowicz, A. Représentation Coadjointe Quotient et Espaces Homogènes de Contact, cours du Collège de France; Springer: Berlin/Heidelberg, Germany, 1986. (In French) [Google Scholar]
Balian, R.; Valentin, P. Hamiltonian structure of thermodynamics with gauge. Eur. Phys. J. B 2001, 21, 269–282. [Google Scholar] [CrossRef]
Marle, C.M. On Henri Poincaré’s note “Sur une forme nouvelle des équations de la mécanique”. J. Geom. Symmetry Phys. 2013, 29, 1–38. [Google Scholar]
Poincaré, H. Sur une forme nouvelle des équations de la Mécanique. C. R. Acad. Sci. 1901, 7, 369–371. (In French) [Google Scholar]
Sternberg, S. Symplectic homogeneous spaces. Trans. Am. Math. Soc. 1975, 212, 113–130. [Google Scholar] [CrossRef]
Bourguignon, J.P. Calcul Variationnel; Ecole Polytechnique: Paris, France, 2007. (In French) [Google Scholar]
Dedecker, P. A property of differential forms in the calculus of variations. Pac. J. Math. 1957, 7, 1545–1549. [Google Scholar] [CrossRef]
Marle, C.M. On mechanical systems with a Lie group as configuration space. In Jean Leray ’99 Conference Proceedings; De Gosson, M., Ed.; Springer: Berlin/Heidelberg, Germany, 2003; pp. 183–203. [Google Scholar]
Marle, C.M. Symmetries of Hamiltonian systems on symplectic and poisson manifolds. In Similarity and Symmetry Methods; Springer: Berlin/Heidelberg, Germany, 2014; pp. 185–269. [Google Scholar]
Kirillov, A.A. Merits and demerits of the orbit method. Bull. Am. Math. Soc. 1999, 36, 433–488. [Google Scholar] [CrossRef]
Cartan, E. La structure des groupes de transformations continus et la théorie du trièdre mobile. Bull. Sci. Math. 1910, 34, 250–284. (In French) [Google Scholar]
Cartan, E. Leçons sur les Invariants Intégraux; Hermann: Paris, France, 1922. (In French) [Google Scholar]
Cartan, E. Les récentes généralisations de la notion d’espace. Bull. Sci. Math. 1924, 48, 294–320. (In French) [Google Scholar]
Cartan, E. Le rôle de la Théorie des Groupes de Lie dans L’évolution de la Géométrie Modern; C.R. Congrès International: Oslo, Norway, 1936; Volume 1, pp. 92–103. (In French) [Google Scholar]
Libermann, P. La géométrie différentielle d’Elie Cartan à Charles Ehresmann et André Lichnerowicz. In Géométrie au XXe siècle, 1930-2000: Histoire et Horizons; Hermann: Paris, France, 2005. (In French) [Google Scholar]
Koszul, J.L. Sur la forme hermitienne canonique des espaces homogènes complexes. Can. J. Math. 1955, 7, 562–576. (In French) [Google Scholar] [CrossRef]
Koszul, J.L. Exposés sur les Espaces Homogènes Symétriques; Publicação da Sociedade de Matematica de São Paulo: São Paulo, Brazil, 1959. (In French) [Google Scholar]
Koszul, J.L. Domaines bornées homogènes et orbites de groupes de transformations affines. Bull. Soc. Math. Fr. 1961, 89, 515–533. (In French) [Google Scholar]
Koszul, J.L. Ouverts convexes homogènes des espaces affines. Math. Z. 1962, 79, 254–259. (In French) [Google Scholar] [CrossRef]
Koszul, J.L. Variétés localement plates et convexité. Osaka J. Math. 1965, 2, 285–290. (In French) [Google Scholar]
Koszul, J.L. Lectures on Groups of Transformations; Tata Institute of Fundamental Research: Bombay, India, 1965. [Google Scholar]
Koszul, J.L. Déformations des variétés localement plates. Ann. Inst. Fourier 1968, 18, 103–114. (In French) [Google Scholar] [CrossRef]
Koszul, J.L. Trajectoires convexes de groupes affines unimodulaires. In Essays on Topology and Related Topics; Springer: Berlin, Germany, 1970; pp. 105–110. [Google Scholar]
Vinberg, E.B. The theory of homogeneous convex cones. Trudy Moskovskogo Matematicheskogo Obshchestva 1963, 12, 303–358. [Google Scholar]
Vinberg, E.B. Structure of the group of automorphisms of a homogeneous convex cone. Trudy Moskovskogo Matematicheskogo Obshchestva 1965, 13, 56–83. (In Russian) [Google Scholar]
Byande, P.M.; Ngakeu, F.; Boyom, M.N.; Wolak, R. KV-cohomology and differential geometry of affinely flat manifolds. Information geometry. Afr. Diaspora J. Math. 2012, 14, 197–226. [Google Scholar]
Byande, P.M. Des Structures Affines à la Géométrie de L’information; Omniscriptum: Saarbrücken, France, 2012. [Google Scholar]
Nguiffo Boyom, M. Sur les structures affines homotopes à zéro des groupes de Lie. J. Differ. Geom. 1990, 31, 859–911. (In French) [Google Scholar]
Nguiffo Boyom, M. Structures localement plates dans certaines variétés symplectiques. Math. Scand. 1995, 76, 61–84. (In French) [Google Scholar] [CrossRef]
Nguiffo Boyom, M. The cohomology of Koszul-Vinberg algebras. Pac. J. Math. 2006, 225, 119–153. [Google Scholar] [CrossRef]
Nguiffo Boyom, M. Some Lagrangian Invariants of Symplectic Manifolds, Geometry and Topology of Manifolds; Banach Center Institute of Mathematics, Polish Academy of Sciences: Warsaw, Poland, 2007; Volume 76, pp. 515–525. [Google Scholar]
Nguiffo Boyom, M. Métriques kählériennes affinement plates de certaines variétés symplectiques. I. Proc. Lond. Math. Soc. 1993, 2, 358–380. (In French) [Google Scholar] [CrossRef]
Nguiffo Boyom, M.; Byande, P.M. KV Cohomology in Information Geometry Matrix Information Geometry; Springer: Heidelberg, Germany, 2013; pp. 69–92. [Google Scholar]
Nguiffo Boyom, M. Transversally Hessian foliations and information geometry I. Am. Inst. Phys. Proc. 2014, 1641, 82–89. [Google Scholar]
Nguiffo Boyom, M.; Wolak, R. Transverse Hessian metrics information geometry MaxEnt 2014. AIP. Conf. Proc. Am. Inst. Phys. 2015. [Google Scholar] [CrossRef]
Vey, J. Sur une Notion D’hyperbolicité des Variables Localement Plates. Thèse de Troisième Cycle de Mathématiques Pures; Faculté des Sciences de l’université de Grenoble: Grenoble, France, 1969. (In French) [Google Scholar]
Vey, J. Sur les Automorphismes affines des ouverts convexes saillants. Annali della scuola normale superiore di pisa. Classe Sci. 1970, 24, 641–665. (In French) [Google Scholar]
Barbaresco, F. Koszul information geometry and Souriau Lie group thermodynamics. In AIP Conference Proceedings, Proceedings of MaxEnt’14 Conference, Amboise, France, 21–26 September 2014.
Lesne, A. Shannon entropy: A rigorous notion at the crossroads between probability, information theory, dynamical systems and statistical physics. Math. Struct. Comput. Sci. 2014, 24, e240311. [Google Scholar] [CrossRef]
Fréchet, M.R. Sur l’extension de certaines évaluations statistiques au cas de petits échantillons. Rev. Inst. Int. Stat. 1943, 11, 182–205. (In French) [Google Scholar] [CrossRef]
Fréchet, M.R. Les espaces abstraits topologiquement affines. Acta Math. 1925, 47, 25–52. [Google Scholar] [CrossRef]
Fréchet, M.R. Les éléments aléatoires de nature quelconque dans un espace distancié. Ann. Inst. Henri Poincaré 1948, 10, 215–310. [Google Scholar]
Fréchet, M.R. Généralisations de la loi de probabilité de Laplace. Ann. Inst. Henri Poincaré 1951, 12, 1–29. (In French) [Google Scholar]
Shima, H. The Geometry of Hessian Structures; World Scientific: Singapore, 2007. [Google Scholar]
Shima, H. Geometry of Hessian Structures. In Springer Lecture Notes in Computer Science; Nielsen, F., Frederic, B., Eds.; Springer: Berlin/Heidelberg, Germany, 2013; Volume 8085, pp. 37–55. [Google Scholar]
Crouzeix, J.P. A relationship between the second derivatives of a convex function and of its conjugate. Math. Program. 1977, 3, 364–365. [Google Scholar] [CrossRef]
Hiriart-Urruty, J.B. A new set-valued second-order derivative for convex functions. In Mathematics for Optimization; Elsevier: Amsterdam, The Netherlands, 1986. [Google Scholar]
Bakhvalov, N.S. Memorial: Nikolai Nikolaevitch Chentsov. Theory Probab. Appl. 1994, 38, 506–515. [Google Scholar] [CrossRef]
Chentsov, N.N. Statistical Decision Rules and Optimal Inference; American Mathematical Society: Providence, RI, USA, 1982. [Google Scholar]
Berezin, F. Quantization in complex symmetric spaces. Izv. Akad. Nauk SSSR Ser. Math. 1975, 9, 363–402. [Google Scholar] [CrossRef]
Bhatia, R. Positive Definite Matrices; Princeton University Press: Princeton, NJ, USA, 2007. [Google Scholar]
Bhatia, R. The bipolar decomposition. Linear Algebra Appl. 2013, 439, 3031–3037. [Google Scholar] [CrossRef]
Bini, D.A.; Garoni, C.; Iannazzo, B.; Capizzano, S.S.; Sesana, D. Asymptotic Behaviour and Computation of Geometric-Like Means of Toeplitz Matrices. In SLA14 Conference, Kalamata, Greece, September 2014; Available online: http://noether.math.uoa.gr/conferences/sla2014/sites/default/files/Iannazzo.pdf (accessed on 8–12 September 2014).
Bini, D.A.; Garoni, C.; Iannazzo, B.; Capizzano, S.S. Geometric means of toeplitz matrices by positive parametrizations. 2016, in press. [Google Scholar]
Calvo, M.; Oller, J.M. An explicit solution of information geodesic equations for the multivariate normal model. Stat. Decis. 1991, 9, 119–138. [Google Scholar] [CrossRef]
Calvo, M.; Oller, J.M. A distance between multivariate normal distributions based in an embedding into the Siegel group. J. Multivar. Anal. Arch. 1990, 35, 223–242. [Google Scholar] [CrossRef]
Calvo, M.; Oller, J.M. A distance between elliptical distributions based in an embedding into the Siegel group. J. Comput. Appl. Math. 2002, 145, 319–334. [Google Scholar] [CrossRef]
Chevallier, E.; Barbaresco, F.; Angulo, J. Probability density estimation on the hyperbolic space applied to radar processing. In Geometric Science of Information Proceedings; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9389, pp. 753–761. [Google Scholar]
Chevallier, E.; Forget, T.; Barbaresco, F.; Angulo, J. Kernel Density Estimation on the Siegel Space Applied to Radar Processing. Available online: https://hal-ensmp.archives-ouvertes.fr/hal-01344910/document (accessed on 24 October 2016).
Costa, S.I.R.; Santosa, S.A.; Strapasson, J.E. Fisher information distance: A geometrical reading. Discret. Appl. Math. 2015, 197, 59–69. [Google Scholar] [CrossRef]
Jeuris, B.; Vandebril, R.; Vandereycken, B. A survey and comparison of contemporary algorithms for computing the matrix geometric mean. Electron. Trans. Numer. Anal. 2012, 39, 379–402. [Google Scholar]
Jeuris, B. Riemannian Optimization for Averaging Positive Definite Matrices. Ph.D. Thesis, Katholieke Universiteit Leuven, Leuven, Belgium, 2015. [Google Scholar]
Jeuris, B.; Vandebril, R. The Kähler Mean of Block-Toeplitz Matrices with Toeplitz Structured Blocks; Department of Computer Science, KU Leuven: Leuven, Belgium, 2015. [Google Scholar]
Maliavin, P. Invariant or quasi-invariant probability measures for infinite dimensional groups, Part II: Unitarizing measures or Berezinian measures. Jpn. J. Math. 2008, 3, 19–47. [Google Scholar] [CrossRef]
Strapasson, J.E.; Porto, J.P.S.; Costa, S.I.R. On bounds for the Fisher-Rao distance between multivariate normal distributions. AIP Conf. Proc. 2015, 1641, 313–320. [Google Scholar]
Hua, L.K. Harmonic Analysis of Functions of Several Complex Variables in the Classical Domains; American Mathematical Society: Providence, RI, USA, 1963. [Google Scholar]
Siegel, C.L. Symplectic geometry. Am. J. Math. 1943, 65, 1–86. [Google Scholar] [CrossRef]
Yoshizawa, S.; Tanabe, K. Dual differential geometry associated with the Kullback-Leibler information on the Gaussian distributions and its 2-parameters deformations. SUT J. Math. 1999, 35, 113–137. [Google Scholar]
Skovgaard, L.T. A Riemannian Geometry of the Multivariate Normal Model; Technical Report for Stanford University: Stanford, CA, USA, April 1981. [Google Scholar]
Deza, M.M.; Deza, E. Encyclopedia of Distances, 3rd ed.; Springer: Berlin/Heidelberg, Germany, 2013; p. 242. [Google Scholar]
Casalis, M. Familles exponentielles naturelles invariantes par un groupe de translations. C. R. Acad. Sci. Ser. I Math. 1988, 307, 621–623. (In French) [Google Scholar]
Casalis, M. Familles Exponentielles Naturelles Invariantes par un Groupe. Ph.D. Thesis, Thèse de l’Université Paul Sabatier, Toulouse, France, 1990. [Google Scholar]
Casalis, M. Familles exponentielles naturelles sur rd invariantes par un groupe. Int. Stat. Rev. 1991, 59, 241–262. (In French) [Google Scholar] [CrossRef]
Casalis, M. Les familles exponentielles à variance quadratique homogène sont des lois de Wishart sur un cône symétrique. C. R. Acad. Sci. Ser. I Math. 1991, 312, 537–540. (In French) [Google Scholar]
Casalis, M.; Letac, G. Characterization of the Jørgensen set in generalized linear models. Test 1994, 3, 145–162. [Google Scholar] [CrossRef]
Casalis, M.; Letac, G. The Lukacs-Olkin-Rubin characterization of the Wishart distributions on symmetric cone. Ann. Stat. 1996, 24, 763–786. [Google Scholar] [CrossRef]
Casalis, M. The 2d + 4 simple quadratic natural exponential families on Rd. Ann. Stat. 1996, 24, 1828–1854. [Google Scholar]
Letac, G. A characterization of the Wishart exponential families by an invariance property. J. Theor. Probab. 1989, 2, 71–86. [Google Scholar] [CrossRef]
Letac, G. Lectures on Natural Exponential Families and Their Variance Functions, Volume 50 of Monografias de Matematica (Mathematical Monographs); Instituto de Matematica Pura e Aplicada (IMPA): Rio de Janeiro, Brazil, 1992. [Google Scholar]
Letac, G. Les familles exponentielles statistiques invariantes par les groupes du Cône et du paraboloïde de revolution. In Journal of Applied Probability, Volume 31, Studies in Applied Probability; Takacs, L., Galambos, J., Gani, J., Eds.; Applied Probability Trust: Sheffield, UK, 1994; pp. 71–95. [Google Scholar]
Barndorff-Nielsen, O.E. Differential geometry and statistics: Some mathematical aspects. Indian J. Math. 1987, 29, 335–350. [Google Scholar]
Barndorff-Nielsen, O.E.; Jupp, P.E. Yokes and symplectic structures. J. Stat. Plan Inference 1997, 63, 133–146. [Google Scholar] [CrossRef]
Barndorff-Nielsen, O.E.; Jupp, P.E. Statistics, yokes and symplectic geometry. Annales de la Faculté des sciences de Toulouse: Mathématiques 1997, 6, 389–427. [Google Scholar] [CrossRef]
Barndorff-Nielsen, O.E. Information and Exponential Families in Stattistical Theory; Wiley: New York, NY, USA, 2014. [Google Scholar]
Jespersen, N.C.B. On the structure of transformation models. Ann. Stat. 1999, 17, 195–208. [Google Scholar]
Skovgaard, L.T. A Riemannian geometry of the multivariate normal model. Scand. J. Stat. 1984, 11, 211–223. [Google Scholar]
Han, M.; Park, F.C. DTI segmentation and fiber tracking using metrics on multivariate normal distributions. J. Math. Imaging Vis. 2014, 49, 317–334. [Google Scholar] [CrossRef]
Imai, T.; Takaesu, A.; Wakayama, M. Remarks on geodesics for multivariate normal models. J. Math. Ind. 2011, 3, 125–130. [Google Scholar]
Inoue, H. Group theoretical study on geodesics for the elliptical models. In Geometric Science of Information Proceedings; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9389, pp. 605–614. [Google Scholar]
Pilté, M.; Barbaresco, F. Tracking quality monitoring based on information geometry and geodesic shooting. In Proceedings of the 17th International Radar Symposium (IRS), Krakow, Poland, 10–12 May 2016; pp. 1–6.
Eriksen, P.S. (k, 1) Exponential transformation models. Scand. J. Stat. 1984, 11, 129–145. [Google Scholar]
Eriksen, P. Geodesics Connected with the Fisher Metric on the Multivariate Normal Manifold; Technical Report 86-13; Institute of Electronic Systems, Aalborg University: Aalborg, Denmark, 1986. [Google Scholar]
Eriksen, P.S. Geodesics connected with the Fisher metric on the multivariate normal manifold. In Proceedings of the GST Workshop, Lancaster, UK, 28–31 October 1987.
Feragen, A.; Lauze, F.; Hauberg, S. Geodesic exponential kernels: When curvature and linearity conflict. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 8–10 June 2015; pp. 3032–3042.
Besse, A.L. Einstein Manifolds, Ergebnisse der Mathematik und ihre Grenzgebiete; Springer: Berlin/Heidelberg, Germany, 1986. [Google Scholar]
Tumpach, A.B. Infinite-dimensional hyperkähler manifolds associated with Hermitian-symmetric affine coadjoint orbits. Ann. Inst. Fourier 2009, 59, 167–197. [Google Scholar] [CrossRef]
Tumpach, A.B. Classification of infinite-dimensional Hermitian-symmetric affine coadjoint orbits. Forum Math. 2009, 21, 375–393. [Google Scholar] [CrossRef]
Tumpach, A.B. Variétés Kählériennes et Hyperkählériennes de Dimension Infinie. Ph.D. Thesis, Ecole Polytechnique, Paris, France, 26 July 2005. [Google Scholar]
Neeb, K.-H. Infinite-dimensional groups and their representations. In Lie Theory; Birkhäuser: Basel, Switzerland, 2004. [Google Scholar]
Gauduchon, P. Calabi’s Extremal Kähler Metrics: An Elementary Introduction. Available online: germanio.math.unifi.it/wp-content/uploads/2015/03/dercalabi.pdf (accessed on 27 October 2016).
Biquard, O.; Gauduchon, P. Hyperkähler Metrics on Cotangent Bundles of Hermitian Symmetric Spaces. Available online: https://www.math.ens.fr/~biquard/aarhus96.pdf (accessed on 27 October 2016).
Biquard, O.; Gauduchon, P. La métrique hyperkählérienne des orbites coadjointes de type symétrique d’un groupe de Lie complexe semi-simple. Comptes Rendus de l’Académie des Sciences 1996, 323, 1259–1264. (In French) [Google Scholar]
Biquard, O.; Gauduchon, P. Géométrie hyperkählérienne des espaces hermitiens symétriques complexifiés. Séminaire de Théorie Spectrale et Géométrie 1998, 16, 127–173. [Google Scholar] [CrossRef]
Chaperon, M. Jets, Transversalité, Singularités: Petite Introduction aux Grandes Idées de René Thom; Kouneiher, J., Flament, D., Nabonnand, P., Szczeciniarz, J.-J., Eds.; Géométrie au Vingtième Siècle, Histoire et Horizons: Hermann, Paris, 2005; pp. 246–256. [Google Scholar]
Chaperon, M. Generating maps, invariant manifolds, conjugacy. J. Geom. Phys. 2015, 87, 76–85. [Google Scholar] [CrossRef]
Viterbo, C. Symplectic topology as the geometry of generating functions. Math. Ann. 1992, 292, 685–710. [Google Scholar] [CrossRef]
Viterbo, C. Generating functions, symplectic geometry and applications. In Proceedings of the International Congress of Mathematics, Zürich, Switzerland, 3–11 August 1994.
Dazord, P.; Weinstein, A. Symplectic, Groupoids, and Integrable Systems; Springer: Berlin/Heidelberg, Germany, 1991; pp. 99–128. [Google Scholar]
Drinfeld, V.G. Hamiltonian structures on Lie groups. Sov. Math. Dokl. 1983, 27, 68–7l. [Google Scholar]
Thom, R. Une théorie dynamique de la Morphogenèse. In Towards a Theoretical Biology I; Waddington, C.H., Ed.; University of Edinburgh Press: Edinburgh, UK, 1966; pp. 52–166. [Google Scholar]
Thom, R. Stabilité Structurelle et Morphogénèse, 2nd ed.; Inter Editions: Paris, France, 1977. [Google Scholar]
Ingarden, R.S.; Nakagomi, T. The second order extension of the Gibbs state. Open Syst. Inf. Dyn. 1992, 1, 243–258. [Google Scholar] [CrossRef]
Ingarden, R.S.; Meller, J. Temperatures in Linguistics as a Model of Thermodynamics. Open Syst. Inf. Dyn. 1994, 2, 211–230. [Google Scholar] [CrossRef]
Nencka, H.; Streater, R.F. Information Geometry for some Lie algebras. Infin. Dimens. Anal. Quantum Probab. Relat. Top. 1999, 2, 441–460. [Google Scholar] [CrossRef]
Burdet, G.; Perrin, M.; Perroud, M. Generating functions for the affine symplectic group. Comm. Math. Phys. 1978, 3, 241–254. [Google Scholar] [CrossRef]
Berthoz, A. Le Sens du Movement; Odile Jacob Edirot: Paris, France, 1997. (In French) [Google Scholar]
Afgoustidis, A. Invariant Harmonic Analysis and Geometry in the Workings of the Brain. Available online: https://hal-univ-diderot.archives-ouvertes.fr/tel-01343703 (accessed on 17 October 2016).
Souriau, J.M. Innovaxiom—Interview of Jean-Marie Souriau. Available online: https://www.youtube.com/watch?v=Lb_TWYqBUS4 (accessed on 27 October 2016).
Souriau, J.M. Quantique ? Alors c’est Géométrique. Available online: http://www.ahm.msh-paris.fr/Video.aspx?domain=84fa1a68-95c0-4c74-aed7-06055edaca16&language=fr&metaDescriptionId=dd3bd275-8372-4130-976b-847c36156a83&mediatype=VideoWithShots (accessed on 27 October 2016).
Masseau, D. Les marges des Lumières Françaises (1750–1789); Dix-huitième Siècle Année: Paris, France, 2005; Volume 37, pp. 638–639. (In French) [Google Scholar]
Cioran, E. Précis de Décomposition Poche; Gallimard: Paris, France, 1977. [Google Scholar]
Rao, C.R. Information and the accuracy attainable in the estimation of statistical parameters. Bull. Calcutta Math. Soc. 1945, 37, 81–91. [Google Scholar]
Burbea, J.; Rao, C.R. Entropy differential metric, distance and divergence measures in probability spaces: A unified approach. J. Multivar. Anal. 1982, 12, 575–596. [Google Scholar] [CrossRef]
Legendre, A.M. Mémoire Sur L’intégration de Quelques Equations aux Différences Partielles; Mémoires de l’Académie des Sciences: Paris, France, 1787; pp. 309–351. (In French) [Google Scholar]
Darboux, G. Leçons sur la Théorie Générale des Surfaces et les Applications Géométriques du Calcul Infinitésimal: Premiere Partie (Généralités, Coordonnées Curvilignes, Surface Minima); Gauthier-Villars: Paris, France, 1887. (In French) [Google Scholar]
Balian, R.; Alhassid, Y.; Reinhardt, H. Dissipation in many-body systems: A geometric approach based on information theory. Phys. Rep. 1986, 131, 1–146. [Google Scholar] [CrossRef]
Balian, R.; Balazs, N. Equiprobability, inference and entropy in quantum theory. Ann. Phys. 1987, 179, 97–144. [Google Scholar] [CrossRef]
Balian, R. On the principles of quantum mechanics. Am. J. Phys. 1989, 57, 1019–1027. [Google Scholar] [CrossRef]
Balian, R. From Microphysics to Macrophysics: Methods and Applications of Statistical Physics; Springer: Heidelberg, Germany, 1991 & 1992; Volumes I and II. [Google Scholar]
Balian, R. Incomplete descriptions and relevant entropies. Am. J. Phys. 1999, 67, 1078–1090. [Google Scholar] [CrossRef]
Balian, R. Entropy, a protean concept. In Poincaré Seminar 2003; Dalibard, J., Duplantier, B., Rivasseau, V., Eds.; Birkhauser: Basel, Switzerland, 2004; pp. 119–144. [Google Scholar]
Balian, R. Information in statistical physics. In Studies in History and Philosophy of Modern Physics, Part B; Elsevier: Amsterdam, The Netherlands, 2005. [Google Scholar]
Balian, R. The entropy-based quantum metric. Entropy 2014, 16, 3878–3888. [Google Scholar] [CrossRef] [Green Version]
Balian, R. François Massieu et les Potentiels Thermodynamiques, Évolution des Disciplines et Histoire des Découvertes; Académie des Sciences: Avril, France, 2015. [Google Scholar]

Figure 1. Souriau Scheme about mysterious “affine group” of a true thermodynamics between Galileo group of classical mechanics, Poincaré group of relativistic mechanics and Smooth group of general relativity.

Figure 2. Extract from the second paper of François Massieu to the French Academy of Sciences [61,62].

Figure 3. Remark of Massieu in 1876 paper [64], where he explained why he took into account the “good advice” of Bertrand to replace variable 1/T, used in his initial paper of 1869, by the variable T.

Figure 4. “Théorie analytique de la chaleur (analytic theory of heat)” by Jean Baptiste Joseph Fourier [88], “théorie mécanique de la chaleur (mechanic theory of heat)” by François Clausius [89] and “théorie mathématique de la chaleur (mathematic theory of heat)” by Siméon-Denis Poisson [90].

Figure 5. Global Souriau scheme of Lie group thermodynamics.

Figure 6. Broken symmetry on geometric heat Q due to adjoint action of the group on temperature β as an element of the Lie algebra.

Figure 7. Fourier heat equation in seminal manuscript of Joseph Fourier [88].

Figure 8. Clairaut-Legendre equation introduced by Maurice Fréchet in his 1943 paper [141].

Figure 9. Generation of Koszul elements from Cartan inner product.

Figure 10. Introduction of potential function for multivariate Gaussian law in Souriau book [10].

Figure 11. Affine Lie group action for multivariate Gaussian law.

Figure 12. Maps between algebras.

Figure 13. Geodesic shooting principle.

Figure 14. GeodesicsShooting between two multivariate Gaussian in case n = 2.

Figure 15. Coding of homogeneous Galileo algebra by vestibular system and otolithes.

Table 1. Table comparing Souriau and Koszul affine representation of Lie group and Lie algebra.

**Table 1.** Table comparing Souriau and Koszul affine representation of Lie group and Lie algebra.
Souriau Model of Affine Representation of Lie Groups and Algebra	Koszul Model of Affine Representation of Lie Groups and Algebra
$A (g) (x) = R (g) (x) + θ (g) with g \in G, x \in E$ $R : G \to G L (E)$ and $θ : G \to E$	$A f f (s) : a \mapsto s a = f (s) a + q (s) \forall s \in G, \forall a \in E$ $\begin{array}{l} f : G \to G L (E) \\ s \mapsto f (s) a = s a - s o \forall a \in E \end{array}$ $\begin{array}{l} q : G \to E \\ s \mapsto q (s) = s o \forall s \in G \end{array}$
$θ (g h) = R (g) (θ (h)) + θ (g)$ with $g, h \in G$ $θ : G \to E$ is a one-cocycle of G with values in E,	$q (s t) = f (s) q (t) + q (s)$
$a (X) (x) = r (X) (x) + Θ (X) with X \in g, x \in E$ The linear map $Θ : g \to E$ is a one-cocycle of G with values in E: $Θ (X) = T_{e} θ (X (e)), X \in g$	$v \mapsto f (X) v + q (Y)$ $f$ and $q$ the differential of $f$ and $q$ respectively
$Θ ([X, Y]) = r (X) (Θ (Y)) - r (Y) (Θ (X))$	$\begin{array}{l} q ([X, Y]) = f (X) q (Y) - f (Y) q (X) \forall X, Y \in g \\ with f : g \to g l (E) and q : g \mapsto E \end{array}$
none	$a f f (X) = [\begin{matrix} f (X) & q (X) \\ 0 & 0 \end{matrix}]$
none	$A f f (s) = [\begin{matrix} f (s) & q (s) \\ 0 & 1 \end{matrix}]$

© 2016 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Barbaresco, F. Geometric Theory of Heat from Souriau Lie Groups Thermodynamics and Koszul Hessian Geometry: Applications in Information Geometry for Exponential Families. Entropy 2016, 18, 386. https://doi.org/10.3390/e18110386

AMA Style

Barbaresco F. Geometric Theory of Heat from Souriau Lie Groups Thermodynamics and Koszul Hessian Geometry: Applications in Information Geometry for Exponential Families. Entropy. 2016; 18(11):386. https://doi.org/10.3390/e18110386

Chicago/Turabian Style

Barbaresco, Frédéric. 2016. "Geometric Theory of Heat from Souriau Lie Groups Thermodynamics and Koszul Hessian Geometry: Applications in Information Geometry for Exponential Families" Entropy 18, no. 11: 386. https://doi.org/10.3390/e18110386

APA Style

Barbaresco, F. (2016). Geometric Theory of Heat from Souriau Lie Groups Thermodynamics and Koszul Hessian Geometry: Applications in Information Geometry for Exponential Families. Entropy, 18(11), 386. https://doi.org/10.3390/e18110386

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Geometric Theory of Heat from Souriau Lie Groups Thermodynamics and Koszul Hessian Geometry: Applications in Information Geometry for Exponential Families

Abstract

1. Introduction

2. Position of Souriau Symplectic Model of Statistical Physics in Historical Developments of Thermodynamic Concepts

3. Revisited Souriau Symplectic Model of Statistical Physics

4. The Souriau-Fisher Metric as Geometric Heat Capacity of Lie Group Thermodynamics

5. Euler-Poincaré Equations and Variational Principle of Souriau Lie Group Thermodynamics

6. Souriau Affine Representation of Lie Group and Lie Algebra and Comparison with the Koszul Affine Representation

6.1. Affine Representations and Cocycles

6.2. Souriau Moment Map and Cocycles

6.3. Equivariance of Souriau Moment Map

6.4. Action of Lie Group on a Symplectic Manifold

6.5. Dual Spaces of Finite-Dimensional Lie Algebras

6.6. Koszul Affine Representation of Lie Group and Lie Algebra

6.7. Comparison of Koszul and Souriau Affine Representation of Lie Group and Lie Algebra

6.8. Additional Elements on Koszul Affine Representation of Lie Group and Lie Algebra

7. Souriau Lie Group Model and Koszul Hessian Geometry Applied in the Context of Information Geometry for Multivariate Gaussian Densities

8. Affine Group Action for Multivariate Gaussian Densities and Souriau’s Moment Map: Computation of Geodesics by Geodesic Shooting

9. Souriau Riemannian Metric for Multivariate Gaussian Densities

10. Conclusions

Acknowledgments

Conflicts of Interest

Appendix A. Clairaut(-Legendre) Equation of Maurice Fréchet Associated to “Distinguished Functions” as Fundamental Equation of Information Geometry

Clairaut Equation and Legendre Transform

Appendix B. Balian Gauge Model of Thermodynamics and its Compliance with Souriau Model

Appendix C. Casalis-Letac Affine Group Invariance for Natural Exponential Families

References and Notes

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI