© Michel Petitjean

Researcher, Dr., Habil.

Member of the Executive Board of the International Symmetry Association

Author's professional address:

MTi, INSERM UMR-S 973, Université Paris 7

35 rue Hélène Brion, 75205 Paris Cedex 13, France.

petitjean.chiral@gmail.com

Formerly: CEA/DSV/iBiTec-S/SB2SM (CNRS URA 2096), Saclay, France

(and formerly: ITODYS, UMR 7086, CNRS, Université Paris 7).

JE SUIS CHARLIE

My Erdös number is 3 (computed with the MR calculator of the AMS).

The content of this page deals with chirality, symmetry, and probability theory, but most parts are readable by non specialists of these fields. A rigorous presentation is available on the papers cited in the text. These papers are available in PDF format upon request to me. The theory below is not related to physical interactions between light and matter.

Related topics:

- Symmetry, Chirality, Symmetry Measures and Chirality Measures: General Definitions
- An Asymmetry Coefficient for Multivariate Distributions
- A Simple Dataset to Test and Compare Chirality Measures
- The Random-Disorder Paradox
- Is There a Most Scalene Triangle?

- Procrustes Methods, Shape Recognition, Similarity and Docking
- Classification: Computing an Optimal Partition
- Mining Data in Graph Databases
- The Parity Phenomenon in Large Chemical Databases
- The Graphic Mendeleev Table
- The Radius Diameter Diagram
- Virtual Screening of 3D Chemical Data Bases
- [6.6]Chiralane: The chemical nomenclature challenge
- Molecular Symmetry and Chirality
- Molecular shape descriptors: the cylindrical model
- Spheres Unions and Intersections
- Analytical Computation of van der Waals Surfaces and Volumes
- Freewares

A modern definition of chirality based on group theory exists. It works in non-orientable spaces and recovers the historical definition. This latter is due to Lord Kelvin, and is reproduced below:

<< I call any geometrical figure, or group of points,

Lord Kelvin,

Appendix H., § 22, footnote p. 619.

C.J. Clay and Sons, Cambridge University Press Warehouse, London 1904.

Lord Kelvin's definition of chirality is in accordance with the ancient and intuitive idea that indirect symmetry is, just as direct symmetry, a dichotomous property of an euclidean set. In other words, a set IS or IS NOT chiral, and a set IS or IS NOT symmetric.

Consider the four vertices of a square in the 3D space. It is an achiral set. Even a slight perturbation of the coordinates of the vertices could lead to a chiral figure, delimiting a tetrahedron. If we need to recognize this perturbated figure as being achiral, how could it be idealized? Moreover, any 4-tuple of points may be considered as a perturbated square or a perturbated regular tetrahedron. Does it mean that all chiral 4-tuple of points are both chiral and achiral? No. So, there is a need to quantify chirality as a "continuous" measure, and a definition of a chirality scale should be provided: a set is LESS or MORE chiral. Achirality is just a limit situation.

A chiral index measuring quantitatively the amount of chirality should handle various situations in the euclidean space:

- Some points could have the same coordinates, then weighted points could exist.

The weights of the points could take any non negative real value.

- The number of points is not limited.

"Continuous" geometric domains and homogeneous solids should be considered.

- Chirality is sensitive to the dimension of the space: a non isoscele triangle is chiral in the plane, and is achiral in the 3D space.
The chirality measure should be defined for a space having any dimension.

The chiral index should not depend of which particular mirror is selected to generate the mirror image, and the chiral index should be insensitive to any isometric transformation of the distribution. A practical way to look at the coincidence of a distribution with its inverted image is to perform translations and rotations of the inverted image to superpose "at best" the two distributions, and to see how similar they are. In other words, the problem of measuring chirality is just a particular instance of the problem of measuring how similar are two classes of equivalence of distributions: the set of all translated and rotated images of the original distribution, and the set of all its translated and rotated and inverted images. Thus, any probability metric measuring the distance between two distributions could be used to measure this similarity. It is why the quantitative measure of chirality is a problem related to the theory of probability metrics [1], and also to the Monge-Kantorovitch transportation problem [2], for which the cost of transporting a distribution of mass to an other one has to be minimized.

What is a "continuous" measure of chirality? Intuitively, two "close" distributions are expected to have "close" chiral indices. In other words, it means that when a sequence of distributions is converging to a limit, the associated sequence of chiral indices should converge to the chiral index of the limiting distribution. Any theory dealing with a "continuous" measure of chirality should specify which kind of convergence is considered in the space of the chiral entities.

Using a probability metric to evaluate the amount of chirality of a distribution does not suffice to handle situations where particles are discernable. Let us consider the eight vertices of a cube, such that each vertex is painted with a different color. Clearly, this cube cannot coincide with its inverted image, and is thus chiral, although the non colored cube is achiral. The same phenomenon arises for the bromo-chloro-fluoro-methane molecule

An other situation is encountered in physics. We assume a simplified model such that the molecule is a mixture of a negative charge distribution, a positive charge distribution, and a mass distribution. The degree of chirality of each distribution could be evaluated separately and discarding the weights (i.e. the ratios of the physical units), although the mixture has its own chirality content. This situation is not different from the previous one. When a color is located on N points, it means in fact that the distribution is concentrated on these N points, or, alternatively, that all the space has the specified color; but most points have a null weight, thus they are invisible.

The adequate formal model is called the colored mixture model [3] (see also the first reference cited in my web page about shape recognition). A random variable X taking values in the product of the d-dimensional euclidean space by the space of the colors (assumed to be measurable) is considered. Its distribution is a mixture of d-variate distributions, each of them being attached to a color. The color may be viewed as a supplementary value on a (d+1)

The chiral index CHI is defined as follows:

X is a colored mixture in R

Y is a colored mixture distributed as an inverted image of X, and submitted to a rotation R and a translation t

W is a joint distribution of X and Y such that the marginals of X and Y in the space of colors are almost surely equal, i.e. the couple (X,Y) has a null probability when X and Y have different colors

T is the inertia of X (or Y), and E denotes the mathematical expectation:

The chiral index depends only on the probability law of X, and is insensitive to isometries.

The normalizing coefficient (d/4T) is such that the chiral index is scale independant, and takes values onto the interval [0..1].

The value 0 indicates that X is achiral.

The optimal translation is known to be such that X and Y should have a null expectation.

Although the definition of the chiral index seems awkward, simpler expressions are available.

c

CHI = (d/2) · ( 1 - [Sup

When X is a mixture of almost surely constant vectors, i.e., when we consider a set of points such that not two of them have the same color, the chiral index is proportional to smallest percentage of inertia of the covariance matrix of the set. L

CHI = d L

In the unidimensional case (with or without colors), the chiral index depends only on the maximal correlation coefficient r between X and Y, Y being distributed as -X:

CHI = ( 1 - Sup

Alternatively, if X and Y are identically distributed on the real line, the chiral index depends only on the minimal correlation r between X and Y:

CHI = ( 1 + Inf

Now, we return back to the multidimensional space, and we consider the non colored situation, i.e., X is now an ordinary random vector. The chiral index is a shape coefficient of the distribution. Note that most statisticians consider that distributions such as the unidimensional gaussian, are symmetric. These latter should be better recognized as being achiral, because achirality is related to indirect symmetry, and because direct symmetry has no sense on the real line.

The chiral index is a better asymmetry coefficient than the skewness, because there are asymmetric distributions having a null skewness, just as for the symmetric (i.e. achiral) distributions. This problem does not occur with the chiral index, because its properties are induced from those of a probability metric.

The general colored mixture model does not handle some situations. When the total mass of a system is infinite, such as for infinite lattices, the model is not adequate. Only sequences of finite lattices could be considered. When the colored mixture model is adequate, the chiral index exists if and only if the inertia is finite and non null. E.g. the Cauchy distribution has no chiral index.

The null inertia occurs if and only if all the mass is concentrated on a single point. This trivial situation should not be related to any achiral or chiral situation, because it can occur either as the limit of a sequence of chiral distributions or as the limit of a sequence of achiral distributions.

The analytical calculation of the chiral index is sometimes feasible. Some examples follow [3-5,7].

- The chiral index of an achiral set or an achiral colored mixture, is null.

- The chiral index of the Bernoulli distribution with parameter p=1-q, is equal to 1-1/2p when p>1/2, and is equal to 1-1/2q when p<1/2.
It is of course null when p=q=1/2.

- The chiral index of a set of colored points in R
^{d}such that not two of them have the same color, is equal to d·V_{d}/T, V_{d}being the smallest eigenvalue of the covariance matrix.

It follows that these sets have the maximal chiral index CHI=1 when d=1, or when the covariance matrix is proportional to the identity matrix.

This latter situation occurs when the points are the vertices of a regular planar polygon, or are the vertices of a full d-dimensional cube, regular d-octahedron, or regular d-simplex.

- The most chiral triangle, i.e. the most chiral set of 3 non colored equally weighted points in the plane, has a chiral index equal to 1-2/5
^{1/2}. This triangle pertains to a family of triangles having a particular geometrical property (see section VII-C in ref.[4]).

- For the non colored model and a fixed rotation, computing CHI is an instance of the Monge-Kantorovitch transportation problem.
Thus, analytical results are available for continuous unidimensional distributions having an invertible distribution function (see chap.3 in ref. [2]).
E.g. the chiral index of an exponential distribution is equal to 1-Pi
^{2}/12.

Computing the chiral index of a set of N equally weighted points in R, colored or not, is simple (see appendix 1 in ref. [5]). Thus, computing CHI for the non colored model is quite easy on a pocket calculator: compute the correlation coefficient between the set of values sorted in inceeasing order and the set sorted in decreasing order, add 1, then divide by 2. Note that this correlation coefficient cannot be positive, and the chiral index is upper bounded by 1/2.

Now, we consider N colored and weighted points in R

When the N colored points are equally weighted, computing the optimal joint density leads to enumerate the permutations of the N points [5,7]. This is practically impossible abouve ten points with the same color. Despite this combinatorial difficulty, there are situations where the combinatoric can be drastically reduced. E.g., many chemists consider that the 3D molecular model of a conformer is also a non directed graph such that the atoms are the vertices and the edges are the chemical bonds. Atoms and bonds have their own colors. It follows that this graph induces constraints on the correspondences allowed between atoms, i.e. a carbon cannot match an oxygen, and an hydrogen of a methyl group cannot match an hydrogen of an hydroxy group. In fact, among the N! correspondences (or permutations, or joint densities), only those generated by the graph automorphisms enumeration are allowed [5]. It is why the chiral index is easily computable for many heavy molecules [7]. As an example of reduction of the combinatoric, consider N equally weighted non colored points, such that the N nodes of the graph define a single ring containing N edges: clearly, there are only 2N graph automorphisms, and the chiral index is computable even for large values of N.

The colored graph model is a generalization of the colored mixture model for which constraints on the joint density are added to the constraints coming from the colors themselves.

The freeware QCM computes the chiral index of a conformer, and is downloadable from the software page.

Many scientists have attempted to define their own chirality measures [8]. In order to compare these measures and evaluate their degree of generality, I propose to look to what they return for the simplest chiral figure: the unidimensional set containing 3 points, so that the ratio of the lengths of the two adjacent segments is the unique parameter of this set (no color here). This parameter is noted "a", and may take any non negative real value. The chiral index is computed from the formula in appendix 1 of [5]:

CHI = (1-a)

E.g., for the set (-1,-1,+2), CHI=1/4, and for the set (+1,+2,+4), CHI=1/28.

Observe the following properties of the function CHI(a):

CHI is function of only the unique parameter of the set, CHI is continuous, CHI(1)=0 and CHI=0 only for the symmetric set, and CHI(a)=CHI(1/a) (due to the invariance through scaling).

For the unidimensional three points set, all these four properties are required to build a safe chirality measure.

As proposed in [9] and [10], those who would like to build their own chirality measure are invited to check if all properties above indeed stand in this very simple situation.

Modeling chirality has required us to work with colored mixtures rather than with random vectors, and calculating the chiral index is intended to evaluate quantitatively how the distribution of a colored mixture differs from that of its inverted image.

A more general problem is to evaluate quantitatively how different are the distributions of any two colored mixtures X and Y. The mixtures are assumed to share the same space of colors, and, as previously, the couple (X,Y) offers a null probability to get different colors for X and Y. The similarity between the distributions is still expressed as a Wasserstein distance:

When each of the two distributions is represented by a set of N colored points, the N colors being all different, and the two sets of N colors being identical, it means that there is a pairwise correspondence between the two sets of points, and the distance is called the pure rotation Procrustes distance. This name comes from a least squares method to superpose by rotation and translation two sets of N points pairwise associated [3]. The optimal translation is get by centering the sets, and the optimal rotation is known in the plane (sect. 3 in ref.[5]) and in the 3D space (appendix of ref.[4]). When all colors are identical, we have the pure rotation Procrustes algorithm without prefixed correspondence.

More details about the applications of the colored mixture model: see the page about Procrustes Methods, Shape Recognition, Similarity and Docking

Chirality is known to be related to indirect symmetry, and has been quantified to evaluate how a set is similar to its inverted image, via an optimal superposition method. Now, how to quantify direct symmetry? In other words, how a set is similar to itself? For a finite number of equally weighted colored points, a direct symmetry index has been defined [4]. Unfortunately, when the number of points tends to be infinite, the trivial optimal superposition of the distribution on itself is unavoidable. The problem may be reformulated as a local minima problem, but this latter is much more difficult to solve.

See also the general definition of symmetry measures to understand why direct symmetry measures are difficult to exhibit.

A transdisciplinary review of methods dealing with chirality and symmetry measures has been published [8].

Generating a symmetric figure by perturbation of a non-symmetric one is, in general, an open problem. For a finite set of colored points, there are simple conditions to get an achiral figure by averaging the set and its optimally superposed mirror image [11].

A relation between chirality and gravitation has been proposed: see the parity Eötvös experiment

The relation between random and order/disorder may be enlighted by symmetry arguments. E.g., assume that N random points are generated following the uniform law over a segment. It is shown that, in this situation, more random points there are, more symmetry there is (see section 6 in [8], or [9]). Now, it is commonly thought that more symmetry there is, more order there is, meaning that more random points there are, more order there is. But it is also commonly though that more random points there are, more disorder there is. Here is the paradox... Theory

You know how it is difficult for the teacher to draw a scalene triangle on the blackboard: most are nearly isosceles or nearly rectangle. Thus I tried to define what could be the "most scalene" triangle. Of course there is an infinite number of criteria to do that. I retained two of them, based on the present theory:

- The least isosceles one (thus, cannot be equilateral): however it appears to be an obtuse triangle and thus may be not satisfactory. See Fig. 1 there.
- The least isosceles one which is not obtuse, and so not rectangle. For that, I considered the disphenoids, also called isoceles or equifacial tetrahedra. These tetrahedra have their four faces congruent to the same triangle, and it is known that this triangle cannot be obtuse. A right triangle defines a degenerated tetrahedron, which is thus achiral. An isosceles triangle defines an achiral tetrahedron, too. Thus, I proposed to retain the triangular face of the most chiral disphenoid as the required "most scalene triangle". See Fig. 2 there. Planar coordinates of this triangle are explicited in [12].

Thank you for having read parts or all the content of this page. The main apects of the theory appear also in the PDF of my invited lectures at MaxEnt 2008 in São Paulo [13], and at the Workshop on Rigidity and Symmetry at the Fields Institute in 2011 [14]. A paper summarizing the main results was published in 2013 in the Proceedings of the SOR13 International Symposium [10]. If you have remarks, criticisms, suggestions, ideas, or you want to discuss about some aspects of chirality or symmetry, please email to me (

- RACHEV S.T.

*Probability Metrics and the Stability of Stochastic Models.*

Wiley, New-York 1981.

- RACHEV S.T., RÜSCHENDORF L.

*Mass Transportation Problems.*Vol. I and II.

Springer-Verlag, New-York 1998.

- PETITJEAN M.

*Chiral Mixtures.*

J. Math. Phys. 2002,**43**[8],4147-4157. (DOI 10.1063/1.1484559)

Copyright AIP (American Institute of Physics), 2002. Download PDF paper (for personal use only. Other uses: see copyright information)

- PETITJEAN M.

*On the Root Mean Square Quantitative Chirality and Quantitative Symmetry Measures.*

J. Math. Phys. 1999,**40**[9],4587-4595. (DOI 10.1063/1.532988)

Copyright AIP (American Institute of Physics), 1999. Download PDF paper (for personal use only. Other uses: see copyright information)

- PETITJEAN M.

*About Second Kind Continuous Chirality Measures. 1. Planar Sets.*

J. Math. Chem. 1997,**22**[2-4],185-201.

(DOI 10.1023/A:1019132116175)

- PETITJEAN M.

*Chiralité quantitative: le modèle des moindres carrés pondérés.*

Compt. Rend. Acad. Sci. Paris, série IIc 2001,**4**[5],331-333.

(DOI 10.1016/S1387-1609(01)01241-5)

- PETITJEAN M.

*Calcul de chiralité quantitative par la méthode des moindres carrés.*

Compt. Rend. Acad. Sci. Paris, série IIc 1999,**2**[1],25-28.

(DOI 10.1016/S1387-1609(99)80034-6)

- PETITJEAN M.

*Chirality and Symmetry Measures: A Transdisciplinary Review.*

Entropy 2003,**5**[3],271-312 (Open Access paper).

(DOI 10.3390/e5030271; Zbl 1078.00503)

- PETITJEAN M.

*Minimal Symmetry, Random and Disorder.*

Symmetry: Culture and Science 2006,**17**[1-2],197-205.

Download PDF file of the associated lecture at the Symmetry Festival 2006.

- PETITJEAN M.

*The Chiral Index: Applications to Multivariate Distributions and to 3D molecular graphs.*

Proceedings of 12th International Symposium on Operations Research in Slovenia, SOR'13, pp.11-16,

Dolenjske Toplice, Slovenia, 25-27 September 2013.

L. Zadnik Stirn, J. Zerovnik, J. Povh, S. Drobne, A. Lisec, Eds.

Slovenian Society Informatika (SDI), Section for Operations Research (SOR), Ljubljana, 2013.

(ISBN 978-961-6165-40-2)

Download PDF paper in preprint form (posted with permission from Society Informatika, Section for Operations Research)

Download PDF file of the lecture.

- PETITJEAN M.

*À propos de la référence achirale.*

Compt. Rend. Chim. 2006,**9**[10],1249-1251.

(DOI 10.1016/j.crci.2006.03.003)

- BOUISSOU C., PETITJEAN M.

*Asymmetric exchanges.*

JIMIS 2018,**4**,1-18.

(DOI 10.18713/JIMIS-230718-4-1)

- PETITJEAN M.

*An Asymmetry Coefficient for Multivariate Distributions.*

MaxEnt 2008, 6-11 July 2008, Boracéia, São Paulo.

(28th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering)

Download PDF file of the lecture.

- PETITJEAN M.

*Chirality and Symmetry Measures: Some Open Problems.*

Workshop on Rigidity and Symmetry, 17-21 October 2011, The Fields Institute, Toronto. Abstracts.

Download PDF file of the lecture.