Plotting the number of molecules having a given number of carbon atoms shows that the even values are preferred to the odd values in large chemical files. This parity phenomenon was discovered in 1990 [1,2]. It does not mean that most chemists around the world have cooperated in order to obtain this. A semi quantitative explanation of the phenomenon has been proposed [2]. It is based on graph theory, and on the statistical occurrences of the elements of the periodic table. |
|
The distribution of the chemical elements (atoms) in a chemical database can be visualized graphically. The periodic table of the elements is viewed as a rectangular array, and each element of this array contains the occurrence of its associated atom. The array is submitted to a factorial correspondence analysis. Thus, each atom of the Mendeleev table get coordinates in a 6-dimensional space: this is the "Graphic Mendeleev Table" [1,3]. In order to visualize the Graphic Mendeleev Table, the successive convex hulls of the points are drawn in the principal planes [4,5]. |
|
There are little statistical tools to perform exploratory data analysis in graphs databases.
The basic observation is the graph, but it may be also a connex component, or simply a node, an edge, a cycle, a path, a concentric layer, etc. [2].
Many univariate and multivariate distributions may be generated from these populations.
One of the most useful distributions is the number of connex components having a given radius R and a given diameter D.
It is recalled that the radius is the lower bound of the eccentricities of the nodes of a connex component, and the diameter is the upper bound, the eccentricity of a node being the upper bound of the distances from the node to all the nodes of the connex component.
It is known that D takes values between R and 2R, such that the bivariate distribution in the (R,D) plane takes place in an angular sector limited by the two lines D=R and D=2R.
Displaying the clusters in this bivariate distribution offers a schematic graphical summary of the population which is called the Radius Diameter Diagram [6,7].
|
|
[6.6]Chiralane (C27H28) is a polycycloalkane designed in 2004 by
A. Schwartz [13].
|
|
; downloads, prints and copies are for subscribers only)