HDF5 Schema for ChemGP Plots¶
HDF5 Layout¶
ChemGP Rust examples write HDF5 files with a fixed layout. The plt-gp
subcommands each read specific groups and attributes from this schema.
Top-Level Groups¶
Group |
Contents |
|---|---|
|
2D arrays with attrs |
|
Group of same-length 1D arrays (DataFrame columns) |
|
Ordered point sequences ( |
|
Point sets ( |
Root attrs |
Metadata scalars ( |
Group Details¶
grids¶
Each grid is a 2D float64 dataset. Attributes store axis info:
x_range[x_min, x_max](float64[2])y_range[y_min, y_max](float64[2])x_lengthNumber of x grid points (int)
y_lengthNumber of y grid points (int)
Named grids: energy (true PES), gp_mean_N (GP at N training points),
variance, nll, gradient_norm.
table¶
A group whose children are same-length 1D arrays, analogous to DataFrame columns. Common columns:
oracle_callsCumulative oracle evaluations
max_fatom,max_force,force_norm,ci_forceForce convergence metrics
energyTotal energy at each step
methodString array identifying the optimizer
d_rff,energy_mae,gradient_maeRFF sweep data
paths¶
Each path is a group with x and y (or rAB and rBC) arrays tracing a
reaction path on the 2D surface.
points¶
Each point set is a group with coordinate arrays:
x,yCartesian coordinates
pc1,pc2PCA projections (for FPS scatter)
Named sets: minima, saddles, endpoints, training, selected, pruned.
Schema per Plot Type¶
Subcommand |
Required groups/attrs |
|---|---|
convergence |
|
surface |
|
quality |
|
rff |
|
nll |
|
sensitivity |
|
trust |
|
variance |
|
fps |
|
profile |
|
landscape |
.con files in |
Filename Conventions¶
The surface and quality subcommands auto-detect energy clamping from the
input filename:
Pattern in filename |
|
|
|---|---|---|
|
-200 |
50 |
|
-5 |
5 |
(other) |
data min |
data max |