Skip to content

Commit aedc10b

Browse files
updated notes
1 parent 09a1982 commit aedc10b

File tree

3 files changed

+24
-35
lines changed

3 files changed

+24
-35
lines changed

setup.py

Lines changed: 0 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -17,10 +17,3 @@
1717
"Operating System :: OS Independent",
1818
],
1919
)
20-
21-
# new
22-
# theforce.cl.relax
23-
# theforce.cl.init_model
24-
# theforce.calculator.meta -> Qlvar
25-
# theforce.calculator.gaussian
26-
# theforce.cl.md -> Langevin

theforce/calculator/README.md

Lines changed: 17 additions & 26 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,7 @@ The following parameters can be passed to `ActiveCalculator`
3939
```
4040
# inputs
4141
covariance: either a kernel or path to a saved/pickled model
42+
kernel_kw e.g. {'cutoff': 6.}
4243
calculator: any ASE calculator or SocketCalculator
4344
process_group: None or the value of mpi_init()
4445
meta: meta energy calculator for metadynamics
@@ -51,16 +52,11 @@ test: intervals for single point tests during MD
5152
5253
# sampling and optimization
5354
ediff: energy sensitivity for sampling LCEs
54-
ediff_tot: total energy sensitivity for sampling DFT data
5555
fdiff: forces sensitivity for sampling DFT data
56-
noise_e: bias noise for total energies
5756
noise_f: bias noise for forces
58-
59-
#
60-
ignore_forces: dumps forces data from the regression
6157
```
6258

63-
#### covariance
59+
#### covariance, kernel_kw
6460
This parameter can be used for passing a kernel
6561
or a saved/pickled model to the calculator
6662
```python
@@ -92,6 +88,9 @@ automatically saved and loaded in consecutive
9288
runs.
9389
If `None`, the default kernel will be used
9490
(see *Kernels*) with an empty initial model.
91+
In that case `kernel_kw` can be used for
92+
passing some parameters (e.g. cutoff) to
93+
the kernel instantiation.
9594

9695
#### calculator
9796
The main DFT calculator can which be any ASE
@@ -150,12 +149,11 @@ and the ML predictions will be saved in `'active_ML.traj'`.
150149
A single point calculation is triggered if `test`
151150
steps have passed from the last one.
152151

153-
#### ediff, ediff_tot, fdiff
152+
#### ediff, fdiff
154153
These parameters control the sampling.
155154
`ediff` is mainly used for sampling of the LCEs
156155
as the inducing data for the sparse representation.
157-
`ediff_tot` and `fdiff` control the sampling
158-
of DFT data.
156+
`fdiff` control the sampling of DFT data.
159157
The default parameters should be appropriate for
160158
starting the simulation.
161159
One can set `fdiff` equal to the desired accuracy
@@ -166,25 +164,18 @@ For global exploration, we recommend increasing the
166164
accuracy gradually/iteratively rather than choosing
167165
small values for these parameters from the beginning.
168166

169-
If `ediff_tot=float('inf')`, this parameter becomes
170-
irrelevent, which might be the desired behaviour in
171-
certain applications (similar for `fdiff`).
172-
173-
#### noise_e, noise_f
174-
In optimization of hyper-parameters, the errors
175-
(e.g. RMSE) are minimized towards these values.
176-
They can be set to 0 for simple minimization of
177-
RMSE but there is a chance for overfitting.
178-
For instance choosing smaller `noise_f` may
167+
#### noise_f
168+
In optimization of hyper-parameters, the
169+
mean absolute error (MAE) of forces in fitting
170+
is tuned to this value: MAE$\sim$`noise_f`.
171+
`noise_f` of 0 is also acceptable but
172+
there is a chance for overfitting.
173+
For instance during on-the-fly training,
174+
choosing smaller `noise_f` may
179175
cause more sampling of DFT data without a
180176
meaningful increase of the models accuracy.
181-
The value of 0 maybe used for fitting a static
182-
data set with a high accuracy.
183-
For more control see the following options
184-
* If noise_e = None -> noise_e = ediff_tot (<- default)
185-
* If noise_e < 0 -> RMSE of energies is omitted from the loss function.
186-
* If noise_f = None -> noise_f = fdiff (<- default)
187-
* If noise_f < 0 -> RMSE of forces is omitted from the loss function.
177+
But the value of 0 maybe used for fitting a
178+
static data set whithout any issues.
188179

189180
### Training with existing data
190181
If some DFT data already exists, one can train a

theforce/cl/README.md

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,7 @@ The following tags are available
2525
```
2626
# inputs
2727
covariance: 'pckl', None, a kernal, folder-name for loading a pickled model (default='pckl')
28+
kernel_kw: e.g. {'cutoff': 6.}
2829
calculator: None, 'VASP', 'Gaussian' (default=None)
2930
3031
# outputs
@@ -35,9 +36,7 @@ test: integer; single-point testing intervals (default=None)
3536
3637
# sampling and optimization
3738
ediff: (eV) energy sensitivity for sampling LCEs (default ~ 2 kcal/mol)
38-
ediff_tot: (eV) total energy sensitivity for sampling DFT data (default ~ 4 kcal/mol)
3939
fdiff: (eV/A) forces sensitivity for sampling DFT data (default ~ 3 kcal/mol)
40-
noise_e: (ev) bias noise for total energies (default=ediff_tot)
4140
noise_f: (ev/A) bias noise for forces (default=fdiff)
4241
```
4342
Note that these parameters are not related to any
@@ -64,6 +63,12 @@ Thus the training is resumed in consecutive runs
6463
(the model is automatically loaded and saved in
6564
the `pckl` folder).
6665

66+
At the beginning of training, if covariance
67+
is not given, the default kernel will be
68+
instantiated.
69+
In this case `kernel_kw` can be used for
70+
defining kernel parameters (e.g. cutoff).
71+
6772
After sufficient training, one might want to use
6873
the result ML potential for fast simulations
6974
without further training.

0 commit comments

Comments
 (0)