Add new category containing interstitial benchmarks#337
Add new category containing interstitial benchmarks#337zhonganr wants to merge 7 commits intoddmms:mainfrom
Conversation
|
Thanks for adding this and sharing the data! It's looking great so far! |
e526ac1 to
fbf8301
Compare
ElliottKasoar
left a comment
There was a problem hiding this comment.
Thanks for addressing so much, @zhonganr, it's looking great!
I've left a few more minor questions/suggestions.
Also note that there are a few conflicts to be resolved with the main branch. Can you try rebasing? We are happy to help with this if you need it.
- Rename FE1SIA to Defectstab across all files and directories - Rename category from interstitial to defect - Split shared DB.zip into separate Defectstab.zip and Relastab.zip - Rename data folders to match benchmark names (Defectstab/, Relastab/) - Add default charge and spin for Orb model compatibility - Replace silent fallbacks with ValueError for unparseable reference energies - Update documentation references (relative_stability -> Relastab) - Remove debug print statements from analysis files
fbf8301 to
8e63dd9
Compare
ElliottKasoar
left a comment
There was a problem hiding this comment.
Thanks for responding to all the comments! It's looking really good!
Just a few more minor suggestions/questions and I think we'll be good to merge.
ElliottKasoar
left a comment
There was a problem hiding this comment.
I think other than the discussion around the metrics/thresholds, this is pretty much good to go!
One other note: I think your documentation was deleted during one of your commits. If you could add back this back in (and update it following whatever decision we make regarding metrics) that would be great!
Follow GMTKN55 pattern: each subset gets its own column in the metrics
table with its own threshold in metrics.yml (weight=0). Total metric
columns keep weight=1 and drive the benchmark score.
- Remove subset dropdowns and update_table_data callbacks from both apps
- Remove inner @build_table loops saving per-subset JSON files
- metrics fixture now returns {total_col: ..., subset_col: ...} directly
- Add per-subset entries to both metrics.yml files with appropriate
good/bad thresholds and weight=0
ElliottKasoar
left a comment
There was a problem hiding this comment.
Thanks for addressing the suggestions so quickly, @zhonganr!
I just spotted a few final minor things we may want to tidy, but otherwise I think we'll probably be good to merge!
| level_of_theory: DFT | ||
| weight: 1 | ||
| fe_sia: | ||
| good: 0.0 | ||
| bad: 1.32 | ||
| unit: eV | ||
| tooltip: "RMSD of formation energies for Fe SIA subset" | ||
| level_of_theory: DFT | ||
| weight: 0 | ||
| boroncarbide_stoichiometry: | ||
| good: 0.0 | ||
| bad: 11.09 | ||
| unit: eV | ||
| tooltip: "RMSD of formation energies for boron carbide stoichiometry subset" | ||
| level_of_theory: DFT | ||
| weight: 0 | ||
| boroncarbide_defects: | ||
| good: 0.0 | ||
| bad: 0.57 | ||
| unit: eV | ||
| tooltip: "RMSD of formation energies for boron carbide defects subset" | ||
| level_of_theory: DFT | ||
| weight: 0 | ||
| mapi_tetragonal: | ||
| good: 0.0 | ||
| bad: 0.39 | ||
| unit: eV | ||
| tooltip: "RMSD of formation energies for MAPI tetragonal subset" | ||
| level_of_theory: DFT |
There was a problem hiding this comment.
| level_of_theory: DFT | |
| weight: 1 | |
| fe_sia: | |
| good: 0.0 | |
| bad: 1.32 | |
| unit: eV | |
| tooltip: "RMSD of formation energies for Fe SIA subset" | |
| level_of_theory: DFT | |
| weight: 0 | |
| boroncarbide_stoichiometry: | |
| good: 0.0 | |
| bad: 11.09 | |
| unit: eV | |
| tooltip: "RMSD of formation energies for boron carbide stoichiometry subset" | |
| level_of_theory: DFT | |
| weight: 0 | |
| boroncarbide_defects: | |
| good: 0.0 | |
| bad: 0.57 | |
| unit: eV | |
| tooltip: "RMSD of formation energies for boron carbide defects subset" | |
| level_of_theory: DFT | |
| weight: 0 | |
| mapi_tetragonal: | |
| good: 0.0 | |
| bad: 0.39 | |
| unit: eV | |
| tooltip: "RMSD of formation energies for MAPI tetragonal subset" | |
| level_of_theory: DFT | |
| level_of_theory: PBE | |
| weight: 1 | |
| fe_sia: | |
| good: 0.0 | |
| bad: 1.32 | |
| unit: eV | |
| tooltip: "RMSD of formation energies for Fe SIA subset" | |
| level_of_theory: PBE | |
| weight: 0 | |
| boroncarbide_stoichiometry: | |
| good: 0.0 | |
| bad: 11.09 | |
| unit: eV | |
| tooltip: "RMSD of formation energies for boron carbide stoichiometry subset" | |
| level_of_theory: PBE | |
| weight: 0 | |
| boroncarbide_defects: | |
| good: 0.0 | |
| bad: 0.57 | |
| unit: eV | |
| tooltip: "RMSD of formation energies for boron carbide defects subset" | |
| level_of_theory: PBE | |
| weight: 0 | |
| mapi_tetragonal: | |
| good: 0.0 | |
| bad: 0.39 | |
| unit: eV | |
| tooltip: "RMSD of formation energies for MAPI tetragonal subset" | |
| level_of_theory: PBE |
Generally we try to give the functional, since it's useful to know when a model has been trained on data using the same functional or not
There was a problem hiding this comment.
Thanks for the remark, indeed it's better.
The functionals used in Defectstab are not all PBE, but I've specified the corresponding functional for each subset.
There was a problem hiding this comment.
Ah I see, sorry I missed that, and thanks for fixing it!
Do you think it's still meaningful to combine them as an average then?
|
|
||
| Reference data: | ||
|
|
||
| * Computed from the DFT total energies provided with the input structures. |
There was a problem hiding this comment.
Could you add a note here either restating the functionals again or noting that the specific functionals are specified with the metrics above?
|
|
||
| Reference data: | ||
|
|
||
| * Computed from the DFT total energies provided with the input structures. |
There was a problem hiding this comment.
Could you add a note here either restating the different functionals again or noting that the specific functionals are specified with the metrics above?
Add a new category
interstitialto assess the models' predictive performance for interstitial defect properties. Two benchmarksFE1SIAandRelastabare included:FE1SIAevaluates the formation energy of a single self-interstitial atom (SIA) in a host lattice for distinct configurations.Relastabevaluates the ability of models to correctly rank the stability of different interstitial configurations.Related to Interstitial benchmarks #339