GMTKN55 Benchmark Evaluator

This project provides a Python-based evaluation framework for computing WTMAD-2 and other statistical metrics on the GMTKN55 benchmark suite. It processes .res files, filters molecules based on chemical constraints, and parses output to compute evaluation metrics such as WTMAD-2, MAE, and more.

📦 Features

Automatically parses GMTKN55 benchmark subsets from a local filesystem
Filters molecules based on charge, number of unpaired electrons, and required/allowed elements
Evaluates reactions using .res or .resRC files and a user-specified method
Computes WTMAD-2 and per-category metrics (e.g. small reactions, NCI, barrier heights)
Exports results to CSV if requested
Includes a progress bar and detailed verbosity levels

🛠 Requirements

Install dependencies using conda:

conda env create -f environment.yaml
conda activate gmtkn55-env

The main dependencies are:

Python ≥ 3.12
numpy
pandas
tqdm

📁 Project Structure

GMTKN55/
├── eval.py                  # Main entry point for evaluating subsets
├── utils/                   # Contains all Python source code beyond the central eval.py script
│   ├── __init__.py
│   ├── statistics.py        # WTMAD-2 and statistical calculations
│   ├── constants.py         # Constant data
│   └── ...                  # Further Python source files
├── ACONF/                   # Expected location of GMTKN55 subset folders
├── ADIM6/                   # ...
├── .../                     # ...
├── environment.yaml         # Conda environment specification
└── README.md                # This file

🚀 Usage

Run the evaluation on your local GMTKN55 directory:

python eval.py --method YOUR_METHOD_NAME --verbosity 1 --write-to-csv

Further optional arguments

--allowed-elements '1-86'
--required-elements-all '6,1'
--required-elements-one '8,7'
--min-charge -1
--max-charge 2
--max-uhf 2
--format 13 (format of the .res files (default: 13))

Example:

python eval.py --method mydft --verbosity 2 --write-to-csv --allowed-elements '1-20'

📊 Output

With --write-to-csv, the script will generate a file: <args.format>.csv containing columns:

Subset
Reaction
Stochiometry
ReferenceValue
MethodValue

Statistics

The script prints:

Overall WTMAD-2
WTMAD-2 per category:
Small Reactions
Larger Reactions
Barrier Heights
Intermolecular NCI
Intramolecular NCI
Optionally: Mean Absolute Error (MAE) per subset

👥 Authors

Marcel Müller
Contributions welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
ACONF		ACONF
ADIM6		ADIM6
AHB21		AHB21
AL2X6		AL2X6
ALK8		ALK8
ALKBDE10		ALKBDE10
Amino20x4		Amino20x4
BH76		BH76
BHDIV10		BHDIV10
BHPERI		BHPERI
BHROT27		BHROT27
BSR36		BSR36
BUT14DIOL		BUT14DIOL
C60ISO		C60ISO
CARBHB12		CARBHB12
CDIE20		CDIE20
CHB6		CHB6
DARC		DARC
DC13		DC13
DIPCS10		DIPCS10
FH51		FH51
G21EA		G21EA
G21IP		G21IP
G2RC		G2RC
HAL59		HAL59
HEAVY28		HEAVY28
HEAVYSB11		HEAVYSB11
ICONF		ICONF
IDISP		IDISP
IL16		IL16
INV24		INV24
ISO34		ISO34
ISOL24		ISOL24
MB16-43		MB16-43
MCONF		MCONF
NBPRC		NBPRC
PA26		PA26
PArel		PArel
PCONF21		PCONF21
PNICO23		PNICO23
PX13		PX13
RC21		RC21
RG18		RG18
RSE43		RSE43
S22		S22
S66		S66
SCONF		SCONF
SIE4x4		SIE4x4
TAUT15		TAUT15
UPU23		UPU23
W4-11		W4-11
WATER27		WATER27
WCPT18		WCPT18
YBDE18		YBDE18
utils		utils
.bib		.bib
.gitignore		.gitignore
.list.dirs		.list.dirs
PBEh-3c_reactions.csv		PBEh-3c_reactions.csv
PBEh-3c_statistics.csv		PBEh-3c_statistics.csv
PBEh-3c_wtmad2.csv		PBEh-3c_wtmad2.csv
README.md		README.md
environment.yaml		environment.yaml
eval.py		eval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GMTKN55 Benchmark Evaluator

📦 Features

🛠 Requirements

📁 Project Structure

🚀 Usage

📊 Output

👥 Authors

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

grimme-lab/GMTKN55

Folders and files

Latest commit

History

Repository files navigation

GMTKN55 Benchmark Evaluator

📦 Features

🛠 Requirements

📁 Project Structure

🚀 Usage

📊 Output

👥 Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages