Command Line Interface¶
Documentation for the command line interface of SEB.
CLI¶
run¶
Runs the Benchmark either on specified models or on all registered models. Can save the benchmark's results, but also displays them in a table similar to the official website.
Examples: To run all models on all languages and tasks:
{bash} seb run
To run a model on all languages and tasks:
seb run -m sentence-transformers/all-MiniLM-L6-v2
To run multiple models: To run a model on all languages and tasks:
seb run -m sentence-transformers/all-MiniLM-L6-v2,sentence-transformers/all-mpnet-base-v2
if you only want to limit it to a subset of languages or tasks you can use the --languages and --tasks flags.
# Running a model on a subset of languages
seb run sentence-transformers/all-MiniLM-L6-v2 -o results/ -l nb,nn
# Running a model on a subset of tasks
seb run sentence-transformers/all-MiniLM-L6-v2 -o results/ -t DKHate,ScaLA
| Argument | Type | Description | Default |
|---|---|---|---|
--models, -m |
Optional[list[str], NoneType] |
Model names or paths. If a model is not registrered in SEB it will be loaded using SentenceTransformers. If none are specified the whole benchmark is run. | None |
--output-path, -o |
Path |
Directory to save all results to. | None |
--languages, -l |
Optional[list[str], NoneType] |
What languages subsection to run the benchmark on. If left blank it will run it on all languages. | None |
--tasks, -t |
Optional[list[str], NoneType] |
What tasks should model be run on. Default to all tasks within the specified languages. | None |
--ignore-cache |
bool |
Ignores caches models. Note that SEB ships with an existing cache. You can set the cache_dir using the environmental variable SEB_CACHE_DIR | False |
--ignore-errors |
bool |
Should errors be ignored when running a model on a benchmark task. | False |
--code, -c |
Path |
Code to run before executing benchmark. Useful for adding custom model to registries. | None |
--logging-level |
str |
Logging level for the benchmark. | 'INFO' |