A Bayesian hypothesis assessment framework, tailored to comparing pairs of experiments. This package is preliminary
implementation statistical analyses described in this paper. This can also be seen as a complimentary to
testSignificanceNLP package which contains popular Frequentist tests.
We focus on the setup where the performances of two algorithms, applied on a certain dataset, are provided based on an evaluation measure. The comparison is done by studying the posterior probabilities (rather than binary commonly-used binary decisions in other tests.)
For further discussion of the approaches implemented here (and their comparison to other techniques), refer to the paper or the extended manual.
For running this code, you will need Python (version >=3.6). You have two options for installation:
- If you don't want to modify any code, you can just install the package and use it in commandline:
> pip install HyBayes
- Alternatively, if you'd like to make modifications to the code (e.g., the underlying model) you can clone this project. Before running the code, make sure that you have all the requirements by running the following line to install all the necessary dependencies:
> pip install -r requirements.txt
In case you have issue with installing GraphViz, consider using conda installer:
> conda install -c anaconda graphviz --yes
If you are using MacOS, brew
command can used:
> brew install graphviz
To analyze your data, you need to prepare the a configuration file that specifies the information needed for the analysis (e.g., the address to your observation files, the type of plots and files you want to be stored).
You can find more details on this in the extended manual. Additionally, there are examples of complete configurations in configs
folder, after running make_configs.py
.
Let's assume that you have followed the above instructions and:
- (1) you have included the performances observations in two separate files,
- (2) and you have prepared a config file.
To run the analysis, you pass the name of the config file as argument:
> python -m HyBayes --config my_config_file.ini --verbose
When the flag --verbose
is on, the details of the analysis will be printed in standard output.
This flag does not affect the log file that is stored along side other outputs.
And here is the general usage template which can be accessed using --help
flag at any time:
> python -m HyBayes --help
usage: __main__.py [-h] [-c CONFIG] [-v] [--make_configs] [--make_data]
Run Bayesian Statistics Tailored towardsanalysing the experiment results
specially in NLP area.Email [email protected] for comments.
optional arguments:
-h, --help show this help message and exit
-c CONFIG, --config CONFIG
address of Config file
-v, --verbose prints the report of the steps
--make_configs if on, example configuration files will be made.
--make_data if on, example data files will be made.
To demonstrate everything with a quick example, run the following command to generate toy data:
> python -m HyBayes --make_data
You can see that some artificial data is made in directory artificial_data
. Moreover either use the config files provided in the repository [], or run:
> python -m HyBayes --make_configs
And then execute the analysis on any of the newly made config files. For example:
> python -m HyBayes --config configs/config_metric.ini --verbose
If you want to learn more about this package, please refer to the extended manual. If you want to learn about the concepts discusssed here, refer to the paper below.
This code is published under Apache 2.0 license.
If you used this package in your research feel free to cite the following paper:
@inproceedings{hypothesisAssessment20,
title={Not All Claims are Created Equal:Choosing the Right Statistical Approach to Assess Hypotheses},
author={{Sadeqi Azer}, Erfan and Khashabi, Daniel and Sabharwal, Ashish and Roth, Dan},
booktitle={Annual Meeting of the Association for Computational Linguistics (ACL)},
year={2020}
}