This implementation covers all the current state-of-the-art evaluation metrics for Generative Adversarial Networks (GANs), including: Inception Score (IS), Fréchet Inception Distance (FID), Kernel Inception Distance (KID), and Precision and Recall (PR).
Although there are many existing implementations of these metrics in other repositories, they have different usage and requirements, which may cause inconvenience during the evaluation process. This package aims to simplify the usage by wrapping all the metrics into one setting.
Inception Score (IS) evaluates the reality and diversity of generated images based on the Kullback-Leibler (KL) divergence between the probability outputs of the Inception model for each fake image and the marginal distribution (the cumulative distribution of all samples). The original Tensorflow implementation can be found here.
Fréchet Inception Distance (FID) compares the Fréchet distance between two datasets by comparing the mean and covariance of the features extracted from the last average pooling layer of the Inception model. The original Tensorflow implementation can be found here.
Kernel Inception Distance (KID) compares the Maximum Mean Discrepancy (MMD) between two datasets by comparing the features extracted using the same method as FID. The original Tensorflow implementation can be found here.
Precision and Recall (PR) of GANs evaluate the quality and diversity of generated images, respectively. The method uses VGG16 as the feature extractor, and the real and fake domains are decided by k-nearest neighbor in the Euclidean space. The precision and recall values are calculated by the intersection between the two domains. The original Tensorflow implementation can be found here.
The Inception and FID models have been modified from the unofficial Pytorch FID implementation. The VGG model uses the timm package, and the implementation of IS, FID, and PR has been modified from the Pytorch StyleGAN2-ADA.
Install from pip:
pip install gans-eval
Requirements:
- numpy
- opencv-python
- pillow
- scipy
- tqdm
- timm
- torch>=1.7.0
- torchvision>=0.8.0
Except for IS, other metrics need two individual datasets to process, the simple usage:
gans-eval --fake-dir path_to_syn_dataset --real-dir path_to_real_dataset --metrics is fid kid pr
All available arguments:
-
--fake-dir
Synthesis dataset directory. -
--real-dir
Real dataset directory. -
--data-name
the statistic features of dataset will be save as pickle file in['HOME']/.cache/gans_metrics/dataset_name.pkl
. Default:default
. -
--save-json
path to save result as json file. Default:None
. -
--metrics
evaluation method. Available optionsis, fid, kid, pr
. Default:fid
. -
--num-splits
parameter to calculate IS, number of divisions to calculate statistics. This parameter together with the number of samples can cause the IS to change. Default:10
. -
--max-real
parameter to calculate KID, the maximum number of real samples. Default:1000000
. -
--num-subsets
parameter to calculate KID, the number of subsets while compute KID. Default:100
. -
--max-subset-size
parameter to calculate KID, the maximum number of sample in one subset. Default:1000
. -
--k-nearest
parameter to calculate PR, the kth nearest neighbor. Default:3
. -
--row-batch-size
parameter to calculate PR, number of samples per once Euclidean distance computing. Default:10000
. -
--col-batch-size
parameter to calculate PR, number of samples per once Euclidean distance computing. Default:10000
. -
--dims
as pytorch-fid package, there are four available positions to extract features from Inception module. Available options:64, 192, 768, 2048
. Default:2048
.- 64: first max pooling features.
- 192: second max pooling features.
- 768: pre-aux classifier features.
- 2048: final average pooling features.
-
--batch-size
Batch size. Default: 50. -
--num-workers
The number of workers use for dataloder. Default: 8. -
--not-save-cache
Store true parameter. Call it if you don't want to save dataset statistic as cache. -
--verbose
Store true parameter. Call it if you want to see the process bar while computing statistics information from the image folder.