Skip to content

DiagSet: a dataset for prostate cancer histopathological image classification

Notifications You must be signed in to change notification settings

michalkoziarski/DiagSet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

DiagSet: a dataset for prostate cancer histopathological image classification

Description

The dataset consists of three different partitions: DiagSet-A, containing over 2.6 million tissue patches extracted from 430 fully annotated scans; DiagSet-B, containing 4675 scans with assigned binary diagnosis; and DiagSet-C, containing 46 scans with diagnosis given independently by a group of histopathologists.

DiagSet-A samples Samples from DiagSet-A dataset, containing patches from different tissue classes extracted at different magnifications.

Access

The data is publicly available and can be accessed by anyone (after registration) at https://ai-econsilio.diag.pl. Note that after registration site administrator will need to activate your account, which should typically take less than 24 hours.

In case of any problems with accessing the website you can either email the website administrator directly or, if that fails, create an issue here.

DiagSet-A container

For easier access to the data we prepared a Python container for loading the patches from raw data files. It can be found here.

Publication

More detailed description about the dataset and the conducted experiments can be found at https://www.nature.com/articles/s41598-024-52183-4. To cite the dataset, you can use:

@article{koziarski2024diagset,
  title={DiagSet: a dataset for prostate cancer histopathological image classification},
  author={Koziarski, Micha{\l} and Cyganek, Bogus{\l}aw and Niedziela, Przemys{\l}aw and Olborski, Bogus{\l}aw and Antosz, Zbigniew and {\.Z}ydak, Marcin and Kwolek, Bogdan and W{\k{a}}sowicz, Pawe{\l} and Buka{\l}a, Andrzej and Swad{\'z}ba, Jakub and others},
  journal={Scientific Reports},
  volume={14},
  number={1},
  pages={6780},
  year={2024},
  publisher={Nature Publishing Group UK London}
}

About

DiagSet: a dataset for prostate cancer histopathological image classification

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages