Balancing Performance and Interpretability in Medical Image Analysis: Case study of Osteopenia

This is official repository of our paper named in title.

Abstract

Multiple studies within the medical field have highlighted the remarkable effectiveness of using convolutional neural networks for predicting medical conditions, sometimes even surpassing that of medical professionals. Despite their great performance, convolutional neural networks operate as black boxes, potentially arriving at correct conclusions for incorrect reasons or areas of focus. Our work explores the possibility of mitigating this phenomenon by identifying and occluding confounding variables within images. Specifically, we focused on the prediction of osteopenia, a serious medical condition, using the publicly available GRAZPEDWRI-DX dataset. After detection of the confounding variables in the dataset, we generated masks that occlude regions of images associated with those variables. By doing so, models were forced to focus on different parts of the images for classification. Model evaluation using F1-score, precision, and recall showed that models trained on non-occluded images typically outperformed models trained on occluded images. However, a test where radiologists had to choose a model based on the focused regions extracted by the GRAD-CAM method showcased different outcomes. The radiologists' preference shifted towards models trained on the occluded images. These results suggest that while occluding confounding variables may degrade model performance, it enhances interpretability, providing more reliable insights into the reasoning behind predictions.

Authors

Mateo Mikulić, Dominik Vičević, Eszter Nagy, Mateja Napravnik, Ivan Štajduhar, Sebastian Tschauner, and Franko Hržić

Minimum working code example

Clone repository

To test our minimal working sample please clone this repository:

git clone https://github.com/mikulicmateo/osteopenia.git

Download the models

Download our models from url. Place the models in OsteopeniaMinimumWorkingExample/models.

Preparing environment

You can create virtual environment using command:

python3 -m venv venv

Don't foget to source it:

source venv/bin/activate

We prepared minimum_requirements.txt to get you started as soon as possible. Just run:

pip install -r minimum_requirements.txt

Testing

To test our minimum working code example just load OsteopeniaMinimumWorkingExample/ExampleNotebook.ipynb.

Reproduce our results

Clone repository

To reproduce our results please clone this repository:

git clone https://github.com/mikulicmateo/osteopenia.git

Download the dataset

Download original dataset. (Dataset paper: Nagy, E., Janisch, M., Hržić, F. et al. A pediatric wrist trauma X-ray dataset (GRAZPEDWRI-DX) for machine learning. Sci Data 9, 222 (2022). https://doi.org/10.1038/s41597-022-01328-z)

Preparing environment

You can create virtual environment using command:

python3 -m venv venv

Don't foget to source it:

source venv/bin/activate

We prepared all_requirements.txt to get you started as soon as possible. Just run:

pip install -r all_requirements.txt

Generating filtered dataset

Extract all the images from (downloaded) original dataset to dataset/images folder. Extract all the *.json annotations from (downloaded) original dataset found in folder_structure.zip under supervisely/wrist/ann to dataset/annotations_all folder.

Your dataset folder should now look like this:

|---dataset
    |---annotations_all
    |   |---0001_1297860395_01_WRI-L1_M014.json
    |   |---...
    |    ...
    |---images
        |---0001_1297860395_01_WRI-L1_M014.png
        |---...
        ...

Edit config.json with your paths for osteopenia_dataset_csv_path and additional_annotations_path.

Change directory to metadata:

cd metadata

Run create_filtered_dataset_files_and_csv.py script to generate new filtered dataset:

python3 create_filtered_dataset_files_and_csv.py

Next, run mask_dataset_images.py to create masks ("dummy" and real) for each image in filtered dataset:

python3 mask_dataset_images.py

Change directory to data in project directory:

cd ../data

Run gen_train_val_test_data.py script to generate train, validation and test split:

python3 gen_train_val_test_data.py

Run training

Change directory to main project directory:

cd ..

Run train_all.py script to train all models.

python3 train_all.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Balancing Performance and Interpretability in Medical Image Analysis: Case study of Osteopenia

Abstract

Authors

Minimum working code example

Clone repository

Download the models

Preparing environment

Testing

Reproduce our results

Clone repository

Download the dataset

Preparing environment

Generating filtered dataset

Run training

About

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
OsteopeniaMinimumWorkingExample		OsteopeniaMinimumWorkingExample
data		data
dataset		dataset
media_readme		media_readme
metadata		metadata
model		model
trainer		trainer
util		util
.gitignore		.gitignore
README.md		README.md
all_requirements.txt		all_requirements.txt
config.json		config.json
minimum_requirements.txt		minimum_requirements.txt
train.py		train.py
train_all.py		train_all.py

mikulicmateo/osteopenia

Folders and files

Latest commit

History

Repository files navigation

Balancing Performance and Interpretability in Medical Image Analysis: Case study of Osteopenia

Abstract

Authors

Minimum working code example

Clone repository

Download the models

Preparing environment

Testing

Reproduce our results

Clone repository

Download the dataset

Preparing environment

Generating filtered dataset

Run training

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Uh oh!

Languages