Skip to content

Image classification model for Alzheimer/Dementia brain scans using a Vision Transformer (ViT) fine-tuned on the ImageNet-21k dataset and implemented with PyTorch tools.

Notifications You must be signed in to change notification settings

rmezapi/dementia_vit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 

Repository files navigation

dementia_vit

This project was intended to test the limits of the ViT on a tough dementia dataset. The data used can be found on HuggingFace at: https://huggingface.co/datasets/Falah/Alzheimer_MRI. The dataset has 4 classes:

  • 0: mild_demented
  • 1: moderate_demented
  • 2: non_demented
  • 3: very_mild_demented

The project follows closely the following tutorials:

I modify the code presented in the video and tune all parameters to optimize performance using mostly the same libraries and tools. This is a practice project for myself as I return to coding/designing ML models after dedicating time to AI/ML theory (model architectures, transfer learning)

After hyperparameter tuning, the highest testing accuracy achieved is 66% using a learning rate of 2e-5, a weight decay of 0.001, batch size of 8, and iterating over 5 epochs.

image image image

About

Image classification model for Alzheimer/Dementia brain scans using a Vision Transformer (ViT) fine-tuned on the ImageNet-21k dataset and implemented with PyTorch tools.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published