This repository contains a project focused on analyzing data derived from fine needle aspiration (FNA) images to support breast cancer diagnosis. The main objective is to apply supervised and unsupervised learning techniques to classify breast masses as benign or malignant.
- Claudia Heredia Ceballos
- Manuel Otero Barbasán
- Marta Pineda Gisbert
- Javier Fernández Castillo
University of Seville
The dataset includes 569 samples, categorized as:
- 357 benign cases.
- 212 malignant cases.
Each sample is described by 33 features related to the properties of cell nuclei in the images.
- Supervised Classification: Algorithms such as CART.
- Unsupervised Clustering: Exploration of latent patterns in the data.
- Clone the repository:
git clone https://github.com/JaviFdez7/FNA-BCA-Diagnosis.git cd FNA-BCA-Diagnosis
- Open the R Markdown file in RStudio:
notebook.Rmd
- Execute the code chunks in the specified order to reproduce the analyses.
- Effective classification of samples as benign or malignant.
- Clustering visualizations highlighting relevant patterns in the data.
This project is licensed under the MIT License.