Overview

Diabetes Prediction Machine Learning Project

Overview

This repository contains a machine-learning project focused on predicting diabetes using relevant medical data. This project aims to develop a model that can accurately predict the likelihood of an individual having diabetes based on various input features.

Dataset

The dataset used for this project consists of 2460 samples with 8 features, including age, BMI, insulin, and more. The dataset was obtained from Kaggle.

Approach

In this project, I employed several different machine learning models from Scikit-learn to train and evaluate the predictive model. The steps involved in the process include:

Importing data: Cleaning the dataset, handling missing values, and scaling features.
Understanding data: Understanding data distribution and correlation matrix
Model training: Training the selected algorithm on the preprocessed dataset.
Model evaluation: Assessing the performance of the trained model using appropriate metrics such as accuracy, precision, recall, F1-score, and ROC-AUC.

Results

Currently, the trained model achieved an accuracy score of 80%.

Usage

To run this project locally, follow these steps:

Open this link to access the Google Colab file and make a copy
Download dataset files from this repository
Run the Google Colab to preprocess the data, train the model, make predictions, and save the model.

Technologies Used

Programming languages: Python

Libraries: Scikit-learn Pandas NumPy Matplotlib Seaborn

Future Improvements

Some potential enhancements for this project include:

Exploring additional features or data sources to improve the model's predictive performance.
Tuning hyperparameters to optimize the model's performance.
Deploying the trained model as a web application or API for real-time predictions.

Acknowledgments

Special thanks to Ehab Aboelnaga for providing the dataset used in this project.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Diabetes_Prediction.ipynb		Diabetes_Prediction.ipynb
README.md		README.md
Testing.csv		Testing.csv
Training.csv		Training.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diabetes Prediction Machine Learning Project

Overview

Dataset

Approach

Results

Usage

Technologies Used

Future Improvements

Acknowledgments

About

Releases

Packages

Languages

Ph1so/Diabetes-Prediction

Folders and files

Latest commit

History

Repository files navigation

Diabetes Prediction Machine Learning Project

Overview

Dataset

Approach

Results

Usage

Technologies Used

Future Improvements

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages