AquaSense 🌊

A comprehensive data analysis and machine learning project to predict water potability using various water quality metrics.

📊 Project Overview

AquaSense analyzes water quality data to determine potability using multiple machine learning models. The project includes extensive data visualization, preprocessing, and comparative analysis of different classification algorithms.

🔬 Features

Comprehensive exploratory data analysis (EDA)
Interactive visualizations using Plotly and Seaborn
Missing data handling and preprocessing
Implementation of 7 different machine learning models:
- Logistic Regression
- Decision Tree Classifier
- Random Forest Classifier
- XGBoost Classifier
- K-Nearest Neighbors
- Support Vector Machine
- AdaBoost Classifier
Model performance comparison and evaluation

📈 Dataset

The project uses a water potability dataset with the following features:

pH value
Hardness
Solids
Chloramines
Sulfate
Conductivity
Organic carbon
Trihalomethanes
Turbidity
Potability (target variable)

🛠️ Technical Stack

Python: Core programming language
Data Processing: Pandas, NumPy
Visualization:
- Matplotlib
- Seaborn
- Plotly Express
Machine Learning:
- Scikit-learn
- XGBoost
Development Environment: Jupyter Notebook

📊 Visualizations Include

Correlation heatmaps
Distribution plots
Box plots
Violin plots
Pair plots
Interactive Plotly visualizations
Missing data analysis

🤖 Machine Learning Models Performance

Model	Accuracy Score
Logistic Regression	✓
Decision Tree	✓
Random Forest	✓
XGBoost	✓
K-Nearest Neighbors	✓
SVM	✓
AdaBoost	✓

🚀 Getting Started

Clone the repository:

git clone https://github.com/yourusername/aquasense.git
cd aquasense

Install required packages:

pip install -r requirements.txt

Run Jupyter Notebook:

jupyter notebook

Open AquaSense.ipynb to view the analysis

📋 Prerequisites

Python 3.x
Jupyter Notebook
Required Python packages:
- pandas
- numpy
- matplotlib
- seaborn
- plotly
- scikit-learn
- xgboost

🔍 Key Findings

Comprehensive analysis of water quality parameters
Identification of key factors affecting water potability
Comparative analysis of different machine learning approaches
Model performance evaluation using various metrics

👥 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Original dataset contributors
Soumya Kushwaha - Project Author
GitHub Repository

📧 Contact

For any queries or suggestions, please reach out through GitHub issues.

Developed with ❤️ by Soumya Kushwaha

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
AquaSense.ipynb		AquaSense.ipynb
AquaSense.py		AquaSense.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AquaSense 🌊

📊 Project Overview

🔬 Features

📈 Dataset

🛠️ Technical Stack

📊 Visualizations Include

🤖 Machine Learning Models Performance

🚀 Getting Started

📋 Prerequisites

🔍 Key Findings

👥 Contributing

📜 License

🙏 Acknowledgments

📧 Contact

About

Uh oh!

Releases

Packages

Languages

Soumya-Kushwaha/AquaSense

Folders and files

Latest commit

History

Repository files navigation

AquaSense 🌊

📊 Project Overview

🔬 Features

📈 Dataset

🛠️ Technical Stack

📊 Visualizations Include

🤖 Machine Learning Models Performance

🚀 Getting Started

📋 Prerequisites

🔍 Key Findings

👥 Contributing

📜 License

🙏 Acknowledgments

📧 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages