This project is part of a class assignment where I explore housing data from Cook County and build a machine learning model to predict housing prices.
The project is divided into two main parts:
-
Part 1: Data Exploration
In this part, I conducted an exploratory data analysis (EDA) on the housing dataset, identifying key features, understanding data trends, and performing the necessary data cleaning and preprocessing steps. -
Part 2: Predicting Housing Prices
In the second part, I utilized Pandas, Seaborn, and Scikit-learn to build and evaluate multiple regression models for predicting housing prices. Additionally, I addressed potential biases and ensured fair outcomes in the predictive analysis, using Matplotlib and Plotly for visualization and interpretation.
- Data Exploration: Handling and preprocessing real-world datasets, and deriving useful insights through exploratory analysis.
- Machine Learning: Building and evaluating regression models, tuning hyperparameters, and interpreting model results.
- Bias Mitigation: Understanding and addressing potential biases to ensure fair predictions in machine learning models.
- Python (Jupyter Notebooks)
- Pandas, NumPy for data manipulation
- Seaborn, Matplotlib, and Plotly for data visualization
- Scikit-learn for machine learning