Member-only story

Data Science Project | Will it Rain Tomorrow?

Start To Finish Logistic Regression Model | Data Series | Project 2

Mazen Ahmed
7 min readNov 9, 2020

In this episode we will be expanding on Logistic Regression in Python, implementing much more data pre-processing steps on a larger data set that contains both numerical and categorical data (words).

Objective

Construct a logistic regression model to predict if it will rain tomorrow in a city in Australia.

Link to data and code can be found in the folder project 2 here: Github

1. Importing and Exploring our Data

Importing our data into python

# Read the data
import pandas as pd
import numpy as np

df = pd.read_csv("D:\ProjectData\weatherAus.csv")

print('Size of weather data frame is :',df.shape)
df.head()

We can see here that we are working with over 100,000 rows of data.

--

--

Mazen Ahmed
Mazen Ahmed

No responses yet