Member-only story
Data Science Project | Will it Rain Tomorrow?
Start To Finish Logistic Regression Model | Data Series | Project 2
7 min readNov 9, 2020
In this episode we will be expanding on Logistic Regression in Python, implementing much more data pre-processing steps on a larger data set that contains both numerical and categorical data (words).
Objective
Construct a logistic regression model to predict if it will rain tomorrow in a city in Australia.
Link to data and code can be found in the folder project 2 here: Github
1. Importing and Exploring our Data
Importing our data into python
# Read the data
import pandas as pd
import numpy as np
df = pd.read_csv("D:\ProjectData\weatherAus.csv")
print('Size of weather data frame is :',df.shape)
df.head()
We can see here that we are working with over 100,000 rows of data.