Creation of the Training and Test Data Sets (WML Project)

Let's import all necessary libraries!

From all the available NYC CitiBike Data (105 months, from June 2013 till February 2022), 80% will be kept for training and 20% for test.

  1. Building the first colum and rows (84 months) for the training set (21 months) for the test set in two different data frames with the right date format.
  1. Incorporating the NYC CitiBike Data (Number of Trips per month) to the corresponding data frames (training and test). Source: https://s3.amazonaws.com/tripdata/index.html
  1. Reading and editing NYC monthly historical weather data (precipitation and temperature). Source: https://www.weather.gov/wrh/Climate?wfo=okx
  1. Adding the weather data to the corresponding data frames and exporting it.