Cyclistic Customer Usage
Cyclistic Customer Usage
Cyclistic Customer Usage
There are some issues with the data from 15th Dec 2020
The data has been masked but is from original source
There are 4.46 Million records, this is comprehensive
The data runs until end of June 2021, it was decided to take
the most recent data
The source can be found on slide 4
Licensing, privacy,
security and accessibility
• On 15th December 2020, there are 378 rides with an end date of November 2020.
This may have been a short term issue as it all happened on the same day.
• In total there are 9,872 rides where the time ending is before the time starting
spread throughout the complete dataset. There could be a systemic issue that
should be explored.
• There are 369 rides where the time taken is 0, but the distance is greater than 0.
This could also point to an issue.
• In February 2021, there are 90 rides which are within seconds of 25 hours long.
Perhaps there was a fault in the process for handling unreturned bikes? This
caused a distortion to the results because February is a low month anyway.
ANALYZE
Final Data Set
• Rather than running the analysis on all 4+ million
records, a summary query has been written:
Example Impact
• On the strange results slide, the 4th issue around a spike in
February 2021 was analysed and a partial reason found. Just
these 90 rides in February caused a 5 and 3 minute reduction
in average times for the 2 customer types
SHARE
Trends in the data
Casual riders take a lot longer to complete their journey, even though the journey is
of similar length
Trends in the data
• Members use the bikes
at a very similar level
every day (perhaps
slightly less on Sundays)
• Casual users are very
weekend focussed
Trends in the data
• Casual users use the
bikes more than
members in June. In
winter months, the
members use the bikes
significantly more
How are the bikes used differently?
• Casual users spend much more time with the bike for
the same actual use
• Over the winter months, our regular users use the
bikes far more
ACT
Next steps
• There are some additional questions that arise from
the data
– What is the full operating model of the solution?
• What is the error handling process?
• Why is there such a difference in the quality of the
locations?
• Is there an auto-ending facility if the bike is not
returned properly (giving rise to the 25 hour issue)
Conclusions
• We need to investigate if incentivising casual users to give the
bikes back as soon as possible. The distance travelled is the same,
but time taken is more than double.
– Is there a lack of trust that there will be a bike available when
needed?
– How can this impact the running/maintenance costs of the
bikes if fewer bikes can stretch further?
• Can the seasonal pattern be incentivised?
– Make the summer months cheaper with a membership