Assignment2_Instruction
Assignment2_Instruction
Assignment2_Instruction
Use of tools:
• Recommended- Python
• Can also use excel
Student Name:
Section:
ID:
Kamloops Hotels
Business Situation: TransCanada Traveler, a magazine devoted to cross-Canada travel, published its
Reader’s Choice List of the top 20 places to stay if you find yourself in Kamloops as you journey from
Victoria to St. John’s (or vice-versa). The data shown are the scores received by these hotels based on
the results from TransCanada Traveler’s annual Readers’ Choice Survey. Each score represents the
percentage of respondents who rated a hotel as excellent or very good on one of three criteria
(comfort, amenities, and nearby dining). An overall score was also reported and used to rank the
hotels. The highest ranked hotel, the Sandman Signature, has an overall score of 94.3, the highest
component of which is 97.7 for nearby dining (due to its downtown Kamloops location).
Part-A
a. Perform simple linear regression, with target variable as “Overall” which is rating of the hotels by
using the independent variable “Amenities”.
b. Determine the equation of the simple linear regression, performed in question “a”. Also mention the
slope and the Intercept.
Your answer to above question…
Regression equation
Part-B
e. Perform multiple linear regression, with target variable as “Overall” which is rating of the hotels,
predicted by using the given independent variables- “comfort”, “amenities”, and “nearby dining”.
f. Determine the estimated multiple linear regression equation (done in part-a) that can be used to
predict the overall score given the scores for comfort, amenities, and nearby dining.
Your answer to above question…
Page 1 of 3
Regression equation
g. What will be the impact on “Overall” for a unit increase in “Amenities”.
Your answer to above question…
h. Use the t-statistics to determine the significance of each independent variable. What is the
conclusion for each test at the 0.05 level of significance?
Note:
• Can also use Excel to see t-statistics as discussed in class:
• If using python no need of pasting the screenshot as you will be providing the link at very
bottom.
Paste the screenshot here…
Part-C
i. Remove all independent variables that are not significant at the 0.05 level of significance from the
estimated regression equation. What is your recommended estimated regression equation?
j. What is the impact of this change on the coefficient of determination and why?
Important Details
Submission Instructions:
• Pdf of this documents with your answers
• File Name Nomenclature: Assignment2_Name_ID_Section (pdf file)
Python Users:
• Only colab link is sufficient provided above, no need to upload any python code file.
Excel Users:
• Save csv file into xlsx format, then start working else you will lose all the work even after saving
• Save your excel file in the same format- Assignment2_Name_ID_Section (xlsx) and submit it
too.
Marking Rubrics:
Page 2 of 3
Activity/Competencies Demonstrated % of
Final Grade
Critical Analysis and Research in statistics (10%)
a. Depth of background summary 5
b. Appropriateness of research 5
Statistical Explanation (50%)
a. Uses statistical keywords clearly and effectively 20
b. Information organized intelligently and to the points (avoiding 10
redundancy)
c. Proper presentation with explanations 20
Application of python (40%)
a. Coding Understanding and application 20
b. To the point answers (No redundancy) 20
Total 100
*Every question has equal weightage, not attempting all the questions will impact marking rubrics
proportionally.
Remark: For any question/doubt, feel free to ask on discussion forum for assignment 2.
Page 3 of 3