Real Estate Case Analysis
Real Estate Case Analysis
Real Estate Case Analysis
net/publication/304615871
CITATION READS
1 14,472
2 authors:
All content following this page was uploaded by Ranthilaka Gedara Ariyawansa on 30 June 2016.
R. G Ariyawansa
Professor
Department of Estate Management and Valuation
Faculty of Management Studies and Commerce
University of Sri Jayewardenepura
Sri Lanka
135/60 A, Jaya Mawatha,
Neelammahara Road,
Maharagama
ariyaw71@sjp.ac.lk
W.H.T. Gunawardhana
Lecturer
Department of Estate Management and Valuation
Faculty of Management Studies and Commerce
University of Sri Jayewardenepura
Sri Lanka
Email : terans@sjp.ac.lk
TP : +94713238188
31.01.2016
2
Contents
Content Page No
3
Chapter Four: Real Estate Analysis- Validity and Reliability 88-96
Validity of Research Outcomes
Reliability of Research Outcomes
Bibliography 97-98
Glossary 99-127
4
Purpose of the book
Real Estate Development and Management (REDM) is multi-
disciplinary in nature. Therefore, the scope of Real Estate
Development and Management is very wide and complex. This
discipline relates with several subjects as well. Any decision
taken by a supplier, buyer, investor, policy maker etc is very
crucial and expensive. Changing or altering decisions is
sometimes impossible or very expensive. Therefore, scientific
analysis is a fundamental requirement for the correct decision
making process. Scientific research also helps for innovations
in the industry.
5
General Management (aspects including four functional
areas i.e. Financial Management, Human Resources
Management, Production and Marketing Management).
Strategic Management.
Role of different stakeholders i.e. developer, investors,
managers etc.
Analysis if highest and best use of lands and properties
Cost Benefit Analysis.
Cash flow, Net Present Value, Internal Rate of Return
etc with regard to real estate investment projects.
Investment appraisal and portfolio management.
Real Estate Development process, real estate life cycle.
Property and the macro-economy, globalization of
properties.
Obsolescence of properties and remedies.
Different costs and values in real estates.
Environmental effects on Real Estate and vice versa.
Building facilities & service management.
Management of Condominiums.
Ownerships, tenure types, different interest and legal
aspects of real estates.
Real estate management in non-real estate firms i.e.
Corporate Real Estate Asset Management (CREAM).
Review of real estate industry and public policy (real
estate sector/real estate markets etc).
Real estate research and consultations.
6
Chapter 01
Introduction to Real Estate
7
locations. However, due to the development of the
infrastructure facilities, it is difficult to demarcate urban uses
and non-urban uses easily. Anyway, in this book, more concern
is given to urban property development and management.
Real Estate: Real estate is defined as the land above and below
the earth’s surface, including all things that are permanently
attached to it either natural or artificial. Therefore, the term
“Real Estate” is broader than the term “land”. It includes not
only the natural components of the land but also all artificially
improved immovable features made by the man.
At the same time, it is clear that the land becomes usable when
it is converted into a real estate. This means when a land
becomes a real estate, it is usable for planned activities.
Therefore, it is able to argue that the “usability of the land” is a
more apparent and logical criterion to recognize real estate. In
this way, it is also able to argue that when the man starts to use
the land it becomes a real estate.
8
Real Property: The word “property” has different meanings in
legal term1. In this discipline, it is the land and tangible features
on the land and permanent improvements.
1
Moveable property, immoveable property etc… In this subject, the term
property referred only to immoveable properties.
9
granite etc are examples of limitations of the surface
right.
10
Classification of Properties (Types of Properties)
When it is analysing the real estate cases, it is essential to know
different types of properties. Properties can be classified
according to different criteria as pointed out below.
11
Markets and related socio-economic context may define
properties (especially residential properties) as,
Luxury
Semi-luxury/ middle income...
Low income etc
12
(a) A process of “idea generation”
(b) A process of “physical construction” and
(c) A process of “use of the built property”
13
Real Estate Management (REM)
Property management starts with the property purchase idea
(idea of acquiring/ building/ purchasing or simply having a
property) of the users and investors. Generally, there are two
main areas in property management. They are,
14
proper care and attention of benefits of properties. Thus,
property assets management functions deal with the operational
decisions. These decisions are usually taken by the middle and
lower level managers of organizations.
In case of properties of private sector organizations, it is
expected to have separate property management plans along
with the corporate business plans of organizations. As far as the
management of the public sector properties are concerned, there
may have some approved laws relating to the respective public
sector agency. Sometimes, necessary regulations and circulars
are issued time to time and strategic level and operational level
decisions are taken accordingly. Decisions according to
circulars are not rational as it does not always concern the
professional requirements rather than administrative
requirement.
15
of Public Properties etc are some of areas under the subject of
real estate management.
Some issues relating to the increasing real estate assets of
organizations
Rapid changes in technology: This is beneficial as
it paves the way for innovations, highest and best
uses etc. At the same time, it is a challenge as it
leads to a heavy cost and functional obsolesce of
properties etc.
Variety of social needs: Since social needs and wants
are rapidly changed, adoptability of properties is an
expensive task.
16
Similarly, organizations have to deal with some common
macro-economic issues when managing properties.
Some macro level socio-economic issues
Interest rates over the cost of money.
Inflation on construction cost.
Tax/ subsidies etc.
International trading issues.
Establishing international property market.
Oil crisis/ cost of energy/ electricity/ gas etc.
Scientific development.
War/crisis/ natural and artificial disasters.
Ultimate result of determinants of property prices:
difficult to renew/ difficult to test/ difficult to focus.
17
Preparing real estate strategic plans to support general
business plans of large organizations.
Real estate brokerage when selling or purchasing.
Capital markets/asset advisory for investment
management.
Feasibility studies, property analysis for new investment
or alteration for the existing real estate.
Evaluate customer needs and to solve customer
problems.
Etc...
18
Analysis of economic base of real estates or businesses.
Analysis of economic impact.
Study of highest and best use of land and properties.
Land use study in regions or neighbourhoods.
Market surveys.
Marketability study.
Financial feasibility.
Environmental impact analysis
19
he/she engages in observing properties, auditing properties,
collecting, reporting, and analyzing data for the purpose of
estimating the market value of a property. Hence, providing
objective opinion is a crucial task for a valuer as a human being
who is living among ordinary community. However, it is
believed and also evident that by means of wider experience,
updated scientific knowledge, a range of smart skills as well as
professional ethics, cord of conducts etc. an ordinary/apprentice
valuer (learner) is developed as a capable, confident
professional who is conscious over the existence of the given
property and the property market enabling to take correct
judgments based of proper analysis.
20
Chapter 02
Real Estate Research and Case Study Analysis
Beginning of a Research
Introduction
How do you find an answer(s) for a problem(s)? There are
several approaches for this. Mainly,
21
The third approach is stronger to find solutions for real problem
as it addresses the root of the problems. This approach is;
Base on philosophy/theory.
Least bias.
Least subjective.
Higher ability to generalize.
Higher degree of applicability.
Measurable/responsible/correctable/improvable.
Higher reliability and validity.
What is a research?
In order to find an answer for the above question i.e. what is a
research, try to find answers for the questions mentioned below.
Why do you want a research?
How do you do research?
When do you do research?
22
Research cycle
Origin of research: Societal problems:
There are questions/issues/ These are multidimensional/
difficulties/needs and wants/ interrelated/ interdependent/
motivations/ frustrations/desires/ critical in situations/ multiple
expectations/plans etc. views.
23
Systematic inquiry: This implies a profound design of a
research. Identification of the problem, establishing aims and
objectives, setting research questions, review of literature,
determining variables, research tools, and analysis etc; are the
features of a systematic approach.
24
Least subjective (objective) and nonbiased.
Thinking and action:
This implies that the whole process of designing, collecting
analysing and interpreting data, drawing conclusions, writing
the report, and presenting of the finding.
Problems/issues/needs:
Why/how/how much/how many/ when/ and so on are
problems.
Higher level:
This is very significant in a research. This emphasises that you
are producing a set of new knowledge. Otherwise whatever
your attempt will not be a research.
25
is the beginning of another research(es). The research process is
generally a cyclical one.
Types of research
(1) According to the nature of what do you want to do,
research can be identified as,
a. Descriptive research:
You describe a situation or the nature of a
problem in this approach. You describe what is
the shape of the problem? For instance, (i) Study
on the changes of land use pattern in Colombo
city. (ii) An empirical examination of socio-
economic characteristics of slum dwellers in
Colombo.
b. Correlation research:
You examine the relationship(s) between (or
among) variables. (With what it is related). For
instance, (i) Analysis of the impact of new tax
policy introduced by the 2008 budget on the land
use in the Western Province. (ii) Analysis of the
effectiveness of application of computer
technology for planning in Sri Lanka.
c. Explanatory research:
You study why a certain problem or a
phenomenon or a situation or a relationship etc.
exists. (Why it is related with that). For instance,
(i) Why the rate of increase of land price is
higher in Colombo than major Asian cities? (ii)
26
Factors for the success or failure of privatization
of state owned investment properties.
27
(4) According to the style of inquiry you find types of
research as,
a. Quantitative research:
In a quantitative research, you use data from
highly structured formats and analyse
accordingly. For instance, you quantify
relationships, classify relationships, order
relationships according to the significance,
compute ratios, use specific samples, and so on.
b. Qualitative research:
In qualitative research, you use data and
information in unstructured forms and analyse
accordingly. There, you describe the variables
and their relationships rather than quantifying,
use selected cases for in-depth analysis and so
on.
28
Therefore, it is clear that there is no a hard and fast
method/methodology called “the methodology”. Determine a
suitable method for your particular study. Then that particular
method will be “the methodology” for your particular study for
that particular purpose, in that particular occasion.
Producers Suppliers
Research on
Similarly you can learn the following examples and you will be
aware of formulating research theme as well.
30
The appropriateness of plot size and different land uses.
Researcher
In order to make an appropriate analysis, real estate researcher
will have some qualities such as;
Researcher is (or will be) an expert in a particular area
of interest in the field of real estate.
Lifelong learner.
Self learner.
Good consultant.
Able to diagnose problems accurately.
Rational thinker.
Good negotiator.
Good observer.
Able to provide objective opinions.
Philosopher.
Theory builder.
Opinion maker.
Good rapport builder.
Good investigator.
Able to read and comprehend.
Listener.
Writer.
Analyser and interpreter.
Presenter and convincer.
31
The research process
(1) Identification of a research problem.
Tentative Title
Research problem
32
(3) Preparing data collection tools.
Observation methods/guidelines.
Interview schedules/ list of interviewees/ organizations
to be visited etc.
Interview guidelines.
Questionnaire(s).
Data recording tools.
33
Objectives (General and special).
Hypothesis (if needed).
Research design (methodology).
Limitations/scope.
Chapters of your research report.
(These features are enough for BSc and MSc level
academic research proposals)
Timeframe.
(PhD and other given task, even for MSc programs if
requested)
Budget.
Qualifications of resource persons (CVs).
Any other relevant particulars.
(For funded projects)
34
Learning outcomes
Our attempt Your responsibility Learning
outcomes
We attempt to Knowing answers for
Short term
help you to (1)-(5)
understand You will be
(1) What is researchers and you do
research? research (Research
(2) How do you oriented/analytical/
do research? good observer/
(3) Who is maintaining your own
researcher? data and information
(4) What bank/ innovator/
Long-term
characteristic critical thinker/ theory
s does a good builder)
researcher Accordingly you help
have? to upgrade the social
(5) What is the well-being through
purpose of being a true real estate
doing a professional
research?
Designing of a Research
This part of the chapter 09 covers following areas.
Designing a research proposal.
Title of your research.
Developing an “Introduction”.
35
Methodology.
Designing of your research.
36
Should not include abbreviation/ jargons/ ambiguous
terms (WHO, CMC, UN…).
Should not be two sentences, but able to have one sub
title not more than one.
37
14. Why do people pay lower amount of stamp duty on land
transaction?
15. How does the existing information system help to the
role of the valuer?
16. What is the appropriate plot size for residential
properties?
17. The appropriateness of plot size and the land use
Question 01: Are these appropriate titles for research and can
you do researches on these areas?
38
you can not do a research. You will not go
ahead…!!!!
Examples 01
Suppose that you have roughly identified a research theme as
“Analysis of Housing Market”. This is a relevant researchable
area in your discipline. But, it is too vague. So, you have to
rearticulate it to be a specific and researchable theme. Go on the
following steps and learn the gradual changes made for the
initial theme.
a) Analysis of Housing Market
b) Analysis of Housing Market in Colombo
c) Analysis of Housing Market in Colombo and Suburbs
d) Analysis of Consumer Behaviour of the Housing Market
in Colombo and Suburbs
e) Analysis of Determinants of Consumer Behaviour of the
Housing Market in Colombo and Suburbs
f) Analysis of Determinants of Consumer Behaviour of
Potential Buyers in the Housing Market in Colombo and
Suburbs
g) Analysis of Determinants of Consumer Behaviour of
Potential Buyers in the Housing Market in Colombo and
Suburbs
h) Analysis of Determinants of Consumer Behaviour of the
Housing Market in Colombo and suburbs: Views of
Potential Buyers
39
i) Analysis of Determinants of Consumer Buying
Behaviour of the Housing Market: Views of Potential
Buyers in Colombo and Suburbs
j) Determinants of Consumer Buying Behaviour of the
Housing Market: Views of Potential Buyers in Colombo
and Suburbs
k) Analysis of Determinants of Consumer Satisfaction of
the Housing Market: Views of Potential Buyers in
Colombo and Suburbs
l) Analysis of Determinants of Consumer’s Expected
Needs and Wants from of the Housing Market: Views of
Potential Buyers Evidence of Colombo and Suburbs
Example 02
Initial title
“Impact of global credit crunch on residential property market
in Sri Lanka with special reference to Kaduwela area in
Colombo”
Improved titles
“Impact of global economic downturn on residential property
market in Sri Lanka: Empirical evidence from Colombo”
40
“Failure of global financial market on residential property
market in Colombo: Competition of global and local contexts”
Developing an “Introduction”
Suppose that you have finalized a title as, “Determinants of
Buying Behaviour of the Housing Market: Views of Potential
Buyers in Colombo and Suburbs”. Accordingly you have to
provide an “INTRODUCTION” to your study at the beginning
of the proposal. What does the “introduction mean? What is it
supposed to do?
41
analyzing the background. It is an overview of your study. You
describe all the core areas of your research theme. According to
this example, “buying behaviour”, “determinants of buying
behaviour” “potential buyer” “housing market” are core
concepts in the study theme. You need to develop a core
ideology/ theme/ logic/ rationale or a central argument for your
study by means of the introduction. Through this way, you trace
a “research problem” more specifically for your study. And at
the end of the introduction you can indicate the research
problem of your study under a separate heading called
“Statement of the Problem”.
In the introduction,
You have to discuss all the major concepts of your
research theme/title.
For instance, according to the first example, “Consumer
buying behaviour”, “Housing market in general”,
“Housing market experience of Colombo and suburbs”
are major areas of importance.
Therefore, in your introduction, you have to describe all
these concepts according to their level of significance.
You should identity and discuss the relationship
between/among these concepts (major concepts and
their variables).
You have to input sufficient amount of literature
(evidence) to support your argument along with
citations in an appropriate way.
42
Once you precisely identified a research problem, you can go
ahead with your research by setting aim(s) and objectives
through which you can address the research problem. You can
have one general objective and few specific objectives.
However, this is not a must.
Methodology
You need to describe particularly the data on which you depend
in finding solutions for the research problem. (Instead of that, it
is needed to explain data collection and analysis methods,
limitations etc).
43
limitations relating to the variables, case studies,
samples for data collections, methods of analysis etc.
Examples - 01
Specific Research Yardstic Data/informat
objectives Questions ks of ion (working
(About (Using indicator definitions of
particula indicators s variables)
r of relevant (relevant
concepts concepts) variables
relating of
to the indicator
research s)
problem)
Househol Distance About the About size of
d’s desires to service quality rooms,
or centres of Size of
satisfactio Quality of structure bedrooms
n structure s, Size of
s (What Design kitchen
Objective specific s Size of
is, features Size of bathrooms
To does a rooms Size of
evaluate buyer Qualit
44
household expect y of sitting
’s desires from a finishe rooms
on house in a s etc.
different flat?) Materi (These are the
features of Neighbour al used expected set of
a house. hood etc. data. You can
quality obtain needed
etc. data through
observation,
questionnaire,
interviewing
experts or
secondary
sources)
Examples - 02
Specific Research Yardstic Data/informat
objectives Questions ks of ion (working
(About (Using indicator definitions of
particula indicators s variables)
r of relevant (relevant
concepts concepts) variables
relating of
to the indicator
research s)
problem)
Househol Level of About the About formal
45
ds income savings savings
purchasin Expenditu Forma Saving
g power re pattern l accounts
Savings saving Fix deposits
Objective Family s NRFC
is, supports EPF
To etc… Jewelle Or
examine ries Rs. Less
level of etc… than
household 100,000
s Rs.
purchasin 100,000-
g power in 200,000
demandin
g a house
46
Chapter 03
Real Estate Analysis-Quantitative Approach
Descriptive Analysis
Tables, Graphs, Summary Measures
Hypothesis Testing
47
Sales Consultant might wish to see whether a new promotion
technique is better than a traditional one.
48
computers and high-powered statistical calculators. The third
method, the confidence interval method illustrates the
relationship between hypothesis testing and confidence
intervals.
49
Situation A: A Structural Engineer changes his drawing to
increase the physical life (durability) of a condominium.
50
Situation C: A Real Estate Researcher is interested in finding
out whether a new global economic downturn would have any
desirable effects on land sales. The researcher is particularly
concerned with the sales volume during the global economic
downturn. Will the sales volume increase, decrease, or remain
unchanged after a sales promotion?
Since the researcher knows that the land sales volume for the
Colombo District under study is Rs.100 million per month, the
hypotheses for this situation are;
51
The null and alternative hypotheses are stated together, and the
null hypothesis contains the equal sign, as shown (where k
represents a specified number).
52
Hypothesis-Testing Common Phrases
> <
is greater than is less than
is above is below
is higher than is lower than
is longer than is shorter than
is bigger than is smaller than
is increased is decreased or reduced from
= ≠
is equal to is not equal to
is the same as is different from
has not changed from has changed from
is the same as is not the same as
53
54
Correlation
55
Pearson Correlation - These numbers measure the strength
and direction of the linear relationship between the two
variables. The correlation coefficient can range from -1 to +1,
with -1 indicating a perfect negative correlation, +1 indicating a
perfect positive correlation, and 0 indicating no correlation at
all. (A variable correlated with itself will always have a
correlation coefficient of 1.)
From the scatterplot of the variables land perch price
and distance to town below, it is possible to see that the points
tend no pattern, which is the same as saying that the correlation
is weak negative. The -0.18 is the numerical description of how
tightly around the imaginary line the points lie.
If the correlation was higher, the points would tend to be closer
to the line; if it was smaller, they would tend to be further away
from the line. Also note that, by definition, any variable
correlated with itself has a correlation of 1.
Sig. (2-tailed) - This is the p-value associated with the
correlation.
56
N - This is number of cases that was used in the correlation.
Because data set has no missing data, all correlations were
based on all 200 cases in the data set. However, if some
variables had missing values, the N's would be different for the
different correlations.
Scatterplot
57
Linear Regression
58
variance of the distribution of the dependent variable should
be constant for all values of the independent variable. The
relationship between the dependent variable and each
independent variable should be linear, and all observations
should be independent.
59
Model - SPSS allows to specify multiple models in a
single regression command. This indicates the number of the
model being reported.
Variables Entered – This indicates entered variables into a
regression in blocks, and it allows stepwise regression. Hence,
researchers need to know which variables were entered into the
current regression. If it doesn’t block the independent variables
or use stepwise regression, this column should list all of the
independent variables that the researcher specified.
Variables Removed - This column listed the variables that
were removed from the current regression. Usually, this
column will be empty unless the researcher did a stepwise
regression.
Method - This column shows the method that SPSS used to run
the regression. "Enter" means that each independent variable
was entered in usual fashion. If the researcher did a stepwise
regression, the entry in this column would show that.
60
Model - SPSS allows to specify multiple models in a
single regression command. This indicates the number of the
model being reported.
R - R is the square root of R-Squared and is the correlation
between the observed and predicted values of dependent
variable.
R-Square - This is the proportion of variance in the dependent
variable (land perch price) which can be explained by the
independent variables (distance to town, shape, extent and soli
type etc). This is an overall measure of the strength of
association and does not reflect the extent to which any
particular independent variable is associated with the dependent
variable.
Adjusted R-square - This is an adjustment of the R-squared
that penalizes the addition of extraneous predictors to the
model. Adjusted R-squared is computed using the formula 1 -
((1 – R2) ((N - 1) / (N - k - 1)) where k is the number of
predictors.
Std. Error of the Estimate - This is also referred to as the root
mean squared error. It is the standard deviation of the error
term and the square root of the Mean Square for the Residuals
in the ANOVA table.
61
Model - SPSS allows to specify multiple models in a
single regression command. This indicates the number of the
model being reported.
Regression, Residual, Total - Looking at the breakdown of
variance in the outcome variable, these are the categories to be
examined: Regression, Residual, and Total. The Total variance
is partitioned into the variance which can be explained by the
independent variables (Model) and the variance which is not
explained by the independent variables (Error).
Sum of Squares - These are the Sum of Squares associated
with the three sources of variance, Total, Model and Residual.
The Total variance is partitioned into the variance which can be
explained by the independent variables (Regression) and the
variance which is not explained by the independent variables
(Residual).
df - These are the degrees of freedom associated with the
sources of variance. The total variance has N-1 degrees of
freedom. The Regression degrees of freedom corresponds to
62
the number of coefficients estimated minus 1. Including the
intercept, there are 11 coefficients, so the model has 11-1=10
degrees of freedom. The Error degrees of freedom is the DF
total minus the DF model, 151 - 10 =141.
Mean Square - These are the Mean Squares, the Sum of
Squares divided by their respective DF.
F and Sig. - This is the F-statistic the p-value associated with
it. The F-statistic is the Mean Square (Regression) divided by
the Mean Square (Residual): 11.654/1.072 = 10.86. The p-value
is compared to some alpha level in testing the null hypothesis
that all of the model coefficients are 0.
63
B - These are the values for the regression equation for
predicting the dependent variable from the independent
variable. The regression equation is presented in many different
ways, for example:
Y predicted = b0 + b1*x1 + b2*x2 + b3*x3 + b4*x4……+ e
The column of estimates provides the values for b0, b1, b2, b3…
and b10 for this equation.
Distance to town- The coefficient for Distance to town is
.837. So for every unit increase in Distance to town, a 0.837
thousand rupees increase in land perch price is predicted,
holding all other variables constant ( This positive impact will
be debatable).However it is possible to consider all other
coefficient values of independent variables.
Std. Error - These are the standard errors associated with the
coefficients.
Beta - These are the standardized coefficients. These are the
coefficients that the researcher would obtain if the researcher
standardized all of the variables in the regression, including the
dependent and all of the independent variables, and ran the
regression. By standardizing the variables before running the
regression, researcher has to put all of the variables on the same
scale, and he can compare the magnitude of the coefficients to
see which one has more of an effect. The researcher will also
notice that the larger betas are associated with the larger t-
values and lower p-values.
64
t and Sig. - These are the t-statistics and their associated 2-
tailed p-values used in testing whether a given coefficient is
significantly different from zero. Using an alpha of 0.05: The
coefficient for Distance to town (0.837) is significantly
different from 0 because its p-value is 0.017, which is smaller
than 0.05.
95% Confidence Limit for B Lower Bound and Upper
Bound - These are the 95% confidence intervals for the
coefficients. The confidence intervals are related to the p-
values such that the coefficient will not be statistically
significant if the confidence interval includes 0. These
confidence intervals can help the researcher to put the estimate
from the coefficient into perspective by seeing how much the
value could vary.
Discriminant Analysis
65
Example. On average, people in rural areas think about the
possibility for more social interaction/ neighbourhood
relationships than people in the urban, and a greater proportion
of the people in the rural areas are farmers. A real estate
researcher wants to combine this information into a function to
determine how well an individual can discriminate between
the two groups of areas. The researcher thinks that population
size and economic information, social factors may also be
important.
66
percentage of variance, canonical correlation, Wilks' lambda,
chi-square. For each step: prior probabilities, Fisher's function
coefficients, unstandardized function coefficients, Wilks'
lambda for each canonical function.
Logistic Regression
A Mortgage bank needs to know the loan repayment capacity of
its mortgage loan holding customers. Accordingly, data were
collected on 250 customers.
67
Loan repayment capacity is a dichotomous variable which has
Yes or No answer. Age, Income, Education, Employment and
other loans are the variables in the model.
This indicates about the cases that were included and excluded
from the analysis, the coding of the dependent variable, and
coding of any categorical variables listed on
the categorical subcommand.
N - This is the number of cases in each category (e.g., included
in the analysis, missing, total).
Percent - This is the percent of cases in each category (e.g.,
included in the analysis, missing, total).
Included in Analysis - This row gives the number and percent
of cases that were included in the analysis. This has no missing
data in the data set, this also corresponds to the total number of
cases.
Missing Cases - This row give the number and percent of
missing cases. By default, SPSS logistic regression does a list
wise deletion of missing data. This means that if there is
68
missing value for any variable in the model, the entire case will
be excluded from the analysis.
Total - This is the sum of the cases that were included in the
analysis and the missing cases. In this example, 250 + 0 = 250
69
Step 0 - SPSS allows you to have different steps in your logistic
regression model. The difference between the steps is the
predictors that are included. This is similar to blocking
variables into groups and then entering them into the equation
one group at a time. By default,
SPSS logistic regression is run in two steps. The first step,
called Step 0, includes no predictors and just the intercept.
Often, this model is not interesting to researchers.
Observed - This indicates the number of 0's and 1's that are
observed in the dependent variable.
Predicted - In this null model, SPSS has predicted that all cases
are 0 on the dependent variable.
Overall Percentage - This gives the percent of cases for which
the dependent variables was correctly predicted given the
model. In this part of the output, this is the null model. 76.4 =
191/250.
B - This is the coefficient for the constant (also called the
"intercept") in the null model.
S.E. - This is the standard error around the coefficient for the
constant.
70
Wald and Sig. - This is the Wald chi-square test that tests the
null hypothesis that the constant equals 0. This hypothesis is
rejected because the p-value (listed in the column called "Sig.")
is smaller than the critical p-value of .05 (or .01). Hence, we
conclude that the constant is not 0. Usually, this finding is not
of interest to researchers.
df - This is the degrees of freedom for the Wald chi-square test.
There is only one degree of freedom because there is only one
predictor in the model, namely the constant.
Exp(B) - This is the exponentiation of the B coefficient, which
is an odds ratio. This value is given by default because odds
ratios can be easier to interpret than the coefficient, which is in
log-odds units. This is the odds: 59/191 = 0.3
Score and Sig. - This is a Score test that is used to predict
whether or not an independent variable would be significant in
the model. Looking at the p-values (located in the column
labelled "Sig.").
df - This column lists the degrees of freedom for each variable.
Each variable to be entered into the model, e.g., age, education,
income etc. has one degree of freedom, which leads to the total
of four shown at the bottom of the column.
Overall Statistics - This shows the result of including all of the
predictors into the model.
Block 1: Method = Enter
The section contains what is frequently the most interesting part
of the output: the overall test of the model (in the "Omnibus
71
Tests of Model Coefficients" table) and the coefficients and
odds ratios (in the "Variables in the Equation" table).
72
Step 1 - This is the first step (or model) with predictors in it. In
this case, it is the full model that we specified in the logistic
regression command. You can have more steps if you do
stepwise or use blocking of variables.
Chi-square and Sig. - This is the chi-square statistic and its
significance level. In this example, the statistics for the Step,
Model and Block are the same because we have not used
stepwise logistic regression or blocking. The value given in the
Sig. column is the probability of obtaining the chi-square
statistic given that the null hypothesis is true. In other words,
this is the probability of obtaining this chi-square statistic
(37.635) if there is in fact no effect of the independent
variables, taken together, on the dependent variable. This is, of
course, the p-value, which is compared to a critical value,
perhaps .05 or .01 to determine if the overall model is
statistically significant. In this case, the model is statistically
significant because the p-value is less than .000.
df - This is the number of degrees of freedom for the model.
There is one degree of freedom for each predictor in the model.
In this example, we have five predictors: age, income,
education, employment and other loans
73
-2 Log likelihood - This is the -2 log likelihood for the final
model. By itself, this number is not very informative.
However, it can be used to compare nested (reduced) models.
Cox & Snell R Square and Nagelkerke R Square - These are
pseudo R-squares. Logistic regression does not have an
equivalent to the R-squared that is found in OLS regression;
however, many people have tried to come up with one. There
are a wide variety of pseudo-R-square statistics (these are only
two of them). Because this statistic does not mean what R-
squared means in OLS regression (the proportion of variance
explained by the predictors).
Observed - This indicates the number of 0's and 1's that are
observed in the dependent variable.
Predicted - These are the predicted values of the dependent
variable based on the full logistic regression model. This table
shows how many cases are correctly predicted (184 cases are
observed to be No and are correctly predicted to be No; 50
cases are observed to be Yes and are correctly predicted to be
Yes), and how many cases are not correctly predicted (7 cases
are observed to be No but are predicted to be Yes; 9 cases are
observed to be Yes but are predicted to be No).
Overall Percentage - This gives the overall percent of cases
that are correctly predicted by the model. As you can see, this
percentage has increased from 76.4 for the null model to 77.2
for the full model.
B - These are the values for the logistic regression equation for
predicting the dependent variable from the independent
74
variable. They are in log-odds units. Similar to OLS
regression, the prediction equation is
Log (p/1-p) = b0 + b1*x1 + b2*x2 + b3*x3 + b3*x3+b4*x4
Where p is the probability of being loan payable. Expressed in
terms of the variables used in this example, the logistic
regression equation is
Log (p/1-p) = 0.02 + -0.03*age + 0.24*income – 0.412
education - 0.257* employment + 0.154 other loans
75
Factor Analysis
76
• Three methods of computing factor scores are available,
and scores can be saved as variables for further analysis.
77
transformation matrix. For oblique rotations: rotated pattern
and structure matrices; factor score coefficient matrix and
factor covariance matrix. Plots: scree plot of eigenvalues and
loading plot of first two or three factors.
78
Mean - These are the means of the variables used in the factor
analysis.
Std. Deviation - These are the standard deviations of the
variables used in the factor analysis.
Analysis N - This is the number of cases used in the factor
analysis.
79
Kaiser-Meyer-Olkin Measure of Sampling Adequacy - This
measure varies between 0 and 1, and values closer to 1 are
better. A value of .6 is a suggested minimum.
Bartlett's Test of Sphericity - This tests the null hypothesis
that the correlation matrix is an identity matrix. An identity
matrix is matrix in which all of the diagonal elements are 1 and
all off diagonal elements are 0. You want to reject this null
hypothesis.
Taken together, these tests provide a minimum standard which
should be passed before a factor analysis (or a principal
components analysis) should be conducted.
80
underlying latent continua). It is also noted as h2 and can be
defined as the sum of squared factor loadings for the variables.
Initial - With principal factor axis factoring, the initial values
on the diagonal of the correlation matrix are determined by the
squared multiple correlation of the variable with the other
variables.
Extraction - The values in this column indicate the proportion
of each variable's variance that can be explained by the retained
factors. Variables with high values are well represented in the
common factor space, while variables with low values are not
well represented. (In this example, we don't have any
particularly low values.) They are the reproduced variances
from the factors that you have extracted. You can find these
values on the diagonal of the reproduced correlation matrix.
81
correlation matrix, the variables are standardized, which means
that the each variable has a variance of 1, and the total variance
is equal to the number of variables used in the analysis, in this
case, 7.
Total - This column contains the eigenvalues. The first factor
will always account for the most variance (and hence have the
highest eigenvalue), and the next factor will account for as
much of the left over variance as it can, and so on. Hence, each
successive factor will account for less and less variance.
% of Variance - This column contains the percent of total
variance accounted for by each factor.
Cumulative % - This column contains the cumulative
percentage of variance accounted for by the current and all
preceding factors. For example, the second row shows a value
of 71.81. This means that the first two factors together account
for 71.81% of the total variance.
Extraction Sums of Squared Loadings - The number of rows
in this panel of the table correspond to the number of factors
retained. In this example, we requested that two factors be
retained, so there are two rows, one for each retained factor.
The values in this panel of the table are calculated in the same
way as the values in the left panel, except that here the values
are based on the common variance. The values in this panel of
the table will always be lower than the values in the left panel
of the table, because they are based on the common variance,
which is always smaller than the total variance.
82
Rotation Sums of Squared Loadings - The values in this
panel of the table represent the distribution of the variance after
the varimax rotation. Varimax rotation tries to maximize the
variance of each of the factors, so the total amount of variance
accounted for is redistributed over the three extracted factors.
The scree plot graphs the eigenvalue against the factor number.
You can see these values in the first two columns of the table
immediately above. From the third factor on, you can see that
the line is almost flat, meaning the each successive factor is
83
accounting for smaller and smaller amounts of the total
variance.
84
Factor Transformation Matrix - This is the matrix by which
you multiply the unrotated factor matrix to get the rotated factor
matrix.
85
The plot above shows the items (variables) in the rotated factor
space. While this picture may be particularly helpful, when you
get this graph in the SPSS output, you can interactively rotate
it. This may help you to see how the items (variables) are
organized in the common factor space. Accordingly Shape is
slightly away from the other factors.
86
Chapter 04
Real Estate Analysis- Validity and Reliability
87
Each type of measure has specific types of issues that need to
be addressed to make the measurement meaningful, accurate,
and efficient.
88
It is important to bear in mind that validity and
reliability are not an all or none issue but a matter of
degree.
Content Validity:
When researcher wants to find out if the entire content
of the behaviour/construct/area is represented in the test we
compare the test task with the content of the behaviour. This is
a logical method, not an empirical one.
89
Face Validity:
Basically face validity refers to the degree to which a
test appears to measure what it purports to measure.
Researchers need to maintain/understand the link among
research problem, objective/s, hypothesis and statistical test/s
properly.
Concurrent Validity:
An example in Real Estate Studies, Concurrent validity
is the degree to which the records on a land sale are related to
the records on another, already established, sale occurred at the
same time, or to some other valid criterion available at the same
time. Logically, predictive and concurrent validation are the
same, the term concurrent validation is used to indicate that no
time elapsed between measures.
Construct Validity:
Construct validity is the degree to which a test measures
an intended hypothetical construct. Many times psychologists
assess/measure abstract attributes or constructs. The process of
90
validating the interpretations about that construct as indicated
by the test score is construct validation.
Reliability Analysis
91
are related to each other, you can get an overall index of the
repeatability or internal consistency of the scale as a whole,
and you can identify problem items that should be excluded
from the scale.
• Split-half. This model splits the scale into two parts and
examines the correlation between the parts.
92
Data. Data can be dichotomous, ordinal, or interval, but the
data should be coded numerically.
Test-retest Reliability:
Test-retest reliability is the degree to which
outcomes/results are consistent over time. It indicates
outcomes/results variation that occurs from
surveying/investigation to surveying/investigation as a result of
errors of measurement.
Split-Half Reliability:
Requires only one administration. Especially
appropriate when the test is very long. The most commonly
93
used method to split the test into two is using the odd-even
strategy. Since longer tests tend to be more reliable, and since
split-half reliability represents the reliability of a test only half
as long as the actual test, a correction formula must be applied
to the coefficient. Spearman-Brown prophecy formula.
Split-half reliability is a form of internal consistency
reliability.
94
Reliability is concerned with questions of stability and
consistency - does the same measurement tool yield stable and
consistent results when repeated over time.
95
Validity and Reliability Convergence
96
Bibliography
1. Ariyawansa R G (2009) Management of Real Estate:
Principles of Real Estate Development & Management
Published by the author, Colombo
2. Ariyawansa R G (2008) “Issues in property development:
Tenure & Ownership, Informality and Price Formation in
Property Markets in Developing Cities”, Published by the
author, Colombo
3. Ariyawansa R G (2008) “Property Market in Colombo:
Evolution and Success”, Published by the author, Colombo
4. Ariyawansa R G (2009) “Housing Market: A Review of
Purchase Decision of Potential Buyers”, published by the
author, Colombo
5. Berk, R., 1979. Generalizability of Behavioral
Observations: A Clarification of Interobserver Agreement
and Interobserver Reliability. American Journal of Mental
Deficiency, Vol. 83, No. 5, p. 460-472.
6. Cronbach, L., 1990. Essentials of psychological testing.
Harper & Row, New York.
7. Cadman, D & Austini, L (1981), Property Development. E
& F.N. Spon Ltd. U.S.A,
8. Chalrles F. Floyd and Marcus T Allen (1993), “Real Estate
Principles”, 4th Edition, Real Estate Education Company,
USA
9. Carmines, E., and Zeller, R., 1979. Reliability and Validity
Assessment. Sage Publications, Beverly Hills, California.
97
10. Fillmore W. Galaty, Wellington J. Allaway, Robert C. Kyle
( 2006) “Modern Real Estate Practice”, 17th Edition,
Dearborn Real Estate Education, USA
11. Gay, L., 1987. Eductional research: competencies for
analysis and application. Merrill Pub. Co., Columbus.
12. Guilford, J., 1954. Psychometric Methods. McGraw-Hill,
New York.
13. Introduction to SAS. UCLA: Statistical Consulting Group.
From http://www.ats.ucla.edu/stat/sas/notes/ (accessed
November 24, 2015).
14. John B. Corgel, David C. Ling, Halbert C. Smith (2001),
“Real Estate Perspectives: An Introduction to Real Estate”,
4th Edition, McGraw-Hill Higher Education, USA
15. Kendell, R; Jablensky, A (2003). "Distinguishing between
the validity and utility of psychiatric diagnoses". The
American Journal of Psychiatry 160 (1): 4–12
16. Kotler P (1999), “Marketing Management: The Millennium
Edition, Prentice Hall, New Delhi
17. Mike E Miles, Richard L Haney, Gayle Berens (1995),
“Real Estate Development: Principle and Process”, 2nd
Edition, urban land institute
18. Nunnally, J., 1978. Psychometric Theory. McGraw-Hill,
New York.
19. Stapleton T (1986), “Estate Management Practice”, 2nd
Edition, The eastern gazette ltd, London
20. Winer, B., Brown, D., and Michels, K., 1991. Statistical
Principles in Experimental Design, Third Edition.
McGraw-Hill, New York.
98
Glossary
99
Biased sample a sample for which some type of systematic
error has been made in the selection of subjects for the sample
100
the mean will be at least1 _ 1_k2, where k is a number greater
than 1
101
Coefficient of determination a measure of the variation of the
dependent variable that is explained by the Regression line and
the independent variable; the ratio of the explained variation to
the total variation
102
Confounding variable a variable that influences the Outcome
variable but cannot be separated from the other Variables that
influence the outcome variable
103
Correlation coefficient a statistic or parameter that measures
the strength and direction of a linear Relationship between two
variables
104
Degrees of freedom the number of values that are free to Vary
after a sample statistic has been computed; used when a
distribution (such as the t distribution) consists Of a family of
curves
105
Double sampling a sampling method in which a Very large
population is given a questionnaire to determine those who
meet the qualifications for a Study; the questionnaire is
reviewed, a second smaller Population is defined, and a sample
is selected from this group
106
Expected frequency the frequency obtained by calculation (as
if there were no preference) and used in the chi-square Test
107
Factors the independent variables in ANOVA tests
108
Hawthorne effect an effect on an outcome variable caused by
the fact that subjects of the study know that they are
participating in the study
109
Inferential statistics a branch of statistics that consists of
Generalizing from samples to populations, performing
Hypothesis testing, determining relationships among Variables,
and making predictions
Interquartile range Q3 _ Q1
110
Probability of an outcome will approach its theoretical
Probability
111
Maximum error of estimate the maximum likely Difference
between the point estimate of a parameter and the actual value
of the parameter
112
Multinomial distribution a probability distribution for an
Experiment in which each trial has more than two Outcomes
113
Nominal level of measurement a measurement level that
Classifies data into mutually exclusive (no overlapping)
Exhaustive categories in which no order or ranking can be
imposed on them. See also interval, ordinal, and ratio Levels of
measurement
114
Observational study a study in which the researcher merely
observes what is happening or what has happened in the past
and draws conclusions based on these observations
115
precise differences between the ranks do not exist. See also
interval, nominal, and ratio levels of Measurement
116
Percentile a location measure of a data value; it divides the
Distribution into 100 groups
117
Positive relationship a relationship between two variables
Such that as one variable increases, the other variable Increases
or as one variable decreases, the other Decreases
118
Qualitative variable a variable that can be placed into Distinct
categories, according to some characteristic or Attribute
Range the highest data value minus the lowest data value
119
Ratio level of measurement a measurement level that
possesses all the characteristics of interval measurement and a
true zero; it also has true ratios between different Units of
measure. See also interval, nominal, and ordinal Levels of
measurement
120
Right-tailed test a test used on a hypothesis when the Critical
region is on the right side of the distribution
121
Sequence sampling a sampling technique used in quality
Control in which successive units are taken from Production
lines and tested to see whether they meet the Standards set by
the manufacturing company
122
Standard error of the mean the standard deviation of the
Sample means for samples taken from the same Population
Stem and leaf plot a data plot that uses part of a data value as
the stem and part of the data value as the leaf to form Groups or
classes
123
homogeneous characteristics and then selecting Members from
each stratum
124
T test a statistical test for the mean of a population, used when
the population is normally distributed and the Population
standard deviation is unknown
Time series graph a graph that represents data that occur over
a specific time
125
Type I error the error that occurs if you reject the null
Hypothesis when it is true
Type II error the error that occurs if you do not reject the Null
hypothesis when it is false
126
Variance the average of the squares of the distance that each
value is from the mean
127
128