Sat Study Guide Problem Solving Data Analysis
Sat Study Guide Problem Solving Data Analysis
Sat Study Guide Problem Solving Data Analysis
Problem Solving and Data Analysis also includes questions that assess
your understanding of essential concepts in statistics. You may be
asked to analyze univariate data presented in dot plots, histograms, box
plots, and frequency tables, or bivariate data presented in scatterplots,
line graphs, and two-way tables. This includes computing, comparing,
and interpreting measures of center, interpreting measures of spread,
describing overall patterns, and recognizing the effects of outliers on
measures of center. These questions may test your understanding of
the conceptual meaning of standard deviation (although you will not
be asked to calculate a standard deviation).
209
PART 3 | Math
Some questions will present you with a description of a study and ask
you to decide what conclusion is most appropriate based on the design
of the study. Some questions ask about using data from a sample to
draw conclusions about an entire population. These questions might
also assess conceptual understanding of the margin of error (although
you won’t be asked to calculate a margin of error) when a population
mean or proportion is estimated from sample data. Other questions
ask about making conclusions about cause-and-effect relationships
between two variables.
REMEMBER Problem Solving and Data Analysis questions include both multiple-
Problem Solving and Data Analysis choice questions and student-produced response questions. The use of
questions comprise 17 of the a calculator is allowed for all questions in this domain.
58 questions (29%) on the Math Test.
Problem Solving and Data Analysis is one of the three SAT Math Test
subscores, reported on a scale of 1 to 15.
Let’s explore the content and skills assessed by Problem Solving and
Data Analysis questions.
Example 1
On Thursday, 240 adults and children attended a show. The ratio of adults to
children was 5 to 1. How many children attended the show?
A) 40
B) 48
C) 192
D) 200
PRACTICE AT Because the ratio of adults to children was 5 to 1, there were 5 adults
satpractice.org for every 1 child. Thus, of every 6 people who attended the show,
5
A ratio represents a relationship 5 were adults and 1 was a child. In fractions, _ of the 240 who attended
6
between quantities, not the actual
1
_ 1
_
quantities themselves. Fractions were adults and were children. Therefore, × 240 = 40 children
6 6
are an especially effective way to attended the show, which is choice A.
represent and work with ratios.
3
Ratios on the SAT may be expressed in the form 3 to 1, 3:1, _
1 , or
simply 3.
210
Chapter 17 | Problem Solving and Data Analysis
Example 2
Because 1 inch represents 3 feet, the actual dimensions of the room are
3 × 3.5 = 10.5 feet and 3 × 5 = 15 feet. Therefore, the floor area of the
room is 10.5 × 15 = 157.5 square feet, which is choice D.
Many of the questions on the SAT Math Test require you to pay
attention to units. Some questions in Problem Solving and Data
Analysis require you to convert units either between the English
system and the metric system or within those systems.
211
PART 3 | Math
Example 3
Scientists estimate that the Pacific Plate, one of Earth’s tectonic plates, has
moved about 1,060 kilometers in the past 10.3 million years. What was the
average speed of the Pacific Plate during that time period, in centimeters per
year?
A) 1.03
B) 10.3
C) 103
D) 1,030
PRACTICE AT Since 1 kilometer = 1,000 meters and 1 meter = 100 centimeters, you get
satpractice.org 1,060 kilometers 1,000 meters 100 centimeters centimeters
__
× — × __
__
= 10.3 year .
Pay close attention to units, 10,300,000 years 1 kilometer 1 meter
and convert units if required by Therefore, the correct answer is choice B.
the question. Writing out the
unit conversion as a series of Questions may require you to move between unit rates and total
multiplication steps, as seen amounts.
here, will help ensure accuracy.
Intermediate units should cancel
(as do the kilometers and meters
Example 4
in Example 3), leaving you with the
desired unit (centimeters per year). County Y consists of two districts. One district has an area of 30 square miles
and a population density of 370 people per square mile, and the other district
has an area of 50 square miles and a population density of 290 people per
square mile. What is the population density, in people per square mile, for all
of County Y?
212
Chapter 17 | Problem Solving and Data Analysis
The sale price of the table was $299. This is equal to the cost from the
wholesaler plus 15%. Thus, $299 = 1.15(cost from the wholesaler), and
$299
1.15 = $260. The usual price is the cost
the cost from the wholesaler is —
from the wholesaler, $260, plus 75%. Therefore, the usual price the
store charges for the table is 1.75 × $260 = $455, which is choice B.
Interpreting Relationships
Presented in Scatterplots,
Graphs, Tables, and Equations
The behavior of a variable and the relationship between two variables
in a real-world context may be explored by considering data presented
in tables and graphs.
Questions on the SAT Math Test assess your ability to understand and
analyze the relationships between two variables, the properties of the
functions used to model these relationships, and the conditions under
which a model is considered to be an appropriate representation of the
data. Problem Solving and Data Analysis questions focus on linear,
quadratic, and exponential relationships.
213
PART 3 | Math
Example 6
190
180
170
160
Because the line of best fit has equation y = 233 − 32x, where x is the
price, in dollars, for a pint of raspberries and y is the expected number
of pints of raspberries sold, the number of pints the store would be
predicted to sell in a week where the price of raspberries is $4.50 per pint
is 233 − 32(4.50) = 89 pints.
B. For how many of the 19 weeks shown was the number of pints of
raspberries sold greater than the number predicted by the line of best fit?
C. What is the best interpretation of the slope of the line of best fit in
this context?
214
Chapter 17 | Problem Solving and Data Analysis
D. What is the best interpretation of the y-intercept of the line of best fit
in this context?
The fact that the y-intercept indicates that 233 people would accept
free raspberries is one limitation of the model. Another limitation is
that for a price of $7.50 per pint or above, the model predicts that a
negative number of people would buy raspberries, which is impossible.
In general, you should be cautious about applying a model for values
outside of the given data. In this example, you should only be confident
in the prediction of sales for prices between $2 and $5.
Example 7
215
PART 3 | Math
The SAT Math Test may have questions on simple and compound
interest, which are important examples of linear and exponential
growth, respectively.
Example 8
A bank has opened a new branch and, as part of a promotion, the bank
branch is offering $1,000 certificates of deposit at simple interest of 4% per
year. The bank is selling certificates with terms of 1, 2, 3, or 4 years. Which of
the following functions gives the total amount, A, in dollars, a customer will
receive when a certificate with a term of k years is finally paid?
A) A = 1,000(1.04k)
B) A = 1,000(1 + 0.04k)
k
C) A = 1,000(1.04)
k
D) A = 1,000(1 + 0.04 )
The general formula for simple interest is A = P (1 + rt ), where P is the
amount, in dollars, of the original deposit, called the principal; r is the
annual interest rate expressed as a decimal; and t is the time, in years,
the deposit is held. In Example 8, P = 1,000, r = 0.04, and t = k ; so A,
in dollars, is given by A = 1,000[1 + (0.04)k ].
216
Chapter 17 | Problem Solving and Data Analysis
Example 9
A bank has opened a new branch and, as part of a promotion, the bank branch
is offering $1,000 certificates of deposit at an interest rate of 4% per year,
compounded semiannually. The bank is selling certificates with terms of 1, 2, 3,
or 4 years. Which of the following functions gives the total amount, A, in dollars,
a customer will receive when a certificate with a term of k years is finally paid?
A) A = 1,000(1 + 0.04k)
B) A = 1,000(1 + 0.08k)
C) A = 1,000(1.04)k
D) A = 1,000(1.02)2k
When the certificate is paid after k years, the value of the certificate
will have been multiplied by the factor (1.02) a total of 2k times.
Therefore, the total amount, A, in dollars, a customer will receive when
a certificate with a term of k years is finally paid is A = 1,000(1.022k).
Choice D is the correct answer.
217
PART 3 | Math
(or if a question says that the population of a city is decreasing by 3% per year, it
population of the city
means that ___
= 0.97). Then, if the question asks by
population of the city a year ago
what percentage the height of the plant will increase in 2 months, you can write
Therefore, the answer is that the height of the plant increases by 21% in
2 months.
An SAT Math Test question may ask you to interpret a graph that
shows the relationship between two variables.
Example 10
9
8
Speed (miles per hour)
7
6
5
4
3
2
1
0
0 10 20 30 40 50 60
Time (minutes)
Each evening, Maria walks, jogs, and runs for a total of 60 minutes. The graph
above shows Maria’s speed during the 60 minutes. Which segment of the graph
represents the times when Maria’s speed is the greatest?
A) The segment from (17, 6) to (19, 8)
B) The segment from (19, 8) to (34, 8)
C) The segment from (34, 8) to (35, 6)
D) The segment from (35, 6) to (54, 6)
218
Chapter 17 | Problem Solving and Data Analysis
Example 11
A store is deciding whether to install a new security system to prevent
shoplifting. Based on store records, the security manager of the store estimates
that 10,000 customers enter the store each week, 24 of whom will attempt
to shoplift. Based on data provided from other users of the security system,
the manager estimates the results of the new security system in detecting
shoplifters would be as shown in the table below.
Customer attempts to
21 3 24
shoplift
Customer does not attempt
35 9,941 9,976
to shoplift
Total 56 9,944 10,000
According to the manager’s estimates, if the alarm sounds for a customer, what
is the probability that the customer did not attempt to shoplift?
A) 0.03%
B) 0.35%
C) 0.56%
D) 62.5%
You may be asked to answer questions that involve a measure of center REMEMBER
for a data set: the mean or the median. A question may ask you to draw Mean and median are measures
conclusions about one or more of these measures of center even if the of center for a data set, while
exact values cannot be calculated. To recall briefly: range and standard deviation are
measures of spread.
The mean of a set of numerical values is the sum of all the values
divided by the number of values in the set.
219
PART 3 | Math
Example 12
18
16
Number of workers
14
12
10
8
6
4
2
0
10 20 30 40 50 60 70
Time worked (in hours)
REMEMBER The histogram above summarizes the distribution of time worked last week,
The distribution of a variable in hours, by the 40 employees of a landscaping company. In the histogram,
provides the possible values of the the first bar represents all workers who worked at least 10 hours but less than
variable and how often they occur. 20 hours; the second represents all workers who worked at least 20 hours but
less than 30 hours; and so on. Which of the following could be the median and
mean amount of time worked, in hours, for the 40 employees?
A) Median 5 22, Mean 5 23
B) Median 5 24, Mean 5 22
C) Median 5 26, Mean 5 32
D) Median 5 32, Mean 5 30
(Note: On the SAT, all histograms have the same type of boundary condition. That is,
the values represented by a bar include the left endpoint but do not include the right
endpoint.)
220
Chapter 17 | Problem Solving and Data Analysis
Now let’s find the possible values of the mean. Each of the 6 employees
represented by the first bar worked at least 10 hours but less than
20 hours. Thus, the total number of hours worked by these 6 employees
is at least 60. Similarly, the total number of hours worked by the
17 employees represented by the second bar is at least 340; the total
number of hours worked by the 9 employees represented by the third
bar is at least 270; the total number of hours worked by the 5 employees
represented by the fourth bar is at least 200; the total number of hours
worked by the 1 employee represented by the fifth bar is at least 50;
and the total number of hours worked by the 2 employees represented
by the sixth bar is at least 120. Adding all these hours shows that
the total number of hours worked by all 40 employees is at least
60 + 340 + 270 + 200 + 50 + 120 = 1,040. Therefore, the mean number of
1,040
hours worked by all 40 employees is at least —
= 26. Therefore, only
40
the values of the mean given in choices C and D are possible. Because
only choice C has possible values for both the median and the mean, it
is the correct answer.
A data set may have a few values that are much larger or smaller than
the rest of the values in the set. These values are called outliers. An
outlier may represent an important piece of data. For example, if a
data set consists of rates of a certain illness in various cities, a data
point with a very high value could indicate a serious health issue to be
investigated.
The mean and the median are different ways to describe the center REMEMBER
of a data set. Another key characteristic of a data set is the amount You won’t be asked to calculate the
of variation, or spread, in the data. One measure of spread is the standard deviation of a set of data
range, which is equal to the maximum value minus the minimum on the SAT Math Test, but you will
value. Another measure of spread is the standard deviation, which is a be expected to demonstrate an
measure of how far away the points in the data set are from the mean understanding of what standard
deviation measures.
value. On the SAT Math Test, you will not be asked to compute the
standard deviation of a data set, but you do need to understand that a
larger standard deviation corresponds to a data set whose values are
more spread out from the mean value.
221
PART 3 | Math
Example 13
Class A
0 1 2 3 4 5
Quiz score
Class B
0 1 2 3 4 5
Quiz score
The dot plots above show the distributions of scores on a current events quiz
for two classes of 24 students. Which of the following correctly compares the
standard deviation of the scores in each of the classes?
A) The standard deviation of quiz scores in Class A is smaller.
B) The standard deviation of quiz scores in Class B is smaller.
C) The standard deviations of quiz scores in Class A and Class B are the same.
D) The relationship cannot be determined from the information given.
PRACTICE AT In Class A, the mean score is between 3 and 4. The large majority of
satpractice.org scores are 3 and 4, with only a few scores of 0, 1, 2, and 5. In Class B,
When asked to compare the the mean score is 2.5, and scores are evenly spread across all possible
standard deviations of two scores, with many scores not close to the mean score. Because the
data sets, first locate the mean scores in Class A are more closely clustered around the mean, the
approximately. Then, ask yourself standard deviation of the scores in Class A is smaller. The correct
which data set has values that are answer is choice A.
more closely clustered around the
mean. That data set will have the A population parameter is a numerical value that describes a
smaller standard deviation. characteristic of a population. For example, the percentage of
registered voters who would vote for a certain candidate is a parameter
describing the population of registered voters in an election. In another
example, the average income of a household in a city is a parameter
describing the population of households in that city. An essential
purpose of statistics is to estimate a population parameter based on
a sample from the population. A common example is election polling,
where researchers will interview a random sample of registered voters
to estimate the proportion of all registered voters who plan to vote
for a certain candidate. The precision of the estimate depends on the
variability of the sample data and the sample size. For instance, if
household incomes in a city vary widely or the sample is small, the
estimate that comes from a sample may differ considerably from the
actual value for the population parameter.
For example, a researcher wants to estimate the mean number of
hours each week that the 1,200 students at a high school spend
on the Internet. Interviewing all 1,200 students would be time
consuming, and it would be more efficient to survey a random
222
Chapter 17 | Problem Solving and Data Analysis
223
PART 3 | Math
If the example above were an SAT question, you might be given survey
results indicating that, for a random sample of 80 students, the estimated
mean was 14 hours with an associated margin of error of 1.2 hours.
An appropriate interpretation of these data is that a plausible population
parameter, or the mean number of hours for all 1,200 students in the
population, is greater than 12.8 hours but less than 15.2 hours.
Example 14
224
Chapter 17 | Problem Solving and Data Analysis
§ §If the subjects in the sample of a study were selected at random PRACTICE AT
from the entire population in question, the results can be satpractice.org
generalized to the entire population because random sampling In order for results of a study
ensures that each individual has the same chance to be selected for to be generalized to the entire
the sample. population, and for a cause-and-
effect relationship to be established,
§ §If the subjects in the sample were randomly assigned to treatments, both random sampling and random
it may be appropriate to make conclusions about cause and effect assignment of individuals to
because the treatment groups will be roughly equivalent at the treatments are needed.
beginning of the experiment other than the treatment they receive.
225
PART 3 | Math
Example 15
A community center offers a Spanish course. This year, all students in the
course were offered additional audio lessons they could take at home. The
students who took these additional audio lessons did better in the course than
students who didn’t take the additional audio lessons. Based on these results,
PRACTICE AT which of the following is the most appropriate conclusion?
satpractice.org A) Taking additional audio lessons will cause an improvement for any student
Be wary of conclusions that claim who takes any foreign language course.
a cause-and-effect relationship
B) Taking additional audio lessons will cause an improvement for any student
or that generalize a conclusion
to a broader population. Before who takes a Spanish course.
accepting a conclusion, assess C) Taking additional audio lessons was the cause of the improvement for the
whether or not the subjects were students at the community center who took the Spanish course.
selected at random from the
D) No conclusion about cause and effect can be made regarding students at
broader population and whether
the community center who took the additional audio lessons at home and
or not subjects were randomly
assigned to treatments. their performance in the Spanish course.
226