0% found this document useful (0 votes)

74 views94 pages

03 - Lecture Slide - Basic Models in TensorFlow

The document discusses basic models in TensorFlow including linear regression and logistic regression. It covers topics such as computation graphs, placeholders, variables, loss functions, and optimizers. It also discusses TensorFlow data processing using tf.data and control flow operations.

Uploaded by

Roberto Pereira

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views94 pages

03 - Lecture Slide - Basic Models in TensorFlow

Uploaded by

Roberto Pereira

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 94

Basic Models in TensorFlow

CS 20: TensorFlow for Deep Learning Research

Lecture 3
1/19/2017

1
2
Agenda
Review

Linear regression on birth/life data

Control Flow

tf.data

Optimizers, gradients

Logistic regression on MNIST

Loss functions
3
Review

4
Computation graph

TensorFlow separates definition of computations from their execution

Phase 1: assemble a graph

Phase 2: use a session to execute operations in the graph.

5
TensorBoard
x = 2
y = 3 useless pow_op
add_op = tf.add(x, y)
mul_op = tf.multiply(x, y)
useless = tf.multiply(x, add_op)
pow_op = tf.pow(add_op, mul_op)
with tf.Session() as sess:
z = sess.run(pow_op)
add_op mul_op

Create a FileWriter object to write your

graph to event files

6
tf.constant and tf.Variable

Constant values are stored in the graph definition

Sessions allocate memory to store variable values

7
tf.placeholder and feed_dict

Feed values into placeholders with a dictionary (feed_dict)

Easy to use but poor performance

8
Avoid lazy loading

1. Separate the assembling of graph and executing ops

2. Use Python attribute to ensure a function is only loaded the first time it’s
called

9
Download from the class’s GitHub

examples/03_linreg_starter.py

examples/03_logreg_starter.py

examples/utils.py

data/birth_life_2010.txt

10
Linear Regression
in TensorFlow

11
Model the linear relationship between:
● dependent variable Y
● explanatory variables X

12
13
Visualization made by Google, based on data collected by World Bank
World Development Indicators dataset

X: birth rate
Y: life expectancy
190 countries

14
Want

Find a linear relationship between X and Y

to predict Y from X

15
Model

Inference: Y_predicted = w * X + b

Mean squared error: E[(y - y_predicted) 2]

16
Interactive Coding

data/birth_life_2010.txt

17
Interactive Coding

examples/03_linreg_starter.py

18
Phase 1: Assemble our graph

19
Step 1: Read in data

I already did that for you

20
Step 2: Create placeholders for
inputs and labels

tf.placeholder(dtype, shape=None, name=None)

21
Step 3: Create weight and bias

tf.get_variable(
No need to specify shape if
using constant initializer
name,

shape=None,

dtype=None,

initializer=None,

) 22
Step 4: Inference

Y_predicted = w * X + b

23
Step 5: Specify loss function

loss = tf.square(Y - Y_predicted, name='loss')

24
Step 6: Create optimizer

opt = tf.train.GradientDescentOptimizer(learning_rate=0.001)

optimizer = opt.minimize(loss)

25
Phase 2: Train our model

Step 1: Initialize variables

Step 2: Run optimizer

(use a feed_dict to feed data into X and Y placeholders)

26
Write log files using a FileWriter

writer = tf.summary.FileWriter('./graphs/linear_reg', sess.graph)

27
See it on TensorBoard

Step 1: $ python3 03_linreg_starter.py

Step 2: $ tensorboard --logdir='./graphs'

28
TypeError?

TypeError: Fetch argument 841.0 has invalid type <class 'numpy.float32'>,

must be a string or Tensor.

(Can not convert a float32 into a Tensor or Operation.)

29
TypeError

for i in range(50): # train the model 100 epochs

total_loss = 0

for x, y in data:

_, loss = sess.run([optimizer, loss], feed_dict={X: x, Y:y}) # Can’t fetch a numpy array

total_loss += loss

30
TypeError

for i in range(50): # train the model 100 epochs

total_loss = 0

for x, y in data:

_, loss_ = sess.run([optimizer, loss], feed_dict={X: x, Y:y})

total_loss += loss_

31
32
Plot the results with matplotlib

Step 1: Uncomment the plotting code at the end of your program

Step 2: Run it again

If run into problem of matplotlib in virtual environment, go to GitHub/setup and see the file
possible setup problems

33
34
Huber loss
Robust to outliers

If the difference between the predicted value and the real value is small, square it

If it’s large, take its absolute value

35
Implementing Huber loss
Can’t write:

if y - y_predicted < delta:

36
You can write it if eager mode were enabled. Stay tuned for the next lecture!
Implementing Huber loss

tf.cond(pred, fn1, fn2, name=None)

37
Implementing Huber loss
tf.cond(pred, fn1, fn2, name=None)
def huber_loss(labels, predictions, delta=14.0):
residual = tf.abs(labels - predictions)
def f1(): return 0.5 * tf.square(residual)
def f2(): return delta * residual - 0.5 * tf.square(delta)
return tf.cond(residual < delta, f1, f2)

38
TF Control Flow

Control Flow Ops tf.group, tf.count_up_to, tf.cond, tf.case, tf.while_loop, ...

Comparison Ops tf.equal, tf.not_equal, tf.less, tf.greater, tf.where, ...

Logical Ops tf.logical_and, tf.logical_not, tf.logical_or, tf.logical_xor

Debugging Ops tf.is_finite, tf.is_inf, tf.is_nan, tf.Assert, tf.Print, ...

Since TF builds graph before computation, we have

to specify all possible subgraphs beforehand.
PyTorch’s dynamic graphs and TF’s eager execution
help overcome this
39
tf.data

40
Placeholder

Pro: put the data processing outside TensorFlow, making it easy to

do in Python

Cons: users often end up processing their data in a single thread

and creating data bottleneck that slows execution down.

41
Placeholder
data, n_samples = utils.read_birth_life_data(DATA_FILE)

X = tf.placeholder(tf.float32, name='X')
Y = tf.placeholder(tf.float32, name='Y')
…
with tf.Session() as sess:
…
# Step 8: train the model
for i in range(100): # run 100 epochs
for x, y in data:
# Session runs train_op to minimize loss
sess.run(optimizer, feed_dict={X: x, Y:y})

42
tf.data

Instead of doing inference with placeholders and feeding in data

later, do inference directly with data

43
tf.data

tf.data.Dataset

tf.data.Iterator

44
Store data in tf.data.Dataset

● tf.data.Dataset.from_tensor_slices((features, labels))
● tf.data.Dataset.from_generator(gen, output_types, output_shapes)

45
Store data in tf.data.Dataset

tf.data.Dataset.from_tensor_slices((features, labels))

dataset = tf.data.Dataset.from_tensor_slices((data[:,0], data[:,1]))

46
Store data in tf.data.Dataset

tf.data.Dataset.from_tensor_slices((features, labels))

dataset = tf.data.Dataset.from_tensor_slices((data[:,0], data[:,1]))

print(dataset.output_types) # >> (tf.float32, tf.float32)

print(dataset.output_shapes) # >> (TensorShape([]), TensorShape([]))

47
Can also create Dataset from files

● tf.data.TextLineDataset(filenames)
● tf.data.FixedLengthRecordDataset(filenames)
● tf.data.TFRecordDataset(filenames)

48
tf.data.Iterator

Create an iterator to iterate through samples in Dataset

49
tf.data.Iterator

● iterator = dataset.make_one_shot_iterator()
● iterator = dataset.make_initializable_iterator()

50
tf.data.Iterator

● iterator = dataset.make_one_shot_iterator()
Iterates through the dataset exactly once. No need to initialization.
● iterator = dataset.make_initializable_iterator()
Iterates through the dataset as many times as we want. Need to initialize with each epoch.

51
tf.data.Iterator

iterator = dataset.make_one_shot_iterator()
X, Y = iterator.get_next() # X is the birth rate, Y is the life expectancy

with tf.Session() as sess:

print(sess.run([X, Y])) # >> [1.822, 74.82825]
print(sess.run([X, Y])) # >> [3.869, 70.81949]
print(sess.run([X, Y])) # >> [3.911, 72.15066]

52
tf.data.Iterator

iterator = dataset.make_initializable_iterator()

...

for i in range(100):
sess.run(iterator.initializer)
total_loss = 0
try:
while True:
sess.run([optimizer])
except tf.errors.OutOfRangeError:
pass

53
Handling data in TensorFlow

dataset = dataset.shuffle(1000)

dataset = dataset.repeat(100)

dataset = dataset.batch(128)

dataset = dataset.map(lambda x: tf.one_hot(x, 10))

# convert each elem of dataset to one_hot vector

54
Does tf.data really perform better?

55
Does tf.data really perform better?

With placeholder: 9.05271519 seconds

With tf.data: 6.12285947 seconds

56
Should we always use tf.data?

● For prototyping, feed dict can be faster and easier to write (pythonic)
● tf.data is tricky to use when you have complicated preprocessing or multiple
data sources
● NLP data is normally just a sequence of integers. In this case, transferring the
data over to GPU is pretty quick, so the speedup of tf.data isn't that large

57
How does TensorFlow know what variables
to update?

58
Optimizers

59
Optimizer

optimizer = tf.train.GradientDescentOptimizer(learning_rate=0.01).minimize(loss)

_, l = sess.run([optimizer, loss], feed_dict={X: x, Y:y})

60
Optimizer

optimizer = tf.train.GradientDescentOptimizer(learning_rate=0.001).minimize(loss)

_, l = sess.run([optimizer, loss], feed_dict={X: x, Y:y})

Session looks at all trainable variables that loss depends on and update them

61
Optimizer

Session looks at all trainable variables that optimizer depends on and update them

62
Trainable variables

tf.Variable(initial_value=None, trainable=True,...)

Specify if a variable should be trained or not

By default, all variables are trainable

63
List of optimizers in TF
tf.train.GradientDescentOptimizer

tf.train.AdagradOptimizer
“Advanced” optimizers work better when tuned,
tf.train.MomentumOptimizer but are generally harder to tune

tf.train.AdamOptimizer

tf.train.FtrlOptimizer

tf.train.RMSPropOptimizer

...

64
Discussion question

1. How to know that our model is correct?

2. How to improve our model?

65
Assignment 1

Out tomorrow
Due 1/31
Optional Interactive Grading

66
Logistic Regression
in TensorFlow

67
Then he separated the
light from the darkness

The first logistic

regression model 68
MNIST Database
Each image is a 28x28 array, flattened out to be a 1-d tensor of size 784

69
MNIST

X: image of a handwritten digit

Y: the digit value
Recognize the digit in the image

70
MNIST

X: image of a handwritten digit

Y: the digit value

71
Model

Inference: Y_predicted = softmax(X * w + b)

Cross entropy loss: -log(Y_predicted)

72
Process data
from tensorflow.examples.tutorials.mnist import input_data
MNIST = input_data.read_data_sets('data/mnist', one_hot=True)

73
Process data
from tensorflow.examples.tutorials.mnist import input_data
MNIST = input_data.read_data_sets('data/mnist', one_hot=True)

MNIST.train: 55,000 examples

MNIST.validation: 5,000 examples
MNIST.test: 10,000 examples

74
Process data
from tensorflow.examples.tutorials.mnist import input_data
MNIST = input_data.read_data_sets('data/mnist', one_hot=True)

MNIST.train: 55,000 examples No immediate way to convert Python generators

MNIST.validation: 5,000 examples to tf.data.Dataset
MNIST.test: 10,000 examples

75
Process data
mnist_folder = 'data/mnist'
utils.download_mnist(mnist_folder)
train, val, test = utils.read_mnist(mnist_folder, flatten=True)

76
Create datasets
mnist_folder = 'data/mnist'
utils.download_mnist(mnist_folder)
train, val, test = utils.read_mnist(mnist_folder, flatten=True)