[mobile] Mobile Perf Recipe #1031

IvanKobzarev · 2020-06-16T21:40:20Z

No description provided.

netlify · 2020-06-16T21:42:18Z

Deploy preview for pytorch-tutorials-preview ready!

Built with commit 7b182fe

https://deploy-preview-1031--pytorch-tutorials-preview.netlify.app

index.rst

fbbradheintz · 2020-06-17T21:45:19Z

recipes_source/mobile_perf.rst

+::
+
+  from torch.utils.mobile_optimizer import optimize_for_mobile
+  traced_model = torch.jit.load("input_model_path")


Total nitpick: Not all TorchScript models are traced - q.v., torch.jit.script(), method decorations. I'd name this var torchscript_model or similar for clarity & accuracy.

fbbradheintz · 2020-06-17T21:45:35Z

recipes_source/mobile_perf.rst

+
+  from torch.utils.mobile_optimizer import optimize_for_mobile
+  traced_model = torch.jit.load("input_model_path")
+  optimized_model = optimize_for_mobile(traced_model)


There are valid TorchScript models (googlenet & inception_v3 in torchvision) that segfault on this line. Other than that, the instructions work and there's a modest (<=2.2%) improvement in file size for these models:

mobilenet_v2
resnet18
alexnet
squeezenet1_0
vgg16
densenet161
shufflenet_v2_x1_0
mnasnet1_0

fbbradheintz · 2020-06-17T22:17:21Z

recipes_source/mobile_perf.rst

+
+2. Fuse operators using ``torch.quantization.fuse_modules``
+Do not be confused that fuse_modules is in the quantization package.
+It works for all types of torch script modules.


Two things:

PMM is going to come back and say that we should write it TorchScript.

The code below does not pass a TorchScript module to fuse_modules() - that MobileNet v2 from TorchVision is not a subclass of ScriptModule.

Passing in the original torchvision module itself or a version of it processed by torch.jit.script() works, with the latter giving ~2% file size improvement.

fbbradheintz · 2020-06-17T22:22:34Z

recipes_source/mobile_perf.rst

+  m = torchvision.models.mobilenet_v2(pretrained=True)
+  m.eval()
+  fuse_model(m)
+  torch.jit.trace(m, torch.rand(1, 3, 224, 224)).save("mobilenetV2-bnfused.pt")


Should we be guiding people to torch.jit.trace()? torch.jit.script() preserves control flow, trace() does not. trace() is still there for cases where script() hits an unsupported op.

fbbradheintz · 2020-06-17T22:34:58Z

recipes_source/mobile_perf.rst

+  model.eval()
+  script_model = torch.jit.script(model)
+  x = torch.rand(1, 3 , 224, 224)
+  y = script_model(x)


y is never used, which means we could do without x as well.

fbbradheintz · 2020-06-17T22:38:54Z

recipes_source/mobile_perf.rst

+
+  supported_qengines = torch.backends.quantized.supported_engines
+  print(supported_qengines)
+  model = torchvision.models.quantization.__dict__['mobilenet_v2'](pretrained=True, quantize=True)


Two things:

Is this the quantization workflow we want to show? This looks like it's just pulling down a pre-quantized version of the model, in which case this recipe is only useful if you want to quantize the torchvision models. In the general case, I'd think people would want to be able to quantize their own trained models for mobile deployment.

On MacOS, this line throws a warning:

/Users/bradheintz/anaconda2/envs/pyto16pre/lib/python3.8/site-packages/torch/nn/quantized/modules/utils.py:8: UserWarning: 0quantize_tensor_per_tensor_affine current rounding mode is not set to round-to-nearest-ties-to-even (FE_TONEAREST). This will cause accuracy issues in quantized models. (Triggered internally at ../aten/src/ATen/native/quantized/affine_quantizer.cpp:25.) qweight = torch.quantize_per_tensor( /Users/bradheintz/anaconda2/envs/pyto16pre/lib/python3.8/site-packages/torch/quantization/observer.py:134: UserWarning: must run observer before calling calculate_qparams. Returning default scale and zero point warnings.warn(

(And yes, it really looks like that in my terminal.) The warning came up in this env:

# torch 1.6.0a0+55bcb5d built from master with USE_CUDA=0 # torchvision 0.7.0a0+148bac2 built from master with USE_CUDA=0 # python 3.8.0 # MacOS 10.15.4

We should show a workflow where we start with a floating point model and then do the quantization. The steps are:

# Start with a fully trained floating point model # The model code is modified to enable eager mode quantization, for more details # please see the quantization tutorials at: https://pytorch.org/tutorials/advanced/static_quantization_tutorial.html model = torchvision.models.quantization.__dict__['resnet18'](pretrained=True, quantize=False) torch.backends.quantized.engine='qnnpack' # We convert the float model with the appropriate_Qconfig model.eval() model.qconfig = torch.quantization.get_default_qconfig('qnnpack') torch.quantization.prepare(model) # Run model with representative data for calibration # model(calibration_data) torch.quantization.convert(model) script_model = torch.jit.script(model) # Export to mobile script_model._save_for_lite_interpreter("model.bc")

fbbradheintz · 2020-06-17T22:39:52Z

recipes_source/mobile_perf.rst

+  model = torchvision.models.quantization.__dict__['mobilenet_v2'](pretrained=True, quantize=True)
+  torch.backends.quantized.engine='qnnpack'
+  model.eval()
+  script_model = torch.jit.script(model)


I don't actually know the answer to this: Is it preferable to do quantization before or after TorchScript conversion, or does it matter at all?

Currently, pytorch only supports doing quantization prior to scripting. We are working on adding support for quantization after scripting, but it is not part of release 1.6 yet.

fbbradheintz

This looks great; I'm still tweaking my Android custom build env for the last couple of recipes.

fbbradheintz · 2020-06-18T17:44:37Z

recipes_source/mobile_perf.rst

+  import torch
+  from torch.utils.mobile_optimizer import optimize_for_mobile
+
+  class AnnotatedConvBnReLUModel(torch.nn.Module):


Including a sample model here is a great idea - this will help users generalize to their own use case.

fbbradheintz

Added notes for one bug in the quantization step

fbbradheintz · 2020-06-23T20:42:06Z

recipes_source/mobile_perf.rst

+::
+
+  model.qconfig = torch.quantization.get_default_qconfig('qnnpack')
+  torch.quantization.prepare(model)


prepare() has inplace=False by default, and the same goes for convert(), so this whole method is a no-op except for setting model.qconfig.

We either need to do:

model = torch.quantization.prepare(model) # calibration return torch.quantization.convert(model)

or:

torch.quantization.prepare(model, inplace=True) # calibration torch.quantization.convert(model, inplace=True)

IvanKobzarev force-pushed the recipe_mobile_perf branch from 2db178e to ff05ebc Compare June 17, 2020 00:46

jlin27 suggested changes Jun 17, 2020

View reviewed changes

index.rst Outdated Show resolved Hide resolved

jlin27 suggested changes Jun 17, 2020

View reviewed changes

index.rst Outdated Show resolved Hide resolved

IvanKobzarev force-pushed the recipe_mobile_perf branch 2 times, most recently from a3c5fd7 to bac6bf6 Compare June 17, 2020 22:01

fbbradheintz suggested changes Jun 17, 2020

View reviewed changes

IvanKobzarev force-pushed the recipe_mobile_perf branch 3 times, most recently from 3ababf9 to f11a36c Compare June 18, 2020 06:12

IvanKobzarev changed the title ~~[WIP][mobile] Mobile Perf Recipe~~ [mobile] Mobile Perf Recipe Jun 18, 2020

fbbradheintz reviewed Jun 18, 2020

View reviewed changes

IvanKobzarev force-pushed the recipe_mobile_perf branch 2 times, most recently from e9d31e3 to 3839516 Compare June 18, 2020 19:05

fbbradheintz approved these changes Jun 23, 2020

View reviewed changes

fbbradheintz suggested changes Jun 23, 2020

View reviewed changes

[mobile] Mobile Perf Recipe

910dfa9

IvanKobzarev force-pushed the recipe_mobile_perf branch from 7d047d3 to 910dfa9 Compare June 23, 2020 20:48

fbbradheintz approved these changes Jun 23, 2020

View reviewed changes

jlin27 self-requested a review June 23, 2020 21:24

Merge branch 'master' into recipe_mobile_perf

7b182fe

jlin27 changed the base branch from master to release/1.6 June 24, 2020 02:13

jlin27 approved these changes Jun 24, 2020

View reviewed changes

jlin27 merged commit 6285b8f into pytorch:release/1.6 Jun 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[mobile] Mobile Perf Recipe #1031

[mobile] Mobile Perf Recipe #1031

IvanKobzarev commented Jun 16, 2020

netlify bot commented Jun 16, 2020 •

edited

Loading

fbbradheintz Jun 17, 2020

fbbradheintz Jun 17, 2020

fbbradheintz Jun 17, 2020

fbbradheintz Jun 17, 2020

fbbradheintz Jun 17, 2020

fbbradheintz Jun 17, 2020 •

edited

Loading

raghuramank100 Jun 18, 2020 •

edited

Loading

fbbradheintz Jun 17, 2020

raghuramank100 Jun 18, 2020

fbbradheintz left a comment

fbbradheintz Jun 18, 2020

fbbradheintz left a comment

fbbradheintz Jun 23, 2020

[mobile] Mobile Perf Recipe #1031

[mobile] Mobile Perf Recipe #1031

Conversation

IvanKobzarev commented Jun 16, 2020

netlify bot commented Jun 16, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fbbradheintz Jun 17, 2020 • edited Loading

Choose a reason for hiding this comment

raghuramank100 Jun 18, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fbbradheintz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fbbradheintz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

netlify bot commented Jun 16, 2020 •

edited

Loading

fbbradheintz Jun 17, 2020 •

edited

Loading

raghuramank100 Jun 18, 2020 •

edited

Loading