Fixed mask shape to suit arbitrary number of feature dimensions #785

TobyPDE · 2016-12-23T11:01:57Z

If you use the CustomRecurrentLayer in order to perform a recurrent convolution, then the masking did not work properly because the masking variable assumed input to have the shape (batch_size, seq_len, num_features). This PR fixes this bug and makes masking work with an arbitrary number of feature dimensions.

f0k

Thanks, good catch! There's still a problem with your implementation, though, see my comments. Once you're done, could you please squash your commits into one? We'd like to have a clean history to make it easier to track changes. (Let me know if you need help on squashing.) Thank you!

f0k · 2017-01-04T11:39:57Z

lasagne/layers/recurrent.py

+            # (seq_len, batch_size, 1, ..., 1)
+            #                       |---N---|
+            num_feature_dims = input.ndim - 2
+            mask = mask.dimshuffle((1, 0) + ('x',) * num_feature_dims)


Good catch! To make the implementation simpler, though, could you instead do:

if mask is not None: # expand mask from (seq_len, batch_size) to cover all feature dimensions mask = T.shape_padright(mask, input.ndim - 2)

This is easier to read and also doesn't require a long comment to explain!

Reading again, your implementation is actually wrong -- the mask needs to match the dimensionality of the hidden states, not the input states. So instead of input.ndim - 2, use len(self.input_to_hidden.output_shape) - 2. Putting this together:

if mask is not None: # expand mask from (seq_len, batch_size) to cover all hidden dimensions hid_ndim = len(self.input_to_hidden.output_shape) mask = T.shape_padright(mask, hid_ndim - 2)

Could you update your PR accordingly? This would be great!

f0k · 2017-01-04T11:53:42Z

lasagne/tests/layers/test_recurrent.py

+        n_out_filters, filter_shape, pad='same')
+    l_hid_to_hid = lasagne.layers.Conv2DLayer(
+        lasagne.layers.InputLayer((None, n_out_filters, width, height)),
+        n_out_filters, filter_shape, pad='same')


Could you modify this test such that the hidden dimensionality is different from the input dimensionality? This should uncover that your current fix of the mask dimensionality is wrong. I'm not sure there's a sensible use case for this -- maybe just reshape l_in_to_hid to 5 dimensions and then use a DenseLayer(..., num_leading_dims=-1) for hid_to_hid so the dimensionality stays at 5.

Pohlen, Tobias WI added 3 commits December 23, 2016 11:10

Fixed mask shape to suit arbitrary number of feature dimensions

a32a81f

Add shape tests that failed before

b501fa5

Fixed coding guideline violations

67b772a

f0k requested changes Jan 4, 2017

View reviewed changes

f0k added this to the v0.2 milestone Feb 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed mask shape to suit arbitrary number of feature dimensions #785

Fixed mask shape to suit arbitrary number of feature dimensions #785

TobyPDE commented Dec 23, 2016

f0k left a comment

f0k Jan 4, 2017

f0k Jan 4, 2017

Fixed mask shape to suit arbitrary number of feature dimensions #785

Are you sure you want to change the base?

Fixed mask shape to suit arbitrary number of feature dimensions #785

Conversation

TobyPDE commented Dec 23, 2016

f0k left a comment

Choose a reason for hiding this comment

f0k Jan 4, 2017

Choose a reason for hiding this comment

f0k Jan 4, 2017

Choose a reason for hiding this comment