Keras : variable input sequence: padding vs shape None?

Question

I've found two possible solution of handling with variable-size input sequences for RNN in Keras. The solution one:

input = Input(shape=(None, num_classes))

then I can put any sequence size as an input for both training and validation.

The solution two:

input = Input(shape=(max_seq_length, num_classes))
...
pad_sequences(input_data, maxlen=max_seq_length, padding='post')

Which solution is recommended?

I consider benefits of these two. What I can see in the solution two is kind of validation of input size. The input cannot be larger than max_seq_size, moreover I can decide of type of padding (pre/post) and the same for timing of too large sequence.

What kind of padding and trimming is done using the solution one? Default parameters of pad_sequence?

I've benchmarked the time of training model for both solution and it's roughly the same time. I guess, that under the hood it's the same, like the max_seq_length is calculated from max length of training sequence, am I right?

Thank you for any clarification!

Daniel Möller · Accepted Answer · 2017-10-08 20:17:31Z

2

There is simply no padding or trimming in solution one. It takes the sequence as is and processes it. The model is totally independent of sequence length.

In solution two, the best to do is add a Masking layer. It will simply skip processing the padded values.

answered Oct 8, 2017 at 20:17

Daniel Möller

86.8k24 gold badges202 silver badges222 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

DINA TAKLIT Over a year ago

is it a good solution to use "0" padding with masking layer. Will not reduce the accuracy of the model ? What about use them for MLPs models ?

Collectives™ on Stack Overflow

Keras : variable input sequence: padding vs shape None?

1 Answer 1

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related