[MRG] added leaky_relu activation and derivative to multilayer_perceptron #10665

dpstart · 2018-02-20T21:12:28Z

Implementation of the leaky ReLU activation function and its derivative, to use as activation in multilayer_perceptron neurons.

This is done as a possible solution to the dying ReLU problem, a situation in which the ReLU function always outputs 0 for any given input.

The leaky ReLU function allows a small gradient when the unit is not active.

jnothman

Apart from this needing tests, I see two problems. One is a question of how mature the technology is and whether we should be maintaining various recent inventions in this space. The other is that the user can't set alpha here. Should we support custom (activation, gradient) instead?

amueller · 2018-05-23T19:35:45Z

I'm -1 on adding this. We don't even have dropout now, right? I feel recent advances are better suited for Keras and we don't really want or need to reimplement Keras here.

rth · 2019-06-16T12:52:34Z

So essentially this adds an activation function and a derivative to neural_network.{ACTIVATIONS,DERIVATIVES}. Maybe we could just document that and provide a simple example? That way anyone could add the activation function they want without much maintenance effort on our side. Sometimes it's still handy to do a simple NN without installing tensorflow or pytorch.

amueller · 2019-08-06T19:41:06Z

there's no way to change the alpha, though? And the question is a bit what "simple" means. Is there evidence that this really helps in practice? In particular for dense networks?

rth · 2019-08-26T19:35:08Z

there's no way to change the alpha, though?

Yes, that is a significant limitation of this approach.

And the question is a bit what "simple" means. Is there evidence that this really helps in practice?

Haven't searched in the literature but this is the third PR or issue about it, so there is certainly interest. Though following the dev meeting today it seems there was a consensus on not adding new DL features.

Another approach requiring a 1 line change for supporting custom activation function can be found in #14815

aurel-av · 2020-12-03T14:17:18Z

I think "delta[Z < 0] = alpha" should be replaced by " delta[Z < 0] *= alpha " in the computation of the derivative in order to be consistent with other derivative computations. For example, for the tanh, the function inplace derivative is computed as : delta *= (1 - Z ** 2). So, delta needs to be multiplied by the derivation function.
I used the california housing database to test this leaky_relu function. It performs well with this modification.

dpstart added 4 commits February 20, 2018 22:09

added leaky_relu activation to multi layer perceptron

08c2272

added leaky_relu activation to multi layer perceptron

eee01de

fixed line length

aa5a916

removed trailing whitespace in comment

b962f3b

jnothman reviewed Feb 22, 2018

View reviewed changes

jnothman mentioned this pull request Aug 31, 2018

Can I use leaky-relu function for MLPs? #11952

Closed

amueller added the Needs Decision Requires decision label Aug 6, 2019

rth mentioned this pull request Aug 26, 2019

MAINT: Remove redundant code in MLP #14815

Merged

rth mentioned this pull request Sep 16, 2019

RFC Definition of public API #12927

Closed

github-actions bot added the module:neural_network label Mar 2, 2020

adrinjalali closed this Jan 22, 2021

adrinjalali deleted the branch scikit-learn:master January 22, 2021 10:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] added leaky_relu activation and derivative to multilayer_perceptron #10665

[MRG] added leaky_relu activation and derivative to multilayer_perceptron #10665

Uh oh!

dpstart commented Feb 20, 2018 •

edited

Loading

Uh oh!

jnothman left a comment

Uh oh!

amueller commented May 23, 2018

Uh oh!

rth commented Jun 16, 2019

Uh oh!

amueller commented Aug 6, 2019

Uh oh!

rth commented Aug 26, 2019

Uh oh!

aurel-av commented Dec 3, 2020

Uh oh!

Uh oh!

Uh oh!

[MRG] added leaky_relu activation and derivative to multilayer_perceptron #10665

[MRG] added leaky_relu activation and derivative to multilayer_perceptron #10665

Uh oh!

Conversation

dpstart commented Feb 20, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

amueller commented May 23, 2018

Uh oh!

rth commented Jun 16, 2019

Uh oh!

amueller commented Aug 6, 2019

Uh oh!

rth commented Aug 26, 2019

Uh oh!

aurel-av commented Dec 3, 2020

Uh oh!

Uh oh!

dpstart commented Feb 20, 2018 •

edited

Loading