asked 73.4k views
3 votes
Why was the logistic activation function a key ingredient in training the first MLPs?

1 Answer

2 votes

Answer:

its derivative is always nonzero, so Gradient Descent can always roll down the slope.

Step-by-step explanation:

The logistic activation function was a key ingredient in training the first MLPs because its derivative is always nonzero, so Gradient Descent can always roll down the slope. When the activation function is a step function, Gradient Descent cannot move, as there is no slope at all.

answered
User Minhee
by
8.0k points

Related questions