Why was the logistic activation function a key ingredient in training the first MLPs?

Question

asked Aug 1, 2024 73.4k views

1 Answer

← Prev Question Next Question →

Ask a Question

Minhee · Answer 1 · 2024-08-06T06:46:10+0000

Answer:

its derivative is always nonzero, so Gradient Descent can always roll down the slope.

Step-by-step explanation:

The logistic activation function was a key ingredient in training the first MLPs because its derivative is always nonzero, so Gradient Descent can always roll down the slope. When the activation function is a step function, Gradient Descent cannot move, as there is no slope at all.

Why was the logistic activation function a key ingredient in training the first MLPs?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

Categories

Other Questions