What is the main motivation for using activation functions in ANN?
Capturing complex non-linear patterns
Their ability to activate each neurons individually.
Help avoiding the vanishing/exploding gradient problem
Transforming continuous values into "ON" (1) or "OFF" (0) values