Convexity and NN

Hello,

I was wondering why we don't use a convex activation (whose compositions would also be convex) wouldn't we then have a unique minimizer and then boom profit....

Thanks for clarifying :)

Page 1 of 1

Add comment

Post as Anonymous Dont send out notification