Connect your moderator Slack workspace to receive post notifications:
Sign in with Slack

question 9

Could someone explain to me the answer to this question? I am a little confused about why w4 is missing will lead to undetermination.
Thanks a lot.!


I think it's because without w4, the model still needs to learn how to produce a good enough output and therefore will change the remaining effective parameters, w1, b1, w3, b3.

In this same question, could it be possible to determine ∂L/w2 or ∂L/w3?
From the passage, I understand that thee only thing we know is that ∂L/w2 + ∂L/w3 = 1

Page 1 of 1

Add comment

Post as Anonymous Dont send out notification