I think it's because without w4, the model still needs to learn how to produce a good enough output and therefore will change the remaining effective parameters, w1, b1, w3, b3.
In this same question, could it be possible to determine ∂L/w2 or ∂L/w3?
From the passage, I understand that thee only thing we know is that ∂L/w2 + ∂L/w3 = 1
question 9
Could someone explain to me the answer to this question? I am a little confused about why w4 is missing will lead to undetermination.
Thanks a lot.!
I think it's because without w4, the model still needs to learn how to produce a good enough output and therefore will change the remaining effective parameters, w1, b1, w3, b3.
3
In this same question, could it be possible to determine ∂L/w2 or ∂L/w3?
From the passage, I understand that thee only thing we know is that ∂L/w2 + ∂L/w3 = 1
Add comment