Vanishing Gradient
CS-433 Exercises · 91 · 2 · 9 Jan '21
Hello :) I'm struggling to understand the role of the weights on the vanishing gradient problem.…
mock 2014 : PCA for count data algorithm
CS-433 Exams · 63 · 1 · 9 Jan '21
hi, I don't understand how to come up with the algorithm shown for the PCA problem. In ALS we al…
invertibility and other matrix property
CS-433 Exams · 73 · 1 · 9 Jan '21
Hi, Which properties of matrices are we supposed to know for the exam? By that I mean for exampl…
Exam 2017 Problem 22
CS-433 Exams · 177 · 2 · 8 Jan '21
Hello, I was wondering if the intersection between the two Train Error lines had any meaning or …
conditional probability
CS-433 Exercises · 108 · 2 · 8 Jan '21
Hi, I was wondering if including pi in the probability expression (highlighted) is necessary, i…
Representation power
CS-433 Lectures · 142 · 3 · 8 Jan '21
Dear TAs, I'm struggling to understand why on a bounded domain, neural nets can **not** approxim…
Training risk
CS-433 Exams · 133 · 3 · 8 Jan '21
In final exam 2019, true/false question 15, it is said that training risk converges to true risk, w…
PCA in the exam 2019
CS-433 Exams · 204 · 5 · 8 Jan '21
Could someone help to explain to me why this statement is wrong? **In PCA, the first principal dir…
Gradient question - Exam 2018
CS-433 Exams · 217 · 5 · 8 Jan '21
Hi! I can't understand the logic of problem 10 in 2018 exam. The question is: for which x does th…
generalization error bounds
CS-433 Lectures · 62 · 1 · 8 Jan '21
Hi! I am having difficulty grasping the steps taken in 'a little thought' to obtain expression s…
exponential family- invertibility of link function
CS-433 Lectures · 62 · 1 · 8 Jan '21
Hello, in al the examples investigated we saw a one to one relationship between E[phi(y)] and et…
minibatch implementation
CS-433 Lectures · 72 · 3 · 8 Jan '21
Good evening, Looking online I find different ways to implement minibatch GD. Some select B rand…
bias-variance decomposition: what exactly is variance
CS-433 Lectures · 65 · 1 · 8 Jan '21
hi, In bias-variance decomposition derivation I understand that the bias term is a measure of th…
first eigenvalue of SVD Decomposition
CS-433 Exams · 180 · 3 · 8 Jan '21
Good evening, I question 6 of 2018's exam, the fact "In PCA, the first principale direction is t…
problem 15, exam 2018
CS-433 Exams · 282 · 3 · 8 Jan '21
Hello, I don't understand why there is an exact solution to the matrix factorization problem... …
Maximum log likelihood ?
CS-433 Lectures · 99 · 2 · 7 Jan '21
Dear TAs, I'm struggling to understand where the log comes from. Is this the maximum likeli…
Mixture of Linear Regression
CS-433 Exams · 105 · 2 · 7 Jan '21
Hi! I was doing the mock midterm exam of 2015, and I can't understand the solution of the last po…
problem9 exam 2018
CS-433 Exams · 132 · 2 · 7 Jan '21
Hi, I was wondering why the answer for problem 9 isn't dL/dw1=1. why would partial derivative w…
2 page cheatsheet
CS-433 Exams · 262 · 3 · 7 Jan '21
Hello, Would it be possible to have one more page as cheatsheet? The one page space restriction …
Order of complexity Newton Method (Mock exam 2014), question on Poisson regression
CS-433 Exams · 137 · 4 · 7 Jan '21
Hello, I am struggling to understand the order of complexity that is given in the answers here. …