Pb 22 exam 2017

This forum is inactive. Browsing/searching possible.

Connect your moderator Slack workspace to receive post notifications:

Hello,
For this problem, I agree with the test error for the 5k and 100k data sets. But I do not understand the following:

first: why the training error for the 5k and 100k data sets are below the true error? This should be a lower bound that cannot be reach
second: why is the 100k training error above the 5k training error?
Thanks for your help.

13 Jan '22 ·

Top comment

Hi,

The true error corresponds to the expected error achieved on fresh samples from the (infinitely large) data distribution. If your model is powerful enough, you can overfit on the training set, and get zero training loss (thus it can go below the true error as it is measured on different data)
The larger your training dataset, the harder it is to overfit on them.

13 Jan '22 ·

Thijs Vogels admin

Page 1 of 1

How to style: strictly use the or click here. E.g., \(\alpha + \beta\) gives (inline) \(\alpha + \beta\). No \(\LaTeX\) preview (yet).