The conclusion, that a low-complexity statistical ensemble is almost as good as ...

cs702 · on Dec 2, 2022

Exactly.

If the task is to predict the next 12 values from a sample of 120 previous values, drawn from some computationally simple statistical process, it's much cheaper and easier to use old-fashioned, tried-and-true statistical methods.

If the task is to predict millions of pixel values that make up an original work of art, or the pixel values over time that make up a deep-fake video, or the next set of values encoding the next best possible play in a game of Go, or the set of values that encode the entire structure of a protein, and so on, then you have no choice: You must use a deep neural network. Simple methods cannot do any of that.

usgroup · on Dec 2, 2022

Sure more data and high signal then you might have a substantial advantage with DL. Lots of data is low signal to noise which again favours parametric models.