The convolution kernels in the first layers of AlexNet and all its DL image proc...

data_maan · on Sept 13, 2022

Could you provide references for your statements

> the first layers of AlexNet and all its DL image processing descendants converge to the Gabor filters (or some variation of)

and

> 15 years before AlexNet there were works showing that such type of filter is kind of mathematically optimally encoding for the feature based image processing

?

trhway · on Sept 13, 2022

For the first - you can just look at the original AlexNet paper. The kernels are unmistakably strikingly Gabor-like. Some differences, like cross-color, are kind of giving rise to possibly interesting questions - is it improvement or deficiency(i.e. more training would correct) over biology? or may be it is just real-valued projection from the [plausible] fact that the optimal is complex-valued?

>?

I don't have that specific reference i had in mind that was published 15-20 years ago, yet you can trace that line of thought development through the works like these for example (there have been a bunch of them in the 199x and into 200x) :

1990 - https://opg.optica.org/josaa/abstract.cfm?uri=josaa-7-8-1362

1998 - https://pubmed.ncbi.nlm.nih.gov/12662821/