Hacker News new | past | comments | ask | show | jobs | submit login

That sounds very interesting, is the anywhere I can read more about this optimization? I didn't know any compiler could do optimizations like that.



there is no way a general purpose compiler will figure this out. op is probably talking about something like halide or tvm or torchscript jit.


https://polly.llvm.org is a research project that could possibly maybe do this.


a polyhedral compiler wouldn't find this either - polyhedral compilation is for finding optimal schedules for loop nests i.e., the order in which independent (wrt dataflow) iterations run. as far as i know you, a transpose can't be expressed in the polyhedral model.


Hmm I thought GCC's polyhedral optimizations had a loop transposition, but it turned out I was remembering an old "-floop-transpose" flag that seems to be only in old Apple GCC to get a SPEC win…




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: