I'm not familiar with tensor contraction as practiced by a machine learning package, but summation convention is just that, it's not a fundamental property of tensors.
As a way of describing physics or geometry they have additional structure which I'm not seeing.