Hacker News new | past | comments | ask | show | jobs | submit login

I've been looking in to this for the java world. What's your use case? Deployment in to existing applications?



Yea exactly - Python for training, Java/.NET for inference at production. I looked at approaches like GRPC and things but my case is a bit more time-sensitive and the latency added by going over a network layer was too much.

For now I'm happy with Pytorch->ONNX and then running the ONNX model directly. But as I said, that means I can't easily train using JAX :-(



Ohh, I'll check that out!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: