Hacker News new | past | comments | ask | show | jobs | submit login

> Specific details of our network architecture will not be published at this time. DeepL Translator is based on a single, non-ensemble model.

Kinda sad to hear, but completely understandable. I'm curious whether the difference in performance is due to their model specifics or just better training data.

Does anyone have more information?




They have the perfect training data as this is a Linguee venture (https://www.linguee.com/). They have millions of translations of paragraphs from one language to another.

I have no information on the model, unfortunately.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: