Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They cannot “figure” it, they could learn it but for that it would need to be in it's training data (which isn't because nobody is writing down the actual pairing in every byte pair encoding in plain text. Also the LLM has no clue about what encoding it uses unless you tell it somehow in the fine-tuning process or the prompt.)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: