The models benchmarked by RULER do worse in needle in a needlestack. It will be ... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

sftombu 7 months ago | parent | context | favorite | on: GPT-4o's Memory Breakthrough – Needle in a Needles...

The models benchmarked by RULER do worse in needle in a needlestack. It will be interested to see how 4o does with RULER.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact