Also probably not realted, but don't these LLMs only work with a relatively short buffer or else they start being completely incoherent?