Hacker News new | past | comments | ask | show | jobs | submit | from login
A Minimal KV Cache Manager for Paged Attention in ~100 Lines of Python (github.com/tspeterkim)
2 points by tspeterkim 10 months ago | past
Show HN: Minimal Paged Attention (github.com/tspeterkim)
3 points by tspeterkim 10 months ago | past
Insta-chat: simplest Instagram chat automation tool made with Google Sheets (github.com/tspeterkim)
1 point by thunderbong 11 months ago | past
Show HN: DIY Instagram Automation for My Influencer Wife (github.com/tspeterkim)
3 points by tspeterkim 11 months ago | past | 3 comments
Show HN: Mixed Precision Training from Scratch (github.com/tspeterkim)
1 point by tspeterkim 11 months ago | past
Show HN: One Billion Rows in CUDA (github.com/tspeterkim)
3 points by tspeterkim on April 14, 2024 | past
Show HN: Flash Attention in ~100 lines of CUDA (github.com/tspeterkim)
230 points by tspeterkim on March 16, 2024 | past | 39 comments

Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: