Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Laiho
6 months ago
|
parent
|
context
|
favorite
| on:
Branchless UTF-8 Encoding
fn validate_ascii(bytes: &[u8]) -> bool{ bytes.iter().fold(true, |acc, b| acc & (\*b <= 127)) }
This check will likely be the best for english text/code. You can check in varying size chunks depending on how common you think non-ascii will be. If its ascii you can move 128 bytes forward on avx2 in a couple of cycles.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: