This repository has been archived by the owner on Oct 17, 2022. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 1
Consider adopting lookup4 #3
Comments
first fastvalidate-utf-8, then lookup2, 3, and 4... you're writing new validators faster than I can port them! |
I think that lookup4 is going to be hard to beat. It is really down to the metal. Have a look. |
I just benchmarked lookup4 (as implemented in simdjson) vs. this implementation (vs. the standard library's). lookup4 comes out on top in all but a couple cases. It's also much more consistent (not relying on the branch predictor as much) |
+1 |
This is done in https://github.com/rusticstuff/simdutf8 |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
The simdjson library has a new UTF8 validator called lookup 4 which is simpler and faster than most alternatives.
See
https://github.com/simdjson/simdjson/blob/master/src/generic/stage1/utf8_lookup4_algorithm.h
It is really not a lot of code!!!
The text was updated successfully, but these errors were encountered: