Comment by sylware

Comment by sylware 3 days ago

Dude, I cannot be more explicit than that.

That said, when you work on such code, use assembly, C code is for reference (have a look at dav1d av1 decoder).

ncruces 2 days ago

I can't either.

Having implemented this, I'll claim that this is already competitive for needles as small as 2 bytes, and if the needle doesn't show up in the first couple “vector sizes” of the haystack.

Also, look at burntsushi's comment. They use this algorithm for similarly small sizes. They do use Rabin-Karp for “supremely short haystacks” (just because that lib is awesome and uses every trick in the book), but that wording should give you an idea of how small the threshold will be.

And really, how common are “supremely small” haystacks?

Reply View 0 replies

burntsushi 2 days ago

What? You don't have to do that. Hell, the SIMD code that ripgrep uses is written in Rust.

Reply View 1 reply

sylware 2 days ago

Yes, I said C, since there are many real-life alternative compilers.

Reply View | 0 replies