social.kernel.org

Conversation

kdave

Interesting. The highly optimized unrolled implementation of CRC32C on intel is about the same speed as a tight loop around ‘crc32q’. Both do like 30G/s on a 4KiB block with many iterations. Also compared to an older code of the tight loop with one extra instruction (probably causing register dependency with crc32q) is about 11G/s instead.

45d350:  ┌─> 48 8b 08                     movq   (%rax),%rcx
45d353:  │   f2 48 0f 38 f1 f1            crc32q %rcx┌%rsi
45d359:  │   48 83 c0 08                  addq   $0x8┌%rax
45d35d:  │   48 39 d0                     cmpq   %rdx┌%rax
45d360:  └── 75 ee                        jne    45d350 <crc32c_sse42+0x30>

vs:

455000:  ┌─> 48 8b 08                     movq   (%rax),%rcx
455003:  │   89 fe                        movl   %edi┌%esi
455005:  │   f2 48 0f 38 f1 f1            crc32q %rcx┌%rsi
45500b:  │   89 f7                        movl   %esi┌%edi
45500d:  │   48 83 c0 08                  addq   $0x8┌%rax
455011:  │   48 39 d0                     cmpq   %rdx┌%rax
455014:  └── 75 ea                        jne    455000 <crc32c_intel+0x20>

The optimized linux starts at https://elixir.bootlin.com/linux/latest/source/lib/crc/x86/crc-pclmul-template.S . This also has the AVX versions that could be faster than plain ‘crc32’ instruction.

kdave

1 month ago

Reply to @kdave

Yep, no, too good be true. The implementation selection was wrong, always using PCLMUL. Interpretting CPU feature sets as linear "levels" somehow works in this case but the definition ordering was incorrect. Nothing to see here.

About social.kernel.org

Terms of service

Please do not use this service in violation of the Linux Kernel Code of Conduct. Doing so will result in your account suspension with the referral of the matter to the CoC committee.
"Repeating"/"boosting" someone else's status on this platform will be treated as endorsement and will fall under rule #1.
You are encouraged to use this platform to promote your work on the Linux Kernel, but there is no restriction on permitted topics (with the exception of anything covered by #1 above).
There is no requirement to post in English, but it should be considered the primary language of communication on this platform.

Privacy notice

The admins of this service have access to all posted statuses. They aren't looking, but if it's something they shouldn't know about, then you should not post it on this platform.

Please see the Linux Foundation Privacy Policy, which applies to this platform as well.

Getting your own account

If you would like an account on this instance, please check that the following applies to you:

You are listed in MAINTAINERS or CREDITS
OR: You have a kernel.org account or email address
OR: You have a long and established history of involvement with the Linux Kernel

If the above is true and you agree with the Terms of Service and Privacy Notice listed above, please use these instructions to request an account:

How to request an account on social.kernel.org