"this program generates around 31GiB of FizzBuzz per second"
"this program generates around 31GiB of FizzBuzz per second"

High throughput Fizz Buzz

I've spent months working on this program. I've long thought that "how fast can you make a FizzBuzz" would be a really interesting question for learning about high-performance programming, and when I subsequently saw this question posted on CGCC, I pretty much had to try.
This is so beautiful 🥹
edit: outjerked yet again https://codegolf.stackexchange.com/a/269772
Why did you post that implementation and not David Frank's with 1.7 Terrabit/s ?
I saw it before that one, and by that time i already posted. I think deleting and reposting is a little weird so I guess I'll have to make due X)
I'll mention it in the post body though, thanks lol
Didn't realise that it's yours! Kudos :)
Can we get a tldr for the approach used?
hand-optimized SIMD-instructions probably