Workgroup2 Reductions and Scans
Pre-release
Pre-release
·
235 commits
to master
since this release
What's Changed
Workgroup Scans
nbl::hlsl::workgroup2
reduce + scan by @keptsecret in #876
Highly Performant, the subgroup emulated variant (Stone-Kogge adder made of subgroupShuffleUp
) up to 200% faster than native (subgroupInclusiveAdd
) on Nvidia RTX GPUs.
Blogpost incoming.
Full Changelog: v0.6.1-alpha1...v0.6.2-alpha1