https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95747
Bug ID: 95747 Summary: [OpenMP/Builtin] nontemporal store support Product: gcc Version: 10.1.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: jed at 59A2 dot org Target Milestone: --- My understanding is that GCC currently has no analog of Clang's __builtin_nontemporal_store (https://clang.llvm.org/docs/LanguageExtensions.html#non-temporal-load-store-builtins) and I don't see a ticket to add support for OpenMP-5's pragma omp simd nontemporal. This leaves developers stuck with needing intrinsics to generate vmovntpd and related instructions. Proper use of nontemporal stores can make circa 50% performance improvement in bandwidth-limited workloads.