Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Oct 14, 2024 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
A header only library implementing common mathematical functions using SIMD intrinsics
Optimized Recursive Bilateral Filter
The Dlang SIMD library
DR3 enables users to write vectorised code using generic lambdas and filters. Switch instruction set just by changing enclosing namespace
Template SIMD Library (+Generator)
C++ template for generating small sorting networks compatible with SIMD intrinsics
A High Performance C# wrapper that allows you to get the benefits of SIMD Intrinsics on List<T>.
C++ interface for SIMD instruction sets
Vectroized String Helper Functions
Fast, modern, fully templated, single-file, header-only, parallel implementation of A* trajectory planner on NVIDIA GPUs in point cloud data.
Winning submission for StartHack 2024: HPC optimized multi-GPU/CPU inference
high-speed math functions based on AVX-512 intrinsics
Simple neural network microkernels in C accelerated with ARMv8.2-a Neon vector intrinsics.
Add a description, image, and links to the simd-intrinsics topic page so that developers can more easily learn about it.
To associate your repository with the simd-intrinsics topic, visit your repo's landing page and select "manage topics."