sdpa

Here are 3 public repositories matching this topic...

⚡️FFPA: Extend FlashAttention-2 with Split-D, achieve ~O(1) SRAM complexity for large headdim, 1.8x~3x↑ vs SDPA.🎉

cuda attention sdpa mla mlsys tensor-cores flash-attention deepseek deepseek-v3 deepseek-r1 fused-mla flash-mla

An open-source interface to use the multiple-precision solver SDPA-GMP with YALMIP

optimization semidefinite-programming yalmip sdpa-gmp sdpa

PyTorch implementation of YOLOv12 with Scaled Dot-Product Attention (SDPA) optimized by FlashAttention for fast and efficient object detection.

pytorch yolo object-detection sdpa ultralytics yolov12 flashattention

Add a description, image, and links to the sdpa topic page so that developers can more easily learn about it.

To associate your repository with the sdpa topic, visit your repo's landing page and select "manage topics."