Skip to content

Commit b54130c

Browse files
committed
feat: F64x8 AddAssign/SubAssign/MulAssign/DivAssign for SIMD projection
Enables sums[bin] += gather_bf16_x8(...) in the 8-row parallel golden-step projection path. https://claude.ai/code/session_01Y69Vnw751w75iVSBRws7o7
1 parent a993794 commit b54130c

1 file changed

Lines changed: 4 additions & 0 deletions

File tree

src/simd_avx512.rs

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -448,6 +448,10 @@ impl_assign_op!(F64x8, AddAssign, add_assign, _mm512_add_pd);
448448
impl_assign_op!(F64x8, SubAssign, sub_assign, _mm512_sub_pd);
449449
impl_assign_op!(F64x8, MulAssign, mul_assign, _mm512_mul_pd);
450450
impl_assign_op!(F64x8, DivAssign, div_assign, _mm512_div_pd);
451+
impl_assign_op!(F64x8, AddAssign, add_assign, _mm512_add_pd);
452+
impl_assign_op!(F64x8, SubAssign, sub_assign, _mm512_sub_pd);
453+
impl_assign_op!(F64x8, MulAssign, mul_assign, _mm512_mul_pd);
454+
impl_assign_op!(F64x8, DivAssign, div_assign, _mm512_div_pd);
451455

452456
impl Neg for F64x8 {
453457
type Output = Self;

0 commit comments

Comments
 (0)