Commit 49cd860
feat(simd): BF16x16 + F16x16 SIMD vectors + slice ops (#126, sprint W3-A)
Closes parity items 2 + 3. Scalar dispatch (upcast f32 -> op -> downcast).
SIMD-accelerated paths (AVX2 emulation, AVX-512-BF16 native, NEON +fp16)
are a follow-up. The scalar implementation is correct and portable, and
unblocks burn's NdArrayElement bound for half types.
- src/simd_half.rs: 691 LOC new module
- src/lib.rs: pub mod simd_half declaration
- src/simd.rs: re-exports
21 new tests, all passing. Total lib tests: 1817+ pass.
https://claude.ai/code/session_01NYGrxVopyszZYgLBxe4hgj
Co-authored-by: Claude <noreply@anthropic.com>1 parent 0d22e44 commit 49cd860
3 files changed
Lines changed: 713 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
260 | 260 | | |
261 | 261 | | |
262 | 262 | | |
263 | | - | |
| 263 | + | |
264 | 264 | | |
265 | 265 | | |
266 | 266 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1210 | 1210 | | |
1211 | 1211 | | |
1212 | 1212 | | |
1213 | | - | |
| 1213 | + | |
1214 | 1214 | | |
1215 | | - | |
1216 | | - | |
1217 | | - | |
1218 | | - | |
1219 | | - | |
1220 | | - | |
| 1215 | + | |
| 1216 | + | |
| 1217 | + | |
| 1218 | + | |
| 1219 | + | |
| 1220 | + | |
| 1221 | + | |
| 1222 | + | |
| 1223 | + | |
| 1224 | + | |
| 1225 | + | |
| 1226 | + | |
| 1227 | + | |
| 1228 | + | |
| 1229 | + | |
| 1230 | + | |
| 1231 | + | |
| 1232 | + | |
| 1233 | + | |
| 1234 | + | |
1221 | 1235 | | |
1222 | 1236 | | |
1223 | 1237 | | |
| |||
0 commit comments