LATX, opt: Opt some SSE/AVX ops.#109
Conversation
MASKMOVDQU,VMASKMOVDQU VPMASKMOVD/Q, MASKMOVQ
|
Hello, thank you for your contribution to improving translation efficiency. In the translation function of vmovmskps/d, the instruction vpickve2gr cannot move the data from the third 32-bit element into the destination register; the xvpickve2gr instruction should be used instead. In the translation function of vpackusxx, when the destination operand is memory, mask of 0 does not clear the target location. I think a more correct way of writing it would be as follows: |
cc @rmjskhy |
MASKMOVDQU,VMASKMOVDQU
VPMASKMOVD/Q
MASKMOVQ
VMOVMSK{PD,PS}
VPACKUSWB
VPACKUSDW
PSHUF{LW,HW}
VPSHUF{LW,HW}
VPSHUFD
SHUFPS
VSHUFPS