XMM 寄存器可以用來做任何 128 位整數數學嗎？ (Can XMM registers be used to do any 128 bit integer math?)

問題描述

My impression is definitely not but perhaps there is a clever trick? Thanks.

‑‑‑‑‑

參考解法

方法 1:

Not directly, but there are 64 bit arithmetic operations which can be easily combined to perform 128 bit (or greater) precision.

方法 2:

The xmm registers can do arithmetics on 8, 16, 32 and 64 bit integers. It doesn't produce a carry flag so you can't extend the precision beyond 64 bits. The extended precision math libraries use the general purpose registers which are 32 bit or 64 bit, depending on the OS.

(by Upper、Paul R、A Fog)

參考文件

Can XMM registers be used to do any 128 bit integer math? (CC BY‑SA 3.0/4.0)

XMM 寄存器可以用來做任何 128 位整數數學嗎？ (Can XMM registers be used to do any 128 bit integer math?)

問題描述

參考解法

方法 1:

方法 2:

參考文件

相關問題

留言討論

XMM 寄存器可以用來做任何 128 位整數數學嗎？ (Can XMM registers be used to do any 128 bit integer math?)

問題描述

參考解法

方法 1:

方法 2:

參考文件

相關問題

SSE：如果不為零則倒數 (SSE: reciprocal if not zero)

使用 SSE2 模擬 packusdw 功能 (Simulating packusdw functionality with SSE2)

什麼會導致 _mm_setzero_si128() 到 SIGSEGV？ (What would cause _mm_setzero_si128() to SIGSEGV?)

ARM NEON 的 SSE _mm_movemask_epi8 等效方法 (SSE _mm_movemask_epi8 equivalent method for ARM NEON)

使用 simd 指令時，32 位圖像處理是否比 24 位圖像處理快？ (Is 32 bit image processing faster than 24 bit image processing when simd instructions are used?)

điều phối cpu cho studio trực quan cho AVX và SSE (cpu dispatcher for visual studio for AVX and SSE)

如何將內存中的 96 位加載到 XMM 寄存器中？ (How to load 96 bits from memory into an XMM register?)

x86中“非臨時”內存訪問的含義是什麼 (What is the meaning of "non temporal" memory accesses in x86)

現代編譯器如何使用 mmx/3dnow/sse 指令？ (How do modern compilers use mmx/3dnow/sse instructions?)

如何讓 ICC 編譯器在內循環中生成 SSE 指令？ (How do you get the ICC compiler to generate SSE instructions within an inner loop?)

如何從 SSE 中獲得最大速度？ (How do you get maximal speed out of SSE?)

XMM 寄存器可以用來做任何 128 位整數數學嗎？ (Can XMM registers be used to do any 128 bit integer math?)

留言討論