VDOT (by element) -- AArch32

BFloat16 floating-point indexed dot product (vector, by element). This instruction delimits the source vectors into pairs of 16-bit BF16 elements. Each pair of elements in the first source vector is multiplied by the indexed pair of elements in the second source vector. The resulting single-precision products are then summed and added destructively to the single-precision element in the destination vector which aligns with the pair of BFloat16 values in the first source vector. The instruction does not update the FPSCR exception status.

It has encodings from the following instruction sets: A32 ( A1 ) and T32 ( T1 ) .

A1
(FEAT_AA32BF16)

Decode for all variants of this encoding

if !IsFeatureImplemented(FEAT_AA32BF16) then UNDEFINED; if Q == '1' && (Vd<0> == '1' || Vn<0> == '1') then UNDEFINED; constant integer d = UInt(D:Vd); constant integer n = UInt(N:Vn); constant integer m = UInt(Vm); constant integer i = UInt(M); constant integer regs = if Q == '1' then 2 else 1;

T1
(FEAT_AA32BF16)

Decode for all variants of this encoding

if InITBlock() then UNPREDICTABLE; if !IsFeatureImplemented(FEAT_AA32BF16) then UNDEFINED; if Q == '1' && (Vd<0> == '1' || Vn<0> == '1') then UNDEFINED; constant integer d = UInt(D:Vd); constant integer n = UInt(N:Vn); constant integer m = UInt(Vm); constant integer i = UInt(M); constant integer regs = if Q == '1' then 2 else 1;

Assembler Symbols

<q>	See Standard assembler syntax fields.

<Qd>	Is the 128-bit name of the SIMD&FP destination register, encoded in the "D:Vd" field as <Qd>*2.

<Qn>	Is the 128-bit name of the first SIMD&FP source register, encoded in the "N:Vn" field as <Qn>*2.

<Dd>	Is the 64-bit name of the SIMD&FP destination register, encoded in the "D:Vd" field.

<Dn>	Is the 64-bit name of the first SIMD&FP source register, encoded in the "N:Vn" field.

<Dm>	Is the 64-bit name of the second SIMD&FP source register, encoded in the "Vm" field.

<index>

Is the element index in the range 0 to 1, encoded in the "M" field.

Operation

CheckAdvSIMDEnabled(); constant FPCR_Type fpcr = StandardFPCR(); bits(64) operand1; bits(64) operand2; bits(64) result; operand2 = Din[m]; for r = 0 to regs-1 operand1 = Din[n+r]; result = Din[d+r]; for e = 0 to 1 constant bits(16) elt1_a = Elem[operand1, 2 * e + 0, 16]; constant bits(16) elt1_b = Elem[operand1, 2 * e + 1, 16]; constant bits(16) elt2_a = Elem[operand2, 2 * i + 0, 16]; constant bits(16) elt2_b = Elem[operand2, 2 * i + 1, 16]; constant bits(32) sum = FPAdd_BF16(BFMulH(elt1_a, elt2_a, fpcr), BFMulH(elt1_b, elt2_b, fpcr), fpcr); Elem[result, e, 32] = FPAdd_BF16(Elem[result, e, 32], sum, fpcr); D[d+r] = result;

Internal version only: isa v01_32, pseudocode v2024-12_rel ; Build timestamp: 2024-12-16T10:54

31	30	29	28	27	26	25	24	23	22	21	20	19	18	17	16	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	1	0	0	D	0	0	Vn				Vd				1	1	0	1	N	Q	M	0	Vm

15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	1	0	0	D	0	0	Vn				Vd				1	1	0	1	N	Q	M	0	Vm

VDOT (by element)

A1
(FEAT_AA32BF16)

Encoding for the 64-bit SIMD vector variant

Encoding for the 128-bit SIMD vector variant

Decode for all variants of this encoding

T1
(FEAT_AA32BF16)

Encoding for the 64-bit SIMD vector variant

Encoding for the 128-bit SIMD vector variant

Decode for all variants of this encoding

Assembler Symbols

Operation

31	30	29	28	27	26	25	24	23	22	21	20	19	18	17	16	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	1	0	0	D	0	0	Vn				Vd				1	1	0	1	N	Q	M	0	Vm

15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	1	0	0	D	0	0	Vn				Vd				1	1	0	1	N	Q	M	0	Vm

31	30	29	28	27	26	25	24	23	22	21	20	19	18	17	16	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	1	0	0	D	0	0	Vn				Vd				1	1	0	1	N	Q	M	0	Vm

15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	1	0	0	D	0	0	Vn				Vd				1	1	0	1	N	Q	M	0	Vm

VDOT (by element)

A1(FEAT_AA32BF16)

Encoding for the 64-bit SIMD vector variant

Encoding for the 128-bit SIMD vector variant

Decode for all variants of this encoding

T1(FEAT_AA32BF16)

Encoding for the 64-bit SIMD vector variant

Encoding for the 128-bit SIMD vector variant

Decode for all variants of this encoding

Assembler Symbols

Operation

A1
(FEAT_AA32BF16)

T1
(FEAT_AA32BF16)

31	30	29	28	27	26	25	24	23	22	21	20	19	18	17	16	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	1	0	0	D	0	0	Vn				Vd				1	1	0	1	N	Q	M	0	Vm

15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	1	0	0	D	0	0	Vn				Vd				1	1	0	1	N	Q	M	0	Vm