VCMLA -- AArch32

This instruction operates on complex numbers that are represented in SIMD&FP registers as pairs of elements, with the more significant element holding the imaginary part of the number and the less significant element holding the real part of the number. Each element holds a floating-point value. It performs the following computation on the corresponding complex number element pairs from the two source registers and the destination register:

The multiplication and addition operations are performed as a fused multiply-add, without any intermediate rounding.

Depending on settings in the CPACR, NSACR, and HCPTR registers, and the Security state and PE mode in which the instruction is executed, an attempt to execute the instruction might be undefined, or trapped to Hyp mode. For more information see Enabling Advanced SIMD and floating-point support.

It has encodings from the following instruction sets: A32 ( A1 ) and T32 ( T1 ) .

A1
(FEAT_FCMA)

Decode for all variants of this encoding

if !IsFeatureImplemented(FEAT_FCMA) then UNDEFINED; if Q == '1' && (Vd<0> == '1' || Vn<0> == '1' || Vm<0> == '1') then UNDEFINED; constant d = UInt(D:Vd); constant n = UInt(N:Vn); constant m = UInt(M:Vm); constant integer esize = 16 << UInt(S); if !IsFeatureImplemented(FEAT_FP16) && esize == 16 then UNDEFINED; constant elements = 64 DIV esize; constant regs = if Q == '0' then 1 else 2;

T1
(FEAT_FCMA)

Decode for all variants of this encoding

if InITBlock() then UNPREDICTABLE; if !IsFeatureImplemented(FEAT_FCMA) then UNDEFINED; if Q == '1' && (Vd<0> == '1' || Vn<0> == '1' || Vm<0> == '1') then UNDEFINED; constant d = UInt(D:Vd); constant n = UInt(N:Vn); constant m = UInt(M:Vm); constant integer esize = 16 << UInt(S); if !IsFeatureImplemented(FEAT_FP16) && esize == 16 then UNDEFINED; constant elements = 64 DIV esize; constant regs = if Q == '0' then 1 else 2;

Assembler Symbols

<q>	See Standard assembler syntax fields.

<dt>

Is the data type for the elements of the vectors, encoded in S:

S	<dt>
0	F16
1	F32

<Qd>	Is the 128-bit name of the SIMD&FP destination register, encoded in the "D:Vd" field as <Qd>*2.

<Qn>	Is the 128-bit name of the first SIMD&FP source register, encoded in the "N:Vn" field as <Qn>*2.

<Qm>	Is the 128-bit name of the second SIMD&FP source register, encoded in the "M:Vm" field as <Qm>*2.

<Dd>	Is the 64-bit name of the SIMD&FP destination register, encoded in the "D:Vd" field.

<Dn>	Is the 64-bit name of the first SIMD&FP source register, encoded in the "N:Vn" field.

<Dm>	Is the 64-bit name of the second SIMD&FP source register, encoded in the "M:Vm" field.

Is the rotation to be applied to elements in the second SIMD&FP source register, encoded in rot:

rot	<rotate>
00	0
01	90
10	180
11	270

Operation

EncodingSpecificOperations(); CheckAdvSIMDEnabled(); constant FPCR_Type fpcr = StandardFPCR(); for r = 0 to regs-1 constant operand1 = D[n+r]; constant operand2 = D[m+r]; constant operand3 = D[d+r]; for e = 0 to (elements DIV 2)-1 bits(esize) element1; bits(esize) element2; bits(esize) element3; bits(esize) element4; case rot of when '00' element1 = Elem[operand2,e*2,esize]; element2 = Elem[operand1,e*2,esize]; element3 = Elem[operand2,e*2+1,esize]; element4 = Elem[operand1,e*2,esize]; when '01' element1 = FPNeg(Elem[operand2,e*2+1,esize], fpcr); element2 = Elem[operand1,e*2+1,esize]; element3 = Elem[operand2,e*2,esize]; element4 = Elem[operand1,e*2+1,esize]; when '10' element1 = FPNeg(Elem[operand2,e*2,esize], fpcr); element2 = Elem[operand1,e*2,esize]; element3 = FPNeg(Elem[operand2,e*2+1,esize], fpcr); element4 = Elem[operand1,e*2,esize]; when '11' element1 = Elem[operand2,e*2+1,esize]; element2 = Elem[operand1,e*2+1,esize]; element3 = FPNeg(Elem[operand2,e*2,esize], fpcr); element4 = Elem[operand1,e*2+1,esize]; constant result1 = FPMulAdd(Elem[operand3,e*2,esize],element2,element1, fpcr); constant result2 = FPMulAdd(Elem[operand3,e*2+1,esize],element4,element3, fpcr); Elem[D[d+r],e*2,esize] = result1; Elem[D[d+r],e*2+1,esize] = result2;

Internal version only: isa v01_32, pseudocode v2024-12_rel ; Build timestamp: 2024-12-16T10:54

VCMLA

A1
(FEAT_FCMA)

Encoding for the 64-bit SIMD vector variant

Encoding for the 128-bit SIMD vector variant

Decode for all variants of this encoding

T1
(FEAT_FCMA)

Encoding for the 64-bit SIMD vector variant

Encoding for the 128-bit SIMD vector variant

Decode for all variants of this encoding

Assembler Symbols

Operation

31	30	29	28	27	26	25	24	23	22	21	20	19	18	17	16	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	0	rot		D	1	S	Vn				Vd				1	0	0	0	N	Q	M	0	Vm

15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	0	rot		D	1	S	Vn				Vd				1	0	0	0	N	Q	M	0	Vm

31	30	29	28	27	26	25	24	23	22	21	20	19	18	17	16	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	0	rot		D	1	S	Vn				Vd				1	0	0	0	N	Q	M	0	Vm

15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	0	rot		D	1	S	Vn				Vd				1	0	0	0	N	Q	M	0	Vm

VCMLA

A1(FEAT_FCMA)

Encoding for the 64-bit SIMD vector variant

Encoding for the 128-bit SIMD vector variant

Decode for all variants of this encoding

T1(FEAT_FCMA)

Encoding for the 64-bit SIMD vector variant

Encoding for the 128-bit SIMD vector variant

Decode for all variants of this encoding

Assembler Symbols

Operation

A1
(FEAT_FCMA)

T1
(FEAT_FCMA)

31	30	29	28	27	26	25	24	23	22	21	20	19	18	17	16	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	0	rot		D	1	S	Vn				Vd				1	0	0	0	N	Q	M	0	Vm

15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0	15	14	13	12	11	10	9	8	7	6	5	4	3	2	1	0
1	1	1	1	1	1	0	rot		D	1	S	Vn				Vd				1	0	0	0	N	Q	M	0	Vm