A64 -- SME Instructions (alphabetic order)

ADDHA: Add horizontally vector elements to ZA tile.

ADDVA: Add vertically vector elements to ZA tile.

BFMOPA: BFloat16 sum of outer products and accumulate.

BFMOPS: BFloat16 sum of outer products and subtract.

FMOPA (non-widening): Floating-point outer product and accumulate.

FMOPA (widening): Half-precision floating-point sum of outer products and accumulate.

FMOPS (non-widening): Floating-point outer product and subtract.

FMOPS (widening): Half-precision floating-point sum of outer products and subtract.

LD1B: Contiguous load of bytes to 8-bit element ZA tile slice.

LD1D: Contiguous load of doublewords to 64-bit element ZA tile slice.

LD1H: Contiguous load of halfwords to 16-bit element ZA tile slice.

LD1Q: Contiguous load of quadwords to 128-bit element ZA tile slice.

LD1W: Contiguous load of words to 32-bit element ZA tile slice.

LDR: Load vector to ZA array.

MOV (tile to vector): Move ZA tile slice to vector register: an alias of MOVA (tile to vector).

MOV (vector to tile): Move vector register to ZA tile slice: an alias of MOVA (vector to tile).

MOVA (tile to vector): Move ZA tile slice to vector register.

MOVA (vector to tile): Move vector register to ZA tile slice.

SMOPA: Signed integer sum of outer products and accumulate.

SMOPS: Signed integer sum of outer products and subtract.

ST1B: Contiguous store of bytes from 8-bit element ZA tile slice.

ST1D: Contiguous store of doublewords from 64-bit element ZA tile slice.

ST1H: Contiguous store of halfwords from 16-bit element ZA tile slice.

ST1Q: Contiguous store of quadwords from 128-bit element ZA tile slice.

ST1W: Contiguous store of words from 32-bit element ZA tile slice.

STR: Store vector from ZA array.

SUMOPA: Signed by unsigned integer sum of outer products and accumulate.

SUMOPS: Signed by unsigned integer sum of outer products and subtract.

UMOPA: Unsigned integer sum of outer products and accumulate.

UMOPS: Unsigned integer sum of outer products and subtract.

USMOPA: Unsigned by signed integer sum of outer products and accumulate.

USMOPS: Unsigned by signed integer sum of outer products and subtract.

ZERO: Zero a list of 64-bit element ZA tiles.


Internal version only: isa v33.11seprel, AdvSIMD v29.05, pseudocode v2021-09_rel, sve v2021-09_rc3d ; Build timestamp: 2021-10-06T11:41

Copyright © 2010-2021 Arm Limited or its affiliates. All rights reserved. This document is Non-Confidential.