AES instruction set
An Advanced Encryption Standard instruction set is a set of instructions that are specifically designed to perform AES encryption and decryption operations efficiently. These instructions are typically found in modern processors and can greatly accelerate AES operations compared to software implementations. An AES instruction set includes instructions for key expansion, encryption, and decryption using various key sizes.
The instruction set is often implemented as a set of instructions that can perform a single round of AES along with a special version for the last round which has a slightly different method.
When AES is implemented as an instruction set instead of as software, it can have improved security, as its side channel attack surface is reduced.
x86 architecture processors
AES-NI was the first major implementation. AES-NI is an extension to the x86 instruction set architecture for microprocessors from Intel and AMD proposed by Intel in March 2008.A wider version of AES-NI, AVX-512 Vector AES instructions (VAES), is found in AVX-512.
Intel
The following Intel processors support the AES-NI instruction set:- Westmere based processors, specifically:
- * Westmere-EP processors
- * Clarkdale processors
- * Arrandale processors
- Sandy Bridge processors:
- * Desktop: all except Pentium, Celeron, Core i3
- * Mobile: all Core i7 and Core i5. Several vendors have shipped BIOS configurations with the extension disabled; a BIOS update is required to enable them.
- Ivy Bridge processors
- * All i5, i7, Xeon and i3-2115C only
- Haswell processors
- Broadwell processors
- Silvermont/Airmont processors
- Goldmont processors
- Skylake processors
AMD
Several AMD processors support AES instructions:- "Heavy Equipment" processors
- * Bulldozer processors
- * Piledriver processors
- * Steamroller processors
- * Excavator processors and newer
- Jaguar processors and newer
- Puma processors and newer
- Zen based processors
Hardware acceleration in other architectures
AES support with unprivileged processor instructions is also available in the latest SPARC processors and in latest ARM processors. The SPARC T4 processor, introduced in 2011, has user-level instructions implementing AES rounds. These instructions are in addition to higher level encryption commands. The ARMv8-A processor architecture, announced in 2011, including the ARM Cortex-A53 and A57 also have user-level instructions which implement AES rounds.x86 CPUs offering non-AES-NI acceleration interfaces
VIA x86 CPUs and AMD Geode use driver-based accelerated AES handling instead.The following chips, while supporting AES hardware acceleration, do not support AES-NI:
- AMD Geode LX processors
- VIA, using VIA PadLock
- * VIA C3 Nehemiah C5P (Eden-N) processors
- * VIA C7 Esther C5J processors
ARM architecture
Programming information is available in ARM Architecture Reference Manual ARMv8, for ARMv8-A architecture profile .The Marvell Kirkwood was the embedded core of a range of SoC from Marvell Technology, these SoC CPUs use driver-based accelerated AES handling.
- ARMv8-A architecture
- * ARM cryptographic extensions are optionally supported on ARM Cortex-A30/50/70 cores
- Cryptographic hardware accelerators/engines
- * Allwinner
- ** A10, A20, A30, A31, A80, A83T, H3 and A64 using Security System
- * Broadcom
- ** BCM5801/BCM5805/BCM5820 using Security Processor
- * NXP Semiconductors
- ** i.MX6 onwards
- * Qualcomm
- ** Snapdragon 810 onwards
- * Rockchip
- ** RK30xx series onwards
- * Samsung
- ** Exynos 7 series onwards
RISC-V architecture
The scalar and vector cryptographic instruction set extensions for the RISC-V architecture were ratified respectively on 2022 and 2023, which allowed RISC-V processors to implement hardware acceleration for AES, GHASH, SHA-256, SHA-512, SM3, and SM4.Before the AES-specific instructions were available on RISC-V, a number of RISC-V chips included integrated AES co-processors. Examples include:
- Dual-core RISC-V 64 bits Sipeed-M1 support AES and SHA256.
- RISC-V architecture based ESP32-C, support AES, SHA, RSA, RNG, HMAC, digital signature and XTS 128 for flash.
- Bouffalo Labs BL602/604 32-bit RISC-V supports various AES and SHA variants.
POWER architecture
Since the Power ISA v.2.07, the instructionsvcipher and vcipherlast implement one round of AES directly.IBM z/Architecture
IBM z9 or later mainframe processors support AES as single-opcode AES ECB/CBC instructions via IBM's CryptoExpress hardware. These single-instruction AES versions are therefore easier to use than Intel NI ones, but may not be extended to implement other algorithms based on AES round functions.Other architectures
- Atmel XMEGA
- SPARC T3 and later processors have hardware support for several cryptographic algorithms, including AES.
- Cavium Octeon MIPS All Cavium Octeon MIPS-based processors have hardware support for several cryptographic algorithms, including AES using special coprocessor 3 instructions.
Performance
In AES-NI Performance Analyzed, Patrick Schmid and Achim Roos found "impressive results from a handful of applications already optimized to take advantage of Intel's AES-NI capability". A performance analysis using the Crypto++ security library showed an increase in throughput from approximately 28.0 cycles per byte to 3.5 cycles per byte with AES/GCM versus a Pentium 4 with no acceleration.Supporting software
Most modern compilers can emit AES instructions.A lot of security and cryptography software supports the AES instruction set, including the following notable core infrastructure:
- Apple's FileVault 2 full-disk encryption in macOS 10.10+
- NonStop SSH2, NonStop cF SSL Library and BackBox VTC Software in HPE Tandem NonStop OS L-series
- Cryptography API: Next Generation (CNG)
- Linux's Crypto API
- Java 7 HotSpot
- Network Security Services version 3.13 and above
- Solaris Cryptographic Framework on Solaris 10 onwards
- FreeBSD's OpenCrypto API
- OpenSSL 1.0.1 and above
- GnuTLS
- Libsodium
- VeraCrypt
- Go programming language
- BitLocker
- Bloombase
- Vormetric
Application beyond AES
A fringe use of the AES instruction set involves using it on block ciphers with a similarly-structured S-box, using affine transform to convert between the two. SM4, Camellia and ARIA have been accelerated using AES-NI. The AVX-512 Galois Field New Instructions allows implementing these S-boxes in a more direct way.New cryptographic algorithms have been constructed to specifically use parts of the AES algorithm, so that the AES instruction set can be used for speedups. The AEGIS family, which offers authenticated encryption, runs with at least twice the speed of AES. AEGIS is an "additional finalist for high-performance applications" in the CAESAR Competition.