This document is a specification snapshot built from github.com/riscv/riscv-cheri/commit/47b031e0d3853d55217bf039e3dc878ddf56bb93 and is not a versioned release. The latest versioned PDF release can be downloaded from github.com/riscv/riscv-cheri/releases.

This document is in the Stable state

Assume anything could still change, but limited change should be expected.

Copyright and license information

This specification is licensed under the Creative Commons Attribution 4.0 International License (CC-BY 4.0). The full license text is available at creativecommons.org/licenses/by/4.0/.

Contributors

This RISC-V specification has been contributed to directly or indirectly by:

Thomas Aird <thomas.aird@codasip.com>
Hesham Almatary <hesham.almatary@cl.cam.ac.uk>
Andres Amaya Garcia <andres.amaya@codasip.com>
John Baldwin <jhb61@cl.cam.ac.uk>
Paul Buxton <paul.buxton@codasip.com>
David Chisnall <david.chisnall@cl.cam.ac.uk>
Jessica Clarke <jessica.clarke@cl.cam.ac.uk>
Brooks Davis <brooks.davis@sri.com>
Lawrence Esswood <lesswood@google.com>
Nathaniel Wesley Filardo <nwf20@cam.ac.uk>
Franz A. Fuchs <franz.fuchs@cl.cam.ac.uk>
Timothy Hutt <timothy.hutt@codasip.com>
Alexandre Joannou <alexandre.joannou@cl.cam.ac.uk>
Martin Kaiser <martin.kaiser@codasip.com>
Tariq Kurd <tariq.kurd@codasip.com>
Ben Laurie <benl@google.com>
Marno van der Maas <mvdmaas@lowrisc.org>
Maja Malenko <maja.malenko@codasip.com>
A. Theodore Markettos <theo.markettos@cl.cam.ac.uk>
Alfredo Mazzinghi <alfredo.mazzinghi@cl.cam.ac.uk>
Jan Matyas <jan.matyas@codasip.com>
Dave McEwan <dave.mcewan@codasip.com>
David McKay <david.mckay@codasip.com>
Jamie Melling <jamie.melling@codasip.com>
Stuart Menefy <stuart.menefy@codasip.com>
Simon W. Moore <simon.moore@cl.cam.ac.uk>
Prashanth Mundkur <prashanth@riscv.org>
Peter G. Neumann <neumann@csl.sri.com>
Robert Norton <robert.norton@cl.cam.ac.uk>
Alexander Richardson <alexrichardson@google.com>
Robert Riglar <robert.riglar@codasip.com>
Michael Roe <mr101@cam.ac.uk>
Peter Rugg <peter.rugg@cl.cam.ac.uk>
Florian Schmaus <florian.schmaus@codasip.com>
Peter Sewell <peter.sewell@cl.cam.ac.uk>
Carl Shaw <carl.shaw@codasip.com>
Ricki Tura <ricki.tura@codasip.com>
Robert N. M. Watson <robert.watson@cl.cam.ac.uk>
Toby Wenman <toby.wenman@codasip.com>
Jay Williams <jay.williams@codasip.com>
Adrian Wise <adrian.wise@codasip.com>
Jonathan Woodruff <jonathan.woodruff@cl.cam.ac.uk>
Jason Zhijingcheng Yu <yu.zhi@comp.nus.edu.sg>

1. Introduction

This chapter is only included in the standalone CHERI spec and not part of the integrated document.

1.1. CHERI Concepts and Terminology

Current CPU architectures (including RISC-V) allow memory access solely by specifying and dereferencing a memory address stored as an integer value in a register or in memory. Any accidental or malicious action that modifies such an integer value can result in unrestricted access to the memory that it addresses. Unfortunately, this weak memory protection model is the root cause of the majority of security vulnerabilities in system software today.

CHERI provides strong hardware mechanisms that software uses to implement fine-grained memory protection and scalable software compartmentalization. These mechanisms enable software to prevent and mitigate security vulnerabilities.

Design goals include incremental adoption from current ISAs and software stacks, low performance overhead for memory protection, significant performance improvements for software compartmentalization, formal grounding, and programmer-friendly underpinnings. CHERI has been designed to provide strong, non-probabilistic protection rather than depending on short random numbers or truncated cryptographic hashes that can be leaked and reinjected, or that could be brute-forced.

1.2. CHERI for RISC-V

This specification is based on publicly available documentation including CHERI v9 (Watson et al., 2023) and CHERI Concentrate (Woodruff et al., 2019).

RISC-V CHERI introduces several new bases that are collectively referred to as RVY (e.g., RV32Y, RV64Y). Any CHERI RISC-V implementation must support one of these CHERI bases. CHERI will define profiles that provide equivalent functionality to the standard RVA, RVB, and RVM profiles. These RVY-based profiles (RVYA, RVYB, RVYM) will list the compatible extensions.

Most existing extensions are compatible with RVY with no modifications. Incompatible extensions will be listed in the profile documents.

Zicfiss is currently incompatible as the relevant push/pop instructions would need modifying for CHERI. An RVY version has not yet been developed.

This section lists new extensions available to RVY, and also extensions where the behavior is modified for RVY.

Almost all existing RISC-V extensions can be added to an implementation, but in most cases they will have some behavioral differences and/or new instructions operating on capabilities.

1.2.1. Stable Extensions and Specifications

Table 1. Unprivileged stable RVY base ISAs and extensions
Extension or Specification	Description
RV64Y	CHERI Base ISA and capability encoding format for RV64
RV32Y	Base ISA additions and capability encoding formats for RV32
Zybndsrdw	Extension for setting bounds round down to representable length
Zyhybrid	Hybrid extension for RVI compatibility
Zca (RVY added instructions)	New 16-bit encodings added to Zca
Zba (RVY added instructions)	New 32-bit encodings added to Zba
Zalrsc (RVY added instructions)	New 32-bit encodings added to Zalrsc
Zaamo (RVY added instructions)	New 32-bit encodings added to Zaamo

Table 2. Unprivileged stable extensions where RVY modifies the behavior
Extension or Specification	Description
RVI (RVY modified behavior)	RVI instructions modified by RVY
Zca (RVY modified behavior)	C instructions modified by RVY
V (RVY modified behavior)	V instructions modified by RVY
Zicbom (RVY modified behavior)	Zicbom instructions modified by RVY
Zicbop (RVY modified behavior)	Zicbop instructions modified by RVY
Zicboz (RVY modified behavior)	Zicboz instructions modified by RVY
Zicsr (RVY modified behavior)	Zicsr instructions modified by RVY

CHERI defines new state and behavior for the privileged modes:

Machine-Level ISA (RVY)
Supervisor-Level ISA (RVY)

Table 3. Privileged stable extensions and specifications
Extension or Specification	Description
Zyhybrid for Privileged Architectures	Privileged support for Zyhybrid
Supervisor-Level ISA for Virtual Memory (RV64Y)	Virtual Memory
Svyrg	MMU-based capability data flow control and measurement

1.2.2. Unratified RISC-V Standard Extensions for RVY

Future versions of this specification will include RVY versions of:

"H" extension
Pointer masking extensions (Ssnpm, Smnpm, Smmpm, Sspm, Supm)
Code size reduction: Zcmt, Zcmp

1.2.3. Experimental Extensions and Specifications

The extensions in this section have not been fully prototyped and so are not considered ratification ready.

Table 4. Unprivileged experimental extensions and specifications
Extension	Description
Zcmt (RV32Y)	Table Jump for RV32Y
Zcmp (RV32Y)	Push/pop and double move for RV32Y
Zyseal	Capability-mediated capability-(un)sealing instructions
Zycheriot	CHERIoT unprivileged extension

Table 5. Privileged experimental extensions and specifications
Extension	Description
Hypervisor "H" (RVY)	Hypervisor
Ssnpm, Smnpm, Smmpm, Sspm, Supm (RVY)	Pointer Masking
Smycheriot	CHERIoT privileged extension

Chapters for the unprivileged specification

2. RVY Base Capability Instruction Set, Version 1.0

This chapter introduces a new set of base ISAs called RVY that extend the RV32[IE]/RV64[IE] base ISAs with CHERI.

The RVY base ISA can be composed with other standard options to bases such as Zfinx.

2.1. CHERI Overview

CHERI enhances the base ISA by adding hardware memory access control. It has an additional memory access mechanism that protects references to code and data (pointers), rather than the location of code and data (integer addresses). This mechanism is implemented by providing a new primitive, called a capability, that software components can use to implement strongly protected pointers within an address space. Capabilities are unforgeable and delegatable tokens of authority that grant software the ability to perform a specific set of operations. In CHERI, integer-based pointers are replaced by capabilities to provide memory access control.

Every RVY base ISA requires a capability encoding format to complete the specification, such as RV64LYA. The capability encoding format gives full details of how CHERI capabilities are represented in memory and in registers, and specifies the exact properties such as bounds precision and available combinations of permissions.

For example, in RV64LYA:

XLEN=64
L denotes a parameterized base.
Y denotes a CHERI base.
A denotes the capability encoding format.

This naming scheme is based on the RISC-V "longer base name" proposal, which allows base ISA names longer than two characters.

Future RV64 CHERI capability encoding formats could be named RV64LYB, RV64LYC, and so on, or they may have more descriptive names after the RV64LY prefix.

Also see capability encoding format.

2.2. CHERI protection model

The CHERI model is motivated by the principle of least privilege, which argues that greater security can be obtained by minimizing the privileges accessible to running software. A second guiding principle is the principle of intentional use, which argues that, where many privileges are available to a piece of software, the privilege to use should be explicitly named rather than implicitly selected. While CHERI does not prevent the expression of vulnerable software designs, it provides strong vulnerability mitigation: attackers have a more limited vocabulary for attacks, and should a vulnerability be successfully exploited, they gain fewer rights, and have reduced access to further attack surfaces.

Protection properties for capabilities include the ISA ensuring that capabilities are always derived via valid manipulations of other capabilities (provenance), that corrupted¹ in-memory capabilities cannot be dereferenced (integrity), and that derived capabilities must not grant permissions or bounds that exceed those of their source capabilities (monotonicity). Tampering or modifying capabilities in an attempt to elevate their rights will yield an invalid capability. Attempting to dereference via an invalid capability will result in raising an exception.

¹ Not all possible corrupted states are detected, see Section 2.5, “Integrity of Capabilities”.

CHERI capabilities may be held in registers or memory, and are loaded, stored, and dereferenced using CHERI-aware instructions that expect capability operands rather than integer addresses. On system initialization, initial capabilities are made available to software by the execution environment via general purpose registers. All other capabilities will be derived from these initial valid capabilities through valid capability transformations.

Developers can use CHERI to build fine-grained spatial and temporal memory protection into their system software and applications and significantly improve their security.

2.3. Capability Registers and Format

RVY widens all registers that can hold addresses (including the general-purpose registers, pc, and address-holding CSRs) from XLEN to 2*XLEN bits (hereafter referred to as YLEN), adding metadata to protect their integrity, limit how they are manipulated, and control their use. In addition to widening to YLEN, each register also gains a one-bit capability tag which is defined below.

RVY specifies the minimum required fields that the capability encoding format must support, and their semantics. This version of RVY defines only little-endian capability memory layouts. The memory representation of capabilities is always 2*XLEN (YLEN) bits wide and holds the address in the lower XLEN bits of memory and the metadata in the upper XLEN bits.

RVY is designed to be highly extensible by allowing different capability encoding formats to exist. Specific capability encoding formats may support different ISA extensions and add new architectural permissions.

Figure 1. CHERI Capability structure

2.3.1. Address

The lower XLEN bits of a capability encode the address to which the capability points. This is also referred to as the integer part of the capability. When an instruction writes non-capability data to an extended register, the metadata fields and capability tag of the destination are set to zero unless that instruction’s definition explicitly states otherwise.

Future extensions may add 2*XLEN-bit operations to make use of the wide registers for efficient handling of 2*XLEN-bit non-capability data.

2.3.2. Capability Tag

The capability tag is a single additional bit associated with YLEN-bit memory locations and YLEN-bit registers. It is stored separately (i.e., is out-of-band or hidden), and is hardware managed. It indicates whether a YLEN-bit register or YLEN-aligned memory location contains a valid capability. If the capability tag is set, the capability is valid and may authorize operations, subject to type, permission, bounds, and integrity checks.

All registers or memory locations able to hold a valid capability are YLEN bits wide and have an associated capability tag. These are referred to as being YLEN-bit in this specification.

Not all memory in a real system will support capability tags. The Execution Environment Interface (EEI) will define what regions of the address space support them.

The capability tag cannot be directly set by software; it is not a conventionally accessible bit of state. If the capability tag is set then it shows that the capability has been derived correctly according to the principles listed above (provenance, integrity, monotonicity). If the rules are followed then the capability tag will propagate through the instructions that modify, load or store the capability.

Therefore, whenever an instruction writes a capability with the capability tag set to a register or to memory:

At least one capability tag must have been set in the input operands.
- This is the provenance check.
The requested operation must have been legal and not increase bounds or permissions.
- This is the monotonicity check.
The instruction may have checked that none of the input capabilities were corrupted.
- This is the integrity check. Not all instructions check all input operands for integrity.

Capability load/store require the provenance check:

Any store that wrote the capability to memory was correctly authorized.
Any load that read the capability from memory was correctly authorized.

When an operation fails a check, either due to software error or malicious intent, the operation raises an exception or sets the resulting capability tag to zero, as defined by each instruction.

For a capability to be valid, the capability tag must be set, otherwise a capability is invalid.

Using an invalid capability to dereference memory or authorize any operation raises an exception. All capabilities derived from invalid capabilities are themselves invalid, i.e., their capability tags are zero.

Writing non-capability data into a register or memory location causes the capability tag to be set to 0. When the capability tag associated with the memory location is zero, the location contains non-capability data.

2.3.3. Capability tags in registers

Every YLEN-bit register has a one-bit capability tag, indicating whether the capability in the register is valid to be dereferenced. This capability tag is set to 0 whenever an invalid capability operation is performed. Examples of such invalid operations include writing only the integer portion (the address field) of the register or attempting to increase bounds or permissions.

2.3.4. Capability tags in memory

Capability tags are tracked through the memory subsystem: every aligned YLEN-bit wide region has a non-addressable one-bit capability tag, which the hardware manages atomically with the data. The implementation must set the capability tag to zero if any byte in the YLEN/8 aligned memory region is ever written using an operation other than a store of a capability operand. For example, the SY instruction sets the capability tag to one if all the instruction’s prerequisites are met, including rs2 's capability tag being set to one, but SW will always set the stored capability tag to zero.

All system memory and caches that store capabilities must preserve this abstraction, handling the capability tags atomically with the data.

2.3.5. Capability Bounds

Capabilities encode memory bounds as a half-open byte range [base, top), which is the range that may be accessed when the capability authorizes data memory access or instruction execution.

Checking is on a byte-by-byte basis, so that it is possible for a memory access to be fully in-bounds, partially out-of-bounds or fully out-of-bounds.

A partially or fully out-of-bounds data access or instruction fetch raises an exception.

Every capability has two memory address bounds: base representing the lowest accessible byte, and top representing one byte above the highest accessible byte.

The base is XLEN bits and is inclusive.

The top is (XLEN+1) bits and is exclusive.

Inclusive top, with XLEN bits, was considered but rejected in favor of the exclusive top (with an additional bit to allow covering the entire address space). Using exclusive top allows representing a zero-length capability starting at address zero, which is not possible with inclusive top unless you introduce a special value for -1.

The length is (XLEN+1) bits and is defined to be top - base.

Therefore, a memory location A in the range base ≤ A < top is within bounds, and so valid to access.

Checking every byte of every executed instruction and every byte of every data memory access is fundamental to the memory safety which CHERI provides. In a typical load/store unit, the expansion of the bounds from rs1 and bounds checking is in parallel with the address calculation, the memory translation and/or the PMA/PMP checking.

A compressed format is used to encode the bounds with a scheme similar to floating-point using an exponent and a mantissa. Therefore, small exponents can allow byte granularity on the bounds, but larger exponents give coarser granularity. One bounds encoding format based upon (Woodruff et al., 2019) is defined in Section 3.1.5.

Future CHERI bases may use encoding formats with different bounds encoding schemes (see capability encoding format).

Software can query the bounds of a capability held in a general-purpose register using the following instructions:

The base is returned by the YBASER instruction.
The length is returned by the YLENR instruction.
The top is returned by the YTOPR instruction.

The length and top are (XLEN+1)-bit values, so YLENR and YTOPR saturate their result to the maximum XLEN-bit value (2^XLEN-1) when it overflows an XLEN-bit register.

2.3.6. Deriving New Bounds

On system initialization, one or more Root capabilities are available. All capabilities with smaller bounds are derived from these.

The bounds are reduced using the YBNDSW, YBNDSWI and YBNDSRW instructions.

They all copy rs1 to the output rd, set the base bound of rd to rs1.address, subject to encoding granularity constraints, and set the length of rd from rs2 or the immediate field depending on the instruction.

YBNDSW sets the base to rs1.address, and the length to rs2.
- The capability tag is set to zero if the bounds cannot be encoded exactly.
YBNDSWI sets the base to rs1.address, and the length to the immediate value.
- The capability tag is set to zero if the bounds cannot be encoded exactly.
YBNDSRW sets the base to rs1.address, and the length to rs2.
- If the requested bounds cannot be encoded exactly, the result uses the smallest representable bounds that contain the requested range and remain within the initial bounds.
YAMASK can be used to calculate the nearest precisely encodable length and base values for a given size.

The bounds are encoded relative to the address field, sharing some upper bits of the address. The number of shared bits depends on the exponent, see Section 3.1.5.

2.3.7. Representability and Updating the Address

Software sometimes uses out-of-bounds pointers, for example, to represent a one-past-the-end pointer to an array in C (arr + size). To support this pattern, CHERI must allow these out-of-bounds pointers to be held in capabilities while still protecting the bounds and integrity. Because the CHERI Concentrate (Woodruff et al., 2019) encoding scheme for memory bounds shares the upper bits of the address with the bounds, only a limited range of out-of-bounds pointers can be represented for any given set of bounds. This range is known as the representable range. The relationship between the bounds and the representable range is illustrated in Figure 2.

The C standard permits forming a one-past-the-end pointer (but no more than one), as long as it is not dereferenced. Real-world software has been observed to temporarily create pointers that point multiple bytes past the end or even before the start. To maximize software compatibility, the bounds representation was designed to allow for out-of-bounds pointers.

Figure 2. Memory address bounds and representable range encoded within a capability

E, MW and R in the figure are all introduced in Section 3.1.5.2 along with the bounds decoding.

The maximum range of address values that the pointer can take without changing the calculation of the bounds is defined by the representable range.

The historic, uncompressed 64-bit CHERI-MIPS CPU had a 256-bit capability encoding with separate 64-bit values for the base and top memory bounds, and another for metadata fields. This scheme could represent all out-of-bounds pointers with a very high hardware cost.

Since the upper bits of the address are used to calculate the top and base bounds, deriving a new capability with a different address could change the resulting bounds. All instructions that derive a capability with a new address must check that the new address does not change the bounds. If the bounds do change then the capability tag of the derived capability is set to zero, so that the capability becomes invalid.

Software can derive a capability with a new address using instructions such as YADDRW, YADD and YADDI.

YADDRW writes back a derived capability with a new address field and, if the capability tag was previously set, sets the capability tag of the derived capability to one if the resulting capability still has the same bounds.

RVY implementations that use a different encoding scheme to RV64LYA (e.g., for accelerators or specific micro-controllers) may specify an alternative to the representable range check, and may never allow the address to be out of bounds. Therefore, the ISA specification uses the phrase represented exactly for this check.

2.3.8. Memory space

A hart supporting RVY has a single byte-addressable address space of 2^XLEN bytes for all memory accesses. Each memory region capable of holding a capability also stores a capability tag for each naturally aligned YLEN bits (e.g., 16 bytes in RV64), so that capabilities with their capability tag set can only be stored in naturally aligned addresses. Capability tags must be atomically bound to the data they protect.

Address arithmetic is modulo 2^XLEN, so the byte at address 2^XLEN - 1 is adjacent to the byte at address zero. A capability’s Representable Range is also circular, so address 0 is within the Representable Range of a capability where address 2^XLEN - 1 is within the bounds.

However, the decoded top bound address of a capability is XLEN + 1 bits wide and does not wrap. For example, a hypothetical capability with base 2^XLEN - 1 and top 2^XLEN + 1 is not a subset of the infinite capability and authorizes access to the last byte of the address space only, and does not authorize access to the byte at address 0.

Like malformed bounds (see Section 3.1.5.6), it is impossible for a CHERI core to generate a valid capability with architectural top > 2^XLEN.

If such a capability exists then it must have been caused by a logic or memory fault. Unlike malformed bounds, the top overflowing is not treated as a special case in the architecture: normal bounds check rules should be followed.

2.3.9. Capability Type (CT)

This metadata value indicates the type of the capability and determines which operations the capability authorizes. For example, the CT-field can be used to seal capabilities against modification, to provide control flow integrity, and to provide immutable software tokens. Which capability types a given CHERI platform supports is a function of the extensions and capability encoding format in use. The capability encoding format specifies a mapping between the CT-field within the encoding and the architectural capability types.

The capability encoding must be able to encode type 0 (unsealed). If any other sealed type exists, then they can be used to provide different levels of control-flow integrity.

Unsealed capabilities: When CT=0, the capability authorizes access to a region of memory as defined by the permissions and bounds. All CHERI systems must support unsealed capabilities.

Sealed capabilities: Capabilities with CT≠0 are sealed against modification and cannot be dereferenced to access memory. Instructions that operate on capabilities will produce a result with the capability tag set to zero if the source capability is sealed and the operation would alter its address, bounds, or permissions. Extensions that extend the capability metadata must describe their interaction(s) with sealed capabilities.
Sealing: Given a capability with CT=0, deriving a capability with CT≠0 is termed sealing (or sealing with type x when a particular output CT=x is meant). This is achieved using YSENTRY.
Unsealing: Given a capability with CT≠0, deriving a capability with CT=0 is termed unsealing (or unsealing from type x when a particular input CT=x is meant). This is achieved using YSUNSEAL (or YUNSEAL if Zyseal is implemented).

Sealed entry point capability: Sealed entry point capabilities provide immutable function pointers within a CHERI software system. Sealed capabilities are a natural foundation, providing immutability.

Capabilities sealed with a specific CT-field which can be unsealed when passed as an argument to JALR (RVY) are dubbed "sentries" (a portmanteau of "sealed entries").

They can establish a form of control-flow integrity between mutually distrusting code. Because they are immutable, the jump target cannot be modified, ensuring that control flow can only enter at the specific address encoded in the capability and not elsewhere in a function.

A sealed entry point capability may be passed as the rs1 argument to JALR (RVY), and is unsealed on dereference. If rd is not x0, the link capability written to rd is also written as a Sealed entry point capability.

Sentry types may be restricted to forward-edge only and backward-edge only transitions to further improve control-flow integrity.

When JALR (RVY) is used as a function call only forward-edge sentries in rs1 are unsealed on dereference. If rd is not x0, the returned rd value is always sealed as a backward-edge sentry type.
When JALR (RVY) is used as a function return only backward-edge sentries in rs1 are unsealed on dereference.

In addition to using sealed capabilities for sealed entry points, sealed capabilities can also be useful to software as secure software tokens. YSUNSEAL can be used to convert such a token back to an unsealed capability.

Ambient sealing type: Some capability types are said to be ambiently available (or just ambient) if they do not require specific authority to seal a capability with that type. Therefore, sentry types can be declared ambient. For example, if YSENTRY is available on a given platform, the type with which it seals capabilities is considered ambiently available.

The standard capability type mapping is shown in Table 6, extensions and capability encoding formats may add more types.

Table 6. Standard capability type mapping
Type Name	Integer Value	Description
Unsealed	0	Unsealed capability granting access to memory
Unrestricted Sealed entry point capability	1	Immutable unrestricted sentry type for both forward-edge and backward-edge transitions.
Forward-edge Sealed entry point capability	Extension defined	Immutable sentry type for forward-edge transitions.
Backward-edge Sealed entry point capability	Extension defined	Immutable sentry type for backward-edge transitions.

The integer value in the table above defines the architectural capability type (i.e., the result of the YTYPER instruction) but does not need to be encoded that way in memory.

Other encoding formats may provide different type mappings, but the value 0 must always represent an unsealed capability.

In general, each of sealing and unsealing actions may require authority to operate on the non-zero type; capability encoding formats which define non-zero types will specify how software expresses this authority.

Capability encoding formats may also make the set of CT-field values that may be used to seal a particular capability depend on the permissions granted by that capability. For example, it can be a useful space optimization to differentiate the CT-field values for capabilities granting X-permission from those not granting X-permission; the X-permission in the capability encoding effectively adds an additional bit to the CT-field field.

Capability encoding formats will explicitly state whether these are relevant to them.

They are not used by RV64LYA, but are by the CHERIoT formats.

Future extensions and capability encoding formats may declare new CT-field values, examples of which are:

Sealed types which cannot be used as a Sealed entry point capability
Sealed entry point capability types which can be only used for function calls or for function returns for improved CFI.

2.3.10. Architectural Permissions (AP)

This metadata field encodes architecturally defined permissions of the capability. Permissions grant access subject to the capability tag being set, the capability being unsealed, and bounds checks passing.

Any memory operation is also contingent on requirements imposed by other RISC-V architectural features, such as virtual memory, PMP and PMAs, even if the capability grants sufficient permissions.

The permissions defined in RVY are listed in Table 7. All capability encoding formats must support at least this set of permissions. Extensions building on top of RVY may define additional permissions.

An example of such an extension is Zylevels1.

Permissions can be cleared when deriving a new capability value (using YPERMC) but they can never be added.

Table 7. AP-field summary
Permission	Type	Comment
R-permission	Data memory permission	Authorize data memory read access
W-permission	Data memory permission	Authorize data memory write access
X-permission	Instruction memory permission	Authorize instruction memory execute access
C-permission	Data memory permission	Authorize loading/storing of capability tags
LM-permission	Data memory permission	Used to restrict the permissions of loaded capabilities.
ASR-permission	Privileged state permission	Authorize privileged instructions and CSR accesses.

Read Permission (R): Allow reading data from memory.

Write Permission (W): Allow writing data to memory.

Execute Permission (X): Allow instruction execution.

Capability Permission (C): Allow reading capability tags from memory if R-permission is also granted.

Allow writing capability tags to memory if W-permission is also granted.

If C-permission is missing then the capability tags for capability loads and stores are read and written as zero.

Load Mutable Permission (LM): Allow preserving the W-permission of capabilities loaded from memory. If a capability grants R-permission and C-permission, but no LM-permission, then a capability loaded via this authorizing capability will have W-permission and LM-permission removed.

The permission stripping behavior only applies to loaded capabilities with their capability tag set and that are not sealed after all other checks. This ensures that capability loads of non-capability data do not modify the loaded value, and that sealed capabilities are not modified.

Clearing a capability’s LM-permission and W-permission allows sharing a read-only version of a data structure without making a copy. For example, if software removes LM-permission from a capability to a linked list, any pointers loaded through that capability will automatically lose their W-permission, preventing modification of the list content.

Access System Registers Permission (ASR, primarily used to authorize CSR accesses): Allow read and write access to all privileged CSRs and some unprivileged CSRs, and the execution of some privileged instructions. ASR-permission checks always use the permission in the pc.

In the unprivileged RVY ISA, the following are affected by ASR-permission:

Thread ID CSRs: The utidc CSR requires ASR-permission for writing but not for reading.
Instructions: CBO.INVAL (RVY) requires ASR-permission in the pc.

In the privileged RVY ISA, the following are affected by ASR-permission:

Instructions: MRET (RVY), SRET (RVY) require ASR-permission in the pc.
CSRs: The privileged xtidc CSRs require ASR-permission for writing, but not for reading (if access is permitted in the current privilege mode). All other privileged CSRs require ASR-permission in the pc for any access.

This permission is important in privileged execution environments. Removing this permission allows constraining privileged software to a sandbox that cannot be subverted by changing privileged state.

Extensions may add additional unprivileged CSRs that require ASR-permission.

2.3.10.1. Permission Transitions

Not all capability permissions are orthogonal as some permissions inherently depend on others. As such, using YPERMC to clear some permissions may have the effect of clearing others as well.

For the base set of permissions just defined, the rules in Table 8 apply.

Table 8. YPERMC base rules
YPERMC Rule	Permission	Valid only if
base-1	C-permission	R-permission or W-permission
base-2	LM-permission	C-permission and R-permission
base-3	ASR-permission	X-permission

These rules enforce sensible permission combinations.

Extensions that define new permission bits may also introduce new dependency constraints. Currently defined examples are:

Zyhybrid, see Table 68
Zylevels1, see Section 14.3.1
Zyseal, see Table 71

Capability encoding formats may impose additional constraints to reduce the number of bits necessary to represent permissions.

2.3.11. Software-Defined Permissions (SDP)

The metadata also contains an encoding-dependent number of software-defined permission (SDP) bits. They can be inspected by the kernel or application programs to enforce restrictions on API calls (e.g., permit/deny system calls, memory allocation, etc.). They can be cleared by YPERMC but are not interpreted by the CPU otherwise.

While these bits are not used by the hardware as architectural permissions, modification follows the same rules: SDP bits can only be cleared and never set on valid capabilities.

This property is required to ensure restricted programs cannot forge capabilities that would pass the software-enforced checks.

Software is completely free to define the usage of these bits.

2.3.12. Special Capabilities

Root Capabilities

Root capabilities are those provided by the system on initialization. In some capability encoding formats there is a single Infinite capability value, which grants all permissions and has bounds covering the whole 2^XLEN address space; in such systems, root capabilities are often Infinite.

Other capability encoding formats may have a set of root capabilities distinguishing between data and executable capabilities as shown below.

At system initialization the set of all root capabilities must be made available to software.

Implementations are not required to provide access to the entire address space to a single hart, the root capabilities may be restricted to only cover a subset of the entire address space, or to remove certain permissions.

Root Executable Capability: An unsealed capability that has bounds nominally covering all addresses and grants at least all of X-permission, R-permission, C-permission, LM-permission, and ASR-permission. Extensions introducing new permissions may require these to be provided by root executable capabilities.

Root Data Capability: An unsealed capability that has bounds nominally covering all addresses and grants at least all of R-permission, W-permission, C-permission, and LM-permission. Extensions introducing new permissions may require these to be provided by root data capabilities.

NULL Capability: A capability with all-zero metadata, a zero capability tag, and an address of zero is referred to as the NULL capability. This capability grants no permissions and any dereference results in raising an exception.

2.4. Capability encoding formats

CHERI implementations make trade-offs in their "capability encoding formats", such as the precision of bounds and the expressible set of combinations of permissions, that impact certain behaviors of the ISA. The definition of all CHERI base ISAs includes sufficient details of a capability encoding format to precisely define the execution behavior of all base ISA instructions.

Whilst RV64Y has enough bits in the encoding format to prevent such trade-offs in general, the concepts are introduced here and RV32Y formats will make extensive use of them.

The name RVY refers generically to CHERI base ISAs, which share common semantics but may have different metadata encodings. The RVY name on its own does not describe the complete base ISAs, but defines all the semantics of a RVY machine, where certain behavioral parameters must be filled in by the capability encoding. For example, the YBNDSRW instruction yields a capability with new bounds that are rounded differently depending on the number of bits allocated to the bounds in the capability encoding format. Systems with XLEN=64 can allocate more bits towards the mantissa of the bounds than XLEN=32 system and therefore have a larger Representable Range.

These observable differences in behavior are similar to floating point numbers where the supported operations and their semantics are the same for all formats, but the exact result depends on the representation (e.g., 16/32/64-bit IEEE-754 or other floating point formats).

It is possible to generate code for RVY that is compatible with all valid capability encoding formats. However, to allow compilers and/or software library authors to make assumptions about encoding-dependent properties such as bounds precisions, each capability encoding format comes with a set of standard parameters.

Each capability encoding format also declares a new base ISA.

The documentation for each capability encoding format specifies at least the parameters in Table 9 below where the default is not used.

The default settings all match RV64LYA.

Table 9. RVY capability encoding format parameters
Parameter	Legal values	Default Value	Comment
Parameters used for the bounds encoding (see Section 3.1.5)
MW	integer values	14	Mantissa width for the bounds encoding
EW	integer values	6	Exponent width for the bounds encoding
CAP_MAX_E	integer values	52	Maximum exponent value
enableL8	0/1	0	Whether the encoding format includes the L₈ bit
Parameters used for permissions encoding in the AP-field
AP_MAX	integer values	all ones	Value of the AP-field giving maximum permissions
AP_ENC	full<suffix>/ part<suffix>	full1	Encoding scheme for the AP-field
Other parameters
AUIPC_SHIFT	integer values up to 12	12	Shift distance of the immediate in AUIPC
REP_RANGE	rc1/r0	rc1	Whether out of bounds addresses are representable

Two parameters use non-numeric values:

AP_ENC: Whether permissions in the AP-field are encoded using a full (full) representation allowing all valid combinations of permissions, or a partial (part) representation which does not represent all possible combinations and so uses fewer encoding bits.
REP_RANGE: Whether the capability encoding format guarantees representability of out-of-bounds capability values: rc1, where the encoding uses one additional bit and a centered out-of-bounds region, or r0, where no bits are used for out-of-bounds capabilities and there is no guaranteed representable range for out-of-bounds addresses.

For RV64Y, the number of spare bits in the capability encoding ensures sufficient future extensibility and therefore few custom formats are expected to exist. RV32Y, however, has insufficient bits available to enable features for all target domains, so multiple RV32Y-extending capability encoding formats will exist.

2.5. Integrity of Capabilities

CHERI defines the following integrity check rules:

The bounds are not malformed.
The capability metadata does not have any bits, or fields, set to a reserved value.

An example of a reserved field value is an illegal set of permissions such as ASR-permission without X-permission.

If any of the above rules do not hold for a capability with the capability tag set, it indicates either:

State corruption due to memory or logic faults, or
The presence of incompatible or faulty CHERI IP within the system.

These checks are much less rigorous than parity or ECC protection, and are only used to detect simple problems with the capability metadata.

Instructions that are defined to accept arbitrary input bit patterns, and can write an output with the capability tag set, must perform an integrity check on the input capability if its capability tag is set to zero.

In the RVY ISA this only affects YBLD.

All other integrity checks are optional as a valid capability can always be assumed to satisfy these integrity rules, because it could only have been produced by a valid derivation. Consequently, an integrity check should only change the result of an operation when it is performed on an invalid input capability (i.e. whose capability tag is zero).

Therefore, unless stated otherwise, all integrity checks in this specification are optional.

Implementations may choose to enforce optional integrity checks for a number of reasons:

Instruction level verification is simpler as the behavior is precisely specified for all 2^(YLEN+1) input values.
To give more resilience in systems which mix CHERI IPs from different sources due to possible incompatibilities.
If the hart implements additional reliability features, it may wish to detect and report capabilities which lack integrity.

2.6. Extended State

As stated above, unless noted otherwise, state which can hold addresses is extended from XLEN to YLEN bits.

The privileged mconfigptr CSR is the only documented exception to the rule.

A reminder that all YLEN state also has a hardware managed capability tag.

2.6.1. General Purpose Registers

The XLEN-wide integer registers (e.g., x1, x2) are all widened to YLEN bits, each gaining an associated capability tag, as shown in Figure 3.

Figure 3. Extended registers in RVY

The zero register (x0) is extended with zero metadata and a zero capability tag: this is called the NULL capability.

2.6.2. The Program Counter Capability (`pc`)

The pc is widened to a capability. Widening the pc allows the range of branches, jumps and linear execution for currently executing code to be restricted. The pc address field is the pc in the base RISC-V ISA so that the hardware automatically updates it as instructions are executed.

The hardware performs the following checks on pc for each instruction executed in addition to the checks already required by the base RISC-V ISA. A failing check raises a CHERI exception.

The capability tag must be set to one.
The capability must not be sealed.
The capability must grant X-permission.
All bytes of the instruction must be in bounds.
All integrity checks must have passed.

On system initialization the pc bounds and permissions must be set such that the program can run successfully (e.g., by setting it to a Root Executable capability to ensure all instructions are in bounds).

Future ISA extensions should respect these rules so that the checked bits do not need to be stored in all copies of the pc in the implementation.

Figure 4. Program Counter Capability

2.6.3. Added CSRs

RVY adds the YLEN-bit CSR shown in Table 10.

Table 10. Capability CSRs added in RVY
YLEN CSR	Permissions	Description
utidc	RW, ASR-permission required for writes, not reads	User Thread ID Capability

2.6.3.1. User Thread Identifier Capability (utidc)

The utidc register is an unprivileged CSR used to identify the current software thread. Any operation that modifies utidc raises an exception unless the ASR-permission is set in the current pc.

The EEI sets the value of the capability tag of this CSR to zero, and the values of the metadata and address fields are UNSPECIFIED.

Figure 5. User thread identifier capability register

Each thread (as scheduled by the kernel) can have its own context stored in privileged data structures managed by the compartmentalization TCB, for example a stack of user-level compartment entries and exits. utidc allows compartment-switching code to determine the currently running thread without a call into the operating system. This is read-only without ASR-permission to prevent a malicious compartment from tricking the switching mechanism into accessing data corresponding to the wrong thread. While the RISC-V ABI includes a thread pointer (tp) register, it is not usable for the purpose of reliably identifying the current software thread because the tp register is a general purpose register and can be changed arbitrarily by untrusted code. Extending utidc to a capability allows a data structure to be shared (guarded by e.g., the subset sealing mechanism), which allows for example efficient recovery of a trusted stack, without requiring additional indirection.

2.6.4. CSR Classes

All CSRs that can hold addresses are widened to YLEN bits.

RVY has three classes of CSRs:

XLEN-bit CSRs

These do not contain pointers (e.g., fcsr from the "F" extension).

Extended CSRs

These are XLEN-bit CSRs widened to YLEN bits, which are able to contain pointers (e.g., jvt from the "Zcmt" extension).

Zcmt is not yet available for RVY.

YLEN-bit CSRs

These are added by RVY and contain pointers (e.g., utidc).

When accessing CSRs these rules are followed:

Accesses to XLEN-bit CSRs are as specified by Zicsr
Accesses to YLEN-bit CSRs and extended CSRs, using CSRRW will:
1. Read YLEN bits
2. Write YLEN bits, and will write the capability tag to zero if the written value fails any of the checks in Section 2.5, “Integrity of Capabilities”
Accesses to YLEN-bit CSRs and extended CSRs, using instructions other than CSRRW will:
1. Read YLEN bits
2. For CSRRS/CSRRC, compute the XLEN-bit address-field value using the standard Zicsr bitwise set or clear operation on the old CSR address field and the XLEN source operand.
3. For immediate forms, compute the XLEN-bit address-field value using the standard Zicsr operation with the zero-extended uimm operand.
4. Write the computed XLEN-bit value to the address field, and use the semantics of the YADDRW instruction to determine the final written value

Any YLEN-bit or extended CSR may have additional rules defined to determine the final written value of the metadata and/or to write zero to the capability tag.

The assembler pseudoinstruction to read a capability CSR csrr rd, csr, is encoded as csrrs rd, csr, x0.

Table 11. YLEN-bit CSR and Extended CSR access summary for RVY
Instruction	Read Width	Write Width
CSRRW rd==x0		YLEN
CSRRW rd!=x0	YLEN	YLEN
CSRR[C\|S] rs1==x0	YLEN
CSRR[C\|S] rs1!=x0	YLEN	XLEN
CSRRWI rd==x0		XLEN
CSRRWI rd!=x0	YLEN	XLEN
CSRR[C\|S]I uimm==0	YLEN
CSRR[C\|S]I uimm!=0	YLEN	XLEN

In Table 11, when there is no read or write width shown, the CSR access is not made and there are no side-effects following standard Zicsr rules.

2.7. Capability checks

Every memory access must be authorized by a valid capability.

Instruction fetches and data memory accesses raise an exception if the access is out of bounds, or if the authorizing capability is missing the required permissions:

All load instructions require R-permission and are authorized by the capability in rs1
All store instructions require W-permission and are authorized by the capability in rs1
All instruction fetches require X-permission in pc

Instruction fetches are authorized by the program counter capability (pc) which extends the pc.

This allows code fetch to be bounded, preventing a wide range of attacks that subvert control flow with non-capability data.

E.g., lw t0, 16(sp) loads a word from memory, getting the address, bounds, and permissions from the sp (capability stack pointer) register.

2.8. Added Instructions

RVY adds new instructions to operate on capabilities, and redefines part of the RVI/RVE custom encoding space for this purpose. For harts implementing RVY, these opcode spaces are not available for vendor custom instructions except where this specification explicitly returns subspaces to custom use.

The RVI custom-3 space is renamed to RVY-A and is used for standard RVY encodings.
The RVI custom-2 space is renamed to RVY-B and is used for standard RVY encodings.
The RVI custom-1 space is renamed to RVY-C and is used for standard RVY encodings.
The RVI custom-0 space is still available for custom encodings.

Table 12. RISC-V base opcode map for RVY, inst[1:0]=11
`inst[4:2]`	`000`	`001`	`010`	`011`	`100`	`101`	`110`	111 (>32b)
`inst[6:5]`	`000`	`001`	`010`	`011`	`100`	`101`	`110`	111 (>32b)
`00`	`LOAD`	`LOAD-FP`	`custom-0`	`MISC-MEM`	`OP-IMM`	`AUIPC`	`OP-IMM-32`	reserved
`01`	`STORE`	`STORE-FP`	`RVY-C`	`AMO`	`OP`	`LUI`	`OP-32`	reserved
`10`	`MADD`	`MSUB`	`NMSUB`	`NMADD`	`OP-FP`	`OP-V`	`RVY-B`	reserved
`11`	`BRANCH`	`JALR`	`reserved`	`JAL`	`SYSTEM`	`OP-VE`	`RVY-A`	reserved

Standard RVY encodings will never be placed in custom-0.

Parts of RVY-A/B/C may be returned to custom space in the future.

2.8.1. Instructions to Update The Capability Pointer

Creating a new capability with a different address (i.e., updating the pointer) requires specific instructions instead of integer ADD/ADDI. These instructions all include a check that the resulting address can be represented exactly within the new capability.

Table 13. Instructions which update the address field summary in RVY
Mnemonic	Description
YADDI	Capability pointer increment by immediate
YADD	Capability pointer increment
YADDRW	Write capability address

2.8.1.1. YADD, YADDI

Synopsis: Capability pointer increment
Mnemonics: YADD rd, rs1, rs2
YADDI rd, rs1, imm
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

YMV uses the reserved encoding of YADD where rs2=0.

Description

Copy the capability in register rs1 to register rd.

For YADD, increment rd.address by the value in rs2[XLEN-1:0].
For YADDI, increment rd.address by the sign-extended immediate value imm.

Set rd.tag=0 if rs1 is sealed.

Set rd.tag=0 if the resulting capability cannot be represented exactly.

Set rd.tag=0 if rs1 fails any integrity checks.

Operation for YADD

  let cs1_val = C(cs1);
  let rs2_val = X(rs2);

  let newCap = incCapAddrChecked(cs1_val, rs2_val);

  C(cd) = newCap;
  RETIRE_SUCCESS

Operation for YADDI

  let cs1_val = C(cs1);
  let immBits : xlenbits = sign_extend(imm);

  let newCap = incCapAddrChecked(cs1_val, immBits);

  C(cd) = newCap;
  RETIRE_SUCCESS

2.8.1.2. YADDRW

Synopsis: Write capability address
Mnemonic: YADDRW rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Copy the capability rs1 to rd.

Set rd.address to rs2[XLEN-1:0].

Set rd.tag=0 if rs1 is sealed.

Set rd.tag=0 if the resulting capability cannot be represented exactly.

Set rd.tag=0 if rs1 fails any integrity checks.

Operation

C(cd) = setCapAddrChecked(C(cs1), X(rs2));
RETIRE_SUCCESS

2.8.2. Instructions to Manipulate Capabilities

For security, capabilities can only be modified in restricted ways. Special instructions are provided to copy capabilities or derive a new capability using manipulations such as shrinking the bounds (YBNDSW), reducing the permissions (YPERMC) or authorizing a capability with another one which has a superset (or identical) bounds and permissions (YBLD).

Capability subset

Some instructions use the concept of cap2 being a capability subset of cap1. The definition is:

All permissions granted by the AP-field and the SDP-field of cap2 are also granted by those of cap1, and
cap2 's top bound is less than or equal to cap1 's top bound, and
cap2 's base bound is greater than or equal to cap1 's base bound, and
cap1 and cap2 pass all integrity checks

A capability whose capability tag is set is assumed to satisfy the integrity checks, so this rule can only affect the result for an operand whose capability tag is zero. Among the instructions that use the subset relation, this is only material for the rs2 operand of YBLD (which can set a capability tag on an arbitrary bit pattern) and for YSS (which can compare operands whose capability tags are zero).

Table 14. Summary of RVY instructions that create a modified capability
Mnemonic	Description
YPERMC	Clear capability permissions
YMV	Capability register copy
YHIW¹	Write capability metadata and clear capability tag (pseudo)
YBNDSWI	Write capability bounds by immediate
YBNDSW	Write capability bounds
YBNDSRW	Write capability bounds, rounding up if required
YSUNSEAL	Unseal by superset reconstruction
YBLD	Build capability
YSENTRY	Create a sealed entry point capability

¹ YHIW is a pseudoinstruction for PACKY

2.8.2.1. YPERMC

Synopsis: Clear capability permissions
Mnemonic: YPERMC rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

YPERMC performs the following operations:

Convert the AP-field and SDP-field fields of capability rs1 into a bit field with the format shown in Figure 6.
The initial value in register rs2[XLEN-1:0] is treated as a bit mask that specifies bit positions to be cleared in the bit field. Any bit that is high in rs2 will cause the corresponding bit to be cleared in the bit field.

Future extensions may include hardwired permission bits, in which case they are not cleared by set bits in rs2.
Encode the resulting architectural permissions as specified by the capability encoding format in use. This involves applying the rules in Section 2.3.10.1, as well as any rules added by extensions or the capability encoding format.
1. If the capability encoding format does not support the requested set of permissions then YPERMC will return a minimal set of permissions, which may be no permissions. Therefore, it is possible that requesting to clear a permission also clears others, but YPERMC will never add new permissions.
Copy rs1 to rd, and update the AP-field and SDP-fields with the newly calculated versions.
Set rd.tag=0 if rs1 is sealed and any bits in the AP-field or SDP-field were affected by YPERMC.
1. Extensions such as Zylevels1 add a field to the metadata which is reduced by YPERMC even when the capability is sealed.
Set rd.tag=0 if any integrity checks fail.

Figure 6. Capability permissions bit field

If a future extension adds a new permission that is dependent on an existing permission then clearing the original must also clear the new permission. This ensures software forward-compatibility: for example, a kernel that does not know about finer-grained ASR permissions must still be able to prevent all access to privileged instructions and CSRs simply by clearing ASR-permission. Such dependent permissions must be allocated to the bits that are currently reported as 1 to ensure forward-compatibility.

Operation

This listing is manually maintained rather than extracted from the Sail model.

let cs1_val = C(cs1);
let rs2_val = X(rs2);

let cond = capIsSealed(cs1_val) | not(capReservedValid(cs1_val));
let inCap = clearTagIf(cs1_val, cond);

let old_perms = packPerms(getArchPermsLegalized(inCap), inCap.sd_perms).bits;

let new_perms = old_perms & not(rs2_val);

let (new_arch_perms, new_sd_perms) = unpackPerms(struct {bits = new_perms});
let newCap = { setArchPerms(inCap, new_arch_perms) with sd_perms = new_sd_perms };

C(cd) = newCap;
RETIRE_SUCCESS

2.8.2.2. YMV

Synopsis: Capability register copy
Mnemonic: YMV rd, rs1
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

YMV uses the reserved encoding of YADD where rs2=0.

Description

Copy rs1 to rd.

This instruction can propagate valid capabilities which fail integrity checks.

Operation

C(cd) = C(cs1);
RETIRE_SUCCESS

2.8.2.3. PACKY

Synopsis: Pack Y register
Mnemonic: PACKY rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

The PACKY instruction copies rs1[XLEN-1:0] into rd[XLEN-1:0], copies rs2[XLEN-1:0] into rd[YLEN-1:XLEN], and sets rd.tag=0.

Operation

let capVal = C(cs1);
let intVal = X(rs2);
let newCap = bitsToCap(false, intVal @ capVal.address);
C(cd) = newCap;
RETIRE_SUCCESS

2.8.2.4. YHIW

Synopsis

Write capability metadata and clear capability tag (pseudo)

Mnemonic

YHIW rd, rs1, rs2

Encoding

YHIW is a pseudoinstruction for PACKY

Description

Construct a YLEN-bit value in rd by setting the address from rs1 and the metadata from rs2. Set rd.tag to 0.

The value of rs1.tag does not affect the result.

Operation

let capVal = C(cs1);
let intVal = X(rs2);
let newCap = bitsToCap(false, intVal @ capVal.address);
C(cd) = newCap;
RETIRE_SUCCESS

2.8.2.5. YBNDSW, YBNDSWI

Synopsis: Write capability bounds
Mnemonics: YBNDSW rd, rs1, rs2
YBNDSWI rd, rs1, imm
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Copy the capability from register rs1 to register rd. Set the base address of its bounds to the value of rs1.address and set the length of its bounds to rs2[XLEN-1:0] for YBNDSW, or imm for YBNDSWI.

Set rd.tag=0 if rs1.tag=0.

Set rd.tag=0 if rs1 is sealed.

Set rd.tag=0 if rd 's bounds exceed rs1 's bounds.

Set rd.tag=0 if the requested bounds cannot be encoded exactly.

Set rd.tag=0 if rs1 fails any integrity checks.

YBNDSWI decodes the 9-bit immediate value imm[8:0] to the requested length result as follows:

If imm[8:0] = 0, result is 4096.
If imm[8] = 0 and imm[7:0] != 0, result is imm[7:0].
If imm[8] = 1 and imm[7:5] = 0, result is 256 | (imm[3:0] << 4) | (imm[4] << 3).
Otherwise, result is imm[7:0] << 4.

The resulting logical encodable regions and step sizes are summarized in the table below:

Region Decoded range (bytes) Immediate decoding condition

Byte granular

1, 2, …, 255

imm[8] = 0 and imm[7:0] > 0

8-byte granular

256, 264, …, 504

imm[8:5] = 4’b1000

16-byte granular

512, 528, …, 4080

imm[8:5] > 4’b1000

Page-sized

4096

imm[8:0] = 0

This immediate encoding scheme achieves coverage of almost all observed immediate values while decoding the 9-bit immediate field without any adders or variable shifts. Mapping imm = 0 to 4096 reclaims the redundant zero encoding (already covered by YBNDSW dst, src, x0) to support highly common page-sized objects.

The hardware implementation is a simple bit-remapping:
result[12] = (imm[8:0] == 0);
result[11:9] = imm[8] ? imm[7:5] : 0;
result[8] = imm[8] ? (imm[7:5] == 0b000 || imm[4]) : 0;
result[7:4] = imm[8] ? imm[3:0] : imm[7:4];
result[3] = imm[8] ? (imm[7:4] == 0b0001) : imm[3];
result[2:0] = imm[8] ? 0 : imm[2:0];

Operation for YBNDSW

let cs1_val = C(cs1);
let length = X(rs2);
let newBase = cs1_val.address;
let newTop : CapLenBits = zero_extend(newBase) + zero_extend(length);
// inCapBoundsNoWrap returns false if the input bounds are malformed.
let inBounds = inCapBoundsNoWrap(cs1_val, newBase, unsigned(length));
let (exact, newCap) : (bool, Capability) = setCapBounds(cs1_val, newBase, newTop);
let cond = not(inBounds & exact) |
           boundsMalformed(newCap) |
           not(capReservedValid(newCap)) |
           capIsSealed(newCap);
C(cd) = clearTagIf(newCap, cond);
RETIRE_SUCCESS

Operation for YBNDSWI

This listing is manually maintained rather than extracted from the Sail model.

let cs1_val = C(cs1);
// Decode the requested length from the 9-bit immediate.
let length : xlenbits =
  if imm == 0b000000000 then to_bits(xlen, 4096)
  else if imm[8] == bitzero then zero_extend(imm[7 .. 0])
  else if imm[7 .. 5] == 0b000 then zero_extend(0b1 @ imm[3 .. 0] @ imm[4 .. 4] @ 0b000)
  else zero_extend(imm[7 .. 0] @ 0b0000);
let newBase = cs1_val.address;
let newTop : CapLenBits = zero_extend(newBase) + zero_extend(length);
// inCapBoundsNoWrap returns false if the input bounds are malformed.
let inBounds = inCapBoundsNoWrap(cs1_val, newBase, unsigned(length));
let (exact, newCap) : (bool, Capability) = setCapBounds(cs1_val, newBase, newTop);
let cond = not(inBounds & exact) |
           boundsMalformed(newCap) |
           not(capReservedValid(newCap)) |
           capIsSealed(newCap);
C(cd) = clearTagIf(newCap, cond);
RETIRE_SUCCESS

2.8.2.6. YBNDSRW

Synopsis: Write capability bounds, rounding up if required
Mnemonic: YBNDSRW rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Copy the capability from register rs1 to register rd. Set the base address of its bounds to the value of rs1.address and set the length of its bounds to rs2[XLEN-1:0].

The base is rounded down and the top is rounded up by the smallest amounts needed to form a capability covering the requested base and top.

Set rd.tag=0 if rs1.tag=0.

Set rd.tag=0 if rs1 is sealed.

Set rd.tag=0 if rd 's bounds exceed rs1 's bounds.

Set rd.tag=0 if rs1 fails any integrity checks.

Operation

let cs1_val = C(cs1);
let length = X(rs2);
let newBase = cs1_val.address;
let newTop : CapLenBits = zero_extend(newBase) + zero_extend(length);
// inCapBoundsNoWrap returns false if the input bounds are malformed.
let inBounds = inCapBoundsNoWrap(cs1_val, newBase, unsigned(length));
let (_, newCap) : (bool, Capability) = setCapBounds(cs1_val, newBase, newTop);
let cond = not(inBounds) |
           boundsMalformed(newCap) |
           not(capReservedValid(newCap)) |
           capIsSealed(newCap);
C(cd) = clearTagIf(newCap, cond);
RETIRE_SUCCESS

Checking only the requested region with inCapBoundsNoWrap is sufficient, since the rounding performed by setCapBounds can never grow the bounds beyond those of rs1.

2.8.2.7. YSUNSEAL

Synopsis: Unseal by superset reconstruction
Mnemonic: YSUNSEAL rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Copy rs2 to rd.

Set rd.ct=0. (That is, unseal rd.)

Set rd.tag=1 if:

rs1.tag=1, and
rs1 is not sealed, and
rs2.tag=1, and
rs2 is sealed, and

rs2 is a capability subset of rs1

Otherwise, set rd.tag=0

When rs1 is x0 YSUNSEAL will copy rs2 to rd and clear rd.tag and rd.ct. However future extensions may add additional behavior to update currently reserved fields, and so software should not assume rs1=0 to be a pseudo-instruction for capability tag and type clearing.

YSUNSEAL is intended to enable "superset unsealing" of opaque handles to software objects. Specifically, a software component can:

allocate the memory for these objects from a region of address space,
render capabilities to these objects opaque by sealing with YSEAL,
distribute these handles to other software components, and
later use its (unsealed) capability to the backing region as the authority (in rs1) of a YSUNSEAL instruction to recover an unsealed capability to the object backing a handle (in rs2) received from other components.

The result of YSUNSEAL will have its capability tag set to zero if the received capability is not a handle to an object in the recipient’s address space. This makes it easy for recipient software to ensure that received capabilities actually are handles to the recipient’s objects.

While YSUNSEAL requires that the capability in its rs2 is sealed, it imposes no requirements on which non-zero CT-field value has been used to seal rs2. If the capability encoding defines multiple non-zero CT-field values and software wishes to distinguish between them, it must use YTYPER on the sealed capability.

Operation

This listing is manually maintained rather than extracted from the Sail model.

let cs1_val = C(cs1);
let cs2_val = C(cs2);

let tag = cs1_val.tag &
          not(capIsSealed(cs1_val)) &
          cs2_val.tag &
          capIsSealed(cs2_val) &
          /* subset checks for malformed bounds, perms, reserved bits */
          capIsSubset(cs2_val, cs1_val);

C(cd) = { unsealCap(cs2_val) with tag = tag };
RETIRE_SUCCESS

2.8.2.8. YBLD

Synopsis: Build capability
Mnemonic: YBLD rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Copy rs2 to rd.

If rd.ct (that is, its CT-field) is neither 0 nor an ambient type, then set rd.ct to 0.

Set rd.tag=1 if:

rs1.tag=1, and
rs1 is not sealed, and
rs2 is a capability subset of rs1, and
rs2 passes a mandatory integrity check

Otherwise, set rd.tag=0

The integrity check on rs2 is mandatory to prevent authorizing a capability with a lack of integrity.

The integrity check, from the capability subset definition, on rs1 is optional.

YBLD will construct a sealed capability only if its type is ambiently available.

YBLD is typically used alongside YHIW to build capabilities from integer values.

YBLD can be used to speed up operations such as paging in memory after swap.

When rs1 is x0 YBLD will copy rs2 to rd and clear rd.tag. However future extensions may add additional behavior to update currently reserved fields, and so software should not assume rs1=0 to be a pseudo-instruction for capability tag clearing.

Operation

  let cs1_val = C(cs1);
  let cs2_val = C(cs2);

  let tag = cs1_val.tag &
            not(capIsSealed(cs1_val)) &
            capIsSubset(cs2_val, cs1_val); /* Subset checks for malformed bounds,
                                              perms, and reserved bits */

  C(cd) = { cs2_val with tag = tag };
  RETIRE_SUCCESS

2.8.2.9. YSENTRY

Synopsis: Create a sealed entry point capability
Mnemonic: YSENTRY rd, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

rs1≠x0 is currently reserved to decode as a future YSEAL instruction.

Description

Copy rs2 to rd.

Seal the capability in rd by setting rd.ct=1.

Set rd.tag=0 if rs2 is sealed or fails any integrity checks.

A future extension will allow YSEAL, which has rs1 specifying an authorizing capability, to perform the sealing action and to specify the destination CT-field value.

If the capability encoding format does not support a CT-field of type 1, then this instruction is reserved. Future capability encoding formats may redefine the behavior for CT-field values greater than 1.

Operation

This listing is manually maintained rather than extracted from the Sail model.

let cs2_val = C(cs2);
let inCap = clearTagIf(cs2_val, capIsSealed(cs2_val) | not(capReservedValid(cs2_val)));
C(cd) = sealCap(inCap);
RETIRE_SUCCESS

2.8.3. Instructions to Decode Capability Bounds

The bounds describing the range of addresses the capability gives access to are stored in a compressed format. These instructions query the bounds and related information.

Table 15. Instructions which decode capability bounds summary in RVY
Mnemonic	Description
YBASER	Read capability base address
YLENR	Read capability length
YTOPR	Read capability top address

2.8.3.1. YBASER

Synopsis: Read capability base address
Mnemonic: YBASER rd, rs1
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Decode the base integer address from rs1 's bounds and write the result to rd.

If rs1 's bounds can’t be decoded, or rs1 fails any integrity checks, then return zero.

The value of rs1.tag does not affect the result.

Operation

let capVal = C(cs1);
X(rd) = match getCapBoundsBits(capVal) {
  None() => zeros(),
  Some(base, _) => base
};
RETIRE_SUCCESS

2.8.3.2. YLENR

Synopsis: Read capability length
Mnemonic: YLENR rd, rs1
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Calculate the length of rs1 's bounds and write the result in rd.

The length is defined as the difference between the decoded bounds' top and base addresses, i.e., top - base.

Return the maximum length, 2^XLEN-1, if the length of rs1 is 2^XLEN.

If rs1 's bounds can’t be decoded, or rs1 fails any integrity checks, then return zero.

The value of rs1.tag does not affect the result.

Operation

let capVal = C(cs1);
// getCapLength returns 0 if the bounds are malformed
let len = getCapLength(capVal);
X(rd) = to_bits(xlen, if len > cap_max_addr then cap_max_addr else len);
RETIRE_SUCCESS

2.8.3.3. YTOPR

Synopsis: Read capability top address
Mnemonic: YTOPR rd, rs1
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Decode the top integer address from rs1 's bounds, saturated to 2^XLEN-1, and write the result to rd.

If rs1 's bounds can’t be decoded, or rs1 fails any integrity checks, then return zero.

The value of rs1.tag does not affect the result.

Operation

This listing is manually maintained rather than extracted from the Sail model.

let capVal = C(cs1);
X(rd) = match getCapBoundsBits(capVal) {
  None() => zeros(),
  Some(_, top) => {
    let top_val = unsigned(top);
    to_bits(xlen, if top_val > cap_max_addr then cap_max_addr else top_val)
  }
};
RETIRE_SUCCESS

2.8.4. Instructions to Extract Capability Fields

These instructions either directly read bit fields from the metadata or capability tag, or only apply simple transformations on the metadata.

Table 16. Instructions which extract capability fields summary in RVY
Mnemonic	Description
YTAGR	Read capability tag
YPERMR	Read capability permissions
YTYPER	Read capability type
YHIR¹	Read capability metadata (pseudo)

2.8.4.1. YTAGR

Synopsis: Read capability tag
Mnemonic: YTAGR rd, rs1
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Zero extend the value of rs1.tag and write the result to rd.

Operation

let capVal = C(cs1);
X(rd) = zero_extend(bool_to_bits(capVal.tag));
RETIRE_SUCCESS

2.8.4.2. YPERMR

Synopsis: Read capability permissions
Mnemonic: YPERMR rd, rs1
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Convert the unpacked AP-field and SDP-fields of capability rs1 into a bit field, with the same format as used by YPERMC (see Figure 7), zero extend and write the result to rd.

All bits in the [23:0] range that are reserved or assigned to extensions that are not implemented by the current hart are hardwired to 1.

If rs1 fails any integrity checks, all currently allocated permission bits in rd report 0.

The hardwired bits described above are unaffected and still report 1.

Figure 7. Capability permissions bit field

The value of rs1.tag does not affect the result.

Operation

let capVal = C(cs1);
X(rd) = packPerms(getArchPermsLegalized(capVal), capVal.sd_perms).bits;
RETIRE_SUCCESS

2.8.4.3. YTYPER

Synopsis: Read capability type
Mnemonic: YTYPER rd, rs1
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Decode the architectural capability type (CT-field) from rs1, zero extend and write the result to rd.

The value of rs1.tag does not affect the result.

Operation

let capVal = C(cs1);
X(rd) = zero_extend(bool_to_bits(capVal.sealed));
RETIRE_SUCCESS

The Sail model stores the 1-bit CT-field of RV64LYA as the boolean sealed. Capability encoding formats with a wider CT-field must return the full field value.

2.8.4.4. SRLIY

Synopsis: Logical right shift of Y register
Mnemonic: SRLIY rd, rs1, shamt
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description: Logical right shift of Y register rs1 to rd, zero-filling the upper bits of the result. Set rd.tag=0.

The only valid shift amount is XLEN (as required by YHIR). Encodings with any other shift amount are reserved.

A future extension may add an arbitrary shift distance.

Operation

This listing is manually maintained rather than extracted from the Sail model.

X(rd) = truncate(capToBits(C(rs1)) >> shamt, xlen);
RETIRE_SUCCESS

2.8.4.5. YHIR

Synopsis: Read capability metadata (pseudo)
Mnemonic: YHIR rd, rs1
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

YHIR rd, rs1 is a pseudo instruction for a XLEN-wide right shift (SRLIY).

Description

Copy the metadata (bits [YLEN-1:XLEN]) of capability rs1 into rd.

The value of rs1.tag does not affect the result.

Operation

let capVal = C(cs1);
X(rd) = capToMetadataBits(capVal).bits;
RETIRE_SUCCESS

2.8.5. Miscellaneous Instructions to Handle Capability Data

Table 17. Miscellaneous capability instruction summary in RVY
Mnemonic	Description
YEQ	Capability equality comparison including capability tag
YSS	Capability is a subset
YAMASK	Capability alignment mask

2.8.5.1. YEQ

Synopsis: Capability equality comparison including capability tag
Mnemonic: YEQ rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Set rd to 1 if all bits (i.e., YLEN bits and the capability tag) of capabilities rs1 and rs2 are equal, otherwise set rd to 0.

Operation

let cs1_val = C(cs1);
let cs2_val = C(cs2);
X(rd) = zero_extend(bool_to_bits(cs1_val == cs2_val));
RETIRE_SUCCESS

2.8.5.2. YSS

Synopsis: Capability is a subset
Mnemonic: YSS rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

rd is set to 1 if:

the capability tags of capabilities rs1 and rs2 are equal, and
rs2 is a capability subset of rs1

Otherwise set rd to 0. Extensions may further impose constraints on when rd is set to 1.

The implementation of this instruction is similar to YBLD, although YSS does not include the sealed bit in the check.

Operation

  let cs1_val = C(cs1);
  let cs2_val = C(cs2);

  X(rd) = zero_extend(bool_bits(
    (cs1_val.tag == cs2_val.tag) &
    capIsSubset(cs2_val, cs1_val) /* capIsSubset returns false if either input
                                     has malformed bounds, perms, or non-zero
                                     reserved bits */
  ));
  RETIRE_SUCCESS

2.8.5.3. YAMASK

Synopsis: Capability alignment mask
Mnemonic: YAMASK rd, rs1
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description: rd[XLEN-1:0] is set to a mask that can be used to round addresses down to a value that is sufficiently aligned to set exact bounds for the nearest representable length of rs1[XLEN-1:0]. The upper bits of rd are zero extended. See Section 3.1.5 for the algorithm used to compute the next representable length.

This RVY instruction only handles XLEN operands and produces an XLEN result.

Operation

let len = X(rs1);
X(rd) = getRepresentableAlignmentMask(len);
RETIRE_SUCCESS

2.8.6. Instructions to Load and Store Capability Data

New loads and stores are introduced to handle capability data, LY and SY. They atomically access YLEN bits of data and the associated capability tag.

All capability memory accesses check for C-permission in the authorizing capability in rs1.

If C-permission is granted then:

LY reads YLEN bits of data from memory, and returns the associated capability tag.
SY writes YLEN bits of data to memory, and writes the associated capability tag.

If C-permission is not granted then:

LY reads YLEN bits from memory, but does not return the associated capability tag, instead zero is returned.
SY writes YLEN bits to memory, and writes zero to the associated capability tag.

All capability data memory access instructions require YLEN-aligned addresses, and will take an access fault exception if this requirement is not met. They cannot be emulated.

An access fault is raised instead of a misaligned exception since these instructions cannot be emulated since there is one hidden capability tag per YLEN-aligned memory region.

All memory accesses, of any type, require permission from the authorizing capability in rs1.

All loads require R-permission, otherwise they raise an exception.
All stores require W-permission, otherwise they raise an exception.

Under some circumstances LY will modify the data loaded from memory before writing it back to the destination register. See LY for details.

Table 18. Capability load/store instruction summary in RVY
Mnemonic	Description
LY	Load capability
SY	Store capability

2.8.6.1. LY

Synopsis: Load capability
Mnemonic: LY rd, offset(rs1)
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Any instance of this instruction with rs1=x0 will raise an exception, as x0 is defined to always hold a NULL capability. As such, the encodings with rs1=x0 are RESERVED for use by future extensions.

Description

Calculate the effective address of the memory access by adding rs1.address to the sign-extended 12-bit offset.

Authorize the memory access with the capability in rs1.

Load a naturally aligned YLEN-bit data value and associated capability tag from memory.

The loaded capability tag may be set to zero under platform specified conditions, such as by a hardware based use-after-free prevention scheme. Privileged mechanisms may also set the loaded capability tag to zero such as PMA settings or virtual memory system page table entry configuration.

Extensions may also specify additional conditions which set the capability tag to zero.

The final value of rd is determined as follows:

Write the loaded YLEN data to rd and the loaded capability tag into rd.tag.
If rs1 does not grant C-permission then set rd.tag=0.

If rd.tag=1, rs1 does not grant LM-permission and rd is not sealed, then an implicit YPERMC is performed to clear W-permission and LM-permission from rd.

The implicit YPERMC is not required to set rd.tag=0 if the loaded data fails any integrity checks. A suggested implementation is to only check the AP-field for legal values, and set the whole AP-field to zero if the check fails.

Metadata modifications on loaded capabilities are on naturally aligned data. Therefore on the read path from a data cache, the modification typically happens in parallel with data alignment multiplexers.

Zylevels1 adds to the YPERMC rules on loaded capability data.

When sending load data to a trace interface, implementations trace the final value written to rd which may not match the value in memory.

Exceptions

Raise a load access fault exception when the effective address is not aligned to YLEN/8.

Exceptions occur when the authorizing capability fails one of the checks listed below:

Kind

Reason

CHERI Load Access Fault

Authorizing capability tag is set to 0.

CHERI Load Access Fault

Authorizing capability is sealed.

CHERI Load Access Fault

Authorizing capability does not grant the necessary permissions. R-permission is required.

CHERI Load Access Fault

At least one byte accessed is outside the authorizing capability bounds, or the bounds could not be decoded.

CHERI Load Access Fault

Authorizing capability failed any integrity check.

Operation

  let offset : xlenbits = sign_extend(imm);
  let (auth_val, vaddr) = get_cheri_mode_cap_addr(rs1_cs1, offset);
  let aq : bool = false;
  let rl : bool = false;

  match check_and_handle_load_vaddr_for_triggers(vaddr, get_arch_pc()) {
    Some (ret) => return ret,
    None () => ()
  };
  if not(capTaggedAndReservedValid(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_TagViolation);
    RETIRE_FAIL
  } else if capIsSealed(auth_val) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_SealViolation);
    RETIRE_FAIL
  } else if not(canR(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_PermissionViolation);
    RETIRE_FAIL
  } else if not(validAddrRange(vaddr, cap_size) | capBoundsInfinite(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_InvalidAddressViolation);
    RETIRE_FAIL
  } else if not(inCapBounds(auth_val, vaddr, cap_size)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_LengthViolation);
    RETIRE_FAIL
  } else if not(is_aligned_addr(vaddr, cap_size)) then {
    handle_mem_exception(vaddr, E_Load_Addr_Align());
    RETIRE_FAIL
  } else match translateAddr(vaddr, Read(Cap)) {
    TR_Failure(E_Extension(_)) => { internal_error(__FILE__, __LINE__, "unexpected cheri exception for cap load") },
    TR_Failure(e) => { handle_mem_exception(vaddr, e); RETIRE_FAIL },
    TR_Address(addr, pbmt, ptw_info) => {
      let c = mem_read_cap(addr, pbmt, aq, aq & rl, false);
      match c {
        MemValue(v) => {
          let cr = clearTagIf(v, ptw_info.ptw_lc == PTW_LC_CLEAR | not(canC(auth_val)));
          C(cd) = legalizeLM(cr, auth_val);
          RETIRE_SUCCESS
        },
        MemException(e) => {handle_mem_exception(vaddr, e); RETIRE_FAIL }
      }
    }
  }

2.8.6.2. SY

Synopsis: Store capability
Mnemonic: SY rs2, offset(rs1)
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Calculate the effective address of the memory access by adding rs1.address to the sign-extended 12-bit offset.

Authorize the memory access with the capability in rs1.

Store the YLEN-bit value in rs2, together with its capability tag, to the naturally aligned memory location.

This instruction can propagate valid capabilities which fail integrity checks.

Stored Capability Tag Value

Set the stored capability tag to zero if:

rs2.tag=0, or
rs1 does not grant C-permission, or
Platform specified conditions or privileged configurations (e.g., PMAs or PTEs) specify that it must be set to zero.

Extensions may define further circumstances under which stored capabilities may have their capability tags set to zero. Zylevels1 is an example of this.

Exceptions

Raise a store/AMO access fault exception when the effective address is not aligned to YLEN/8.

Exceptions occur when the authorizing capability fails one of the checks listed below:

Kind

Reason

CHERI Store/AMO Access Fault

Authorizing capability tag is set to 0.

CHERI Store/AMO Access Fault

Authorizing capability is sealed.

CHERI Store/AMO Access Fault

Authorizing capability does not grant the necessary permissions. W-permission is required.

CHERI Store/AMO Access Fault

At least one byte accessed is outside the authorizing capability bounds, or the bounds could not be decoded.

CHERI Store/AMO Access Fault

Authorizing capability failed any integrity check.

Operation

  let offset : xlenbits = sign_extend(imm);
  let (auth_val, vaddr) = get_cheri_mode_cap_addr(rs1_cs1, offset);
  let cs2_val = C(cs2);
  let aq : bool = false;
  let rl : bool = false;
  let cs2_val = clearTagIf(cs2_val, not(canC(auth_val)));

  match check_and_handle_store_vaddr_for_triggers(vaddr, get_arch_pc()) {
    Some (ret) => return ret,
    None () => ()
  };
  if not(capTaggedAndReservedValid(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_TagViolation);
    RETIRE_FAIL
  } else if capIsSealed(auth_val) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_SealViolation);
    RETIRE_FAIL
  } else if not(canW(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_PermissionViolation);
    RETIRE_FAIL
  } else if not(validAddrRange(vaddr, cap_size) | capBoundsInfinite(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_InvalidAddressViolation);
    RETIRE_FAIL
  } else if not(inCapBounds(auth_val, vaddr, cap_size)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_LengthViolation);
    RETIRE_FAIL
  } else if not(is_aligned_addr(vaddr, cap_size)) then {
    handle_mem_exception(vaddr, E_SAMO_Addr_Align());
    RETIRE_FAIL
  } else match translateAddr(vaddr, Write(if cs2_val.tag then Cap else Data)) {
    TR_Failure(e) => { handle_mem_exception(vaddr, e); RETIRE_FAIL },
    TR_Address(addr, pbmt, _) => {
      let eares : MemoryOpResult(unit) = mem_write_ea_cap(addr, aq & rl, rl, false);
      match (eares) {
        MemException(e) => { handle_mem_exception(vaddr, e); RETIRE_FAIL },
        MemValue(_) => {
          let res : MemoryOpResult(bool) = mem_write_cap(addr, pbmt, cs2_val, aq & rl, rl, false);
          match (res) {
            MemValue(true)  => RETIRE_SUCCESS,
            MemValue(false) => internal_error(__FILE__, __LINE__, "store got false from mem_write_value"),
            MemException(e) => { handle_mem_exception(vaddr, e); RETIRE_FAIL }
          }
        }
      }
    }
  }

2.9. Changes to Existing RISC-V Base ISA Instructions

RVY extends existing instructions that are used for handling addresses so that they manipulate a whole capability.

Whenever an input operand is used as an address (e.g., the load/store base address), all capability bits are fed into the instruction instead of just XLEN bits.
Any instruction that writes back an address (e.g., AUIPC (RVY) or CSRRW (RVY)) to the destination register writes a full capability register instead of just XLEN bits. For all other results the high bits of the register and the capability tag are set to zero.
Whenever a capability with a new address is returned, the result is always created using the semantics of the YADDRW instruction.

ADD and ADDI are not affected by the rule above. Even though they are used for handling addresses, they also have other uses. New encodings are used for capability addition: YADD and YADDI. They must be used for all capability address incrementing.

Integer add (ADD) and capability add (YADD) have separate encodings. Using a single encoding for both is undesirable:

Integer ADD is most commonly used for purposes other than address calculations.
For high performance implementations which can issue multiple ADDs, it means that the integer ADD units do not need the upper halves of the operands, and do not need the capability check logic on the result.
The compiler and/or programmer would have to execute another metadata clearing instruction after each ADD to ensure that compartments do not leak capabilities.

The rules above apply to the base ISA instructions listed in the following subsections, but also apply to instructions added by other extensions. Any change to instruction semantics for RVY is called out in the chapter defining the extension.

2.9.1. Changes to load/stores

All load and store instructions behave as described in Load and Store Instructions with one fundamental difference:

Any memory instruction that has rs1 as a base address register reads the full capability register instead. The base address is unchanged, i.e., using the value from rs1. The metadata and capability tag are used to authorize the access.
For a load instruction, the lower XLEN bits of the result written to the destination register are the same as in the RV32[IE]/RV64[IE] specification.

All load and store instructions authorized by rs1 raise exceptions if any of these checks fail:

rs1 must not be x0¹
The capability tag (rs1.tag) must be set to one.
rs1 must be unsealed.
For loads, R-permission must be granted in rs1.
For stores, W-permission must be granted in rs1.
All integrity checks on rs1 must pass.

¹ All load/store encodings are reserved if rs1=x0 (since dereferencing NULL always faults).

All load instructions, except for LY, always set the capability tag and the metadata of the result register to zero.

All store instructions, except for SY, always write zero to the capability tag or capability tags associated with the memory locations that are written to. Therefore, misaligned stores may clear up to two associated capability tags.

The changed interpretation of the base register also applies to all loads, stores and all other memory operations defined in later chapters of this specification with a base operand of rs1 unless stated otherwise.

Under RVY all loads and stores are authorized by rs1.

These rules affect the following base ISA instructions listed in Table 19, and also apply to instructions added by other extensions, e.g.,:

Floating-point loads and stores.
Vector load and stores.
Atomic memory accesses, see Zaamo (RVY added instructions) and Zalrsc (RVY added instructions).

Table 19. Changed RISC-V base ISA load/store instructions summary in RVY
Mnemonic	Description
LD, LW[U], LH[U], LB[U]	Integer loads (authorized by the capability in `rs1`)
SD, SW, SH, SB	Integer stores (authorized by the capability in `rs1`)

2.9.2. Changes to PC

Whenever the address field of the pc is modified, it is always updated using the semantics of the YADDRW instruction. This includes adding an offset to the pc from direct jumps and branches for both the target address and the link register. In this case, e.g., new_pc = YADDRW(old_pc, offset)
JALR (RVY) copies rs1 into the pc, and increments the address field with the offset. In this case, e.g., new_pc = YADDRW(Unseal(rs1), offset)

Using YADDRW to update the pc address can cause the target pc capability tag to be set to zero, that raises a CHERI Instruction Access Fault on the target instruction.

A future extension may raise an exception on branch and jump instructions if fetching a minimum-sized instruction at the target pc will raise a CHERI Instruction Access Fault.

These rules affect the following base ISA instructions listed in Table 20, and also apply to instructions added by other extensions.

Table 20. Changed RISC-V base ISA PC relative instructions summary in RVY
Mnemonic	Description
AUIPC (RVY)	Add upper immediate to pc
JAL (RVY)	Immediate offset jump, and link to capability register
JALR (RVY)	Jump to capability register, and link to capability register
BEQ (RVY)	Immediate offset condition branch, taken if operands are equal
BNE (RVY)	Immediate offset condition branch, taken if operands are not equal

2.9.3. AUIPC (RVY)

Synopsis: Add upper immediate to pc
Mnemonic: auipc rd, imm
Encoding

This instruction is modified from the version in RV32[IE]/RV64[IE].

Description

Form a 32-bit offset from the 20-bit immediate filling the lowest bits with zero; the number of places to shift is determined by the capability encoding format’s AUIPC_SHIFT parameter (defaulting to 12). Take the value of the AUIPC instruction’s pc, increment its address by the 32-bit offset using the semantics of the YADDRW instruction and write the result to rd.

Set rd.tag=0 if the resulting capability cannot be represented exactly.

Operation

let off : xlenbits = sign_extend(imm @ 0x000);
let (representable, newCap) = setCapAddr(PCC, PC + off);
C(cd) = clearTagIf(newCap, not(representable));
RETIRE_SUCCESS

2.9.3.1. The AUIPC Shift

The RISC-V base integer ISA frequently splits signed 32-bit constants across instructions as the addition of a signed 12-bit constant and a 20-bit constant shifted left by 12 bits. For example, the I-type instruction ADDI, with its 12-bit immediate, is to be combined with the U-type LUI instruction and its 20-bit immediate when values beyond the reach of a signed 12-bit value are needed. To reach a given value in this way involve "overshooting" the desired value: for example, to materialize 0xf01 (3841) into a register, one uses LUI to materialize 0x1000 (4096) and ADDI to subtract 0xff (255). Similarly, the U-type AUIPC instruction, with its 20-bit immediate, is designed to compose well with the signed 12-bit immediate operands of load (I-type) and store (S-type) instructions.

When manipulating addresses within capabilities, there is a risk that such two-step sequences could take the address out of bounds before attempting to bring it back within bounds. Many capability encodings, including those of RV64LYA, have a representable range sufficient to ensure that any capability whose length is larger than 2 KiB (that is, those for which a signed 12-bit displacement might be insufficient) are able to represent at least 2 KiB on either side of their bounds. However, this is not an essential property of capability encodings, and so this specification allows the capability encoding to specify the shift used within address-manipulating instructions with shifted immediates. For AUIPC (RVY) specifically, we refer to this value as the AUIPC shift, and take it to be 12 unless the capability encoding sets it to another value. (Taking the shift to be 11 instead of 12 decreases the reach of AUIPC from ±2 GiB to ±1 GiB, but ensures that all values within that range can be obtained with the same sign bit in the AUIPC immediate and subsequent 12-bit immediate(s), thereby ensuring that in-bounds addresses can be reached without risk of the intermediate computation exceeding capability bounds.)

Future extensions that add instructions with similar semantics should make use of this same encoding-specified shift value or otherwise allow the capability encoding to set the shift amount.

It is possible for the compiler to generate code that is compatible with any AUIPC shift by emitting a AUIPC with a zero offset followed by a sequence of LUI and YADD/YADDI.

2.9.4. JAL (RVY)

Synopsis: Immediate offset jump, and link to capability register
Mnemonic: jal rd, offset
Encoding

This instruction is modified from the version in RV32[IE]/RV64[IE].

Description

Direct jump with an address offset.

The target address is obtained by adding the sign-extended 20-bit J-immediate to pc.address.
The target pc is obtained by applying the target address to the pc of the JAL instruction, using the semantics of the YADDRW instruction.
The pc of the next instruction is sealed as a Sealed entry point capability and written to rd.

Jump to the target pc.

A future extension may raise an exception on the branch instruction itself if fetching a minimum-sized instruction at the target pc will raise a CHERI Instruction Access Fault. Performing the pc bounds check at the branch source instead of on instruction fetch is helpful for debugging and can simplify the implementation of CPUs with very short pipelines.

Operation

This listing is manually maintained rather than extracted from the Sail model.

let off : xlenbits = sign_extend(imm);
let newPC = PC + off;
/* nextPC accounts for compressed instructions */
let (_, linkCap) = setCapAddr(PCC, nextPC);
C(cd) = sealCap(linkCap);
set_next_pc(newPC);
RETIRE_SUCCESS

2.9.5. JALR (RVY)

Synopsis: Jump to capability register, and link to capability register
Mnemonic: jalr rd, rs1, offset
Encoding

This instruction is modified from the version in RV32[IE]/RV64[IE].

Description

Indirect jump to the target capability in rs1 with an address offset.

rs1 is written to the target pc.
The target address is obtained by adding the sign-extended 12-bit I-immediate to rs1.address, then setting the least-significant bit of the result to zero.
Unseal the target pc if it is a Sealed entry point capability, rs1.address[0] is zero, and the I-immediate is zero.
Set the address of the target pc to the target address using the semantics of the YADDRW instruction.
The pc of the next instruction is sealed as a Sealed entry point capability and written to rd.
Jump to the target pc.

A future extension may raise an exception on the JALR (RVY) instruction itself if the target pc will raise a CHERI Instruction Access Fault at the target.

Operation

This listing is manually maintained rather than extracted from the Sail model.

let cs1_val = C(cs1);
let off : xlenbits = sign_extend(imm);
/* clear bit zero as for RISC-V JALR */
let newPC = [cs1_val.address + off with 0 = bitzero];
/* nextPC accounts for compressed instructions */
let (_, linkCap) = setCapAddr(PCC, nextPC);
C(cd) = sealCap(linkCap);
set_next_pc(newPC);
let (_, newPCC) = setCapAddr(cs1_val, newPC);
set_next_pcc(unsealCap(newPCC));
RETIRE_SUCCESS

2.9.6. BEQ, BNE (RVY)

For beq and bne only, if rs1≤rs2 then the encoding is reserved. This includes all encodings with rs1=rs2.

The reserved encodings are redundant and may be used by future extensions, suggestions include branching on capability tag values only, or YLEN-bit compares.

For beq rs1, rs2, offset and bne rs1, rs2, offset, assemblers must emit the operand order whose register-number ordering is not reserved. For example, beq x1, x2, offset is emitted as beq x2, x1, offset. For the reserved rs1=rs2 case, j/jal can be used instead of an always-taken beq rs1, rs1, offset, and a never-taken bne rs1, rs1, offset is equivalent to a nop.

2.10. RVY ISA summary

2.10.1. RVY added instructions

Table 21. RVY added instructions
Mnemonic	Function
LY	Load capability
SY	Store capability
YADD	Capability pointer increment
YADDI	Capability pointer increment by immediate
YADDRW	Write capability address
YTAGR	Read capability tag
YPERMR	Read capability permissions
YMV	Capability register copy
YPERMC	Clear capability permissions
SRLIY	Logical right shift of Y register
YHIR	Read capability metadata (pseudo)
PACKY	Pack Y register
YHIW	Write capability metadata and clear capability tag (pseudo)
YEQ	Capability equality comparison including capability tag
YSS	Capability is a subset
YBLD	Build capability
YSUNSEAL	Unseal by superset reconstruction
YBNDSW	Write capability bounds
YBNDSWI	Write capability bounds by immediate
YBNDSRW	Write capability bounds, rounding up if required
YAMASK	Capability alignment mask
YBASER	Read capability base address
YLENR	Read capability length
YTYPER	Read capability type
YTOPR	Read capability top address

2.10.2. RVI (RVY modified behavior)

Table 22. RVI (RVY modified behavior) instructions
Mnemonic	Function
AUIPC (RVY)	Add upper immediate to pc
BEQ (RVY)	Immediate offset condition branch, taken if operands are not equal
BNE (RVY)	Immediate offset condition branch, taken if operands are not equal
JAL (RVY)	Immediate offset jump, and link to capability register
JALR (RVY)	Jump to capability register, and link to capability register

3. The RV64LYA Capability Base for RV64Y

This section describes the in-memory format and properties of the capability encoding intended for RV64Y.

The format is closely modeled upon features from CHERI v9 (Watson et al., 2023), and the bounds encoding scheme is based upon CHERI Concentrate (Woodruff et al., 2019).

3.1. Capability Encoding

Figure 8. Capability encoding for RV64LYA

Reserved bits must be 0 in valid capabilities and are available for future extensions to RVY.

The encoding diagram above of the capability encoding format includes some fields which depend upon the presence of extensions:

Zyhybrid

When Zyhybrid is implemented

The P-bit is implemented, otherwise reserved zero.

Zylevels1

When Zylevels1 is implemented:

The GL flag is implemented, otherwise reserved zero.
The AP-field includes LG-permission and SL-permission. Otherwise both are reserved one.

The bit ranges in Table 23 are relative to the XLEN-bit metadata, i.e., the upper half of the in-memory representation.

Table 23. RV64LYA capability encoding format metadata fields
Field	Bit range	Comment
SDP	63:60	Software Defined Permissions.
AP	52:45	Architectural Permissions, see Table 26.
P	44	Pointer Mode (Zyhybrid only).
GL	43	Global (GL) flag (Zylevels1 only).
CT	27	Capability Type field.
EF	26	Exponent format for the capability bounds decoding, see Table 27.
T[11:3]	25:17	Top address field for the capability bounds decoding, see Table 27.
TE	16:14	Top address or exponent field for the capability bounds decoding, see Table 27.
B[13:3]	13:3	Base address field for the capability bounds decoding, see Table 27.
BE	2:0	Base address or exponent field for the capability bounds decoding, see Table 27.
All other fields are reserved

T[13:12], T[2:0] and B[2:0] are not specified by the metadata fields and are calculated as shown in Section 3.1.5.2.

3.1.1. Capability Encoding Parameter Summary

Table 24. RV64LYA capability encoding format parameters
Parameter	Default Value	Comment
MW	14	Mantissa width for the bounds encoding
EW	6	Exponent width for the bounds encoding
CAP_MAX_E	52	Maximum exponent value
enableL8	0	Whether the encoding format includes the `L₈` bit
AP_MAX	All ones	Value of the AP-field giving maximum permissions
AP_ENC	full1_rv64ya_1	Encoding scheme for the AP-field - all combinations are representable
AUIPC_SHIFT	12	Shift distance of the immediate in AUIPC
REP_RANGE	rc1	Whether out of bounds addresses are representable

Table 25. RV64LYA extension summary
Extension	Comment
Zyhybrid	Compatible
Zylevels1	Compatible
Zyseal	Will be compatible once new permissions are encoded
All RV64Y versions of other standard extensions	Compatible

3.1.2. Architectural Permissions (AP) Encoding

The permissions field is 8 bits wide and is encoded using one bit per architectural permission as shown in Table 26. Some bit patterns are reserved because they encode illegal combinations of permissions (see Section 2.3.10.1).

Table 26. Encoding of architectural permissions for RV64LYA
Bit	Encoded permission
0	C-permission
1	W-permission
2	R-permission
3	X-permission
4	ASR-permission
5	LM-permission
6	LG-permission (Zylevels1)
7	SL-permission (Zylevels1)

Future extensions which define new permissions must augment Table 26.

There are 9 bits in total allocated to the AP-field and the P-bit.

The total valid combinations of C, W, R, LM, LG and SL are 18.
The total valid combinations of X, ASR and P are 5.
The two groups are fully orthogonal.

Therefore there are 90 valid combinations encoded in 9-bits.

Future extensions or capability encoding formats may leverage these reserved patterns to implement a more compact joint AP-field and P-bit encoding that is fully compatible for all software to increase the number of reserved bits.

3.1.3. Software-Defined Permissions (SDP) Encoding

The SDP-field is 4 bits wide. The value of the SDP-field bits of the YPERMR result maps 1:1 to the SDP-field in the capability.

3.1.4. Capability Type (CT) Encoding

The CT-field is as specified in Table 6.

The capabilities of this chapter define a 1-bit field for CT-field values; this field directly encodes the values 0 and 1:

The encoded value 0 indicates an unsealed type.
The encoded value 1 indicates a sealed type which is unsealed if passed to JALR (RVY) in rs1, and is used to seal the capability written to rd.

The capabilities of this chapter define a 1-bit field for CT-field values; this field directly encodes the values 0 and 1. The value 1 is…

considered ambiently available for YBLD,
used as the type for capabilities sealed by YSEAL instructions, not requiring a specific sealing permission.

Additionally, JALR (RVY) both

unseals input capabilities of type 1 and
seals its return capabilities with type 1.

JALR (RVY) places no constraints on the triple of input CT-field value, rd selector, and rs1 selector. That is, JALR (RVY) will, as directed, attempt to jump to any unsealed or sealed capability in any register regardless of which register comes to hold the sealed return pointer.

The permission encodings of RV64LYA do not provide mappings for the Zyseal extension’s SE-permission or US-permission. Thus, without further revision, the encodings of this chapter are incompatible with the Zyseal extension.

3.1.5. Bounds (EF, T, TE, B, BE, L8) Encoding

3.1.5.1. Concept

This bounds encoding scheme is based upon (Woodruff et al., 2019).

The bounds encode the base and top addresses that constrain memory accesses. The capability can be used to access any memory location A in the range base ≤ A < top. The bounds are encoded in a compressed format, so it is not possible to encode any arbitrary combination of base and top addresses. An invalid capability is produced when attempting to construct a capability that is not representable because its bounds cannot be correctly encoded.

The bounds field has the following components:

Table 27. RV64LYA capability bounds encoding fields
Variable	Description
T	Value substituted into the capability’s address to decode the top address.
TE	Value either substituted into the exponent or the top address depending on EF.
B	Value substituted into the capability’s address to decode the base address.
BE	Value either substituted into the exponent or the base address depending on EF.
E	Exponent that determines the position at which T and B are substituted into the capability’s address.
L₈¹	Value used to form either the MSB of the exponent or the MSB of the top bound.
EF	Exponent format flag indicating the encoding for T, B and E: EF=0: The exponent is calculated from TE and BE, and conditionally L8, so it is 'internal'. EF=1: The exponent is zero; TE, BE and L8 form part of the address bounds.

¹ Only valid if enableL8=1.

The variables in Table 27 are either taken directly from the capability metadata (see Table 23) or, in the case of the upper and lower bits of T and B, are calculated from it. They are used as shown in the pseudo-code below.

The bit widths of T and B are defined in terms of the mantissa width (MW) which is set depending on capability encoding as shown in Table 24.

The exponent E indicates the position of T and B within the capability’s address as described in Section 3.1.5.4. The bit width of the exponent (EW) is set depending on the encoding. The maximum value of the exponent is calculated as follows:

CAP_MAX_E = XLEN - MW + 2

The values of EW and CAP_MAX_E are shown in Table 24.

The address and bounds must be representable in valid capabilities i.e., when the capability tag is set (see Section 3.1.5.6).

3.1.5.2. Calculation of Variables

The metadata is encoded in a compressed format termed CHERI Concentrate (Woodruff et al., 2019). It uses a floating point representation to encode the bounds relative to the capability address. The following subsections describe how the base and top addresses are decoded from the metadata.

The pseudocode below is normative. In this notation, / means "integer division", [] are the bit-select operators, {} is bit concatenation, ?: is the conditional operator, and arithmetic is signed.

The pseudocode is intentionally simpler than the equivalent definitions in the SAIL model.

In the pseudo-code below:

EF, T, TE, B and BE are all encoded in the capability metadata, see Table 23
- L8 is also a capability metadata field but only encoded if enableL8=1.
EW, MW, CAP_MAX_E and enableL8 are all capability encoding format parameters, see Table 24.
LCout and LMSB are intermediate values used to reconstitute the top two bits of T, see below.

If EF = 1:
    E               = 0
    T[EW / 2 - 1:0] = TE
    B[EW / 2 - 1:0] = BE
    LCout           = (T[MW - 3:0] < B[MW - 3:0]) ? 1 : 0
    LMSB            = enableL8 ? L8 : 0
else:
    E               = CAP_MAX_E - ( enableL8 ? { L8, TE, BE } : { TE, BE } )
    T[EW / 2 - 1:0] = 0
    B[EW / 2 - 1:0] = 0
    LCout           = (T[MW - 3:EW / 2] < B[MW - 3:EW / 2]) ? 1 : 0
    LMSB            = 1

T and B are both MW bits wide, and are formed of different bit ranges:

T[MW - 1:MW - 2] is specified below.
T[MW - 3:EW / 2] is taken directly from the capability metadata.
T[EW / 2 - 1:0] is as specified in the pseudo-code above.
B[MW - 1:EW / 2] is taken directly from the capability metadata.
B[EW / 2 - 1:0] is as specified in the pseudo-code above.

The design rationale for the two EF cases is as follows:

EF = 1: The exponent is 0. When enableL8=1, L₈ encodes the MSB of the length, which can be used to derive T[MW-1:MW-2], forming a full MW-wide T field.
EF = 0: The exponent is internal with E stored in the lower bits of T and B, with L₈ used for the MSB of E when enableL8=1. E is chosen so that the most significant non-zero bit of the length of the region aligns with T[MW - 2] such that this bit is implied by E.

To save space in the encoding metadata, the top two bits of T are not stored directly but reconstituted as follows:

T[MW - 1:MW - 2] = B[MW - 1:MW - 2] + LCout + LMSB

This derivation relies on the equality T = B + L, where L is the encoded length of the capability. LMSB is the value of L[MW - 2], which is known from the values of EF and E (as well as L₈ when enableL8=1). LCout is the carry out from the lower bits, which is implied by T[MW - 3:0] < B[MW - 3:0] since it is guaranteed that the top is larger than the base.

3.1.5.3. Correction Factors

The correction factors used in the bounds calculation, c_t and c_b, are calculated as shown below using the definitions in Table 28 and Table 29.

R marks the lower boundary of the representable range (see Figure 2). If A, T or B is less than R, then it lies in the 2^E+MW aligned region above R, and the corrections c_t and c_b adjust the upper bits of the decoded bounds accordingly.

As shown in Section 3.1.5.4 below, the upper bits of the bound are computed by substituting B or T into the upper bits of the capability address. This substitution is only correct if the memory bound and the capability address lie within the same 2^E+MW-aligned region. When they do not (address 0x20000 in Figure 9), the correction factors c_t and c_b adjust the computed upper bits to select the correct region.

Figure 9. Example capability bounds that use the correction. The base is an s-aligned region below the address, meaning the upper bits (above the mantissa) of the base are smaller than the upper bits of the address. Therefore c_b=-1. The address and top are in the same region, so c_t=0.

The calculation of A and R is as follows.

A[MW-1:0] = a[E + MW - 1:E]; // capability address field
R[MW-1:0] = B - 2^MW-2

The comparisons in Table 28 and Table 29 are unsigned.

The correction factors c_b and c_t are calculated as follows.

Table 28. Calculation of top address correction
A < R	T < R	c_t
false	false	0
false	true	+1
true	false	-1
true	true	0

Table 29. Calculation of base address correction
A < R	B < R	c_b
false	false	0
false	true	+1
true	false	-1
true	true	0

Figure 10 shows examples of each of the possible cases that arise from this calculation.

Any cases not shown are unreachable.

Figure 10. Illustrative examples of the six possible cases for c_t and c_b

In summary, for the base bound:

If the base bound is below the alignment boundary and the capability address is above the alignment boundary, then c_b=-1 to correct the base bound downwards.
If the base bound and the capability address are within the same alignment boundary, then c_b=0, so the base bound is unchanged.
If the base bound is above the alignment boundary and the capability address is below the alignment boundary, then c_b=1 to correct the base bound upwards.

In summary, for the top bound:

If the top bound is below the alignment boundary and the capability address is above the alignment boundary, then c_t=-1 to correct the top bound downwards.
If the top bound and the capability address are within the same alignment boundary, then c_t=0, so the top bound is unchanged.
If the top bound is above the alignment boundary and the capability address is below the alignment boundary, then c_t=1 to correct the top bound upwards.

3.1.5.4. Format of the Bounds

The base, b, and top, t, addresses are derived from the address by substituting a[E + MW - 1:E] with B and T respectively and clearing the lower E bits. The most significant bits of a may be adjusted up or down by 1 using corrections c_b and c_t to allow encoding memory regions that span alignment boundaries.

The bounds are formed of three sections as shown in Figure 11 and Figure 12.

XLEN and MW are constant parameters, but E is decoded from the capability metadata, and so the bit positions of the boundaries between the sections vary.

The top bound (t) has an additional bit so that the bounds can include the topmost byte of memory.

Figure 11. Decoding of the XLEN+1 wide top (t) bound

Figure 12. Decoding of the XLEN wide base (b) bound

Figure 11 and Figure 12 include ranges which may not be present when the bounds are decoded:

If E = 0 the lower section does not exist.
If E+MW=XLEN then the top section is only the least significant bit of c_t for the top bound, and top section does not exist for the bottom bound.
If E+MW>XLEN then neither top section exists, and so the bounds are calculated with no dependency on the address field a.

The compressed bounds encoding allows the address to roam over a large representable region while maintaining the original bounds. This is enabled by defining a lower boundary R from the out-of-bounds values that allows us to disambiguate the location of the bounds with respect to an out-of-bounds address. R is calculated relative to the base by subtracting 2^MW-2 from B. If B, T or a[E + MW - 1:E] is less than R, it is inferred that they lie in the 2^E+MW aligned region above R labeled space_U in Figure 2 and the corrections c_t and c_b are computed accordingly. The overall effect is that the address can roam 2^E+MW/4 bytes below the base address and at least 2^E+MW/4 bytes above the top address while still allowing the bounds to be correctly decoded.

3.1.5.5. Top bound MSB correction

A capability has infinite bounds if its bounds cover the entire address space such that the base address b=0 and the top address t≥2^XLEN, i.e., t is an XLEN + 1 bit value. However, b is an XLEN-bit value and the size mismatch introduces additional complications when decoding, so the following condition is required to correct t for capabilities whose Representable Range wraps the edge of the address space:

if ( (E < (CAP_MAX_E - 1)) && (t[XLEN: XLEN - 1] - {0b0, b[XLEN - 1]}) >= 0b10)
    t[XLEN] = !t[XLEN]

The comparison is unsigned.

That is, invert the most significant bit of t if the decoded top and base are one address space (2^XLEN bytes) or more apart, and E is too small to allow this.

A capability has infinite bounds if E=CAP_MAX_E and it is not malformed (see Section 3.1.5.6); this check is equivalent to b=0 and t≥2^XLEN.

3.1.5.6. Malformed Capability Bounds

A capability is malformed if its bounds cannot be correctly decoded. The following check indicates whether a capability is malformed. If enableL8=1 the L₈ bit is available in the capability encoding format for extra precision when EF=1.

malformedMSB =  (E == CAP_MAX_E     && B         != 0)
             || (E == CAP_MAX_E - 1 && B[MW - 1] != 0)
malformedLSB =  (E  < 0) || (E == 0 && enableL8)
malformed    =  !EF && (malformedMSB || malformedLSB)

The above includes a special case for encodings where enableL8=1. In this case, an EF=0 encoding that decodes to E=0 (with its implied LMSB of one) expresses the same bounds as the EF=1 format with the L₈ bit set. The EF=0, E=0 encodings are therefore redundant, so they are declared malformed when enableL8=1, leaving EF=1 as the canonical encoding.

Capabilities with malformed bounds:

Return both base and top bounds as zero, which affects instructions like YBASER.
Cause certain manipulation instructions like YADDI to always set the capability tag of the result to zero.

3.2. Representable Range Check

The concept of the representability check was introduced in Section 2.3.7.

The definition of the check is:

A source capability with address a, metadata m that decodes to give the bounds b and t.
A derived capability with arbitrary address a' with the same metadata m that decodes to give the bounds b' and t'.

The address a' is within the source capability’s representable range if b == b' && t == t'.

If the address a' is outside the representable range, then the derived capability has the capability tag set to zero.

3.2.1. Practical Information

An artifact of the bounds encoding is that if the new address causes t != t', then it is also the case that b != b'.

The converse is also true, if b != b' then t != t'.

Therefore, for representable range checking, it is acceptable to either check t == t' or b == b'.

The top and bottom capability bounds are formed of two or three sections:

Upper bits from the address
- This is only if the other sections do not fill the available bits (E + MW < XLEN)
Middle bits from T and B decoded from the metadata
Lower bits are set to zero
- This is only if there is an internal exponent (EF=0)

Table 30. Composition of the decoded top address bound
Configuration	Upper Section (if E + MW < XLEN)	Middle Section	Lower Section
EF=0	address[XLEN-1:E + MW] + ct	T[MW - 1:0]	zeroes(E)
EF=1, i.e., E=0	address[XLEN-1:MW] + ct	T[MW - 1:0]

The representable range defines the range of addresses which do not corrupt the bounds encoding. The encoding was first introduced in Section 3.1.5, and is repeated in a different form in Table 30 to aid this description.

For the address to be valid for the current bounds encoding, the value in the Upper Section of Table 30 must not change as this will change the meaning of the bounds. This is because T, B and E will be unchanged for the source and destination capabilities. Therefore, the Middle and Lower sections of the bounds calculation are also unchanged for source and destination capabilities.

When E > CAP_MAX_E - 2, the calculation of the top bound is entirely derived from T and E, which will be identical for both the source and destination capabilities, thus guaranteeing that t == t'. Likewise, with such values of E, the base bound is entirely derived from B and E and therefore b == b'.

The calculation of the MSB of the top bound may be inverted as specified in Section 3.1.5.5. Assuming (E < (CAP_MAX_E - 1)), the truth-table for this inversion is as follows:

Table 31. Top bound MSB inversion truth table
input_t[XLEN:XLEN-1]	b[XLEN-1]	output_t[XLEN:XLEN-1]
00	0	00
01	0	01
10	0	00
11	0	01
00	1	10
01	1	01
10	1	10
11	1	01

Inspection of Table 31 shows that output_t[XLEN] does not depend on input_t[XLEN] as:

output_t[XLEN] = {input_t[XLEN-1], b[XLEN-1]} == 0b01.

This leads to the conclusions:

If t[XLEN-1] == t'[XLEN-1] and b[XLEN-1] == b'[XLEN-1], then it is guaranteed t[XLEN] == t'[XLEN].
If t[XLEN-1] != t'[XLEN-1] or b[XLEN-1] != b'[XLEN-1], then the representable check will fail regardless of checking t[XLEN] == t'[XLEN].

Therefore, for the purpose of representable range checking, it is not required to check that t[XLEN]==t'[XLEN].

Given that t[XLEN] is not part of the representable range check:

when E == CAP_MAX_E - 2, t[XLEN-1:E] == T[MW-1:0] and b[XLEN-1:E] == B[MW-1:0].

Therefore, T and B are both derived from the capabilities metadata and are therefore constant. Therefore, the representable range check always passes in this case too.

As a result:

If E > CAP_MAX_E - 3, then the representability check always passes, even though the bounds are only infinite if E = CAP_MAX_E

This gives a range of s=2^E+MW, as shown in Figure 2.

The gap between the object bounds and the bound of the representable range is always guaranteed to be at least 1/4 of s. This is represented by R = B - 2^MW-2 in Section 3.1.5. This gives useful guarantees, such that if an executed instruction is in pc bounds, then it is also guaranteed that the next linear instruction is representable.

3.3. Encoding of Special Capabilities

3.3.1. NULL Capability Encoding

The NULL capability is represented with 0 in all fields. This implies that it has no permissions and its exponent E is CAP_MAX_E (52). The exponent value causes its bounds to cover the entire address space, but no permissions are granted and so it cannot successfully be used to grant authorization.

Table 32. Field values of the NULL capability
Field	Value	Comment
Capability Tag	zero	Capability is not valid
SDP	zeros	Grants no permissions
AP	zeros	Grants no permissions
CT	zero	Unsealed
EF	zero	Internal exponent format
L₈¹	zero	Top address reconstruction bit
T	zeros	Top address bits
T_E	zeros	Exponent bits
B	zeros	Base address bits
B_E	zeros	Exponent bits
Address	zeros	Capability address
Reserved	zeros	All reserved fields

¹ Only present if enableL8=1 as defined by the capability encoding format.

Permissions added by extensions (such as those of Zylevels1) are presumed absent in NULL capabilities.

3.3.2. Infinite Capability Encoding

This encoding is for an Infinite capability value, which grants all permissions while its bounds also nominally cover the whole address space. This infinite capability is both a Root Executable and a Root Data capability.

Table 33. Field values of the Infinite capability
Field	Value	Comment
Capability Tag	one	Capability is valid
SDP	ones	Grants all permissions
AP	AP_MAX	Grants all permissions
CT	zero	Unsealed
EF	zero	Internal exponent format
L₈¹	zero	Top address reconstruction bit
T	zeros	Top address bits
T_E	zeros	Exponent bits
B	zeros	Base address bits
B_E	zeros	Exponent bits
Address	any²	Capability address
Reserved	zeros	All reserved fields

¹ Only present if enableL8=1 as defined by the capability encoding format.

²If an infinite capability is used as a constant in either hardware or software, then the address field will typically be set to zero. If the address field is non-zero then it is still referred to as an infinite capability, and it still has the authority to authorize all memory accesses.

Permissions added by extensions (such as those of Zylevels1) are presumed present in Infinite capabilities, which may involve redefining the AP_MAX value.

4. Zca (RVY added and modified instructions)

Zcf (RV32) and Zcd (RV64) are incompatible with the RVY base ISA, as their encodings are reused for the capability loads and stores below. On harts that also implement Zyhybrid, these encodings revert to Zcf/Zcd when executing in Integral Pointer Mode (see CHERI Pointer Mode).

Table 34. Zca (RVY added instructions) instruction extension
Mnemonic	Function
C.LYSP	Load capability stack pointer relative, 16-bit encoding
C.SYSP	Store capability stack pointer relative, 16-bit encoding
C.LY	Load capability, 16-bit encoding
C.SY	Store capability, 16-bit encoding

4.1. RV32 / RV32Y RVC load/store mapping summary

Table 35. 16-bit load/store instruction mapping in RV32I
Encoding		Supported Extensions
[15:13]	[1:0]	Zca	Zcf	Zcd	Zcmp/ Zcmt	Zclsd
111	00	N/A	C.FSW	N/A	N/A	C.SD
011	00	N/A	C.FLW	N/A	N/A	C.LD
111	10	N/A	C.FSWSP	N/A	N/A	C.SDSP
011	10	N/A	C.FLWSP	N/A	N/A	C.LDSP
101	00	N/A	N/A	C.FSD	reserved	N/A
001	00	N/A	N/A	C.FLD	reserved	N/A
101	10	N/A	N/A	C.FSDSP	Zcmp/ Zcmt	N/A
001	10	N/A	N/A	C.FLDSP	reserved	N/A

Table 36. 16-bit load/store instruction mapping in RV32Y
Encoding		Supported Extensions
[15:13]	[1:0]	Zca	Zcd	Zcmp/ Zcmt
111	00	C.SY
111	10	C.SYSP
011	10	C.LYSP
011	00	C.LY
101	00	N/A	C.FSD	reserved
001	00	N/A	C.FLD	reserved
101	10	N/A	C.FSDSP	Zcmp (RV32Y)/ Zcmt (RV32Y)
001	10	N/A	C.FLDSP	reserved

Zcf and Zclsd are incompatible with RV32Y.

4.2. RV64 / RV64Y RVC load/store mapping summary

Table 37. 16-bit load/store instruction mapping in RV64I
Encoding		Supported Extensions
[15:13]	[1:0]	Zca	Zcd	Zcmp/ Zcmt
111	00	C.SD	N/A	N/A
011	00	C.LD	N/A	N/A
111	10	C.SDSP	N/A	N/A
011	10	C.LDSP	N/A	N/A
101	00	N/A	C.FSD	reserved
001	00	N/A	C.FLD	reserved
101	10	N/A	C.FSDSP	Zcmp/ Zcmt
001	10	N/A	C.FLDSP	reserved

Table 38. 16-bit load/store instruction mapping in RV64Y
Encoding		Supported Extensions
[15:13]	[1:0]	Zca
111	00	C.SD
011	00	C.LD
111	10	C.SDSP
011	10	C.LDSP
101	00	C.SY
001	00	C.LY
101	10	C.SYSP
001	10	C.LYSP

Zcd, Zcmp and Zcmt are incompatible with RV64Y.

4.2.1. C.LY

see C.LYSP.

4.2.2. C.LYSP

Synopsis: Capability loads (C.LY, C.LYSP), 16-bit encodings

These instructions have different encodings for RV64Y and RV32Y.

Mnemonics: C.LY rd', offset(rs1')
C.LYSP rd, offset(sp)
Expansions: LY rd', offset(rs1')
LY rd, offset(sp)
Encoding

Description: For C.LY the authorizing capability is rs1.

For C.LYSP, the authorizing capability is sp.

The effective address of the memory access is obtained by adding the address of the authorizing capability to the zero-extended offset.

Load data from memory in the same manner as LY.
Prerequisites: Zca, RVY
Operation (after expansion to 32-bit encodings): See LY.

4.2.3. C.SY

see C.SYSP.

4.2.4. C.SYSP

Synopsis: Capability stores (C.SY, C.SYSP), 16-bit encodings

These instructions have different encodings for RV64Y and RV32Y.

Mnemonics: C.SY rs2', offset(rs1')
C.SYSP rs2, offset(sp)
Expansions: SY rs2', offset(rs1')
SY rs2, offset(sp)
Encoding

Description: For C.SY the authorizing capability is rs1.

For C.SYSP, the authorizing capability is sp.

The store data is in rs2'.

The effective address of the memory access is obtained by adding the address of the authorizing capability to the zero-extended offset.

Store data to the memory in the same manner as SY.
Prerequisites: Zca, RVY
Operation (after expansion to 32-bit encodings): See SY.

4.3. Zca (RVY modified behavior)

An RVY core that supports Zca also supports Zca (RVY modified behavior) which modifies the behavior of some instructions.

C.YMV is renamed from C.MV to avoid ambiguity in disassembly.

Table 39. Zca (RVY modified behavior) instruction extension
Mnemonic	Function
C.ADDI16SP (RVY)	Stack pointer increment in blocks of 16, 16-bit encoding
C.ADDI4SPN (RVY)	Stack pointer increment in blocks of 4, 16-bit encoding
C.YMV	Capability register copy, 16-bit encoding
C.JAL (RV32Y)	Immediate offset jump, and link to capability register, 16-bit encoding
C.JALR (RVY)	Jump to capability register, and link to capability register, 16-bit encoding
C.JR (RVY)	Jump to capability register, 16-bit encoding

4.3.1. C.ADDI16SP (RVY)

Synopsis: Stack pointer increment in blocks of 16, 16-bit encoding
Mnemonic: C.ADDI16SP nzimm
Expansion: YADDI sp, sp, nzimm
Encoding

This instruction is modified from the version in RV32[IE]/RV64[IE].

Description

Add the non-zero sign-extended 6-bit immediate to sp.address (sp=x2), where the immediate is scaled to represent multiples of 16 in the range (-512,496).

All other details, including the conditions for clearing sp.tag, are as defined by YADDI for the expansion shown above.

Prerequisites

Zca, RVY

Operation

execute(CADDI(sp, sp, sign_extend(nzimm)))

4.3.2. C.ADDI4SPN (RVY)

Synopsis: Stack pointer increment in blocks of 4, 16-bit encoding
Mnemonic: C.ADDI4SPN rd', nzuimm
Expansion: YADDI rd', sp, nzuimm
Encoding

This instruction is modified from the version in RV32[IE]/RV64[IE].

Description

Copy sp (sp=x2) to rd'. Add a zero-extended non-zero immediate, scaled by 4, to rd'.address.

All other details, including the conditions for clearing rd'.tag, are as defined by YADDI for the expansion shown above.

Prerequisites

Zca, RVY

Operation

let cd = creg2reg_idx(cdc) in
execute(CADDI(cd, sp, zero_extend(nzuimm)))

4.3.3. C.YMV

Synopsis: Capability register copy, 16-bit encoding
Mnemonic: C.YMV rd, rs2
Expansion: YMV rd, rs2

c.mv is remapped to copy YLEN-bit registers for RVY. The mnemonic is changed to avoid ambiguity about whether the copy is XLEN or YLEN bits.

Encoding

This instruction is modified from the version in RV32[IE]/RV64[IE].

Description: Copy rs2 to rd.

This instruction can propagate valid capabilities which fail integrity checks.
Prerequisites: Zca, RVY
Operation (after expansion to 32-bit encoding): See YMV.

4.3.4. C.JR (RVY)

Synopsis: Jump to capability register, 16-bit encoding
Mnemonic: C.JR rs1
Expansion: JALR x0, 0(rs1)
Encoding

This instruction is modified from the version in RV32[IE]/RV64[IE].

Description: See JALR (RVY) for execution of the expanded instruction as shown above. Note that the offset is zero in the expansion.
Prerequisites: Zca, RVY
Operation (after expansion to 32-bit encodings): See JALR (RVY).

4.3.5. C.JAL (RV32Y)

Synopsis: Immediate offset jump, and link to capability register, 16-bit encoding
Mnemonic (RV32Y): c.jal x1, offset
Expansion (RV32Y): jal x1, offset
Encoding (RV32Y)

This instruction is modified from the version in RV32[IE]/RV64[IE].

Description: Link the next linear pc to rd and seal. Jump to pc.address+offset.
Prerequisites: Zca, RVY
Operation (after expansion to 32-bit encodings): See JAL (RVY).

4.3.6. C.JALR (RVY)

Synopsis: Jump to capability register, and link to capability register, 16-bit encoding
Mnemonic: C.JALR x1, rs1
Expansion: JALR x1, 0(rs1)
Encoding

This instruction is modified from the version in RV32[IE]/RV64[IE].

Description: See JALR (RVY) for execution of the expanded instruction as shown above. Note that the offset is zero in the expansion.
Exceptions: See JALR (RVY).
Prerequisites: Zca, RVY
Operation (after expansion to 32-bit encodings): See JALR (RVY).

5. Zalrsc (RVY added instructions)

Specifying RVY and Zalrsc adds atomic capability load and store instructions.

Table 40. Zalrsc (RVY added instructions) instruction extension
Mnemonic	Function
LR.Y	Load Reserved capability
SC.Y	Store Conditional capability

5.1. LR.Y

Synopsis: Load Reserved capability
Mnemonic: LR.Y rd, 0(rs1)
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

rs1.address is the effective address of the memory access.

Authorize the memory access with the capability in rs1.

Load a naturally aligned YLEN-bit data value from memory.

Set the reservation as for LR.W/D.

Use the YLEN-bit data and the capability tag to determine the value of rd as specified by the LY instruction.

This instruction can propagate valid capabilities which fail integrity checks.

Exceptions

Raise a load access fault exception when the effective address is not aligned to YLEN/8.

Exceptions occur when the authorizing capability fails one of the checks listed below:

Kind

Reason

CHERI Load Access Fault

Authorizing capability tag is set to 0.

CHERI Load Access Fault

Authorizing capability is sealed.

CHERI Load Access Fault

Authorizing capability does not grant the necessary permissions. R-permission is required.

CHERI Load Access Fault

At least one byte accessed is outside the authorizing capability bounds, or the bounds could not be decoded.

CHERI Load Access Fault

Authorizing capability failed any integrity check.

Prerequisites

Zalrsc, RVY

Operation

  let (auth_val, vaddr) = get_cheri_mode_cap_addr(rs1_cs1, zeros());

  match check_and_handle_load_vaddr_for_triggers(vaddr, get_arch_pc()) {
    Some (ret) => return ret,
    None () => ()
  };
  if not(capTaggedAndReservedValid(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_TagViolation);
    RETIRE_FAIL
  } else if capIsSealed(auth_val) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_SealViolation);
    RETIRE_FAIL
  } else if not(canR(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_PermissionViolation);
    RETIRE_FAIL
  } else if not(validAddrRange(vaddr, cap_size) | capBoundsInfinite(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_InvalidAddressViolation);
    RETIRE_FAIL
  } else if not(inCapBounds(auth_val, vaddr, cap_size)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_LengthViolation);
    RETIRE_FAIL
  } else if not(is_aligned_addr(vaddr, cap_size)) then {
    handle_mem_exception(vaddr, E_Load_Addr_Align());
    RETIRE_FAIL
  } else match translateAddr(vaddr, Read(Cap)) {
    TR_Failure(E_Extension(_)) => { internal_error(__FILE__, __LINE__, "unexpected cheri exception for cap load") },
    TR_Failure(e) => { handle_mem_exception(vaddr, e); RETIRE_FAIL },
    TR_Address(addr, pbmt, ptw_info) => {
      let c = mem_read_cap(addr, pbmt, aq, aq & rl, false);
      match c {
        MemValue(v) => {
          let cr = if ptw_info.ptw_lc == PTW_LC_CLEAR
                   then clearTag(v) /* strip the tag */
                   else {
                     /* the Sail model currently reserves physical addresses */
                     load_reservation(addr);
                     clearTagIf(v, not(canC(auth_val)))
                   };
          C(cd) = legalizeLM(cr, auth_val);
          RETIRE_SUCCESS
        },
        MemException(e) => {handle_mem_exception(vaddr, e); RETIRE_FAIL }
      }
    }
  }

5.2. SC.Y

Synopsis: Store Conditional capability
Mnemonic: SC.Y rd, rs2, 0(rs1)
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

rs1.address is the effective address of the memory access.

Authorize the memory access with the capability in rs1.

Conditionally store the YLEN-bit value in rs2, together with its capability tag, to the naturally aligned memory location, following the same rules as SC.W.

Set rd to zero for success or non-zero for failure.

The written capability tag may be cleared following the same modification rules as SY.

This instruction can propagate valid capabilities which fail integrity checks.

Exceptions

Raise a store/AMO access fault exception when the effective address is not aligned to YLEN/8.

Exceptions occur when the authorizing capability fails one of the checks listed below:

Kind

Reason

CHERI Store/AMO Access Fault

Authorizing capability tag is set to 0.

CHERI Store/AMO Access Fault

Authorizing capability is sealed.

CHERI Store/AMO Access Fault

Authorizing capability does not grant the necessary permissions. W-permission is required.

CHERI Store/AMO Access Fault

At least one byte accessed is outside the authorizing capability bounds, or the bounds could not be decoded.

CHERI Store/AMO Access Fault

Authorizing capability failed any integrity check.

Prerequisites

Zalrsc, RVY

Operation

  let (auth_val, vaddr) = get_cheri_mode_cap_addr(rs1_cs1, zeros());
  let cs2_val = C(cs2);
  let cs2_val = clearTagIf(cs2_val, not(canC(auth_val)));

  match check_and_handle_store_vaddr_for_triggers(vaddr, get_arch_pc()) {
    Some (ret) => return ret,
    None () => ()
  };
  if speculate_conditional () == false then {
    /* should only happen in rmem
     * rmem: allow SC to fail very early
     */
    X(rd) = zero_extend(0b1);
    return RETIRE_SUCCESS
  };
  if not(capTaggedAndReservedValid(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_TagViolation);
    RETIRE_FAIL
  } else if capIsSealed(auth_val) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_SealViolation);
    RETIRE_FAIL
  } else if not(canW(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_PermissionViolation);
    RETIRE_FAIL
  } else if not(validAddrRange(vaddr, cap_size) | capBoundsInfinite(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_InvalidAddressViolation);
    RETIRE_FAIL
  } else if not(inCapBounds(auth_val, vaddr, cap_size)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_LengthViolation);
    RETIRE_FAIL
  } else if not(is_aligned_addr(vaddr, cap_size)) then {
    handle_mem_exception(vaddr, E_SAMO_Addr_Align());
    RETIRE_FAIL
  } else {
    match translateAddr(vaddr, Write(if cs2_val.tag then Cap else Data)) {
      TR_Failure(e) => { handle_mem_exception(vaddr, e); RETIRE_FAIL },
      TR_Address(addr, pbmt, _) => {
        if not(match_reservation(addr)) then {
          /* cannot happen in rmem */
          X(rd) = zero_extend(0b1);
          cancel_reservation();
          RETIRE_SUCCESS
        } else {
          let eares : MemoryOpResult(unit) = mem_write_ea_cap(addr, aq & rl, rl, false);
          match (eares) {
            MemException(e) => { handle_mem_exception(vaddr, e); RETIRE_FAIL },
            MemValue(_) => {
              let res : MemoryOpResult(bool) = mem_write_cap(addr, pbmt, cs2_val, aq & rl, rl, false);
              match (res) {
                MemValue(true)  => {
                  X(rd) = zero_extend(0b0);
                  cancel_reservation();
                  RETIRE_SUCCESS
                },
                MemValue(false) => {
                  X(rd) = zero_extend(0b1);
                  cancel_reservation();
                  RETIRE_SUCCESS
                },
                MemException(e) => {
                  handle_mem_exception(vaddr, e);
                  RETIRE_FAIL
                }
              }
            }
          }
        }
      }
    }
  }

6. Zaamo (RVY added instructions)

Specifying RVY and Zaamo gives Zaamo (RVY added instructions) functionality, which adds atomic capability swap.

Table 41. Zaamo (RVY added instructions) instruction extension
Mnemonic	Function
AMOSWAP.Y	Atomic swap of capabilities

6.1. AMOSWAP.Y

Synopsis: Atomic swap of capabilities
Mnemonic: AMOSWAP.Y rd, rs2, 0(rs1)
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Atomic swap of capabilities, authorized by the capability in rs1.

The operation is equivalent to an atomically executed sequence of:

LY rd, 0(rs1)

SY rs2, 0(rs1)

rd is only updated if no exceptions are raised. The value stored to memory is the original value of rs2, even if rd=rs1 or rd=rs2.

Permissions

Requires R-permission and W-permission in the authorizing capability. If C-permission isn’t granted then the capability tag will be stored as zero.

Exceptions

Raise a store/AMO access fault exception when the effective address is not aligned to YLEN/8.

Exceptions occur when the authorizing capability fails one of the checks listed below:

Kind

Reason

CHERI Store/AMO Access Fault

Authorizing capability tag is set to 0.

CHERI Store/AMO Access Fault

Authorizing capability is sealed.

CHERI Store/AMO Access Fault

Authorizing capability does not grant the necessary permissions. W-permission and R-permission are both required.

CHERI Store/AMO Access Fault

At least one byte accessed is outside the authorizing capability bounds, or the bounds could not be decoded.

CHERI Store/AMO Access Fault

Authorizing capability failed any integrity check.

Prerequisites

Zaamo, RVY

Operation

  let (auth_val, vaddr) = get_cheri_mode_cap_addr(rs1_cs1, zeros());
  let cs2_val = C(cs2);
  let cs2_val = clearTagIf(cs2_val, not(canC(auth_val)));

  match check_and_handle_amo_vaddr_for_triggers(vaddr, get_arch_pc()) {
    Some (ret) => return ret,
    None () => ()
  };
  if not(capTaggedAndReservedValid(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_TagViolation);
    RETIRE_FAIL
  } else if capIsSealed(auth_val) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_SealViolation);
    RETIRE_FAIL
  } else if not(canR(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_PermissionViolation);
    RETIRE_FAIL
  } else if not(canW(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_PermissionViolation);
    RETIRE_FAIL
  } else if not(validAddrRange(vaddr, cap_size) | capBoundsInfinite(auth_val)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_InvalidAddressViolation);
    RETIRE_FAIL
  } else if not(inCapBounds(auth_val, vaddr, cap_size)) then {
    handle_cheri_exception(CapCheckType_Data, CapEx_LengthViolation);
    RETIRE_FAIL
  } else if not(is_aligned_addr(vaddr, cap_size)) then {
    handle_mem_exception(vaddr, E_SAMO_Addr_Align());
    RETIRE_FAIL
  } else {
    match translateAddr(vaddr, ReadWrite(Cap, if cs2_val.tag then Cap else Data)) {
      TR_Failure(e) => { handle_mem_exception(vaddr, e); RETIRE_FAIL },
      TR_Address(addr, pbmt, ptw_info) => {
        let eares : MemoryOpResult(unit) = mem_write_ea_cap(addr, aq & rl, rl, false);
        match (eares) {
          MemException(e) => { handle_mem_exception(vaddr, e); RETIRE_FAIL },
          MemValue(_) => {
            let c = mem_read_cap(addr, pbmt, aq, aq & rl, false);
            match c {
              MemValue(v) => {
                let wres : MemoryOpResult(bool) = mem_write_cap(addr, pbmt, cs2_val, aq & rl, rl, false);
                match wres {
                  MemValue(_) => {
                    let cr = clearTagIf(v, ptw_info.ptw_lc == PTW_LC_CLEAR | not(canC(auth_val)));
                    C(cd) = legalizeLM(cr, auth_val);
                    RETIRE_SUCCESS
                  },
                  MemException(e) => {
                    handle_mem_exception(vaddr, e);
                    RETIRE_FAIL
                  }
                }
              },
              MemException(e) => { handle_mem_exception(vaddr, e); RETIRE_FAIL }
            }
          }
        }
      }
    }
  }

7. Zba (RVY added instructions)

Specifying RVY and Zba gives Zba (RVY added instructions) functionality, which adds more instructions.

Table 42. Zba (RVY added instructions) instruction extension
Mnemonic	Function
YSH1ADD	shift and add, representability check
YSH2ADD	shift and add, representability check
YSH3ADD	shift and add, representability check
YSH4ADD	shift and add, representability check
YSH1ADD.UW	shift and add unsigned word, representability check
YSH2ADD.UW	shift and add unsigned word, representability check
YSH3ADD.UW	shift and add unsigned word, representability check
YSH4ADD.UW	shift and add unsigned word, representability check

There is currently no RVY equivalent for add.uw. A future extension may add such an instruction.

7.1. YSH1ADD, YSH2ADD, YSH3ADD, YSH4ADD (RV64Y)

Synopsis: Shift by n and add for address generation (YSH1ADD, YSH2ADD, YSH3ADD, YSH4ADD)
Mnemonics (RVY): YSH1ADD rd, rs1, rs2
YSH2ADD rd, rs1, rs2
YSH3ADD rd, rs1, rs2
Mnemonics (RV64Y): YSH4ADD rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description: Copy the capability in rs2 to rd.

Increment rd.address by rs1 shifted left by n bit positions.

Set rd.tag=0 if rs2 is sealed.

Set rd.tag=0 if the resulting capability cannot be represented exactly.

Set rd.tag=0 if rs2 fails any integrity checks.
Operation

This listing is manually maintained rather than extracted from the Sail model.

let rs1_val = X(rs1);
let cs2_val = C(cs2);
let shamt : range(1,4) = match op {
  RISCV_SH1ADD => 1,
  RISCV_SH2ADD => 2,
  RISCV_SH3ADD => 3,
  RISCV_SH4ADD => 4,
};
let result = incCapAddrChecked(cs2_val, rs1_val << shamt);
C(cd) = result;
RETIRE_SUCCESS

7.2. YSH1ADD.UW, YSH2ADD.UW, YSH3ADD.UW, YSH4ADD.UW (RV64Y)

Synopsis: Shift by n and add unsigned words for address generation (YSH1ADD.UW, YSH2ADD.UW, YSH3ADD.UW, YSH4ADD.UW)
Mnemonics (RV64Y): YSH1ADD.UW rd, rs1, rs2
YSH2ADD.UW rd, rs1, rs2
YSH3ADD.UW rd, rs1, rs2
YSH4ADD.UW rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description: Copy the capability in rs2 to rd.

Increment rd.address by the unsigned word rs1 shifted left by n bit positions.

Set rd.tag=0 if rs2 is sealed.

Set rd.tag=0 if the resulting capability cannot be represented exactly.

Set rd.tag=0 if rs2 fails any integrity checks.
Operation

This listing is manually maintained rather than extracted from the Sail model.

let rs1_val = X(rs1);
let cs2_val = C(cs2);
let shamt : range(1,4) = match op {
  RISCV_SH1ADDUW => 1,
  RISCV_SH2ADDUW => 2,
  RISCV_SH3ADDUW => 3,
  RISCV_SH4ADDUW => 4,
};
let result = incCapAddrChecked(cs2_val, zero_extend(rs1_val[31..0]) << shamt);
C(cd) = result;
RETIRE_SUCCESS

8. Zicbom (RVY modified behavior)

Specifying RVY and Zicbom gives Zicbom (RVY modified behavior) functionality, which extends the checking.

Table 43. Zicbom (RVY modified behavior) instruction extension
Mnemonic	Function
CBO.INVAL (RVY)	Cache block invalidate (implemented as clean)
CBO.CLEAN (RVY)	Cache block clean
CBO.FLUSH (RVY)	Cache block flush

8.1. CBO.CLEAN (RVY)

Synopsis: Perform a clean operation on a cache block
Mnemonic: CBO.CLEAN 0(rs1)
Encoding

Description: A CBO.CLEAN instruction performs a clean operation on the cache block whose effective address is the base address specified in rs1. The authorizing capability for this operation is rs1.

Exceptions

Kind

Reason

CHERI Store/AMO Access Fault

Authorizing capability tag is set to 0.

CHERI Store/AMO Access Fault

Authorizing capability is sealed.

CHERI Store/AMO Access Fault

Authorizing capability does not grant the necessary permissions. W-permission and R-permission are both required.

CHERI Store/AMO Access Fault

None of the bytes accessed are within the bounds, or the bounds could not be decoded.

CHERI Store/AMO Access Fault

Authorizing capability failed any integrity check.

CBO.CLEAN and CBO.FLUSH only require one byte of the cache block to be within the authorizing capability’s bounds because these operations do not reveal or modify any data, so performing them on a partially covered cache block is harmless. CBO.INVAL and CBO.ZERO destroy or overwrite the contents of the entire cache block and therefore require every byte of the cache block to be within bounds.

Prerequisites: Zicbom, RVY
Operation

This listing is manually maintained rather than extracted from the Sail model.

let cache_block_size = 2 ^ plat_cache_block_size_exp();
let negative_offset = (X(cs1) & ~(zero_extend(ones(plat_cache_block_size_exp())))) - X(cs1);
match ext_data_get_addr(cs1, negative_offset, Cache(CleanFlush), cache_block_size) {
  Ext_DataAddr_Error(e) => { ext_handle_data_check_error(e); RETIRE_FAIL },
  Ext_DataAddr_OK(_)    => RETIRE_SUCCESS
}

8.2. CBO.FLUSH (RVY)

Synopsis: Perform a flush operation on a cache block
Mnemonic: CBO.FLUSH 0(rs1)
Encoding

Description: A CBO.FLUSH instruction performs a flush operation on the cache block whose effective address is the base address specified in rs1. The authorizing capability for this operation is rs1.

Exceptions

Kind

Reason

CHERI Store/AMO Access Fault

Authorizing capability tag is set to 0.

CHERI Store/AMO Access Fault

Authorizing capability is sealed.

CHERI Store/AMO Access Fault

Authorizing capability does not grant the necessary permissions. W-permission and R-permission are both required.

CHERI Store/AMO Access Fault

None of the bytes accessed are within the bounds, or the bounds could not be decoded.

CHERI Store/AMO Access Fault

Authorizing capability failed any integrity check.

Prerequisites: Zicbom, RVY
Operation

This listing is manually maintained rather than extracted from the Sail model.

let cache_block_size = 2 ^ plat_cache_block_size_exp();
let negative_offset = (X(cs1) & ~(zero_extend(ones(plat_cache_block_size_exp())))) - X(cs1);
match ext_data_get_addr(cs1, negative_offset, Cache(CleanFlush), cache_block_size) {
  Ext_DataAddr_Error(e) => { ext_handle_data_check_error(e); RETIRE_FAIL },
  Ext_DataAddr_OK(_)    => RETIRE_SUCCESS
}

8.3. CBO.INVAL (RVY)

Synopsis: Perform an invalidate operation on a cache block
Mnemonic: CBO.INVAL 0(rs1)
Encoding

Description

A CBO.INVAL instruction performs an invalidate operation on the cache block whose effective address is the base address specified in rs1. The authorizing capability for this instruction is rs1.

Executing CBO.INVAL requires ASR-permission in pc, even when CSR state causes it to perform a flush operation instead of an invalidation.

An illegal instruction exception is raised if ASR-permission is not granted, matching the behavior of xRET and CSR accesses that require ASR-permission.

Exceptions

Kind

Reason

Illegal instruction

pc does not grant ASR-permission.

CHERI Store/AMO Access Fault

Authorizing capability tag is set to 0.

CHERI Store/AMO Access Fault

Authorizing capability is sealed.

CHERI Store/AMO Access Fault

Authorizing capability does not grant the necessary permissions. W-permission, R-permission are both required.

CHERI Store/AMO Access Fault

At least one byte accessed is outside the authorizing capability bounds, or the bounds could not be decoded.

CHERI Store/AMO Access Fault

Authorizing capability failed any integrity check.

CSR state controls whether CBO.INVAL performs cache block flushes instead of invalidations for less privileged modes.

Invalidating a cache block can re-expose capabilities previously stored to it after the most recent flush, not just secret values. As such, CBO.INVAL has stricter checks on its use than CBO.FLUSH, and should only be made available to, and used by, sufficiently-trusted software.

Untrusted software should use CBO.FLUSH instead as a minimum, and a sensible implementation choice for CHERI systems is to always execute CBO.INVAL as CBO.FLUSH.

Prerequisites: Zicbom, RVY
Operation

This listing is manually maintained rather than extracted from the Sail model.

let cache_block_size = 2 ^ plat_cache_block_size_exp();
let negative_offset = (X(cs1) & ~(zero_extend(ones(plat_cache_block_size_exp())))) - X(cs1);
match ext_data_get_addr(cs1, negative_offset, Cache(Inval), cache_block_size) {
  Ext_DataAddr_Error(e) => { ext_handle_data_check_error(e); RETIRE_FAIL },
  Ext_DataAddr_OK(_)    => RETIRE_SUCCESS
}

9. Zicboz (RVY modified behavior)

Specifying RVY and Zicboz gives Zicboz (RVY modified behavior) functionality, which extends the checking.

Table 44. Zicboz (RVY modified behavior) instruction extension
Mnemonic	Function
CBO.ZERO (RVY)	Cache block zero

9.1. CBO.ZERO (RVY)

Synopsis: Store zeros to the full set of bytes corresponding to a cache block
Mnemonic: CBO.ZERO 0(rs1)
Encoding

Description

A cbo.zero instruction performs stores of zeros to the full set of bytes corresponding to the cache block whose effective address is the base address specified in rs1. An implementation may or may not update the entire set of bytes atomically although each individual write must atomically set the capability tag of the corresponding aligned YLEN-bit location to zero. The authorizing capability for this instruction is rs1.

Exceptions

Exceptions occur when the authorizing capability fails one of the checks listed below:

Kind

Reason

CHERI Store/AMO Access Fault

Authorizing capability tag is set to 0.

CHERI Store/AMO Access Fault

Authorizing capability is sealed.

CHERI Store/AMO Access Fault

Authorizing capability does not grant the necessary permissions. W-permission is required.

CHERI Store/AMO Access Fault

At least one byte accessed is outside the authorizing capability bounds, or the bounds could not be decoded.

CHERI Store/AMO Access Fault

Authorizing capability failed any integrity check.

Prerequisites

Zicboz, RVY

Operation

This listing is manually maintained rather than extracted from the Sail model.

let cache_block_size = 2 ^ plat_cache_block_size_exp();
let negative_offset = (X(cs1) & ~(zero_extend(ones(plat_cache_block_size_exp())))) - X(cs1);
match ext_data_get_addr(cs1, negative_offset, Cache(Zero), cache_block_size) {
  Ext_DataAddr_Error(e) => { ext_handle_data_check_error(e); RETIRE_FAIL },
  Ext_DataAddr_OK(vaddr) => match translateAddr(vaddr, Write(Data)) {
    TR_Failure(e, ext_ptw) => { handle_translate_exception(vaddr - negative_offset, e, ext_ptw); RETIRE_FAIL },
    TR_Address(paddr, pbmt, _) => {
      let ep = effectivePrivilege(Cache(Zero), mstatus, cur_privilege());
      let _ = mem_write_value_priv_meta(pbmt, paddr, cache_block_size, zeros(),
                                        Cache(Zero), ep, default_meta, false, false, false);
      RETIRE_SUCCESS
    }
  }
}

10. Zicbop (RVY modified behavior)

Specifying RVY and Zicbop gives Zicbop (RVY modified behavior) functionality, which extends the checking.

Table 45. Zicbop (RVY modified behavior) instruction extension
Mnemonic	Function
PREFETCH.I (RVY)	Cache Block Prefetch for Instruction Fetch
PREFETCH.R (RVY)	Cache Block Prefetch for Data Read
PREFETCH.W (RVY)	Cache block Prefetch for Data Write

10.1. PREFETCH.I (RVY)

Synopsis: Provide a HINT to hardware that a cache block is likely to be accessed by an instruction fetch in the near future
Mnemonic: PREFETCH.I offset(rs1)
Encoding

Description

A PREFETCH.I instruction indicates to hardware that the cache block whose effective address is the sum of the base address specified in rs1 and the sign-extended offset encoded in imm[11:0], where imm[4:0] equals 0b00000, is likely to be accessed by an instruction fetch in the near future.

The authorizing capability is rs1. This instruction does not raise any exceptions.

The following authorization checks are performed:

rs1.tag=1
rs1 is not sealed
At least one byte of the requested cache block is within rs1 's bounds
rs1 grants X-permission
rs1 passes all integrity checks

If any of these checks fail, the instruction executes as a nop and no prefetch is performed.

Only one byte of the cache block is required to be within bounds because a hardware prefetcher is permitted to prefetch the same cache block in the background.

If the checks above pass, an implementation may opt to cache a copy of the cache block in a cache accessed by an instruction fetch in order to improve memory access latency, but this behavior is not required.

If rs1=x0 this instruction executes as a nop.

Prerequisites: Zicbop, RVY
Operation

// PREFETCH.I is a hint: it performs no architectural operation
RETIRE_SUCCESS

10.2. PREFETCH.R (RVY)

Synopsis: Provide a HINT to hardware that a cache block is likely to be accessed by a data read in the near future
Mnemonic: PREFETCH.R offset(rs1)
Encoding

Description

A PREFETCH.R instruction indicates to hardware that the cache block whose effective address is the sum of the base address specified in rs1 and the sign-extended offset encoded in imm[11:0], where imm[4:0] equals 0b00000, is likely to be accessed by a data read (i.e., load) in the near future.

The authorizing capability is rs1. This instruction does not raise any exceptions.

The following authorization checks are performed:

rs1.tag=1
rs1 is not sealed
At least one byte of the requested cache block is within rs1 's bounds
rs1 grants R-permission
rs1 passes all integrity checks

If any of these checks fail, the instruction executes as a nop and no prefetch is performed.

Only one byte of the cache block is required to be within bounds because a hardware prefetcher is permitted to prefetch the same cache block in the background.

If the checks above pass, an implementation may opt to cache a copy of the cache block in a cache accessed by a data read in order to improve memory access latency, but this behavior is not required.

If rs1=x0 this instruction executes as a nop.

Prerequisites: Zicbop, RVY
Operation

// PREFETCH.R is a hint: it performs no architectural operation
RETIRE_SUCCESS

10.3. PREFETCH.W (RVY)

Synopsis: Provide a HINT to hardware that a cache block is likely to be accessed by a data write in the near future
Mnemonic: PREFETCH.W offset(rs1)
Encoding

Description

A PREFETCH.W instruction indicates to hardware that the cache block whose effective address is the sum of the base address specified in rs1 and the sign-extended offset encoded in imm[11:0], where imm[4:0] equals 0b00000, is likely to be accessed by a data write (i.e., store) in the near future.

The authorizing capability is rs1. This instruction does not raise any exceptions.

The following authorization checks are performed:

rs1.tag=1
rs1 is not sealed
At least one byte of the requested cache block is within rs1 's bounds
rs1 grants W-permission
rs1 passes all integrity checks

If any of these checks fail, the instruction executes as a nop and no prefetch is performed.

Only one byte of the cache block is required to be within bounds because a hardware prefetcher is permitted to prefetch the same cache block in the background.

If the checks above pass, an implementation may opt to cache a copy of the cache block in a cache accessed by a data write in order to improve memory access latency, but this behavior is not required.

If rs1=x0 this instruction executes as a nop.

Prerequisites: Zicbop, RVY
Operation

// PREFETCH.W is a hint: it performs no architectural operation
RETIRE_SUCCESS

11. User-mode YLEN CSR Summary

Table 46 shows all YLEN User-Mode CSRs added or modified by RVY.

Table 46. All RVY User-Mode CSRs.
RVY CSR	Extension	Width	Address	Permissions	Initial Value	Description
jvt (RVY)	Zcmt	YLEN	0x017	URW	tag=0, otherwise UNSPECIFIED	Jump Vector Table Capability
ddc	Zyhybrid	YLEN	0x416	URW	nominally Root Data	Default Data Capability
utidc	RVY	YLEN	0x480	Read: U, Write: U, ASR-permission	tag=0, otherwise UNSPECIFIED	User Thread ID

Where initial values are specified as a Root Executable or Root Data they are the maximum possible value. The EEI may initialize those CSRs to cover a smaller memory range, or to grant fewer permissions.

Table 47. Action taken on writing to YLEN-bit and extended User-Mode CSRs
RVY CSR	Extension	Width	Action on XLEN write	Action on YLEN write
jvt (RVY)	Zcmt	YLEN	Apply Invalid address conversion. Always update the CSR with YADDRW even if the address didn’t change.	Apply Invalid address conversion and update the CSR with the result if the address changed, direct write if address didn’t change
ddc	Zyhybrid	YLEN	Apply Invalid address conversion. Always update the CSR with YADDRW even if the address didn’t change.	Apply Invalid address conversion and update the CSR with the result if the address changed, direct write if address didn’t change
utidc	RVY	YLEN	Update the CSR using YADDRW.	Direct write

Code pointers and data pointers in Table 48 are WARL CSRs that do not need to store full 64-bit addresses on RV64, and so may be subject to Invalid address conversion on writing.

Table 48. YLEN-wide User-Mode CSRs storing code pointers or data pointers
RVY CSR	Extension	Code Pointer	Data Pointer	Unseal On Execution
jvt (RVY)	Zcmt	✔
ddc	Zyhybrid		✔

12. Zybndsrdw

Zybndsrdw adds the YBNDSRDW instruction.

Table 49. Zybndsrdw instruction extension
Mnemonic	Function
YBNDSRDW	Write capability bounds, rounding down if required

12.1. RV32Y Base Capability Instruction Set, Version 1.0

Chapter 2 introduced the CHERI base ISA and the RV64LYA encoding format, as the only defined RV64Y format.

RV32Y inherits all details from RV64Y, and, sets XLEN=32, YLEN=64.

Depending on the target domain different capability requirements exist, but for XLEN=32 there are insufficient bits available in the capability metadata to define one common format for all use cases. For example, a small embedded controller with limited address space of a few megabytes may need to make different tradeoffs than an auxiliary core within a larger SoC with many gigabytes of address space, and an accelerator using the RVY ISA may need yet another set of features. Therefore, multiple capability encodings with observable (but generally uninteresting) behavior differences need to be defined. While the in-memory bit pattern of capabilities across these systems as well as the precision of bounds may differ, software that does not depend on the raw bits in memory is source (and often also binary) compatible with these different capability encodings.

The RV32LYA follows the same rules as RV64LYA, and sets different parameter values. It also encodes the AP-field to use fewer bits.

The currently defined RV32 capability encoding formats are:

RV32LYA
RV32LYenccheriot1
RV32LYenccheriot2
RV32LYenccheriot3

Further domain specialized capability encoding formats are expected in the future.

12.2. The RV32LYA Capability Base for RV32

This section describes an in-memory format and properties of a capability encoding intended for RV32. This format is based upon the RV64LYA format with the following changes:

The width, or presence, of fields in the encoding are changed as defined in Table 51.
The architectural permissions (AP) field is compressed to save encoding space, and so rules are defined for removing permissions.

12.2.1. Capability Encoding

The encoding format of the RV32LYA capability is shown in Figure 13.

Figure 13. Capability encoding for RV32LYA

12.2.1.1. Capability Encoding Summary

The bit ranges in Table 50 are relative to the XLEN-bit metadata, i.e., the upper half of the in-memory representation.

Table 50. RV32LYA capability encoding format metadata fields
Field	Bit range	Comment
SDP	31:30	Software Defined Permissions.
AP	29:25	Architectural Permissions (includes fields for Zyhybrid and Zylevels1).
GL	24	Global (GL) flag (Zylevels1 only).
CT	20	Capability Type field.
EF	19	Exponent format for the capability bounds decoding, see Table 27.
L8	18	Exponent or top bound MSB for the capability bounds decoding, see Table 27.
T[7:2]	17:12	Top address field for the capability bounds decoding, see Table 27.
TE	11:10	Top address or exponent field for the capability bounds decoding, see Table 27.
B[9:2]	9:2	Base address field for the capability bounds decoding, see Table 27.
BE	1:0	Base address or exponent field for the capability bounds decoding, see Table 27.
All other fields are reserved

T[9:8], T[1:0] and B[1:0] are not specified by the metadata fields and are calculated as shown in Section 3.1.5.2.

Table 51. RV32LYA parameter summary
Parameter	Value	Comment
MW	10	Mantissa width
EW	5	Exponent width
CAP_MAX_E	24	Maximum exponent value
enableL8	1	Whether the encoding format includes the L₈ bit
AP_MAX	0x8	Value of the AP-field giving maximum permissions
AP_ENC	part1_rv32ya_1	Encoding scheme for the AP-field - not all combinations are representable
AUIPC_SHIFT	12	Shift distance of the immediate in AUIPC
REP_RANGE	rc1	At least 1/4 of the capability length

¹If Zyhybrid is supported, then the infinite capability must represent Integral Pointer Mode for compatibility with standard RISC-V code. See Table 60.

Table 52. RV32LYA extension summary
Extension	Comment
Zyhybrid	Compatible
Zylevels1	Compatible
Zyseal	Will be compatible once new permissions are encoded
All RV32Y versions of other standard extensions	Compatible

12.2.1.2. Architectural Permissions (AP) Encoding - `part1_rv32ya_1`

The AP-field is 5 bits wide and does not encode all possible permission combinations. It omits rarely used combinations of permissions to save encoding space.

The AP-field is split into four quadrants. The quadrant number is taken from AP[4:3]. The meanings of bits AP[2:0] for each quadrant are shown in Table 53.

Extensions which allocate new permissions need to specify modifications to the AP-fields and the YPERMC rules to reduce the permissions to maintain an encodable set. The recommendation is for new permissions added to existing entries in quadrant 0 must not be granted as that quadrant includes the NULL capability. New permissions added to quadrant 1 entry 0 must be granted as this represents both of the Root Executable and Root Data capabilities.

AP encoding and rules

All gaps in Table 53 are reserved for future extensions.

Table 53. Encoding of architectural permissions for RV32LYA
AP-encoding table
Quadrant 0: Non-capability data read/write - regular future entries have: W=AP[2], R=AP[0]
Field[2:0]	R	W	C	LM	X	ASR	Notes
0							No permissions¹
1	✔						Data RO
4		✔					Data WO
5	✔	✔					Data RW
Quadrant 1: Executable capabilities - regular future entries have: X=1
Field[2:0]	R	W	C	LM	X	ASR	Notes
0	✔	✔	✔	✔	✔	✔	Execute + Data & Cap RW + ASR²
2	✔		✔	✔	✔		Execute + Data & Cap RO
4	✔	✔	✔	✔	✔		Execute + Data & Cap RW
6	✔	✔			✔		Execute + Data RW
Quadrant 2: Restricted capability data read/write - regular future entries have: R=1, C=1, W=AP[2]
Field[2:0]	R	W	C	LM	X	ASR	Notes
3	✔		✔				Data & Cap RO (no LM)
Quadrant 3: Capability data read/write - regular future entries have: R=1, C=1, W=AP[2]
Field[2:0]	R	W	C	LM	X	ASR	Notes
3	✔		✔	✔			Data & Cap RO
7	✔	✔	✔	✔			Data & Cap RW

¹ This is used for the NULL capability encoding.

² This is used for the Root Executable and Root Data capability encoding.

The compressed permission format specifies a procedure for encoding architectural permissions. When encoding permissions the following rules are run once in order:

Table 54. Compressed AP-field YPERMC rules.
YPERMC Rule	Permission	Valid only if
`RV32-base-1`	C	R (supersedes base-1)
`RV32-base-2`	X	R
`RV32-base-3`	W	not(C) or LM
`RV32-base-4`	X	W or C
`RV32-base-5`	LM	C (supersedes base-2)
`RV32-base-6`	X	(C and LM) or not (C or LM)
`RV32-base-7`	ASR	W and C and X (supersedes base-3)

12.2.2. Zyhybrid Support

In addition to Table 51 Zyhybrid redefines the following parameter:

Table 55. RV32LYA parameter summary with Zyhybrid support.
Parameter	Value	Comment
AP_MAX	0x9	Value of the AP-field giving maximum permissions

The AP-field states shown in Table 56 are allocated, and the definition of the AP-field from both the Root Executable and Root Data capabilities are replaced.

Table 56. Encoding of architectural permissions for RV32LYA with Zyhybrid support
Quadrant 1: Executable capabilities - regular future entries have: X=1
Field[2:0]	R	W	C	LM	X	ASR	P-bit	Notes
1	✔	✔	✔	✔	✔	✔	1	Execute + Data & Cap RW + ASR + Integral Pointer Mode¹
3	✔		✔	✔	✔		1	Execute + Data & Cap RO + Integral Pointer Mode
5	✔	✔	✔	✔	✔		1	Execute + Data & Cap RW + Integral Pointer Mode
7	✔	✔			✔		1	Execute + Data RW + Integral Pointer Mode

¹ This replaces the Root Executable and Root Data capability encoding.

12.2.3. Zylevels1 Support

Zylevels1 adds LG-permission and SL-permission to the states in Table 53, including allocating some previously reserved entries. Table 57 and Table 58 show which entries have been allocated or modified to grant LG and/or SL.

Table 57. Newly allocated AP-field entries.
Quadrant	Entries	LG	SL	Comment
2	6	0	1	R, W, C, LM, SL granted
2	7	0	0	R, W, C, LM granted
3	6	1	1	R, W, C, LM, LG, SL granted

Table 58. Modified AP-field entries.
Quadrant	Entries	LG	SL	Comment
1	all	AP[2:0]<6	!AP[1]	LG conditionally granted, SL always granted
3	3	1	1	LG and SL granted
3	7	1	0	LG granted

The following YPERMC rules are defined:

Table 59. YPERMC rules if Zylevels1 and Zyhybrid are present.
YPERMC Rule	Permission	Valid only if
`RV32-l1-1`	LM	W or LG
`RV32-l1-2`	LG	LM
`RV32-l1-3`	SL	LM and W
`RV32-l1-4`	X	(C and LM and LG and SL) or (C and LM and LG and not W) or not (C or LM or LG or SL)

The following rules are run once in order:

RV32-base-1,2,3,4,5
RV32-l1-1,2,3,4
RV32-base-6,7

12.2.3.1. AP encoding and rules summary with Zylevels1 for RV32LYA

This section is non-normative - it summarizes the overall encoded permissions table and YPERMC rules when both Zyhybrid and Zylevels1 are implemented.

Table 60. Encoding of architectural permissions with Zyhybrid and Zylevels1
AP-encoding table
Quadrant 0: Non-capability data read/write
Field[2:0]	R	W	C	LM	LG	SL	X	ASR	Notes
0									No permissions
1	✔								Data RO
4		✔							Data WO
5	✔	✔							Data RW
Quadrant 1: Executable capabilities
Field[2:0]	R	W	C	LM	LG	SL	X	ASR	Notes
0	✔	✔	✔	✔	✔	✔	✔	✔	Execute + Data & Cap RW + ASR + Capability Pointer Mode¹
1	✔	✔	✔	✔	✔	✔	✔	✔	Execute + Data & Cap RW + ASR + Integral Pointer Mode¹
2	✔		✔	✔	✔		✔		Execute + Data & Cap RO + Capability Pointer Mode
3	✔		✔	✔	✔		✔		Execute + Data & Cap RO + Integral Pointer Mode
4	✔	✔	✔	✔	✔	✔	✔		Execute + Data & Cap RW + Capability Pointer Mode
5	✔	✔	✔	✔	✔	✔	✔		Execute + Data & Cap RW + Integral Pointer Mode
6	✔	✔					✔		Execute + Data RW + Capability Pointer Mode
7	✔	✔					✔		Execute + Data RW + Integral Pointer Mode
Quadrant 2: Restricted capability data read/write
Field[2:0]	R	W	C	LM	LG	SL	X	ASR	Notes
3	✔		✔						Data & Cap RO (without LM-permission)
6	✔	✔	✔	✔		✔			Data & Cap RW (with SL-permission, no LG-permission)
7	✔	✔	✔	✔					Data & Cap RW (no SL-permission, no LG-permission)
Quadrant 3: Capability data read/write
Field[2:0]	R	W	C	LM	LG	SL	X	ASR	Notes
3	✔		✔	✔	✔				Data & Cap RO
6	✔	✔	✔	✔	✔	✔			Data & Cap RW (with SL-permission)
7	✔	✔	✔	✔	✔				Data & Cap RW (no SL-permission)

¹ This replaces the Root Executable and Root Data capability encoding.

The following rules are run once in order:

Table 61. RV32LYA YPERMC rules if Zyhybrid and Zylevels1 are present.
YPERMC Rule	Permission	Valid only if
`RV32-base-1`	C	R (supersedes base-1)
`RV32-base-2`	X	R
`RV32-base-3`	W	not(C) or LM
`RV32-base-4`	X	W or C
`RV32-base-5`	LM	C (supersedes base-2)
`RV32-l1-1`	LM	W or LG
`RV32-l1-2`	LG	LM
`RV32-l1-3`	SL	LM and W
`RV32-l1-4`	X	(C and LM and LG and SL) or (C and LM and LG and not W) or not (C or LM or LG or SL)
`RV32-base-6`	X	(C and LM) or not (C or LM)
`RV32-base-7`	ASR	W and C and X (supersedes base-3)
`RV32-hybrid-1`	P-bit	X

13. Vector "V" Extension (RVY)

The Vector extension for RVY does not support capability tags in the vector registers. Consequently, vector stores will always clear capability tags in memory.

A future extension may allow capability tags to be stored in vector registers. Until that time, vector load and store instructions must not be used to implement generic memory copying in software, such as the memcpy() standard C library function, that requires capabilities to be preserved, because the vector registers do not hold capabilities, so the capability tags of any copied capabilities will be set to 0 in the destination memory.

Under RVY, vector loads and stores follow the standard rules for active elements:

Only active elements are subject to CHERI exception checks.
CHERI exceptions are only raised on fault-only-first loads if element 0 is both active and fails any exception checks.

Additionally, the standard RVY rule that all loads and stores where the base register is x0 are reserved applies to all vector memory access instructions.

The approach of using indexed loads with the base register set to the value zero and XLEN-wide offsets does not work well with CHERI as the authorizing capability must cover all of memory. If the authorizing capability is specified as x0 then the instruction encoding is reserved.

14. "Zylevels1" Extension for CHERI 2-Level Information Flow Control

Zylevels1 adds an Information Flow Control (IFC) mechanism to CHERI.

It adds:

One new "level" metadata bit (the GL flag) to classify capabilities as either global or local.
Checks to memory operations to control how capabilities can be shared.
Two new level-related permissions, Store Local (SL) and Load Global (LG).

The behavior is an extension as it may be replaced by a future scheme with more than two levels.

14.1. Added Metadata Bits (Permissions and Flags)

Bit

Type

Comment

GL flag

Information flow label

Used to classify capabilities as global or local.

SL-permission

Data memory permission

Used to filter the capability tags of stored capabilities.

LG-permission

Data memory permission

Used to filter the permissions of loaded capabilities.

14.1.1. The Capability Global (GL) Flag

The Global (GL) flag is a metadata bit used to track and control the propagation of capabilities. It works in conjunction with LG-permission and SL-permission to enforce information flow boundaries.

Global Capability: A global capability has GL flag=1.

Local Capability: A local capability has GL flag=0.

Like permissions, the GL flag can be cleared when creating a new capability from an existing one, but it can never be set unless derived from a superset capability which has the GL flag set.

14.1.2. Store Local Permission (SL)

Store Local Permission (SL) controls whether a local capability can be stored to memory.

If the authorizing capability lacks SL-permission then the stored capability has its capability tag set to zero.

SL-permission is dependent on both C-permission and W-permission as shown in Table 62.

14.1.3. Load Global Permission (LG)

Load Global Permission (LG) controls whether global capabilities can be loaded from memory. Loading a capability using an authorizing capability that lacks LG-permission clears the GL flag bit of the loaded capability, making it into a local capability. If the loaded capability is unsealed, its LG-permission is also cleared.

This behavior is similar to how LM-permission affects loaded capabilities (but note the difference in interaction with seals).

LG-permission is dependent on both C-permission and R-permission as shown in Table 62.

14.2. Interaction with Root Capabilities

The Root capabilities definitions are extended:

A Root Executable capability is required to grant LG-permission and to be a global capability.
A Root Data capability is required to grant both LG-permission and SL-permission and to be a global capability.

14.3. Interaction with YPERMC and YPERMR

Figure 14. Extended capability permissions bit field (see Figure 6)

The GL flag, SL-permission, and LG-permission fields are mapped into the capability permissions bitfield as used by YPERMC and YPERMR, as shown in Figure 14.

The GL flag is treated as a permission by YPERMC and YPERMR.

Unlike actual permissions it is not subject to the sealed check, and so YPERMC can produce a new capability with its GL flag cleared, even if the source capability is sealed.

14.3.1. Additional YPERMC permission rules

The following YPERMC permission rules are added:

Table 62. YPERMC permission rules for Zylevels1.
YPERMC Rule	Permission	Valid only if
Zylevels1-1	LG-permission	C-permission and R-permission
Zylevels1-2	SL-permission	C-permission and W-permission

14.4. Interaction with LY

Zylevels1 introduces two new constraints on capabilities loaded from memory. These constraints apply to LY and all other instructions that can load valid capabilities.

The permissions of the loaded capability may be reduced:

If the authorizing capability lacks LG-permission and the loaded capability is unsealed, both GL flag and LG-permission are cleared in the result.
If the authorizing capability lacks LG-permission and the loaded capability is sealed, GL flag is cleared in the result.

The absence of LG-permission affects the GL flag even of sealed capabilities because the latter is an information flow label rather than an architectural permission.

Table 63. LG-permission effects for loading capabilities
Auth cap field			Data cap field
R	C	LG	Tag	Sealed	Action
1	1	0	1	Yes	Load data capability with its GL flag cleared
		0	1	No	Load data capability with both its GL flag and LG-permission cleared
		All other cases			Load data capability with both its GL flag and LG-permission unmodified

14.5. Interaction with SY

Zylevels1 adds a constraint to SY and all instructions which can store valid capabilities.

The capability is stored with a capability tag set to one only if:

The capability being stored is a global capability, or
The authorizing capability grants SL-permission.

Otherwise, the capability is stored with the capability tag set to zero.

While a lack of LG-permission restricts flow by reducing permissions (labeling the loaded capability as local and stripping permissions), SL-permission restricts flow by invalidating capabilities (by setting the capability tag to zero).

Table 64. SL-permission effects for stored capabilities
Auth cap field			Data cap field
W	C	SL	GL	Notes
1	1	1	X	Store data capability unmodified
		0	1	Store data capability unmodified
		0	0	Store data capability with capability tag set to zero

14.6. Interaction with Capability Subsets

The definition of capability subset has an additional rule:

The GL flag of cap2 is zero, or the GL flag of cap1 is one.

This affects the behavior of YSS, YBLD and YSUNSEAL.

14.7. Rationale and Software Use Cases

The primary goal of Zylevels1 is to prevent storing pointers to specific memory regions in order to enforce security invariants. There are many uses of this mechanism, but two examples are listed below.

14.7.1. Restricting Capability Flow Between Regions or Domains (e.g., Stack-to-Heap or Kernel-to-User)

Software can use these features to prevent pointers from leaking between different memory regions or privilege domains that have different security properties. The general pattern is to mark sensitive capabilities as local (by clearing their GL flag) and ensuring that destination memory regions are accessed through authorities that lack the SL-permission.

Intra-program flow (e.g., Stack-to-Heap):

To prevent pointers to short-lived memory (like the stack) from leaking into long-lived memory (like the heap), software marks all stack pointers as local. If the authorities for the heap lack the SL-permission, any attempt to store a stack pointer into the heap results in the hardware invalidating the stored capability. This ensures stack pointers are strictly confined to registers and the stack itself.
Cross-domain flow (e.g., Kernel-to-User):

These features also apply across privilege boundaries. For example, a kernel can ensure that the capability authorizing a write to userspace (e.g., for a copy_to_user operation) lacks the SL-permission. Even if a kernel bug causes it to accidentally attempt to copy sensitive, local kernel capabilities into the user’s buffer, the hardware will invalidate those capabilities. This prevents unintentional leaks of usable capabilities to a less-privileged domain.

14.7.2. Enforcing Shallow vs. Deep No-Capture Guarantees

When passing a capability to a less-trusted function, a caller often wants to enforce "no-capture" guarantees, ensuring the callee cannot hold onto the capability after returning. Zylevels1 provides two levels of protection against capability capture.

Shallow no-capture (via GL flag and SL-permission):

The caller makes the argument capability local by clearing its GL flag. Because the callee’s global variables and persistent data structures are accessed through capabilities that lack the SL-permission, the callee cannot store the local argument capability into them. This enforces a shallow guarantee: the callee cannot capture the argument itself, but it might still load a global capability through the argument and capture that.
Deep no-capture (via LG-permission):

To solve the shallow limitation, the caller also removes the LG-permission from the argument capability. If the callee loads any capability through this argument, the hardware automatically removes the loaded capability’s GL flag and LG-permission. This enforces a deep no-capture guarantee: the callee is prevented from capturing the argument and anything reachable from it.

15. "Zyhybrid" Extension for CHERI Pointer Modes

Zyhybrid is an optional extension to RVY which adds the ability to execute standard RISC-V (RV32[IE]/RV64[IE]) code on an RVY hart, while still allowing access to RVY instructions.

The ability to choose between these two behaviors is referred to as switching between CHERI Pointer Modes. This extension introduces a new bit in the capability metadata, the P-bit, to control these modes. When installed in pc, the P-bit determines the current execution mode.

The P-bit determines whether pointers are interpreted as standard RISC-V Integer Addresses or as CHERI Capabilities.

Zyhybrid adds the instructions in Table 65 to query and update the current mode, and switch modes.

Table 65. Zyhybrid instruction extension
Mnemonic	Function
YMODEW	Set capability execution mode
YMODER	Read capability mode
YMODESWY	Switch execution to Capability Pointer Mode
YMODESWI	Switch execution to Integral Pointer Mode

This extension also adds the default data capability (ddc), an unprivileged CSR used to authorize data memory accesses when in Integral Pointer Mode.

Together with pc, ddc enables confining execution to a compartment (or sandbox) where all memory accesses are bounded. These compartments provide binary compatibility for existing RV32[IE]/RV64[IE] code. Provided that the execution environment has set up ddc and pc appropriately, non-CHERI-aware programs will execute unmodified within these bounds.

For detailed examples of how this extension facilitates co-existence and sandboxing, see Appendix B.

RVY implementations that support Zyhybrid are typically referred to as CHERI Hybrid. Implementations which do not support Zyhybrid are referred to as CHERI Purecap and always operate in Capability Pointer Mode.

Because the base architecture remains RVY, the custom encoding space re-allocated to CHERI remains re-allocated.

If an implementation needs to be able to access custom encodings in spaces which have been reallocated, then privileged software should disable CHERI entirely.

15.1. CHERI Pointer Modes

The two CHERI Pointer Modes are:

Capability Pointer Mode

All pointer operands to instructions are interpreted as capabilities. Instruction semantics are as defined in the RVY base ISA.

Integral Pointer Mode

All pointer operands to instructions are interpreted as for the RV32[IE]/RV64[IE] base ISA, i.e., as plain memory addresses without metadata. While the base ISA remains RVY this mode provides backwards compatibility with RV32[IE]/RV64[IE]. Operands are still checked against a capability, but instead of using the metadata from the full YLEN register, the bounds and permissions are obtained from ddc (for data accesses) or pc (for instruction accesses).

ARC-NOTE: Please confirm the names of the two CHERI Pointer Modes. The names Integral Pointer Mode and Capability Pointer Mode distinguish interpreting pointer operands as plain integer addresses vs. full capabilities with metadata, and match compiler terminology (integral vs. non-integral pointers).

If RVC encodings are supported, load/store encodings will revert to their non-CHERI encodings, such as C.LY reverting to C.FSD for RV64/RV64Y. This behavior is summarized in Table 35, Table 36, Table 37 and Table 38.

Instructions which are modified on an RVY architecture (see Table 22) revert to their standard RV32[IE]/RV64[IE] behavior.

All RVY instructions and CSRs are available in addition to all standard RV32[IE]/RV64[IE] instructions and CSRs.

Non-standard extensions with encodings in remapped custom space are not supported.

The authorizing capability for data access is ddc (as opposed to rs1). That is, all data memory accesses are implicitly authorized by ddc with only the memory address sourced from rs1.

ddc is also used to authorize memory instructions added by RVY (such as LY and SY).

The restriction of not allowing rs1=x0 is removed and all data memory accesses are permitted to use x0 as the base address.

Capability memory access instructions are also permitted to have rs1=x0.

All instruction fetches are authorized by pc.

Prefetches are also checked against the appropriate CSR (ddc for PREFETCH.W (RVY) and PREFETCH.R (RVY), pc for PREFETCH.I (RVY)) and execute as nops if authorization fails.

Authorization of prefetches is useful to mitigate against side channels, e.g., preventing userspace from fetching kernel memory into a cache.

Reads to extended CSRs are XLEN-wide, and writes use the semantics of the YADDRW instruction (see Zicsr (RVY modified behavior)).

The CHERI Pointer Mode is key in providing backwards compatibility with the base RV32[IE]/RV64[IE] ISAs.

The CHERI Pointer Mode is always Capability Pointer Mode on implementations that support RVY, but not Zyhybrid.

Software is Purecap if it uses CHERI capabilities for all memory accesses (including loads, stores, and instruction fetches). Purecap software requires RVY. Software is Hybrid if it uses both integer addresses and CHERI capabilities. Hybrid software requires both RVY and Zyhybrid.

15.1.1. CHERI Pointer Mode Encoding

The current CHERI Pointer Mode is controlled by the P-bit in the capability AP-field of pc. The P-bit (P) is only architecturally relevant for capabilities granting X-permission, as it only affects execution state when installed into pc.

While the P-bit can be read or written explicitly using YMODER and YMODEW, these instructions do not change the current execution mode. The mode changes when a capability is installed into pc (e.g., via an indirect jump, see Section 15.1.2).

The P-bit is only valid in capabilities that grant X-permission (see Section 15.1.3). Capabilities that do not grant X-permission therefore do not carry a meaningful P-bit, so YMODEW cannot modify their P-bit. Zyhybrid requires a capability encoding that supports transport of the P-bit (e.g., RV64LYA).

CHERI Pointer Mode (P)=0 indicates Capability Pointer Mode.
CHERI Pointer Mode (P)=1 indicates Integral Pointer Mode.

When executing YPERMC, if X-permission is removed while the P-bit is set to one, and the capability encoding still permits representing the P-bit, then the P-bit must be set to zero.

15.1.2. Changing CHERI Pointer Mode

The P-bit of pc can be updated by the instructions listed in Table 66:

Table 66. Zyhybrid instructions that can perform mode changes
Mnemonic	From mode	Description
JALR (RVY)	Capability Pointer Mode	Jump to capability register, and link to capability register
YMODESWI	Capability Pointer Mode	Switch execution to Integral Pointer Mode
YMODESWY	Integral Pointer Mode	Switch execution to Capability Pointer Mode

When JALR (RVY) copies rs1 into pc it includes copying the P-bit and so setting the CHERI Pointer Mode of the target instruction.

The P-bit of a X-permission-granting capability can be read and written by the instructions listed in Table 67:

Table 67. Zyhybrid instructions to observe and update the mode in a capability
Mnemonic	Description
YMODEW	Set capability execution mode
YMODER	Read capability mode

In addition to the mode switching instructions, the current mode can also be updated by setting the P-bit of a target capability using YMODEW followed by a JALR (RVY).

15.1.3. Additional YPERMC permission rule

The following YPERMC permission rule is added:

Table 68. YPERMC permission rules for Zyhybrid.
YPERMC Rule	Permission	Valid only if
Zyhybrid-1	P-bit	X-permission

15.1.4. Observing the CHERI Pointer Mode

The effective CHERI Pointer Mode cannot be determined just by reading the P-bit from the pc since it also depends on the execution environment. The following code sequence demonstrates how a program can observe the current, effective CHERI Pointer Mode. It will write, to x1, the value 1 for Capability Pointer Mode (where AUIPC (RVY) derives a valid capability from the pc) or 0 for Integral Pointer Mode (where auipc reverts to its standard behavior and writes an integer result with a clear capability tag):

auipc x1, 0
ytagr x1, x1

15.2. Added instructions

15.2.1. YMODEW

Synopsis: Set capability execution mode
Mnemonic: YMODEW rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Copy rs1 to rd.

If rs1 is sealed or if rs1 fails any integrity checks, then set rd.tag=0.

Otherwise, if rs1 grants X-permission then update the P-bit of rd to:

Capability Pointer Mode if the least significant bit of rs2 is 0, or,
Integral Pointer Mode if the least significant bit of rs2 is 1.

The value of rs1.tag is not checked and is propagated unchanged to rd.

Prerequisites

Zyhybrid

Operation

  let cap = C(cs1);
  let mode = execution_mode_encdec(X(rs2)[0 .. 0]);

  let cap = clearTagIf(cap, capIsSealed(cap));
  let hasMode = not(permsMalformed(cap)) & canX(cap);
  let newCap = if hasMode then setCapMode(cap, mode) else cap;

  C(cd) = newCap;
  RETIRE_SUCCESS

15.2.2. YMODER

Synopsis: Read capability mode
Mnemonic: YMODER rd, rs1
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Decode the CHERI Pointer Mode from the capability in rs1 and write the result to rd.

Set rd to 0 if rs1 does not grant X-permission

Set rd to 0 if any integrity checks fail.

Otherwise set rd according to rs1 's CHERI Pointer Mode (P-bit):

Set rd to 0 for Capability Pointer Mode, or,
Set rd to 1 for Integral Pointer Mode.

The value of rs1.tag does not affect the result.

Prerequisites

Zyhybrid

Operation

let capVal = C(cs1);
X(rd) = zero_extend(execution_mode_encdec(getCapMode(capVal)));
RETIRE_SUCCESS

15.2.3. YMODESWY, YMODESWI

Synopsis: Switch execution mode to Capability Pointer Mode (YMODESWY), or Integral Pointer Mode (YMODESWI), 32-bit encodings
Mnemonics: YMODESWY
YMODESWI
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description

Set the current CHERI Pointer Mode in pc.

YMODESWY: If the current mode in pc is Integral Pointer Mode (1), then the P-bit in pc is set to Capability Pointer Mode (0). Otherwise no effect.
YMODESWI: If the current mode in pc is Capability Pointer Mode (0), then the P-bit in pc is set to Integral Pointer Mode (1). Otherwise no effect.

Prerequisites

Zyhybrid

Operation

This listing is manually maintained rather than extracted from the Sail model.

let target : ExecutionMode = match op {
  MODESW_CAP => CapPtrMode,
  MODESW_INT => IntPtrMode,
};
set_next_pcc(setCapMode(PCC, target));
RETIRE_SUCCESS

15.3. Added State

Zyhybrid adds the YLEN-wide CSR shown in Table 69.

Table 69. Unprivileged YLEN-wide CSRs added in Zyhybrid
YLEN CSR	Permissions	Description
ddc	URW	Default Data Capability

15.3.1. Default Data Capability CSR (ddc)

ddc is a read-write, user mode accessible capability CSR. It does not require ASR-permission in pc for writes or reads. Similarly to pc authorizing all control flow and instruction fetches, this capability register is implicitly checked to authorize all data memory accesses when the current CHERI mode is Integral Pointer Mode.

The EEI initializes ddc such that the program can run successfully (e.g., by setting it to be a Root Data capability).

Figure 15. Unprivileged default data capability register

16. Zicsr (RVY modified behavior)

Table 70. Zicsr (RVY modified behavior) instructions
Mnemonic	Function
CSRRW (RVY)	CSR write
CSRRS (RVY)	CSR set
CSRRC (RVY)	CSR clear
CSRRWI (RVY)	CSR write (immediate form)
CSRRSI (RVY)	CSR set (immediate form)
CSRRCI (RVY)	CSR clear (immediate form)

Zyhybrid makes Integral Pointer Mode available.

In addition to the rules in Section 2.6.4, when executing in Integral Pointer Mode, extended CSRs are always accessed as XLEN-wide for backwards compatibility. The final written value is determined using the semantics of the YADDRW instruction. YLEN-bit CSRs (including ddc) continue to be accessed as YLEN-wide.

16.1. CSRRWI, CSRRS, CSRRSI, CSRRC, CSRRCI (RVY)

Synopsis: CSR access (CSRRWI, CSRRS, CSRRSI, CSRRC, CSRRCI) 32-bit encodings for RVY
Mnemonics: csrrs rd, csr, rs1
csrrc rd, csr, rs1
csrrwi rd, csr, imm
csrrsi rd, csr, imm
csrrci rd, csr, imm
Encoding

Description

These CSR instructions have extended functionality for accessing YLEN-bit CSRs and extended CSRs.

Access to XLEN-bit CSRs is as defined in Zicsr.

Zicsr rules are followed when determining whether to read or write the CSR. Suppressed read or write actions have no side-effects on the CSR.

For the instructions on this page, all writes are XLEN bits only and use the semantics of the YADDRW instruction to determine the final write data.

Read data from YLEN-bit CSRs is always YLEN bits.

Read data from extended CSRs is YLEN bits in Capability Pointer Mode or XLEN bits in Integral Pointer Mode.

Permissions

Accessing CSRs may require ASR-permission.

Prerequisites

RVY, Zicsr, optional Zyhybrid

Operation

  let isWrite = op == CSRRW | rs1 != zeros();
  // CSRRS[I] and CSRRC[I] always read. CSRRW[I] reads if the destination
  // register is not x0.
  let isRead = op != CSRRW | rd != zeros();

  if not(check_CSR(csr, cur_privilege(), isWrite))
  then { handle_illegal(); RETIRE_FAIL }
  else if not(ext_check_CSR(csr, cur_privilege(), isWrite))
  then { ext_check_CSR_fail(); RETIRE_FAIL }
  else handle_csr_op(csr, rs1, rd, is_imm, op, isWrite, isRead)

16.2. CSRRW (RVY)

Synopsis: CSR access (CSRRW) 32-bit encodings for RVY
Mnemonic: csrrw rd, csr, rs1

Encoding

Description

CSRRW has extended functionality for accessing YLEN-bit CSRs and extended CSRs.

Access to XLEN-bit CSRs is as defined in Zicsr.

CSRRW accesses to YLEN-bit CSRs read YLEN bits into rd and write YLEN bits of rs1 into the CSR.

CSRRW accesses to extended CSRs read YLEN bits into rd and write YLEN bits of rs1 into the CSR. However, if Zyhybrid is supported and the current mode is Integral Pointer Mode, accesses to extended CSRs are XLEN-wide. The final write data for extended CSRs is determined using the semantics of the YADDRW instruction.

In all cases, when writing YLEN bits of rs1, if any integrity checks fail then set the capability tag to zero before writing to the CSR.

Permissions

Accessing CSRs may require ASR-permission.

Prerequisites

RVY, Zicsr, optional Zyhybrid

Operation

  let isWrite = op == CSRRW | rs1 != zeros();
  // CSRRS[I] and CSRRC[I] always read. CSRRW[I] reads if the destination
  // register is not x0.
  let isRead = op != CSRRW | rd != zeros();

  if not(check_CSR(csr, cur_privilege(), isWrite))
  then { handle_illegal(); RETIRE_FAIL }
  else if not(ext_check_CSR(csr, cur_privilege(), isWrite))
  then { ext_check_CSR_fail(); RETIRE_FAIL }
  else handle_csr_op(csr, rs1, rd, is_imm, op, isWrite, isRead)

17. "Zyseal" Extension for CHERI Capability (Un)Sealing

This chapter is not part of the v1.0 ratification package and is ready to freeze.

17.1. Explicit Sealing and Unsealing Operations

The RVY base architecture defines sealed capabilities. The YBLD, JALR (RVY), and YSUNSEAL instructions and the YSEAL instruction with CT-field > 1 allow platforms to build and consume sealed capabilities in particular ways. This extension introduces a more general, intentional (that is, capability-mediated) mechanism for the introduction and elimination of sealed capability forms, in keeping with CHERI’s principle of intentional use.

This extension first introduces a fundamentally new kind of capabilities, "type capabilities", that do not authorize memory access. Instead, their address and bounds identify a range of CT-field values. As subsequently detailed, these capabilities, and their new permissions, will serve as authorizing capabilities to new instructions which perform transformations of other capabilities' CT-field fields:

Constructing a sealed capability with type T from an unsealed capability requires the authority to seal at type T, and
Constructing an unsealed capability from a sealed capability with type T requires the authority to unseal at type T.

This extension does not define "type conversion" transformations directly between sealed capability types.

17.2. Usable CT-field Values Are Encoding Specified

The capabilities used to mediate (un)sealing are, like memory capabilities, associated with an XLEN-bit address space. However, capability encodings have fewer than XLEN bits devoted to storing CT-field values. As such, encodings will specify what CT-field values can be used to seal capabilities (recall that encodings must support representing unsealed capabilities). The remainder of the address space described by type capabilities is available for software use.

17.3. Single Address Space Encodings

Capability encodings are permitted to conflate memory and type address spaces, such that one capability may authorize both memory access to a location and (un)sealing with a type of equal numeric value. Indeed, the encoding of RV32LYA/RV64LYA is one such encoding. Ideally, such encodings should permit separate manipulation of (un)sealing permission and memory access permissions, so that software can segregate the address spaces even when the encoding does not do so intrinsically.

17.4. Added Architectural Permissions (AP) Bits

Table 71. Zyseal YPERMC rules.
Permission	Type	Comment
SE-permission	CT-field permission	Grants sealing authority
US-permission	CT-field permission	Grants unsealing authority

Seal Permission (SE): Permit the bearer to YSEAL capabilities with type values that are within the bounds of this capability.

Unseal Permission (US): Permit the bearer to YUNSEAL capabilities with type values that are within the bounds of this capability.

17.5. Interaction with YPERMC and YPERMR

Figure 16. Extended capability permissions bit field (see Figure 6)

The SE-permission and US-permission fields are mapped into the capability permissions bitfield (Figure 6), used by YPERMC and YPERMR, as shown in Figure 16.

17.6. Extended and New Instructions

YSEAL: A yseal rd, rs1, rs2 instruction will use the provided sealing authority of rs1 to copy the unsealed capability in rs2 into rd and seal it with type rs1.address, assuming rs1 has a set capability tag, is in bounds, and grants SE-permission.
YUNSEAL: A yunseal rd, rs1, rs2 instruction will use the provided unsealing authority of rs1 to copy the sealed capability in rs2 into rd and unseal it, so long as rs2.ct = rs1.address.

17.6.1. YSEAL

The behavior of the YSEAL instruction is extended to allow rs1≠x0.

Extended Description if rs1≠x0

Construct, into rd, a sealed copy of the unsealed capability in rs2, using the type and authority from rs1.

Copy rs2 into rd, and then…

Set the capability tag of the capability in rd to zero if any of the following hold:
- rs2 is sealed (has a non-zero CT-field value)
- rs1 has a clear capability tag
- rs1 does not grant SE-permission
- The address of rs1 is out of bounds
- The address of rs1 is not a CT-field value that the capability encoding can encode on the capability in rs2
Set the CT-field of rd to the address of rs1.

YSEAL uses the (in-bounds) address of the authority in rs1 as the type in the resulting capability. If the authority has a nontrivial range, software can use YADDRW to select which type should be used.

If a capability encoding also entails the presence of Sealed entry point capability types, it will be possible for software (bearing suitably permissive capabilities) to seal and unseal the sentry types that that encoding defines. This is deliberate. Software should ensure that the capabilities requisite for such operations are attenuated, confined to sufficiently trusted components, and/or destroyed.

Some capability encodings correlate non-zero CT-field values with other aspects of capabilities, notably permissions. That is, some encodings may be able to represent a valid capability with a given non-zero CT-field only if other properties of that capability hold, such as it granting, or not granting, a particular permission. As such, YSEAL may set the capability tag of the result in rd to zero depending on these other aspects of its rs2 input, even if some capabilities can be sealed with the type called for by the address of rs1.

Operation

SAIL TO BE UPDATED

17.6.2. YUNSEAL

Synopsis

Unseal a capability authorized using an unsealing capability

Mnemonic

YUNSEAL rd, rs1, rs2

Encoding

TODO

Description

Construct, into rd, an unsealed copy of the capability in rs2, using the type and authority from rs1.

Copy rs2 into rd, and then…

Set the capability tag of the capability in rd to zero if any of the following hold:
- rs1 has a clear capability tag
- rs1 does not grant US-permission
- The address of rs1 is out of bounds
- The address of rs1 is not equal to the CT-field of the capability in rs2.
Propagate permissions from rs1 onto rd:
- If the Zylevels1 extension is implemented, and the capability in rs1 does not grant GL flag, use the semantics of the YPERMC instruction to clear the GL flag of the capability in rd.
  
  (That is, the resulting capability in rd will grant GL flag if and only if the capabilities in rs1 and rs2 both grant GL flag.)
- Other extensions may impose similar constraints.
Set the CT-field of the capability in rd to zero.

YUNSEAL requires exact equality of the authority’s type, rs1.address, and the to-be-unsealed capability’s type, rs2.ct. If it is desirable to unseal one of several capability types, using an authority with nontrivial range, software can use YTYPER and YADDRW to make these values match. Future extensions may specify a fused "copy type" operation, as was present in the CHERI v9 ISA.

Operation

SAIL TO BE UPDATED

18. "Zybndsrdw" Extension for Bounding to Representable Lengths

This chapter is not part of the v1.0 ratification package and is ready to freeze.

This extension adds a single instruction, YBNDSRDW, which writes capability bounds while rounding down to ensure precise representability of the result. In contrast to YBNDSRW, it does not alter the requested lower bound (capability base).

This instruction is useful when exposing (byte) buffers across trust domains. Given a buffer, an address therein, and a length after that address of elements to be revealed to a different trust domain, software in the originating trust domain wishes to compute the largest span, starting at the cursor and up to the length of elements to be shared, that can be precisely represented with a capability. Rounding the base down and/or the length up (as with YBNDSRW) risks exposing buffered data in the buffer not suitable for exposure to the different trust domain in question. While the originating domain could instead make a series of exposures, each sufficiently small to be guaranteed to be representable, it is a significant improvement to allow capability representation itself to determine the largest safe exposure size.

18.1. YBNDSRDW

Synopsis: Write capability bounds, rounding down if required
Mnemonic: YBNDSRDW rd, rs1, rs2
Encoding

The instruction encoding is not final and is highly likely to change prior to v1.0

Description: Copy the capability from register rs1 to register rd. Set the base address of its bounds to the value of rs1.address.

Round down the requested length, in rs2[XLEN-1:0], by the smallest amount necessary to guarantee that it is precisely representable given this base address.

Set rd.tag=0 if rs1.tag=0, rs1 is sealed or if rd 's bounds exceed rs1 's bounds.

Set rd.tag=0 if rs1 fails any integrity checks.

If the result in rd has its capability tag set to one, then its base is rs1.address, as with YBNDSW. The result in rd will be zero-length if and only if rs2[XLEN-1:0] is zero. In practice, capability encodings offer byte-granularity of bounds for capabilities of lengths up to some threshold between a few hundred and a few thousand bytes, depending on the encoding; for requested lengths below this threshold, no rounding is required, and otherwise this threshold serves to guarantee a minimum length return from YBNDSRDW.

YBNDSRDW finds the largest length l that is both less than or equal to the requested rs2[XLEN-1:0] and precisely representable given a particular base address, b. A length l is precisely representable given base b if a YBNDSW instruction…

whose rs2 register holds l and
whose rs1 register holds a capability…

— whose address is b and

— whose bounds cover the entire address space

produces a result in rd whose capability tag is set to one. This value may be efficiently found, for most capability encoding schemes, by counting trailing zeros in the desired base address and computing the length that is a maximal mantissa shifted left by that count (that is, using the number of trailing zeros in the base address as the value’s exponent).

Operation

SAIL TO BE UPDATED

19. RVY Specializations for Microcontroller Systems

This chapter is not part of the v1.0 ratification package and is ready to freeze.

19.1. The Zycheriot Unprivileged ISA Extension

This section defines a series of small changes to the RVY and RVYE unprivileged base architectures that serve to specialize it for microcontroller environments. These changes are based on, but are not exactly isomorphic to, the prior CHERIoT RV32E-based ISA.

Some of these changes are intended to work in concert with their privileged counterparts found in the Smycheriot extension.

19.1.1. Required Extensions

Zycheriot assumes the presence of both the Zylevels1 and Zyseal extensions. The present specification presumes the absence of both the Zyhybrid and no support for rs1=x0 passed to the YSEAL instruction, which bypasses the authority.

While Zycheriot is nominally compatible with Zyhybrid, and particular instantiations may opt to permit disabling CHERI, we have not yet found a compelling reason to formally specify this composition.

While Zycheriot is nominally compatible with CT-field=1, the operating system written for CHERIoT has a security model that presumes the absence of ambient sealing, and so this specification does not define any ambiently available sentry types.

19.1.2. Refining CHERI Capabilities

19.1.2.1. Software Defined Permissions

Zycheriot defines SDPLEN, the number of software-defined permissions, to be 1. We denote this one user permission as U0.

19.1.2.2. Root Permission Sets

Zycheriot defines three Root capability values, each of which has a set capability tag, is unsealed, has bounds that span its associated address space, and has GL flag set. That is, they differ only in their granted permissions, thus:

Its Root Executable capability value grants exactly all of X, R, C, LM, ASR, and LG.
Its Root Data capability value grants exactly all of R, W, C, LM, LG, and SL.
It defines a root sealing capability that grants exactly all of SE, US, and U0.

This set of root capabilities satisfies the following properties, by construction. Any further extension which adds or modifies capability permissions must ensure that its revised or additional root capabilities do so as well. Since all capabilities in the system must trace their provenance back to a root capability, these properties will necessarily remain true through any series of YPERMC transitions.

w-nand-x: At most one of X or W may be set.
mem-nand-ct: The SE, US, and U0 collectively conflict with either of the R or W permissions.

That is, a capability may grant permissions from at most one of these two sets; this serves to partition capabilities that refer to memory addresses from those that refer to CT-field values (those granting SE and/or US) or uninterpreted integers (those granting U0). Capabilities granting no permissions from the union of these sets are not distinguished.

19.1.2.3. Permission Transition Constraints

Zycheriot requires a RVY base that is using a compressed permission scheme (that is, one has its LY p parameter set to pc).

In addition to the constraints on permission transitions defined across the base RVY ISA and the Zylevels1 extension, Zycheriot adds two additional constraints, shown in Table 72.

Table 72. Zycheriot Permission Transition Rules
YPERMC Rule	Permission	Valid only if
Zycheriot-1	X-permission	C-permission and R-permission
Zycheriot-2	SL-permission	R-permission

These constraints enable more compression of capability permissions by disallowing the least useful permission combinations. In particular, ABIs usually require code to be readable to enable PC relative access to constant data, and write-only store-local capabilities are not required given that store-local is generally used only for thread stacks.

19.1.2.4. Capability Types

Zycheriot introduces several kinds of sentry capabilities, imposing a degree of control flow integrity, by giving architectural semantics to several CT-field values. The presence of, and handling constraints for, these sentry types are important to unprivileged software. In particular, Zycheriot uses five CT-field values (1 through 5, inclusive) to modify the behavior of JALR (RVY):

All of these values are defined to be Sealed entry point capability values. Thus, sealed capabilities with any of these CT-field values may be passed as inputs to JALR (RVY) and will be unsealed prior to installation into pc.
Two values, 4 and 5, are used by JALR (RVY) and JAL (RVY) to seal the return capability when rd is ra. The choice between the two depends on privileged machine state. If rd is not ra, then the return capability is not sealed.
JALR (RVY) is given conditional behavior based on the register selectors used, as detailed in Table 73. Prohibited combinations of register selectors and CT-field value will cause the target pc to have a clear capability tag and so raise a CHERI Instruction Access Fault.

In light of the last rule, we call capabilities granting X-permission and…

with a CT-field value of 1, 2, or 3 "forward" sentries.
with a CT-field value of 4 or 5 "backward" sentries.

The Smycheriot extension to privileged architecture draws further distinction between the various forward and backward types.

Table 73. JALR (RVY) Conditional Behavior
rs1	rd	Permitted rs1 CT-field-s	Comments
any	`ra`	0, 1, 2, 3	Function call
`ra`	`null`	4, 5	Function return
other operands		0, 1	Non-standard control flow (e.g., tail calls)

All of the Sealed entry point capability CT-field values defined herein need only be encodable if the capability grants X-permission, as required by JALR (RVY) of the capability to be installed into pc. Zycheriot imposes no architectural requirement for, or semantics upon, capabilities that do not grant X-permission and have one of these CT-field values.

The use of different Sealed entry point capability CT-field values for "forward" control flow arcs (in which JALR (RVY) writes a capability) and "backward" arcs (in which it does not) means that functions cannot return "backwards" to a (forward) function pointer and, dually, that a function pointer cannot be substituted for a return pointer.

19.2. An RV32LYmw9e14r0as11pc Common Base Architecture

It is intended that Zycheriot and Smycheriot be instantiated (in tandem) atop a RV32Y base architecture that is at least as expressive as LYmw9e14r0as11pc and in combination with one of the capability encoding schemes discussed below (or a future, possibly vendor-specific, capability encoding). Software written for this common subset architecture will be source compatible across encoding formats and with more expressive variants of RV32Y.

This parameter list needs to be updated.

The parameters set by this common architecture are:

mw9 and e14: A mantissa width of 9 and expressibility of capability exponents from 0 (inclusive) to 14 (inclusive). Together, these ensure that capabilities can precisely capture capabilities whose lengths are between 1 and 2⁹ - 1 times a power of 2 between 1 (inclusive) and 2¹⁴ (inclusive) or is 2²⁴.

The latter of these is sufficient to represent the entire 32-bit address space at a granularity (that is, bounds alignment) of 16MiB.
r0: Software must not presume that taking a capability out of bounds, other than the "one past the end" position required by C, will leave it tagged.
as11: The shift used by AUIPC (RVY) (and similar instructions) is 11.
pc: The set of capability permissions that can be expressed is subject to compression. For these bases, the only compression used is a consequence of the roots and permission transition rules found in the base architecture and present extensions (such as Zycheriot).

TODO: Can we refactor RV32Y’s compression scheme out to something like the Zycheriot ISA perspective, too?

As a reminder, these parameters constrain the behavior of RVY instructions such as YBNDSW and YPERMC.

19.3. The RV32LYenccheriot1 CHERI Capability Encoding Scheme

This section describes an in-memory format and properties of a capability encoding intended for RV32Y. It is specifically designed to be a suitable substrate for RV32Y_Zycheriot systems.

This is the encoding used for the existing CHERIoT ISA. Compared to RV32LYA it:

supports more precise bounds for a given capability length using slightly simpler hardware
adds features for building rich compartmentalization, such as extra permissions and sealing types.

The first of these is achieved by:

Using two extra bits for the bounds encoding
Giving the exponent its own field instead of embedded it in the T and B fields
Saving space by reducing the exponent in size by one and dropping support for exponents in the range 15 to 23. This limits the precision for the bounds of large capabilities (>= 8MiB).
dropping guaranteed out-of-bounds representability (except "one past the end")

These tradeoffs are aimed at microcontroller implementations where precise bounds on small capabilities are important to save memory on padding, large capabilities are unlikely and microarchitectural complexity should be minimised.

19.3.1. Capability Encoding

The components of a capability, except the capability tag, are encoded as shown in Figure 17.

Figure 17. Capability encoding for RV32LYenccheriot1

Field

Description

Reserved bit

Global bit, as in GL.

Permissions, see CHERIoT permission encoding.

Sealed object Type, see CHERIoT Capability Type (CT) Encoding.

Exponent field for bounds encoding, see CHERIoT Capability Bounds Encoding.

Top field for bounds encoding, see CHERIoT Capability Bounds Encoding.

Base field for bounds encoding, see CHERIoT Capability Bounds Encoding.

Address

The address field as per Figure 1.

The reserved bit (the R field) is defined to be 0 in the encoding of the root capabilities given below, and is not mutable by any instruction defined in RVY, and so will necessarily be 0 in all valid capabilities derived therefrom. It is therefore available for use by suitable extensions or derivative encodings.

The encoding impacts the following CHERI extensions:

Zyhybrid: This encoding does not support Zyhybrid: there is no mode bit so only purecap mode is supported.
Zylevels1: This encoding supports Zylevels1. Its GL flag is encoded in the G bit, and its GL and SL permissions are encoded in the P field.
Zyseal: This encoding supports Zyseal. The SE and US permissions are encoded in the P field.

19.3.1.1. Capability Encoding Parameter Summary

This encoding is suitable for use with RV32Y base architectures whose parametric requirements are no stronger than those given in Table 74. While not strictly required, it is recommended to use this encoding with RV32Y base architectures whose AUIPC shift (as) RVY parameter is 11 or smaller. Known extension (in)compatibilities are listed in Table 75.

Table 74. RV32LYenccheriot1 RVY Parameter Values
Parameter	Value	Comment
mw	9	Mantissa width
e	14	Exponent limit before jumping to maximum exponent
rc	0	Representable range between base and top only
p	pc	Compressed permission encoding
enc	Cheriot1	Encoding variant

Table 75. RV32LYenccheriot1 extension summary
Extension	Comment
Zyhybrid	Not supported (P-bit not defined)
Zylevels1	Compatible (recommended)
Zyseal	Compatible (recommended)
Zycheriot	Compatible (recommended)
Smycheriot	Compatible (recommended)
All RVY versions of other standard extensions	Compatible if the extension is compatible with RV32E

Table 76. RV32LYenccheriot1 Feature summary
Feature	Comment
Representable range	Between base and top only (inclusive)
Permission encodings	Not all combinations can be represented

19.3.1.2. Permissions Encoding

Similarly to RV32LYA, capability permissions are encoded in a compressed form. The encoding herein exactly reflects the Root Permission Sets and Permission Transition Constraints of Zycheriot.

The permission encoding space is split into quadrants using P[3:2]. Each quadrant may include some fixed permissions (indicated with ✔) and some dependent permissions encoded using P[2:0].

Quadrant 0 is used to encode permissions that authorize sealing (see Zyseal) and also the single software-defined permission, U0. It can also encode 'no permissions'.
Quadrant 1 encodes executable capabilities along with the dependent permission ASR and optional permissions LM and LG.
Quadrant 2 is subdivided into octants:
- Octant 4 encodes combinations of R and W without C with the redundant not-R and not-W used to encode write-only with C.
- Octant 5 encodes read-only capabilities with C and the dependent permissions LM and LG.
Quadrant 3 encodes permissions with R, W, C and the dependent permissions SL, LM and LG. The meaning for bits [2:0] are shown in CHERIoT permission encoding.

P[4:0]

Decoded Permissions

P[4:3]

P[2]

P[1]

P[0]

ASR

Notes

P[2]

P[1]

P[0]

Sealing

ASR

✔

P[1]

P[0]

✔

P[2]

Executable

✔

Cap WO

P[1]

P[0]

Data RW (R and/or W)

✔

P[1]

P[0]

Cap RO

✔

P[2]

P[1]

P[0]

Cap RW

For example, if P[4:3] = 01 and P[2:0] = 101 then the decoded permissions are R, C, X, LG, and ASR. Note that there is no encoding for an "Infinite" capability with all permissions. In particular there is no overlap between the software and sealing permissions (U0, SE, US) and any other permissions (recall mem-nand-ct from Root Permission Sets) and X is mutually exclusive with W (recall w-nand-x).

To encode a set of permissions resulting from YPERMC, the first of the following rules to apply is used:

If the permissions include X, R and C, then encode ASR, LM and LG using the executable format (P[4:3] = 01).
If the permissions include R, W and C then encode SL, LM and LG using the Cap RW format (P[4:3] = 11).
If the permissions include R and C then encode LM and LG using the Cap RO format (P[4:2] = 101).
If the permissions include W and C then encode using the Cap WO format (P[4:0] = 10000).
If the permissions include R or W then encode using the Data RW format (P[4:0] = 100RW).
Encode U0, SE and US using the sealing format (P[4:3] = 00).

This procedure will automatically apply the rules defined in Permission Transition Constraints to legalize permission sets. If any of the requested permissions cannot be represented using the chosen format, then the they are dropped. For example, if the requested permissions are R, LG and X, then rule 5 applies and the resulting permissions will be just R.

The three Root capabilities defined by Zycheriot have permissions encodings as shown in CHERIoT root capabilities table.

Root name P[4:0] Permissions

Sealing

00111

U0, SE, US

Root Executable

01111

X, R, C, LM, LG, ASR

Root Data

11111

W, R, C, LM, LG, SL

19.3.1.3. Capability Type (CT) Encoding

Capabilities in this encoding have a 3-bit field for CT-field values. This is used to encode 15 different sealing types by distinguishing between executable and non-executable sealed capabilities as follows:

X	CT Value	Decoded Type
1	000	0 (Unsealed)
1	1 .. 7	1 .. 7
0	000	0 (Unsealed)
0	1 .. 7	9 .. 15 (CT + 8)

Attempts to seal a capability with a type not compatible with its X-permission value will yield a result with a clear capability tag.

Recall that CT-field values 1 (inclusive) through 5 (inclusive) are given semantics if the Zycheriot extension is present.

19.3.1.4. Bounds (E, B, T) Encoding

The bounds are encoded in similar, but slightly simplified way to RV64LYA, with field B and T being substituted into the address at the offset given by the exponent to form the base and top. Rather than using an EF field to indicate whether the exponent is zero or contained in the low bits of B and T the exponent is stored in its own 4-bit field, E. Since this is not large enough to store the maximum exponent of 24 required to cover the full 32-bit address space, an E value of 15 decodes as 24 with exponents 15 to 23 being invalid. The decoded exponent, e, is given by:

e = (E == 15) ? 24 : E

This means that capabilities up to nearly 8 MiB are represented with alignment requirements increasing in powers of two for increasing exponents up to 14, and all larger capabilities must have bounds aligned to 16 MiB. RV32LYenccheriot2 describes an encoding variant that removes this limitation.

For a given exponent the base and top are then computed as:

Figure 18. Decoding of the XLEN+1 wide top (t) bound

Figure 19. Decoding of the XLEN wide base (b) bound

where the corrections, c_t and c_b, are given by:

A_hi = (A[e + 8 : e]) < B ? 1 : 0
T_hi = (T < B) ? 1 : 0
c_b = -A_hi
c_t = A_hi - T_hi

These corrections work by assuming that the base is in the lower of two 2^e+9 aligned regions, and that the top and address are always greater than or equal to the base but may be in the higher region provided they are within 2^e+9 of base. These assumptions lead to the representable range being given by:

base <= address < base + (1 << (e + 9))

This must be checked by all operations that change the capability address. If this check fails the resulting capability will have its capability tag cleared. Note that this means that it is not possible to represent a capability with an address less than the base. Depending on the size of the capability some addresses above top may be representable, but in the worst case the highest representable address is equal to top (one byte beyond the end of the dereferenceable region).

19.3.1.4.1. Encoding bounds

When YBNDSW is used to set the bounds of a capability the E, B and T fields are computed from the desired base and length as follows:

// compute candidate exponent
e = 23 - count_leading_zeros(length[31 : 9])
if e > 14 then {
  e = 24
}

// extend base and top to XLEN+1 bits
base33 = 1b0 @ base
top33 = base33 + (1b0 @ length)

// extract 10-bit from base and top
b = base33[e + 9 : e]
t = top33[e + 9 : e]

// round up top if low bits are truncated
if top33[e - 1 : 0] != 0 {
  t = t + 1
}

// in case this caused length overflow use the next exponent
if t - b >= 1 << (e + 9) {
  e += 1
  if e > 14 then {
    e = 24
  }
  b = base33[e + 9 : e]
  t = top33[e + 9 : e]
  if top33[e - 1 : 0] != 0 {
    t = t + 1
  }
}

// encode E
E = (e == 24) ? 15 : e
// truncate B and T to 9 bits
B = b[8 : 0]
T = t[8 : 0]

19.3.2. Encoding of Special Capabilities

19.3.2.1. NULL Capability Encoding

The NULL capability is represented with 0 in all fields. This implies that it is unsealed, has no permissions and its exponent, base and top are 0.

Table 77. Field values of the NULL capability
Field	Value	Comment
Capability Tag	zero	Capability is not valid
R	zero	Reserved bit
G	zero	Not global
P	zeros	Grants no permissions
CT	zeros	Unsealed
E	zeros	Exponent
T	zeros	Top address bits
B	zeros	Base address bits
Address	zeros	Capability address

19.3.2.2. Root Capability Encoding

The encoding for the Root Executable and Root Data capabilities defined by RVY and the sealing root defined by Zycheriot have bounds that cover the entire associated address space. Root capabilities necessarily have set capability tags and are necessarily unsealed, Zylevels1 requires root capabilities to be Global, and we define our root capabilities as having a clear reserved bit (that is, an R field of 0). The encoded fields of root capabilities are shown in Table 78, except the P field, which depends on which root capability is being represented.

Table 78. Bounds field values of root capabilities
Field	Value	Comment
Capability Tag	one	Capability is valid
R	zero	Reserved bit
G	1	Global
P	XXXXX	Varies as per CHERIoT root capabilities table
CT	zeros	Unsealed
E	0xf	Maximum exponent
T	0x100	top = 2^XLEN
B	0x000	base 0
Address	zeros	Capability address

19.4. The RV32LYenccheriot2 CHERI Capability Encoding Scheme

This chapter describes a variation on the RV32LYenccheriot1 encoding to support all exponents using the same number of bits. This change is backwards compatible for software but enables support for more precise bounds on capabilities larger than 8MiB. Due to increased microarchitectural complexity and the limited need for precise alignment on such large capabilities on small systems, implementations may choose to support either encoding.

The changes are limited to the bounds encoding; all other aspects of the capability encoding are identical to RV32LYenccheriot1.

19.4.1. Capability Encoding

The components of a capability, except the capability tag, are encoded as shown in Figure 20.

Figure 20. Capability encoding for RV32LYenccheriot2

The only difference from RV32LYenccheriot1 is that the exponent field, E, is grown to 5 bits and the T field is shrunk to 8 bits. This allows all exponents from 0 to 24 to be represented directly, with some spare exponents available for other uses. To enable the reduced T field to be decoded to a 9-bit value it is assumed that the top bit of the length, L = T - B, is one, similar to normalised floating point numbers. To encode small capabilities with exponent zero and lengths less than 256 bytes the special E value, 31, is used to mean L[8] is zero. T[8] can then be reconstructed as follows:

T[8] = B[8] XOR (E == 31 ? 0 : 1) XOR (T[7:0] < B[7:0])

The unused exponent values (25 .. 30) are reserved for future use. Other aspects of bounds decoding remain identical to CHERIoT Capability Bounds Encoding.

Encoding the bounds for a requested base and length is similar to before but without the special cases for exponents greater than 15.

// compute candidate exponent
e = 23 - count_leading_zeros(length[31:9])

// extend base and top to XLEN+1 bits
base33 = 1b0 @ base
top33 = base33 + (1b0 @ length)

// extract 10-bit from base and top
b = base33[e + 9 : e]
t = top33[e + 9 : e]

// round up top if low bits are truncated
if top33[e - 1 : 0] != 0 {
  t = t + 1
}

// in case this caused length overflow use the next exponent
if t - b >= 1 << (e + 9) {
  e += 1
  b = base33[e + 9 : e]
  t = top33[e + 9 : e]
  if top33[e - 1 : 0] != 0 {
     t = t + 1
  }
}

// truncate B and T to 9 bits
B = b[8 : 0]
T = t[8 : 0]
// encode E
E = (e == 0 && (T - B)[8] == 0) ? 31 : e

19.4.1.1. Capability Encoding Parameter Summary

This encoding is suitable for use with RV32Y base architectures whose parametric requirements are no stronger than those given in Table 79.

Table 79. RV32LYenccheriot2 parameter summary
Parameter	Value	Comment
mw	9	Mantissa width
e	a	All exponents are supported
rc	0	Representable range between base and top only
p	pc	Compressed permission encoding
enc	Cheriot2	Encoding variant

19.5. The RV32LYenccheriot3 CHERI Capability Encoding Scheme

This chapter describes a further variation on the RV32LYenccheriot2 encoding. It is equivalent in terms of representable bounds, supported features and the number of bits used, but has different microarchitectural properties. The only observable difference for software is the encoded metadata bits of capabilities in stored in memory, which is only relevant to specialised system software such as debuggers. As such, the decision of which to use should be based on microarchitectural requirements. It is included here in order to provide flexibility to implementations.

19.5.1. Capability Encoding

The components of a capability, except the capability tag, are encoded as shown in Figure 21.

Figure 21. Capability encoding for RV32LYenccheriot3

The only difference from RV32LYenccheriot2 is that the top field, T, is replaced by a length, L. The 8-bit L is expanded to 9 bits by adding a leading implicit one bit, except in the case of the special exponent 31 where L[8] is zero. Bounds decoding can then proceed as follows:

// Decode L8 and exponent
e = (E == 31) ? 0 : E
L8 = (e == 31) ? 0 : 1

// Decode base (same as <<rv32y_cheriot_encoding1_name>>)
c_b = (A[e + 8 : e]) < B ? -1 : 0
base = ((A[31 : e + 9] + c_b) @ B) << e

// Compute top from base and length
top = base + ((L8 @ L) << e)

All other aspects of the capability encoding are identical to RV32LYenccheriot2.

19.5.1.1. Capability Encoding Parameter Summary

This encoding is suitable for use with RV32Y base architectures whose parametric requirements are no stronger than those given in Table 79. These are identical to those of RV32LYenccheriot2 except for the encoding variant parameter.

Table 80. RV32LYenccheriot3 parameter summary
Parameter	Value	Comment
mw	9	Mantissa width
e	a	All exponents are supported
rc	0	Representable range between base and top only
p	pc	Compressed permission encoding
enc	Cheriot3	Encoding variant

19.6. "Zcmp", "Zcmt" (RVY)

This chapter is not part of the v1.0 ratification package and is stable.

19.7. "Zcmp" Standard Extension For Code-Size Reduction

The push (CM.PUSH (RV32Y)) and pop (CM.POP (RV32Y), CM.POPRET (RV32Y), CM.POPRETZ (RV32Y)) instructions are redefined in Capability Pointer Mode to save/restore capability data.

The double move instructions (CM.MVSA01 (RV32Y), CM.MVA01S (RV32Y)) are redefined in Capability Pointer Mode to move capability data between registers. The saved register mapping is as shown in Table 81.

Table 81. saved register mapping for Zcmp
saved register specifier	xreg	integer ABI	RV32Y ABI
0	x8	s0	s0
1	x9	s1	s1
2	x18	s2	s2
3	x19	s3	s3
4	x20	s4	s4
5	x21	s5	s5
6	x22	s6	s6
7	x23	s7	s7

19.7.1. CM.PUSH (RV32Y)

Synopsis: Create stack frame (CM.PUSH): store the return address register and 0 to 12 saved registers to the stack frame, optionally allocate additional stack space. 16-bit encoding.
Mnemonic: cm.push {creg_list}, -stack_adj
Encoding

Assembly Syntax:

cm.push {reg_list},  -stack_adj
cm.push {xreg_list}, -stack_adj

The variables used in the assembly syntax are defined below.

RV32Y:

switch (rlist){
  case  4: {reg_list="ra";         xreg_list="x1";}
  case  5: {reg_list="ra, s0";     xreg_list="x1, x8";}
  case  6: {reg_list="ra, s0-s1";  xreg_list="x1, x8-x9";}
  case  7: {reg_list="ra, s0-s2";  xreg_list="x1, x8-x9, x18";}
  case  8: {reg_list="ra, s0-s3";  xreg_list="x1, x8-x9, x18-x19";}
  case  9: {reg_list="ra, s0-s4";  xreg_list="x1, x8-x9, x18-x20";}
  case 10: {reg_list="ra, s0-s5";  xreg_list="x1, x8-x9, x18-x21";}
  case 11: {reg_list="ra, s0-s6";  xreg_list="x1, x8-x9, x18-x22";}
  case 12: {reg_list="ra, s0-s7";  xreg_list="x1, x8-x9, x18-x23";}
  case 13: {reg_list="ra, s0-s8";  xreg_list="x1, x8-x9, x18-x24";}
  case 14: {reg_list="ra, s0-s9";  xreg_list="x1, x8-x9, x18-x25";}
  //note - to include s10, s11 must also be included
  case 15: {reg_list="ra, s0-s11"; xreg_list="x1, x8-x9, x18-x27";}
  default: reserved();
}
stack_adj      = stack_adj_base + spimm * 16;

RV32Y:

switch (rlist) {
  case  4.. 5: stack_adj_base =  16;
  case  6.. 7: stack_adj_base =  32;
  case  8.. 9: stack_adj_base =  48;
  case 10..11: stack_adj_base =  64;
  case 12..13: stack_adj_base =  80;
  case     14: stack_adj_base =  96;
  case     15: stack_adj_base = 112;
}

Valid values:
switch (rlist) {
  case  4.. 5: stack_adj = [ 16| 32| 48| 64];
  case  6.. 7: stack_adj = [ 32| 48| 64| 80];
  case  8.. 9: stack_adj = [ 48| 64| 80| 96];
  case 10..11: stack_adj = [ 64| 80| 96|112];
  case 12..13: stack_adj = [ 80| 96|112|128];
  case     14: stack_adj = [ 96|112|128|144];
  case     15: stack_adj = [112|128|144|160];
}

rlist values 0 to 3 are reserved for a future EABI variant

Description: Create stack frame, store capability registers as specified in creg_list using SY semantics.

Optionally allocate additional multiples of 16-byte stack space in sp.

All accesses are authorized against sp.
Prerequisites: Zca, RVY, Zcmp
Operation

SAIL TO BE UPDATED

19.7.2. CM.POP (RV32Y)

Synopsis: Destroy stack frame (CM.POP): load the return address register and 0 to 12 saved registers from the stack frame, deallocate the stack frame. 16-bit encodings.
Mnemonic: cm.pop {creg_list}, -stack_adj
Encoding

Assembly Syntax:

cm.pop {reg_list},  stack_adj
cm.pop {xreg_list}, stack_adj

The variables used in the assembly syntax are defined below.

RV32Y:

switch (rlist){
  case  4: {reg_list="ra";         xreg_list="x1";}
  case  5: {reg_list="ra, s0";     xreg_list="x1, x8";}
  case  6: {reg_list="ra, s0-s1";  xreg_list="x1, x8-x9";}
  case  7: {reg_list="ra, s0-s2";  xreg_list="x1, x8-x9, x18";}
  case  8: {reg_list="ra, s0-s3";  xreg_list="x1, x8-x9, x18-x19";}
  case  9: {reg_list="ra, s0-s4";  xreg_list="x1, x8-x9, x18-x20";}
  case 10: {reg_list="ra, s0-s5";  xreg_list="x1, x8-x9, x18-x21";}
  case 11: {reg_list="ra, s0-s6";  xreg_list="x1, x8-x9, x18-x22";}
  case 12: {reg_list="ra, s0-s7";  xreg_list="x1, x8-x9, x18-x23";}
  case 13: {reg_list="ra, s0-s8";  xreg_list="x1, x8-x9, x18-x24";}
  case 14: {reg_list="ra, s0-s9";  xreg_list="x1, x8-x9, x18-x25";}
  //note - to include s10, s11 must also be included
  case 15: {reg_list="ra, s0-s11"; xreg_list="x1, x8-x9, x18-x27";}
  default: reserved();
}
stack_adj      = stack_adj_base + spimm * 16;

RV32Y:

switch (rlist) {
  case  4.. 5: stack_adj_base =  16;
  case  6.. 7: stack_adj_base =  32;
  case  8.. 9: stack_adj_base =  48;
  case 10..11: stack_adj_base =  64;
  case 12..13: stack_adj_base =  80;
  case     14: stack_adj_base =  96;
  case     15: stack_adj_base = 112;
}

Valid values:
switch (rlist) {
  case  4.. 5: stack_adj = [ 16| 32| 48| 64];
  case  6.. 7: stack_adj = [ 32| 48| 64| 80];
  case  8.. 9: stack_adj = [ 48| 64| 80| 96];
  case 10..11: stack_adj = [ 64| 80| 96|112];
  case 12..13: stack_adj = [ 80| 96|112|128];
  case     14: stack_adj = [ 96|112|128|144];
  case     15: stack_adj = [112|128|144|160];
}

rlist values 0 to 3 are reserved for a future EABI variant

Description: Load capability registers as specified in creg_list using LY semantics.

Deallocate stack frame.

All accesses are authorized by sp.
Prerequisites: Zca, RVY, Zcmp
Operation

SAIL TO BE UPDATED

19.7.3. CM.POPRET (RV32Y)

Synopsis: Destroy stack frame (CM.POPRET): load the return address register and 0 to 12 saved registers from the stack frame, deallocate the stack frame. Return through the return address register. 16-bit encodings.
Mnemonic: cm.popret {creg_list}, -stack_adj
Encoding

Assembly Syntax:

cm.popret {reg_list},  stack_adj
cm.popret {xreg_list}, stack_adj

The variables used in the assembly syntax are defined below.

RV32Y:

switch (rlist){
  case  4: {reg_list="ra";         xreg_list="x1";}
  case  5: {reg_list="ra, s0";     xreg_list="x1, x8";}
  case  6: {reg_list="ra, s0-s1";  xreg_list="x1, x8-x9";}
  case  7: {reg_list="ra, s0-s2";  xreg_list="x1, x8-x9, x18";}
  case  8: {reg_list="ra, s0-s3";  xreg_list="x1, x8-x9, x18-x19";}
  case  9: {reg_list="ra, s0-s4";  xreg_list="x1, x8-x9, x18-x20";}
  case 10: {reg_list="ra, s0-s5";  xreg_list="x1, x8-x9, x18-x21";}
  case 11: {reg_list="ra, s0-s6";  xreg_list="x1, x8-x9, x18-x22";}
  case 12: {reg_list="ra, s0-s7";  xreg_list="x1, x8-x9, x18-x23";}
  case 13: {reg_list="ra, s0-s8";  xreg_list="x1, x8-x9, x18-x24";}
  case 14: {reg_list="ra, s0-s9";  xreg_list="x1, x8-x9, x18-x25";}
  //note - to include s10, s11 must also be included
  case 15: {reg_list="ra, s0-s11"; xreg_list="x1, x8-x9, x18-x27";}
  default: reserved();
}
stack_adj      = stack_adj_base + spimm * 16;

RV32Y:

switch (rlist) {
  case  4.. 5: stack_adj_base =  16;
  case  6.. 7: stack_adj_base =  32;
  case  8.. 9: stack_adj_base =  48;
  case 10..11: stack_adj_base =  64;
  case 12..13: stack_adj_base =  80;
  case     14: stack_adj_base =  96;
  case     15: stack_adj_base = 112;
}

Valid values:
switch (rlist) {
  case  4.. 5: stack_adj = [ 16| 32| 48| 64];
  case  6.. 7: stack_adj = [ 32| 48| 64| 80];
  case  8.. 9: stack_adj = [ 48| 64| 80| 96];
  case 10..11: stack_adj = [ 64| 80| 96|112];
  case 12..13: stack_adj = [ 80| 96|112|128];
  case     14: stack_adj = [ 96|112|128|144];
  case     15: stack_adj = [112|128|144|160];
}

rlist values 0 to 3 are reserved for a future EABI variant

Description: Load capability registers as specified in creg_list using LY semantics.

Deallocate stack frame.

Return by calling JALR (RVY) to ra.

All data accesses are authorized by sp.

The return destination is authorized by ra.
Prerequisites: Zca, RVY, Zcmp
Operation

SAIL TO BE UPDATED

19.7.4. CM.POPRETZ (RV32Y)

Synopsis: Destroy stack frame (CM.POPRETZ): load the return address register and register 0 to 12 saved registers from the stack frame, deallocate the stack frame. Move zero into argument register zero. Return through the return address register. 16-bit encoding.
Mnemonic: cm.popretz {creg_list}, -stack_adj
Encoding

Assembly Syntax:

cm.popretz {reg_list},  stack_adj
cm.popretz {xreg_list}, stack_adj

The variables used in the assembly syntax are defined below.

RV32Y:

switch (rlist){
  case  4: {reg_list="ra";         xreg_list="x1";}
  case  5: {reg_list="ra, s0";     xreg_list="x1, x8";}
  case  6: {reg_list="ra, s0-s1";  xreg_list="x1, x8-x9";}
  case  7: {reg_list="ra, s0-s2";  xreg_list="x1, x8-x9, x18";}
  case  8: {reg_list="ra, s0-s3";  xreg_list="x1, x8-x9, x18-x19";}
  case  9: {reg_list="ra, s0-s4";  xreg_list="x1, x8-x9, x18-x20";}
  case 10: {reg_list="ra, s0-s5";  xreg_list="x1, x8-x9, x18-x21";}
  case 11: {reg_list="ra, s0-s6";  xreg_list="x1, x8-x9, x18-x22";}
  case 12: {reg_list="ra, s0-s7";  xreg_list="x1, x8-x9, x18-x23";}
  case 13: {reg_list="ra, s0-s8";  xreg_list="x1, x8-x9, x18-x24";}
  case 14: {reg_list="ra, s0-s9";  xreg_list="x1, x8-x9, x18-x25";}
  //note - to include s10, s11 must also be included
  case 15: {reg_list="ra, s0-s11"; xreg_list="x1, x8-x9, x18-x27";}
  default: reserved();
}
stack_adj      = stack_adj_base + spimm * 16;

RV32Y:

switch (rlist) {
  case  4.. 5: stack_adj_base =  16;
  case  6.. 7: stack_adj_base =  32;
  case  8.. 9: stack_adj_base =  48;
  case 10..11: stack_adj_base =  64;
  case 12..13: stack_adj_base =  80;
  case     14: stack_adj_base =  96;
  case     15: stack_adj_base = 112;
}

Valid values:
switch (rlist) {
  case  4.. 5: stack_adj = [ 16| 32| 48| 64];
  case  6.. 7: stack_adj = [ 32| 48| 64| 80];
  case  8.. 9: stack_adj = [ 48| 64| 80| 96];
  case 10..11: stack_adj = [ 64| 80| 96|112];
  case 12..13: stack_adj = [ 80| 96|112|128];
  case     14: stack_adj = [ 96|112|128|144];
  case     15: stack_adj = [112|128|144|160];
}

rlist values 0 to 3 are reserved for a future EABI variant

Description: Load capability registers as specified in creg_list using LY semantics.

Deallocate stack frame.

Move zero into a0.

Return by calling JALR (RVY) to ra.

All data accesses are authorized by sp.

The return destination is authorized by ra.
Prerequisites: Zca, RVY, Zcmp
Operation

SAIL TO BE UPDATED

19.7.5. CM.MVSA01 (RV32Y)

Synopsis: CM.MVSA01: Move argument registers 0 and 1 into two saved registers. 16-bit encoding.
Mnemonic: cm.mvsa01 c1s', c2s'
Encoding

The encoding uses sreg number specifiers instead of xreg number specifiers to save encoding space. The saved register encoding is shown in Table 81.

Description: Atomically move two saved capability registers s0-s7 into a0 and a1.
Prerequisites: Zca, RVY, Zcmp
Operation

SAIL TO BE UPDATED

19.7.6. CM.MVA01S (RV32Y)

Synopsis: Move two saved registers into argument registers 0 and 1. 16-bit encoding.
Mnemonic: cm.mva01s c1s', c2s'
Encoding

The encoding uses sreg number specifiers instead of xreg number specifiers to save encoding space. The saved register encoding is shown in Table 81.

Description: Atomically move two capability registers a0 and a1 into s0-s7.
Prerequisites: Zca, RVY, Zcmp
Operation

SAIL TO BE UPDATED

19.8. "Zcmt" Standard Extension For Code-Size Reduction

The table jump instructions (CM.JT (RV32Y), CM.JALT (RV32Y)) are not redefined in Capability Pointer Mode to have capabilities in the jump table. This is to prevent the code-size growth caused by doubling the size of the jump table.

In the future, new jump table modes or new encodings can be added to have capabilities in the jump table.

The jump vector table CSR jvt (RVY) is a full capability so that it can only be configured to point to accessible memory. All accesses to the jump table are checked against jvt (RVY) in Capability Pointer Mode, and against pc bounds in Integral Pointer Mode. This allows the jump table to be accessed when the pc bounds are set narrowly to the local function only in Capability Pointer Mode.

In Capability Pointer Mode the instruction fetch bounds check is authorized by two different capabilities - jvt (RVY) for the table access and pc for the CM.JALT (RV32Y)/CM.JT (RV32Y) instruction, and target instruction.

In Capability Pointer Mode the implementation does not need to expand and bounds check against jvt (RVY) on every access, it is sufficient to decode the valid accessible range of entries after every write to jvt (RVY), and then check that the accessed entry is in that range.

19.8.1. Jump Vector Table CSR (jvt)

The Zcmt jvt CSR is extended to be a full capability.

Figure 22. Jump Vector Table Capability register

All instruction fetches from the jump vector table are checked against jvt (RVY) in Capability Pointer Mode. In Integral Pointer Mode the address field gives the base address of the table, and the access is checked against pc bounds.

See CM.JALT (RV32Y), CM.JT (RV32Y).

If the access to the jump table succeeds, then the instructions execute as follows:

CM.JT (RV32Y) executes as J or AUIPC+JR
CM.JALT (RV32Y) executes as JAL or AUIPC+JALR

As a result the capability metadata is retained in pc during execution.

19.8.2. CM.JALT (RV32Y)

Synopsis: Jump via table with link (CM.JALT), 16-bit encodings
Mnemonic (RV32): cm.jalt index
Encoding

For this encoding to decode as CM.JALT (RV32Y), index≥32, otherwise it decodes as CM.JT (RV32Y).

Description (RV32Y): Redirect instruction fetch via the jump table defined by the indexing via jvt.address+ index*XLEN/8, checking every byte of the jump table access against jvt (RVY) bounds (not against pc) and requiring X-permission. Link to cra.

The target pc is calculated by replacing the current pc address with the value read from the jump table, and is updated using the semantics of the YADDRW instruction.

If the jvt (RVY) check fails, then set the capability tag of the target pc to zero.

If Zcherihybrid is implemented and the CHERI Pointer Mode is Integral Pointer Mode then the table access is checked against pc bounds.

Permissions (RV32Y): Requires jvt (RVY) to have its capability tag set, not be sealed, have X-permission and for the full XLEN-wide table access to be in jvt (RVY) bounds.
Prerequisites: Zca, RVY, Zcmt
Operation

SAIL TO BE UPDATED

19.8.3. CM.JT (RV32Y)

Synopsis: Jump via table with link (CM.JT), 16-bit encodings
Mnemonic (RV32): cm.jt index
Encoding

For this encoding to decode as CM.JT (RV32Y), index<32, otherwise it decodes as CM.JALT (RV32Y).

Description (RV32Y): Redirect instruction fetch via the jump table defined by the indexing via jvt.address+ index*XLEN/8, checking every byte of the jump table access against jvt (RVY) bounds (not against pc) and requiring X-permission.

The target pc is calculated by replacing the current pc address with the value read from the jump table, and is updated using the semantics of the YADDRW instruction.

If the jvt (RVY) check fails, then set the capability tag of the target pc to zero.

If Zcherihybrid is implemented and the CHERI Pointer Mode is Integral Pointer Mode then the table access is checked against pc bounds.

Permissions (RV32Y): Requires jvt (RVY) to have its capability tag set, not be sealed, have X-permission and for the full XLEN-wide table access to be in jvt (RVY) bounds.
Prerequisites: Zca, RVY, Zcmt
Operation

SAIL TO BE UPDATED

20. Placeholder references to the unprivileged spec

This chapter only exists for the standalone document to allow references to resolve.

RV32I: See Chapter RV32I Base Integer Instruction Set in (RISC-V, 2023).
General purpose registers: See Chapter RV32I Base Integer Instruction Set in (RISC-V, 2023).
Load and Store Instructions: See Chapter RV32I Base Integer Instruction Set in (RISC-V, 2023).
Zcmt: See Chapter "Zc*" Extension for Code Size Reduction in (RISC-V, 2023).
Zcmp: See Chapter "Zc*" Extension for Code Size Reduction in (RISC-V, 2023).
jvt: See Chapter "Zc*" Extension for Code Size Reduction in (RISC-V, 2023).

Chapters for the privileged specification

21. "Machine-Level ISA (RVY)" Extensions, Version 1.0

This chapter describes integration of RVY with the RISC-V privileged architecture.

21.1. Machine-Level CSRs widened by RVY

RVY widens some M-mode CSRs so that they can hold capabilities. Additionally, RVY allocates some previously reserved bits in existing CSRs.

mconfigptr is not widened, despite representing an address, as it is solely for use by low-level system software and is not interpreted by hardware. Such software can be expected to hold suitable Root capabilities from which it can derive a capability to the address in this register.

21.1.1. Machine Trap Vector Base Address Capability Register (mtvec)

The mtvec register is widened to a capability. Its reset value is nominally a Root Executable capability.

mtvec (RVY) exists in all CHERI implementations, and so may be used as a source of a Root Executable capability after reset.

Figure 23. Machine-mode trap-vector base-capability register

The fields in the metadata are WARL as many fields can be implemented as constants.

Examples of WARL behavior include always setting X-permission to 1 and setting the reserved fields to zero, otherwise the capability is unusable. Another example is to partially or fully restrict the bounds to constant values.

Care must be taken however that suitable root capabilities are available to software after reset if this CSR does not represent one.

When a trap is taken into machine mode, the capability in mtvec (RVY) is installed in the pc, with the address determined by the standard mtvec behavior.

As in the standard mtvec definition, the address observed through mtvec (RVY) is not simply the stored base address:

mtvec.address[1:0] holds the MODE field, which selects the trap-vector mode and does not form part of the trap-vector base address.
When MODE=Vectored, the effective trap-vector address is the base address plus four times the interrupt cause.
An explicit CSR read returns the base address with the MODE field in mtvec.address[1:0].

HICAUSE is defined to be the largest interrupt cause value that the implementation can write to xcause when an interrupt is taken. The set of addresses that can therefore be observed through mtvec (RVY) ranges from a minimum of mtvec.address & ~3 to a maximum of (mtvec.address & ~3) + 4 x HICAUSE.

Every read of mtvec (RVY), whether implicit (taking a trap) or explicit (a CSR read), forms the observed address using the semantics of the YADDRW instruction, either inserting the MODE bits or, in Vectored mode, adding the cause offset. This has two consequences:

All of the observable addresses must be represented exactly. Software must ensure this property holds when writing mtvec (RVY), and the hardware sets the capability tag to zero if any observable address cannot be represented exactly.
A sealed capability written to mtvec (RVY) is always read back with its capability tag cleared, because the YADDRW address update invalidates a sealed capability even when the address value is unchanged.

The capability in mtvec (RVY) is not unsealed when it is written to pc, unlike other CSRs such as mepc (RVY).

Following the standard mtvec rules, mtvec (RVY) need not be able to hold all possible invalid addresses.

21.1.2. Machine Environment Configuration (`menvcfg`) Register

Zyhybrid adds the Y bit to menvcfg. The Y bit can be used to disable or enable RVY for privilege modes below M-mode, as described in Chapter 24, "Zyhybrid for Privileged Architectures" Extension, Version 1.0.

21.1.3. Machine Scratch Capability Register (mscratch)

The mscratch register is widened to a capability.

The reset value of the capability tag of this CSR is zero, the reset values of the metadata and address fields are UNSPECIFIED.

Figure 24. Machine-mode scratch capability register

21.1.4. Machine Exception Program Counter Capability (mepc)

The mepc register is widened to a capability. Its reset value is nominally a Root Executable capability.

Figure 25. Machine exception program counter capability register

mepc.address is the mepc CSR, and so follows the standard rules, meaning that:

mepc.address[0]=0, and
mepc.address[1]=0 when IALIGN is fixed to 32
mepc.address[1] reads as zero when IALIGN is programmable and is set to 32

As listed above for mtvec (RVY), this means that mepc.address can represent multiple different values. Therefore, software must ensure that all possible values are represented exactly on writing, otherwise the hardware sets the written capability tag to zero.

Sealed capabilities may be written to mepc (RVY). Set the capability tag to zero on writing if:

mepc.address[0]=1, or
mepc.address[1]=1 when IALIGN=32

In the following case the value of the capability tag observable in the CSR depends on the value of IALIGN:

mepc.ct≠0 && mepc.tag=1, and
mepc.address[1]=1 and IALIGN=16 when writing the CSR

The observable capability tag is zero when IALIGN=32 and one when IALIGN=16.

When a trap is taken into M-mode, the pc is written to mepc.address following the standard behavior. The capability tag and metadata of the pc are also written to mepc (RVY).

On execution of an MRET (RVY) instruction, the capability value from mepc (RVY) is unsealed if it is a Sealed entry point capability and written to pc.

mepc (RVY) follows the rule from mepc about not needing to be able to hold all possible invalid addresses.

21.2. Machine-Level CSRs added by RVY

21.2.1. Machine Thread Identifier Capability (mtidc)

The mtidc register is used to identify the current software thread in machine mode, using the method defined in the section for the unprivileged utidc CSR.

The reset value of the capability tag of this CSR is zero, the reset values of the metadata and address fields are UNSPECIFIED.

Figure 26. Machine thread identifier capability register

21.3. Machine-Level CSRs modified by RVY

21.3.1. Machine ISA (`misa`) Register

When the base ISA is RVY, the Y bit is one.

misa.Y is a WARL field. If any of pc, xtvec, or xepc is not currently configured as a root capability then zero is not a legal value for misa.Y.

The list also includes ddc if Zyhybrid is implemented.

This restriction ensures M-mode software cannot drop CHERI checks by modifying misa once they have been configured. A simpler conforming implementation could prevent setting misa.Y to zero if the capability metadata of any of these registers has ever been written to, rather than dynamically checking the current metadata.

Either misa.I or misa.E is also set to indicate whether 32 or 16 general purpose registers are present.

Implementations which allow misa.C to be writable need to legalize xepc on reading if the misa.C value has changed since the value was written as this can cause the read value of bit [1] to change state.

21.3.2. Machine Status Registers (mstatus)

mstatus includes the Userspace Capability Read Generation (UYRG) bit, the Supervisor version (SYRG) and the enable bit (YRGE). These fields reside in mstatus; the other CSRs provide the standard restricted views of them.

The definitions of the following fields are furnished by the Svyrg extension:

Userspace Capability Read Generation (UYRG)
Supervisor Capability Read Generation (SYRG)
The Capability Read Generation Enable bit (YRGE).

Figure 27. Machine-mode status (mstatus) register for RV64Y

mstatus for RVY has additional restrictions on XLEN and endianness:

The MXL field, and the SXL and UXL fields that control the value of XLEN for S-mode and U-mode, must be read-only in RVY implementations, with SXL and UXL equal to MXL. Only 1 and 2 are supported values.
The MBE, SBE, and UBE fields that control the memory system endianness for M-mode, S-mode, and U-mode must be read-only in RVY implementations. SBE and UBE must be read only and equal to MBE, if S-mode or U-mode, respectively, is implemented, or read-only zero otherwise.

Changing XLEN or endianness would change the interpretation of all in-memory capabilities, so allowing these fields to change at runtime is prohibited.

These restrictions may be relaxed by a future extension. Such an extension is likely to enforce the constraint that any privilege level with XLEN less than MXLEN has CHERI disabled.

MXR has no effect on the CHERI permission checking.

CHERI does not need to make use of execute only memory for security reasons, and so MXR has no relevance.

21.3.3. Machine Status High Bits Register (mstatush)

Spec to be written for RV32

21.3.4. Machine Cause Register (mcause)

RVY adds new exception codes for CHERI exceptions that mcause must be able to represent. The new exception codes are listed in Table 82 and the priorities are listed in Table 83.

Table 82. Synchronous exception causes for RVY.
Exception code	Description
32	CHERI Instruction Access Fault
33	CHERI Load Access Fault
34	CHERI Store/AMO Access Fault
35	CHERI Load Capability Fault¹
36	CHERI Store/AMO Page Fault¹

¹ These exceptions are defined by the Svyrg extension which is available for RV64Y only.

Table 83. Synchronous exception priority in decreasing priority order for RVY.
Priority	Exc.Code	Description
Highest	3	Instruction address breakpoint
	32	Prior to instruction address translation: CHERI Instruction Access Fault due to pc checks (capability tag, sealed, execute permission, bounds¹)
	12, 1	During instruction address translation: First encountered page fault or access fault
	1	With physical address for instruction: Instruction access fault
	2 0 8,9,11 3 3	Illegal instruction Instruction address misaligned Environment call Environment break Load/store/AMO address breakpoint
	33,34	Prior to address translation for an explicit memory access: CHERI Load Access Fault, CHERI Store/AMO Access Fault due to capability checks (capability tag, sealed, permissions, bounds)
	5,7	Load/store/AMO capability address misaligned³
	4,6	Optionally: Load/store/AMO address misaligned
	36, 13, 15, 5, 7	During address translation for an explicit memory access: First encountered CHERI Store/AMO Page Fault, page fault or access fault
	5,7	With physical address for an explicit memory access: Load/store/AMO access fault
	4,6	If not higher priority: Load/store/AMO address misaligned
Lowest	35	CHERI Load Capability Fault²

¹ pc bounds are checked against all bytes of fetched instructions. If the instructions could not be decoded to determine the length, then the pc bounds check is made against the minimum-sized instruction supported by the implementation which can be executed, when prioritizing against Instruction Access Faults.

² CHERI Load Capability Fault is the lowest priority as determining whether to raise the exception may include checking the loaded capability tag.

³ Misaligned capability accesses raise access faults instead of misaligned faults since they cannot be emulated in software.

Full details of CHERI Instruction Access Fault, CHERI Load Access Fault and CHERI Store/AMO Access Fault are in Table 84.

21.3.5. Machine Trap Delegation Register (medeleg)

Bits 32,33,34,35,36 of medeleg refer to valid CHERI exceptions and so can be used to delegate CHERI exceptions to supervisor mode.

21.3.6. "Smstateen" Integration

The TIDC bit in mstateen0 controls access to the stidc, vstidc, and utidc CSRs.

Figure 28. Machine State Enable 0 Register (mstateen0)

21.4. CHERI Exception handling

CHERI faults are typically higher priority than standard RISC-V faults. E.g., CHERI faults on the pc are higher priority than any other fault affecting the program counter such as Instruction Access Fault.

The overall priority is shown in Table 83.

Table 84 shows the detailed exception checks added by CHERI.

In the table below auth_cap is rs1, unless Zyhybrid is implemented and executing in Integral Pointer Mode where it is ddc.

Table 84. Valid CHERI exception combination description
Instructions	Xcause	Description	Check
All instructions have these exception checks first
All	32	CHERI Instruction Access Fault	pc capability tag is zero
All	32	CHERI Instruction Access Fault	pc is sealed
All	32	CHERI Instruction Access Fault	pc does not grant X-permission
All	32	CHERI Instruction Access Fault	Any byte of the current instruction is out of pc bounds¹
All	32	CHERI Instruction Access Fault	pc fails any integrity checks.
CSR/Xret additional exception check
CSR*, MRET (RVY), SRET (RVY), CBO.INVAL (RVY)	2	Illegal instruction	pc does not grant ASR-permission when required for CSR access or execution of MRET (RVY), SRET (RVY) or CBO.INVAL (RVY)
Load additional exception checks
All loads	33	CHERI Load Access Fault	`auth_cap` capability tag is zero
All loads	33	CHERI Load Access Fault	`auth_cap` is sealed
All loads	33	CHERI Load Access Fault	`auth_cap` does not grant R-permission
All loads	33	CHERI Load Access Fault	Any byte of the load access is out of `auth_cap` bounds²
All loads	33	CHERI Load Access Fault	`auth_cap` fails any integrity checks.
Capability loads	5³	Load access fault	Misaligned capability load
Store/atomic/cache-block-operation additional exception checks
All stores, all atomics, all CBOs	34	CHERI Store/AMO Access Fault	`auth_cap` capability tag is zero
All stores, all atomics, all CBOs	34	CHERI Store/AMO Access Fault	`auth_cap` is sealed
All stores, CBO.ZERO	34	CHERI Store/AMO Access Fault	`auth_cap` does not grant W-permission
All atomics, CBO.CLEAN, CBO.FLUSH, CBO.INVAL	34	CHERI Store/AMO Access Fault	`auth_cap` does not grant both R-permission and W-permission
All stores, all atomics	34	CHERI Store/AMO Access Fault	Any byte of the access is out of `auth_cap` bounds²
CBO.ZERO, CBO.INVAL	34	CHERI Store/AMO Access Fault	Any byte of the cache block is out of `auth_cap` bounds²
CBO.CLEAN, CBO.FLUSH	34	CHERI Store/AMO Access Fault	All bytes of the cache block are out of `auth_cap` bounds²
All stores, all atomics, all CBOs	34	CHERI Store/AMO Access Fault	`auth_cap` fails any integrity checks.
Capability stores	7³	Store access fault	Misaligned capability store

¹ The pc bounds checks includes the case where the bounds could not be decoded and so a minimum-sized instruction is not fully in bounds.

² The load/store bounds check includes the case where the bounds could not be decoded as they are malformed.

³ Misaligned capability accesses raise access faults instead of misaligned faults since they cannot be emulated in software.

21.5. CHERI Exceptions and speculative execution

ARC-NOTE: Please advise whether this section should remain in this chapter, or whether it should be expanded and placed in a supporting document.

CHERI adds architectural guarantees that can prove to be micro-architecturally useful for reducing side-channels caused by speculative-execution.
Programmers should be able to rely on instructions that fail CHERI permission checks not to update microarchitectural state, e.g., not updating cache state for a load which fails a bound check.
When implementing an RVY hart, micro-architects need to carefully consider the interaction of raising exceptions late in the pipeline, speculation windows and the resulting side-channel attacks.

21.6. Physical Memory Attributes (PMA)

Typically, only parts of the entire memory space need to support CHERI capability tags. Therefore, Physical Memory Attributes are included indicating whether a memory region allows storing CHERI capability tags and to define the behavior when attempting to access them.

There are three levels of support:

Table 85. CHERI PMAs
PMA	Load Behavior	Store Behavior	Comment
CHERI Capability Tag	Load capability tag	Store capability tag	Tagged memory supporting capability tags
CHERI Capability Tag Strip	Load zero capability tag	Ignore stored capability tag	No support for capability tags, ignore them
CHERI Capability Tag Store Fault	Load zero capability tag	Store/AMO Access Fault on capability tag¹	No support for capability tags, trap on storing one

¹ The access fault is triggered on all capability stores or atomics such as SY or AMOSWAP.Y when C-permission and W-permission are granted and the to-be-stored capability tag is set to one. For SC.Y, the access fault is raised if the to-be-stored capability tag is set in rs2, independent of the reservation outcome. This matches the semantics of SC.* with regard to pte.d (see also Capability Dirty Tracking).

Only memory regions that have the CHERI Capability Tag PMA require storage for capability tags.

21.7. Modified MRET Behavior

MRET (RVY) reads the full YLEN bits of the mepc (RVY) register and unseals it prior to exception return if it is sealed as a Sealed entry point capability.

21.8. Machine-mode YLEN CSR Summary

Table 86 shows all YLEN Machine-mode CSRs added or modified by RVY.

Table 86. All RVY Machine-Mode CSRs.
RVY CSR	Extension	Width	Address	Permissions	Reset Value	Description
mscratch (RVY)	M-mode	YLEN	0x340	MRW, ASR-permission	tag=0, otherwise specified by the platform	Machine Scratch Capability
mtvec (RVY)	M-mode	YLEN	0x305	MRW, ASR-permission	Nominally Root Executable	Machine Trap-Vector Base-Address Capability
mepc (RVY)	M-mode	YLEN	0x341	MRW, ASR-permission	Nominally Root Executable	Machine Exception Program Counter Capability
mtidc	RVY	YLEN	0x780	Read: M, Write: M, ASR-permission	tag=0, otherwise specified by the platform	Machine Thread ID

Where reset values are specified as a Root Executable or Root Data they are the maximum possible values. The platform may reset those CSRs to a smaller memory range, or to have fewer permissions.

Table 87. Action taken on writing to YLEN-bit and extended Machine-Mode CSRs
RVY CSR	Extension	Width	Action on XLEN write	Action on YLEN write
mscratch (RVY)	M-mode	YLEN	Update the CSR using YADDRW.	Direct write
mtvec (RVY)	M-mode	YLEN	Apply Invalid address conversion. Always update the CSR with YADDRW even if the address didn’t change, including the MODE field in the address for simplicity. Vector range check if vectored mode is programmed.	Apply Invalid address conversion. Always update the CSR with YADDRW even if the address didn’t change, including the MODE field in the address for simplicity. Vector range check if vectored mode is programmed.
mepc (RVY)	M-mode	YLEN	Apply Invalid address conversion. Always update the CSR with YADDRW even if the address didn’t change.	Apply Invalid address conversion and update the CSR with the result if the address changed, direct write if address didn’t change
mtidc	RVY	YLEN	Update the CSR using YADDRW.	Direct write

Code pointers and data pointers in Table 88 are WARL CSRs that do not need to store full 64-bit addresses on RV64, and so may be subject to Invalid address conversion on writing.

Table 88. YLEN-wide Machine-Mode CSRs storing code pointers or data pointers
RVY CSR	Extension	Code Pointer	Data Pointer	Unseal On Execution
mtvec (RVY)	M-mode	✔
mepc (RVY)	M-mode	✔		✔

22. "Supervisor-Level ISA (RVY)" Extensions, Version 1.0

22.1. Supervisor-Level CSRs widened by RVY

22.1.1. Supervisor Trap Vector Base Address Capability Register (stvec)

The stvec register is widened to a capability.

When the S-mode execution environment starts, the value is nominally the Root Executable capability.

Figure 29. Supervisor trap-vector base-capability register

The handling of stvec (RVY) is identical to mtvec (RVY), but in supervisor mode.

22.1.2. Supervisor Status Registers (sstatus)

Figure 30. Supervisor-mode status (sstatus) register when SXLEN=64

sstatus makes visible the UYRG, SYRG and YRGE fields added to mstatus (RVY).

22.1.3. Status Status High Bits Register (sstatush)

Spec to be written for RV32

22.1.4. Supervisor Scratch Capability Register (sscratch)

The sscratch register is widened to a capability.

At the start of the S-mode execution environment, the value of the capability tag of this CSR is zero and the values of the metadata and address fields are UNSPECIFIED.

Figure 31. Supervisor scratch capability register

22.1.5. Supervisor Exception Program Counter Capability (sepc)

The sepc register is widened to a capability.

When the S-mode execution environment starts, the value is nominally the Root Executable capability.

The handling of sepc (RVY) is identical to mepc (RVY), but in supervisor mode (i.e. applying to SRET (RVY)).

Figure 32. Supervisor exception program counter capability register

22.2. Supervisor-Level CSRs added by RVY

22.2.1. Supervisor Thread Identifier Capability (stidc)

The stidc register is used to identify the current software thread in supervisor mode, using the method defined in the section for the unprivileged utidc CSR.

At the start of the S-mode execution environment, the value of the capability tag of this CSR is zero and the values of the metadata and address fields are UNSPECIFIED.

Figure 33. Supervisor thread identifier capability register

22.3. Supervisor-Level CSRs modified by RVY

22.3.1. Supervisor environment configuration register (senvcfg)

Zyhybrid adds the Y bit to senvcfg. The Y bit can be used to disable or enable RVY for U-mode, as described in Chapter 24, "Zyhybrid for Privileged Architectures" Extension, Version 1.0.

Figure 34. Supervisor environment configuration register (senvcfg)

22.3.2. Supervisor Cause Register (scause)

RVY adds new exception codes for CHERI exceptions as in mcause (RVY).

22.3.3. "Ssstateen" Integration

The TIDC bit in sstateen0 controls access to the utidc CSR. See utidc for a description of the usage.

Figure 35. Supervisor State Enable 0 Register (sstateen0)

22.4. Modified SRET Behavior

SRET (RVY) reads the full YLEN bits of the sepc (RVY) register and unseals it prior to exception return if it is sealed as a Sealed entry point capability.

22.4.1. SRET (RVY)

See MRET (RVY).

22.4.2. MRET (RVY)

Synopsis: Trap Return (MRET, SRET)
Mnemonics: mret
sret
Encoding

Description

Return from machine mode (MRET) or supervisor mode (SRET) trap handler.

MRET unseals mepc (RVY) and writes the result into pc.

SRET unseals sepc (RVY) and writes the result into pc.

Exceptions

An illegal instruction exception is raised when pc does not grant ASR-permission.

ASR-permission is required as MRET and SRET require access to privileged CSRs.

Prerequisites (MRET)

Machine-Level ISA, RVY

Prerequisites (SRET)

Supervisor-Level ISA, RVY

Operation for MRET

if   cur_privilege() != Machine
then { handle_illegal(); RETIRE_FAIL }
else if not(ext_check_xret_priv (Machine))
then { ext_fail_xret_priv(); RETIRE_FAIL }
else {
  set_next_pc(exception_handler(cur_privilege(), CTL_MRET(), PC));
  RETIRE_SUCCESS
}

Operation for SRET

let sret_illegal : bool = match cur_privilege() {
  User       => true,
  Supervisor => not(extensionEnabled(Ext_S)) | mstatus[TSR] == 0b1,
  Machine    => not(extensionEnabled(Ext_S))
};
if   sret_illegal
then { handle_illegal(); RETIRE_FAIL }
else if not(ext_check_xret_priv (Supervisor))
then { ext_fail_xret_priv(); RETIRE_FAIL }
else {
  set_next_pc(exception_handler(cur_privilege(), CTL_SRET(), PC));
  RETIRE_SUCCESS
}

22.5. Supervisor-mode YLEN CSR Summary

Table 89 shows all YLEN Supervisor-mode CSRs added or modified by RVY.

Table 89. All RVY Supervisor-Mode CSRs.
RVY CSR	Extension	Width	Address	Permissions	Reset Value	Description
sscratch (RVY)	S-mode	YLEN	0x140	SRW, ASR-permission	tag=0, otherwise specified by the platform	Supervisor Scratch Capability
stvec (RVY)	S-mode	YLEN	0x105	SRW, ASR-permission	Nominally Root Executable	Supervisor Trap-Vector Base-Address Capability
sepc (RVY)	S-mode	YLEN	0x141	SRW, ASR-permission	Nominally Root Executable	Supervisor Exception Program Counter Capability
stidc	RVY	YLEN	0x580	Read: S, Write: S, ASR-permission	tag=0, otherwise specified by the platform	Supervisor Thread ID

Table 90. Action taken on writing to YLEN-bit and extended Supervisor-Mode CSRs
RVY CSR	Extension	Width	Action on XLEN write	Action on YLEN write
sscratch (RVY)	S-mode	YLEN	Update the CSR using YADDRW.	Direct write
stvec (RVY)	S-mode	YLEN	Apply Invalid address conversion. Always update the CSR with YADDRW even if the address didn’t change, including the MODE field in the address for simplicity. Vector range check if vectored mode is programmed.	Apply Invalid address conversion. Always update the CSR with YADDRW even if the address didn’t change, including the MODE field in the address for simplicity. Vector range check if vectored mode is programmed.
sepc (RVY)	S-mode	YLEN	Apply Invalid address conversion. Always update the CSR with YADDRW even if the address didn’t change.	Apply Invalid address conversion and update the CSR with the result if the address changed, direct write if address didn’t change
stidc	RVY	YLEN	Update the CSR using YADDRW.	Direct write

Code pointers and data pointers in Table 91 are WARL CSRs that do not need to store full 64-bit addresses on RV64, and so may be subject to Invalid address conversion on writing.

Table 91. YLEN-wide Supervisor-Mode CSRs storing code pointers or data pointers
RVY CSR	Extension	Code Pointer	Data Pointer	Unseal On Execution
stvec (RVY)	S-mode	✔
sepc (RVY)	S-mode	✔		✔

23. "Supervisor-Level ISA for Virtual Memory (RV64Y)" Extension, Version 1.0

CHERI checks are made on the effective address according to the current translation scheme. I.e., on the virtual address if translation is enabled or the physical address if translation is disabled.

Implicit memory accesses made by the page table walker are not subject to CHERI checks.

A future extension may add CHERI checks to the page table walker.

23.1. Limiting Capability Propagation

Page table enforcement can allow the operating system to limit the flow of capabilities between processes. It is highly desirable that a process should only possess capabilities that have been issued for that address space by the operating system. Unix processes may share memory for efficient communication, but capability pointers must not be shared across these channels into a foreign address space. An operating system might defend against this by only issuing a capability to the shared region that does not grant the load/store capability permission. However, there are circumstances where portions of general-purpose, mmapped^* memory become shared, and the operating system must prevent future capability communication through those pages. This is not possible without restructuring software, as the capability for the original allocation, which spans both shared memory and private memory, would need to be deleted and replaced with a list of distinct capabilities with appropriate permissions for each range. Such a change would not be transparent to the program. Such sharing through virtual memory is on the page granularity, so preventing capability writes with a PTE permission is a natural solution.

^* allocated using mmap

23.2. The pte.rvy field for capability flow control

As RV32Y has no field allocated in the PTE, capability loads and stores operate as normal. Therefore, the RVY field is reserved in the RV64 PTE formats only (e.g., Figure 36) for use by RV64Y extensions. The principle is that extensions, such as Svyrg may redefine the usage of the RVY field by adding an enable bit into Xstatus.

The definition of the RVY field is as follows:

Table 92. Definition of the *pte*.rvy field
Bit	Name	Comment
pte.rvy[3]	pte.y	Enable capability (Y) access
pte.rvy[2]	reserved	reserved
pte.rvy[1]	reserved	reserved
pte.rvy[0]	reserved	reserved

If pte.y=0 then:

All capability loads set the loaded capability tag to zero
All capability stores with the to-be-stored capability tag set raise a CHERI Store/AMO Page Fault.

If pte.y=1 then:

All capability loads and capability stores operate as normal.

The reserved bits pte.rvy[2:0] follow the standard reserved behavior for PTE bits. They are allocated by Svyrg, which redefines the entire pte.rvy field when enabled.

23.3. Invalid Virtual Address Handling

When address translation is in effect for RV64Y, the upper bits of virtual memory addresses must match for the address to be valid.

The CSRs shown in Table 88 and Table 91, as well as the pc, need not hold all possible invalid addresses.

However, the bounds encoding of capabilities depends on the address value if the bounds are not infinite.

Therefore, if the bounds are not infinite, implementations must not convert an invalid address to a different invalid address, except as permitted by the rules in the following subsections.

If the bounds could not be decoded due to the address being invalid, then a CHERI Instruction Access Fault, CHERI Load Access Fault or CHERI Store/AMO Access Fault exception is raised as appropriate.

Not requiring the implementation to decode the bounds for invalid addresses reduces the size of bounds comparators from 64-bits to the supported virtual address width.

23.3.1. Updating CSRs

A CSR may be updated to hold a capability with an invalid address, due to:

executing instructions, such as CSRRW (RVY)
hardware updates to CSRs such as storing the pc into mepc (RVY)/sepc (RVY) etc. when taking an exception.

To ensure that the bounds of a valid capability cannot be corrupted:

If the new address is invalid and the capability bounds are not infinite then set the capability tag to zero before writing to the CSR.

When the new address is invalid and happens to match an invalid address which the CSR can hold, then it is implementation-defined whether to set the capability tag to zero.

23.3.2. Branches and Jumps

If the effective target address of the jump or branch is invalid, and the authorizing capability’s bounds are not infinite, then set the capability tag of the target pc to zero.

A CHERI Instruction Access Fault exception will be raised at the target.

RISC-V harts that do not support RVY normally raise an instruction access fault or page fault after jumping or branching to an invalid address. Therefore, RVY preserves that behavior to ensure that harts supporting RVY and Zyhybrid are fully compatible with RISC-V harts provided that pc and ddc are set to Root Executable and Root Data capabilities, respectively.

23.3.3. Memory Accesses

If the effective address of the memory access is invalid, and the authorizing capability’s bounds are not infinite, then raise a CHERI Load Access Fault or CHERI Store/AMO Access Fault exception because the bounds cannot be reliably decoded.

24. "Zyhybrid for Privileged Architectures" Extension, Version 1.0

When using a system with Zyhybrid, it may be desirable to configure privilege levels below Machine-mode such that they operate as a standard RV32[IE]/RV64[IE] base architecture.

Zyhybrid includes the ability to disable CHERI by making the Y bit of menvcfg (RVY) and senvcfg (RVY) writable.

Zyhybrid adds ddc to the registers which must be configured as root capabilities for misa.Y to be programmed to zero.

The effective CHERI-enable for the current privilege is:

Machine-mode: always enabled
Supervisor-mode: menvcfg.Y
User-mode: menvcfg.Y & senvcfg.Y

The reset values are:

menvcfg.Y=0
senvcfg.Y=0
pc.P-bit=1 (for Integral Pointer Mode).
ddc, pc, xtvec, xepc are nominally root capabilities.

The following occurs when executing code in a privilege mode that has CHERI disabled:

Custom encoding spaces reallocated as standard encoding spaces for RVY revert to being custom encoding spaces
- Therefore, RVY instructions are no longer available.
No natively YLEN CSRs are available.
Only XLEN CSR access is permitted.
- This is identical to Integral Pointer Mode access.

Disabling CHERI has no effect on implicit accesses or security checks.

The last capability written to pc and ddc before disabling CHERI is used to authorize instruction execution and data memory accesses respectively.
Exceptions and interrupts use the full capability held in xtvec and xepc.

Disabling CHERI prevents low-privileged Integral Pointer Mode software from interfering with the correct operation of higher-privileged Integral Pointer Mode software that does not perform ddc switches on trap entry and return.

Table 93 summarizes the behavior of a hart for the different CHERI modes.

Table 93. Hart’s behavior depending on the effective CHERI enable and CHERI Pointer Mode
	Y¹=0	Y=1, P-bit=1	Y=1, P-bit=0
Authorizing capability for data memory accesses	ddc	ddc	capability in `rs1`
Natively YLEN CSRs available	✘	✔	✔
Extended YLEN CSR access width	XLEN	XLEN	YLEN
16-bit RVY instructions available	✘	✘	✔
Wider RVY instructions available	✘	✔	✔
custom-1/2/3 available for custom use	✔	✘	✘
Summary	*Fully RV32[IE]/RV64[IE] compatible*²	*Integral Pointer Mode*	*Capability Pointer Mode*

¹ Y represents the effective CHERI enable for the current privilege mode.

² The hart is fully compatible with standard RISC-V when Y=0 provided that pc, xtvec, xepc and ddc have not been changed from their reset state.

25. "Svyrg" Extension, Version 1.0 for RV64Y

This extension specifies use of the RVY field in the Sv39, Sv48 and Sv57 PTE formats (e.g., see Figure 36).

Page-Based 39-bit Virtual-Memory System (Sv39)

Figure 36. Sv39 page table entry.

Diagram included in the standalone specification for reference only.

Svyrg implies Sv39.

The Svyrg extension is enabled when the sstatus.YRGE bit is set.

When sstatus.YRGE=0, only pte.rvy[3] (pte.y) has meaning: it completely enables or disables storing capabilities to the page, as defined in pte.rvy field. When sstatus.YRGE=1, the entire pte.rvy field is redefined as described in this chapter.

The extension adds the ability for supervisor-mode software to quickly enable trapping loads of capabilities from all pages of memory, incrementally allow loads of capabilities from such pages, and track stores of capabilities to all pages of memory. Applied to userspace pages, this has been shown to allow the operating system to implement capability revocation schemes, which allow userspace memory allocators to deterministically guard against use-after-reallocation (Filardo et al., 2024). Applied to kernel pages, they can similarly be used against use-after-reallocation within the kernel itself.

The 4-bit pte.rvy field is subdivided into four 1-bit fields, two controlling capability loads (including the capability loaded during an AMO) and two controlling capability stores (again, including the capability stored as part of an AMO).

Table 94. Breakdown of the 4-bit *pte*.rvy field.
Bit	Name	Comment
Capability load/AMO fields
pte.rvy[0]	pte.yr	Capability readable
pte.rvy[1]	pte.yrg	Capability read generation
Capability store/AMO fields
pte.rvy[2]	pte.yw	Capability writable
pte.rvy[3]	pte.yd	Capability dirty

When all of the following are true, Svyrg adds two related architectural features, CHERI Load Capability Faults and Capability Dirty Tracking:

sstatus.YRGE is set.
The authorizing capability has C-permission.
The PMA is set to CHERI Capability Tag.

It may make sense to subdivide this into two extensions as Capability Dirty Tracking is more generally useful than CHERI Load Capability Faults.

25.1. CHERI Load Capability Faults

pte.yr and pte.yrg control whether capability loads or AMOs write the loaded capability tag to rd.

When pte.yr is clear, pte.yrg selects the behavior of capability loads: if pte.yrg is clear, the loaded capability tag is written to rd as zero; if pte.yrg is set, the load operates as normal.

When pte.yr is set, pte.yrg can be used to trap on capability loads or AMOs when it does not match the Capability Read Generation value that is represented by the value of sstatus.UYRG for userspace pages (pte.u=1) or sstatus.SYRG for kernel pages (pte.u=0).

The implementation raises a CHERI Load Capability Fault when, in addition to the rules above, all of the following are true:

A capability load or AMO is executed.
pte.yr is set.
if pte.u=1, pte.yrg does not equal sstatus.UYRG.
if pte.u=0, pte.yrg does not equal sstatus.SYRG.
The loaded capability tag is set, unless the implementation traps conservatively¹.
Any other platform specific rules have not forced the loaded capability tag to be clear.

¹ Implementations that trap conservatively may raise the fault regardless of the value of the loaded capability tag, as shown in Table 95.

Checking the value of the capability tag requires taking data dependent exceptions on loaded capabilities for loads or AMOs. Ideally all implementations would trap precisely (taking the capability tag into account in all cases) rather than conservatively (trapping more often, potentially on every loaded capability). However, the loaded capability tag may not be available in all implementations when determining whether to raise the exception, and therefore flexibility is permitted. As a result, the software is required to be tolerant of raising the trap when the capability tag is not set, potentially resulting in spurious traps from pages that have pte.yr=1, and so are likely to store valid capabilities.

Implementations that already take synchronous traps on loaded data, such as ECC faults, are recommended to check the loaded capability tag when determining whether to raise the fault.

Table 95. Summary of capability load *pte*.yr and *pte*.yrg behavior in the PTEs
pte.yr	pte.yrg	tag¹	Load Capability Behavior
0	0	X	Clear loaded capability tag
0	1	X	Normal operation
1	≠ `sstatus.xYRG`²	0	Implementation defined choice of CHERI Load Capability Fault or normal operation
1	≠ `sstatus.xYRG`²	1	CHERI Load Capability Fault
1	= `sstatus.xYRG`²	X	Normal operation

¹ The loaded capability tag.

² if pte.u=1, sstatus.UYRG, else sstatus.SYRG.

CHERI Load Capability Faults may be used to implement the load-barrier primitive from (Filardo et al., 2024).

In the case of AMOs that could trigger both a CHERI Store/AMO Page Fault, due to storing a valid capability tag, and a CHERI Load Capability Fault, the CHERI Store/AMO Page Fault takes priority as shown in Table 83.

25.2. Capability Dirty Tracking

When pte.yw is clear, capability stores or AMOs where the to-be-stored capability tag is set will raise a CHERI Store/AMO Page Fault.

When pte.yw is set, capability stores to the virtual page are permitted. In addition, the pte.yd bit indicates that a capability was stored to the virtual page since the last time the pte.yd bit was cleared.

This is equivalent to the behavior of the pte.d bit except that pte.yd only tracks stores of valid capabilities instead of all stores.

Capability dirty tracking behavior is enabled when, in addition to the rules above, all of the following are true:

A capability store or AMO instruction is executed.
The to-be-stored capability tag is set.
pte.yw is set.
pte.yd is clear.

Two schemes for capability dirty tracking are permitted, and the scheme in use is determined by whether the Svade or Svadu extensions are enabled.

For Svade, take a CHERI Store/AMO Page Fault.
For Svadu, do a hardware update that sets pte.yd=1, following the same rules as setting pte.d.
- When setting pte.yd, the hardware update also necessarily sets (or leaves set) pte.a and pte.d.

Table 96. Summary of capability store *pte*.yw and *pte*.yd behavior in the PTEs
pte.yw	pte.yd	tag¹	Store Capability Behavior
0	X	0	Normal operation
0	X	1	CHERI Store/AMO Page Fault
1	0	0	Normal operation
1	0	1	CHERI Store/AMO Page Fault (Svade) or hardware pte.yd update (Svadu)
1	1	X	Normal operation

¹ The to-be-stored capability tag.

The capability dirty tracking is resolved during memory translation, but there are cases where it is not known if there will be a capability tag stored to memory or not at this point. Capability dirty tracking must be triggered when pte.yw=1 and pte.yd=0 and the to-be-stored capability tag is set, and may be triggered in the following case where it is not known if there will be a stored capability tag during translation:

SC.Y triggers capability dirty tracking if the capability tag is set in rs2, even if the store fails. This matches the semantics of SC.* with regard to pte.d.

pte.yd must always be set when a capability with a valid capability tag is written to a virtual page so that software knows which pages have had capabilities stored to them. It may be set too often, which may cause software to examine the page to check for capabilities when none are present. This is a situation software is required to handle anyway, as it is always possible for all capabilities in a page to be overwritten by non-capability data. In this case, the pte.yd bit would still be set.

Future AMOs fall into this category:

For future AMOCAS.Y, it is not known whether the store will happen until the load has executed, and the compare has been done. Therefore, AMOCAS.Y is likely to trigger capability dirty tracking if the capability tag is set in rs2.
For future AMOADD.Y, the stored capability tag depends upon the loaded capability tag which is not known during translation and on the execution of the YADD. Therefore, AMOADD.Y, is likely to always trigger capability dirty tracking.

Checking the stored capability tag is less of a burden to the implementation than checking the loaded capability tag for CHERI Load Capability Fault, which is why checking the loaded capability tag is optional behavior. However, a future extension may reduce the burden further by removing the check on the to-be-stored capability tag.

Capability dirty tracking may be used to implement the store-barrier primitive from (Filardo et al., 2024).

The minimum level of PTE support is to set pte.yr=1, pte.yd=1, pte.yw=1 and pte.yrg=0 in all PTEs intended for storing capabilities (e.g., private anonymous mappings) and set sstatus.UYRG=0 and sstatus.SYRG=0 on all harts, which will enable capabilities to be loaded and stored successfully.

26. Pointer Masking (Ssnpm, Smnpm, Smmpm, Sspm, Supm) (RV64Y)

This chapter is not part of the v1.0 ratification package and is experimental.

Whenever pointer masking is enabled, all bounds decoding, representable range checks and bounds checks are affected.

The suggestion in this section is based on the pointer masking approach from Morello but with changes to sign extension and to address the dynamic nature of bit masking.

When bounds are encoded or decoded, a masked but not sign-extended address is used. Changing how many bits are masked can therefore change the interpretation of the bounds of a capability, both for the purpose of implicit accesses via bounds checks and any instructions that report the bounds of a capability. Apart from treating the PMLEN high address bits as zero, there are no other changes to bounds decoding, which is still based on XLEN, not the new effectively addressable space. That is, the maximum length of a capability does not change, and it is not invalid to have a capability that covers a longer range than could actually be addressed with pointer masking enabled (such as one covering the entire XLEN address space). For the representable range check, both the original and new addresses are masked. Bounds setting instructions also mask the address in the same way.

Because dynamically changing the number of masked bits changes the interpretation of a capability, software must take the same care when sharing capabilities between address spaces with differing pointer masking as it generally must when sharing capabilities between address spaces with different page mappings.

Any address that is checked against a capability is also first subject to the same masking as bounds decode (masking without extension). After any CHERI operations, the final access address is still subject to as much sign extension as the pointer masking extensions mandate.

In summary, for data accesses only:

When setting bounds (YBNDSW/YBNDSWI/YBNDSRW), bits [XLEN-1:XLEN-PMLEN] of the address are set to zero and therefore the resulting capability will have a base with the PMLEN high address bits set to zero.
When decoding bounds, the address used for decoding has bits [XLEN-1:XLEN-PMLEN] set to zero.
When checking the representable range for YADDI/YADD/YADDRW, the old address and new addresses both have bits [XLEN-1:XLEN-PMLEN] set to zero before the check.

Also note that:

pc bounds decoding isn’t affected by pointer masking.
Any representable range check triggered by a write to a CSR (e.g., mtvec (RVY)) is not affected by pointer masking.

27. Hypervisor "H" Extension (RVY)

This chapter is not part of the v1.0 ratification package and is stable.

The RISC-V hypervisor (H) extension virtualizes the supervisor-level architecture to support the efficient hosting of guest operating systems atop a type-1 or type-2 hypervisor (RISC-V, 2023).

The hypervisor extension is generally orthogonal to CHERI; the main requirements, when integrating with RVY and Zyhybrid, are that address CSRs added for hypervisors are widened so that they can hold capabilities. The remainder of this chapter describes these changes in detail.

27.1. Hypervisor Status Register (hstatus)

The hstatus register operates as described above except for the VSXL field that controls the value of XLEN for VS-mode (known as VSXLEN).

The encoding of the VSXL field is the same as the MXL field of misa. Only 1 and 2 are supported values for VSXL. When the implementation supports RVY (but not Zyhybrid), then hstatus's VSXL must be read-only. When the implementation supports both RVY and Zyhybrid, then VSXL behaves as mstatus.SXL.

The VSBE field controls the endianness of explicit memory accesses from VS-mode and implicit memory accesses to VS-level memory management data structures. VSBE=0 indicates little endian and VSBE=1 is big endian. VSBE must be read-only and equal to MBE when the implementation only supports RVY.

27.2. Hypervisor Environment Configuration Register (henvcfg)

The henvcfg(RVY) register operates as described in the RISC-V Privileged Specification. A new enable bit is added to henvcfg(RVY) when the implementation supports Zyhybrid as shown in Figure 37.

Figure 37. Hypervisor environment configuration register (henvcfg)

The Y bit controls whether explicit access to CHERI registers is permitted when V=1. When henvcfg(RVY).Y=1 and menvcfg (RVY).Y=1 and misa (RVY).Y=1, CHERI can be enabled by VS-mode and VU-mode. When henvcfg(RVY).Y=0, CHERI is disabled in VS-mode and VU-mode as described in Chapter 24.

The Y bit is reset to 0 for compatibility, so that non-CHERI aware code can run unmodified.

27.3. Hypervisor Exception Delegation Register (hedeleg)

Bits 32,33,34,35,36 of hedeleg refer to valid CHERI exceptions and so can be used to delegate CHERI exceptions to supervisor mode.

27.4. Virtual Supervisor Status Register (vsstatus)

Figure 38. Virtual Supervisor-mode status (vsstatus) register when VSXLEN=64

The vsstatus register operates as described above except for the UXL field that controls the value of XLEN for VU-mode.

The encoding of the UXL field is the same as the MXL field of misa. Only 1 and 2 are supported values for UXL. When the implementation supports RVY (but not Zyhybrid), then vsstatus.UXL must be read-only. When the implementation supports both RVY and Zyhybrid, then UXL behaves as described in mstatus (RVY) for mstatus.UXL.

27.5. Virtual Supervisor Trap Vector Base Address Capability Register (vstvec)

The vstvec register is widened to a capability.

Figure 39. Virtual supervisor trap vector base address capability register

The handling of vstvec (RVY) is otherwise identical to mtvec (RVY), but in virtual supervisor mode.

27.6. Virtual Supervisor Scratch Register (vsscratch)

The vsscratch register is widened to a capability.

It is not WARL, all capability fields must be implemented.

Figure 40. Virtual supervisor scratch capability register

27.7. Virtual Supervisor Exception Program Counter Capability (vsepc)

The vsepc register is widened to a capability.

vsepc (RVY) is an executable capability, so it does not need to be able to hold all possible invalid addresses (see Invalid address conversion). Additionally, the capability in vsepc (RVY) is unsealed if it is a Sealed entry point capability when it is written to pc on execution of an SRET (RVY) instruction when V=1. The handling of vsepc (RVY) is otherwise identical to mepc (RVY), but in VS-mode.

Figure 41. Virtual supervisor exception program counter capability

27.8. Virtual Supervisor Thread Identifier Capability (vstidc)

The vstidc register is used to identify the current software thread in virtual supervisor mode. As with other Virtual Supervisor registers when V=1, vstidc substitutes for stidc, so that instructions that normally read or modify stidc actually access vstidc instead. When V=0, vstidc does not directly affect the behavior of the machine.

Figure 42. Virtual supervisor thread identifier capability register

27.9. "Smstateen/Ssstateen" Integration

The TIDC bit controls access to the stidc (really vstidc) CSR.

Figure 43. Hypervisor State Enable 0 Register (hstateen0)

27.10. Hypervisor Load and Store Instructions For Capability Data

Hypervisor virtual-machine load (HLV.Y) and store (HSV.Y) instructions read or write YLEN bits from memory as though V=1. These instructions change behavior depending on the CHERI Pointer Mode although the instruction’s encoding remains unchanged.

When in Capability Pointer Mode, the hypervisor load and store capability instructions behave as described in Section 2.9. In Integral Pointer Mode, the instructions use the low XLEN bits of the base register as the effective address for the memory access and the capability authorizing the memory access is ddc.

27.10.1. HLV.Y

Synopsis: Hypervisor virtual-machine load capability
Mnemonic: HLV.Y rd, rs1
Encoding

Description: Execute LY as though V=1, following the same pattern as HLV.W but with capability data.
Prerequisites: RVY, H
Operation

SAIL TO BE UPDATED

27.10.2. HSV.Y

Synopsis: Hypervisor virtual-machine store capability
Mnemonic: HSV.Y rs2, rs1
Encoding

Description: Execute SY as though V=1; following the same pattern as HSV.W but with capability data.
Prerequisites: RVY, H
Operation

SAIL TO BE UPDATED

28. The "Smycheriot" Privileged ISA Extension

This section defines a series of small changes to the Machine-Level ISA (RVY) privileged architecture that serve to specialize it for microcontroller environments. These changes are based on, but are not exactly isomorphic to, the prior CHERIoT RV32E-based ISA.

28.1. Required Extensions

Smycheriot requires its unprivileged counterpart, Zycheriot, and builds on Machine-Level ISA (RVY).

The present specification presumes the absence of both

any execution mode other than M-mode (such as VS, HS, VU, S, or U), and
the Zyhybrid privileged extensions.

While Smycheriot is nominally compatible with Zyhybrid for Privileged Architectures, and particular instantiations may opt to permit disabling CHERI, we have not yet found a compelling reason to formally specify this composition.

28.2. CSR Reset States

The RVY base privileged ISA tends to define M-mode CSRs' reset values either as Root Executable capabilities or as otherwise unspecified values with clear capability tags. To make available Zycheriot’s multiple root capability values, we redefine two CSRs' reset values:

mtidc: The mtidc register’s reset value is changed to be Zycheriot’s Root Data capability value (with set capability tag), rather than the unspecified value with zero capability tag of RVY.
mscratch (RVY): The mscratch (RVY) register’s reset value is changed to be Zycheriot’s sealing capability value (with set capability tag), rather than the unspecified value with zero capability tag of RVY.

28.3. Additional CSR Legalization Requirements

All of pc, mtvec (RVY), and mepc (RVY) will clear their capability tag if the capability they would come to hold after an update would be sealed or would not grant X-permission.

The Machine-Level ISA (RVY) requires that mtvec (RVY) and mepc (RVY) nominally reset to Root Executable capabilities, and pc must be unsealed and grant X-permission for successful execution. Neither architecture nor software expects to be able to place capabilities thereby prohibited in these CSRs.

28.4. Capability Types

Zycheriot introduces several sentry CT-field values and discusses some aspects of their interaction with JALR and JAL instructions. Smycheriot further equips these CT-field values with privileged architectural side-effects when used with JALR. In particular, Smycheriot uses these CT-field values to enforce structured interrupt control, by atomically capturing and changing the value of xstatus.xIE in JALR. The purpose of this is to enable calls to short per-core atomic sequences without granting the right to modify xstatus.xIE arbitrarily. Functions that are called with a new interrupt stance can return to the caller’s stance. Non-standard control flow transfers such as tail calls and calls to compiler outlined functions using a non-standard link register are still possible but cannot change the interrupt status. Beyond the JALR behaviors specified by Zycheriot, Smycheriot further requires that…

Upon successfully retiring, a JALR instruction whose rs1 holds such a capability updates the hart’s xstatus.xIE bit as per Table 97.
A successfully retiring JALR or JAL whose rd is ra will seal its written-back return capability based on the value of the hart’s xstatus.xIE bit immediately before this instruction’s execution. If that bit is 0, the return capability is sealed with 4; otherwise, 5 is used.

Combining these semantics with those of Zycheriot lets us give convenient mnemonics for the particular CT-field values that these extensions use, as shown in Table 98.

Table 97. Additional `JALR` Architectural Semantics
CT-field	IRQs at Retirement (`xstatus.xIE`)
1	Unchanged (as is)
2, 4	Deferred (`0`)
3, 5	Enabled (`1`)

Table 98. Sentry type mnemonics
CT-field	Name
1	Forward interrupt-inheriting
2	Forward interrupt-deferring
3	Forward interrupt-enabling
4	Backward interrupt-deferring
5	Backward interrupt-enabling

The degree of control flow integrity provided by the rules of Zycheriot ensures that a leaf function entered via a forward interrupt-disabling sentry cannot be induced, by a malicious caller, to return to its own entry vector (indeed, to return via any forward sentry). This ensures that granting untrusted code such a sentry is not tantamount to handing it the ability to wedge the machine.

28.5. Stack High Watermark CSRs

Smycheriot introduces two new XLEN CSRs for each privilege level: xshwm and xshwmb. Both are freely read but require ASR-permission for explicit writes from software. Writes are WARL, with legal values being addresses with capability alignment (that is, addresses aligned to YLEN/8 bytes). Store instructions targeting addresses between the values held in xshwm and xshwmb cause xshwm to be updated to the lowest targeted address (rounded down to capability alignment).

The intended software use of these CSRs is, as the section title suggests, to track the "high watermark" of a thread’s C stack (that is, the lowest address written to, because "stacks grow down"). The thread context switching code should context switch these registers, having initialized xshwmb to the lower bound of the thread’s stack capability and xshwm to its upper bound. Privileged stack zeroing code can be used to lower xshwm, so that all bytes between the addresses held in xshwm and xshwmb are known to be zero.

28.6. Capability Load Filter and The Revocation Bitmap

This section is non-normative, as it introduces no new behavior not already permitted by the RVY unprivileged architecture. It is meant to be informative detail about how CHERIoT platforms avail themselves of a particular architectural affordance therein.

As part of providing heap temporal safety, CHERIoT platforms may, as permitted by RVY, clear tags being transported from memory into CPU registers by a ly instruction. CHERIoT platforms mediate capability loads with a capability load filter. Software dynamic memory allocators can use this load filter to ensure that additional copies of capabilities to deallocated objects cannot be constructed (into register files and, so, into memory, too). This facilitates particularly straightforward global revocation of pointers to freed heap objects.

In implementations to date, this capability load filter is instantiated by pairing each memory block(s) in which software’s dynamic allocation heap(s) are to reside with a per-memory-block revocation bitmap, a bit-vector wherein each bit corresponds to a capability-sized memory granule. (That is, with each capability-sized and -aligned region of the primal memory.) Such memory block(s) are said to be revocable. These revocation bitmaps are also exposed (as memory) to software. When ly transfers a valid capability (one with a set capability tag) from memory (revokable or not) into a hart’s register file, it checks whether the base (lower bound) of that capability is within a revokable memory block and, if so, fetches the corresponding bit in the associated revocation bitmap. If that bit is set, then the capability tag stored in the hart’s register file as part of this transfer is cleared.

It is up to each particular CHERIoT platform to define…

the number, location(s), and size(s) of revokable memory blocks,
the location(s) of their associated revocation bitmaps, and
the mapping function between primal memory address and revocation bitmap address and bit index.

Appendix A: RVY Instruction Encoding Overview

ARC-NOTE: The instruction encoding listings below are presented in three different formats (bytefield, wavedrom and AsciiDoc tables). Please choose which of these formats you would like to see in the final specification.

A.1. Bytefield Encoding Overview

A.1.1. 3OP Instruction Encodings

A.1.2. 2OP Instruction Encodings

A.1.3. YADDI/LY/SY Instruction Encodings

A.1.4. AMO Instruction Encodings

A.1.5. MISC Instruction Encodings

A.2. Wavedrom Encoding Overview

A.2.1. 3OP Instruction Encodings

A.2.2. 2OP Instruction Encodings

A.2.3. YADDI/LY/SY Instruction Encodings

A.2.4. AMO Instruction Encodings

A.2.5. MISC Instruction Encodings

A.3. AsciiDoc Table Encoding Overview

A.3.1. 3OP Instruction Encodings

3OP Instruction Encodings

31:25

24:20

19:15

14:12

11:7

6:0

Inst

0000001

rs2

rs1

000

rd

RVY-A=1111011

PACKY

0000001

rs2

rs1

000

rd

RVY-A=1111011

YHIW (pseudo)

0000011

rs2≠0

rs1

000

rd

RVY-A=1111011

YADD

0000011

00000

rs1

000

rd

RVY-A=1111011

YMV

0001011

rs2

rs1

000

rd

RVY-A=1111011

YADDRW

0010011

rs2

rs1

000

rd

RVY-A=1111011

YPERMC

0011011

rs2

rs1

000

rd

RVY-A=1111011

YBNDSW

0100011

rs2

rs1

000

rd

RVY-A=1111011

YBNDSRW

0101011

rs2

rs1

000

rd≠0

RVY-A=1111011

YMODEW

0101011

00000

rs1=0

000

rd=0

RVY-A=1111011

YMODESWY

0101011

00001

rs1=0

000

rd=0

RVY-A=1111011

YMODESWI

0110011

rs2

rs1

000

rd

RVY-A=1111011

YBNDSRDW

0000101

rs2

rs1

000

rd

RVY-A=1111011

YSH1ADD

0001101

rs2

rs1

000

rd

RVY-A=1111011

YSH2ADD

0010101

rs2

rs1

000

rd

RVY-A=1111011

YSH3ADD

0011101

rs2

rs1

000

rd

RVY-A=1111011

YSH4ADD

0100101

rs2

rs1

000

rd

RVY-A=1111011

YSH1ADD.UW

0101101

rs2

rs1

000

rd

RVY-A=1111011

YSH2ADD.UW

0110101

rs2

rs1

000

rd

RVY-A=1111011

YSH3ADD.UW

0111101

rs2

rs1

000

rd

RVY-A=1111011

YSH4ADD.UW

0000110

rs2

rs1

000

rd

RVY-A=1111011

YEQ

0001110

rs2

rs1

000

rd

RVY-A=1111011

YSS

0000111

rs2

rs1

000

rd

RVY-A=1111011

YSUNSEAL

0001111

rs2

rs1

000

rd

RVY-A=1111011

YBLD

0010111

rs2

rs1=0

000

rd

RVY-A=1111011

YSENTRY

0011111

rs2

rs1

000

rd

RVY-A=1111011

YUNSEAL

A.3.2. 2OP Instruction Encodings

2OP Instruction Encodings

31:25

24:20

19:15

14:12

11:7

6:0

Inst

1111000

00000

rs1

000

rd

RVY-A=1111011

YAMASK

1111010

00000

rs1

000

rd

RVY-A=1111011

YBASER

1111010

00001

rs1

000

rd

RVY-A=1111011

YPERMR

1111010

00010

rs1

000

rd

RVY-A=1111011

YTOPR

1111010

00011

rs1

000

rd

RVY-A=1111011

YLENR

1111010

00100

rs1

000

rd

RVY-A=1111011

YTAGR

1111010

00101

rs1

000

rd

RVY-A=1111011

YTYPER

1111010

00110

rs1

000

rd

RVY-A=1111011

YMODER

A.3.3. YADDI/LY/SY Instruction Encodings

YADDI/LY/SY Instruction Encodings

31:25

24:20

19:15

14:12

11:7

6:0

Inst

imm[11:0]

rs1

100

rd

RVY-A=1111011

YADDI

imm[11:0]

rs1≠0

001

rd

RVY-A=1111011

LY

offset[11:5]

rs2

rs1≠0

010

offset[4:0]

RVY-A=1111011

SY

A.3.4. AMO Instruction Encodings

AMO Instruction Encodings

31:27

26

25

24:20

19:15

14:12

11:7

6:0

Inst

00010

aq

rl

00000

rs1≠0

011

rd

RVY-A=1111011

LR.Y

00001

aq

rl

rs2

rs1≠0

011

rd

RVY-A=1111011

AMOSWAP.Y

00011

aq

rl

rs2

rs1≠0

011

rd

RVY-A=1111011

SC.Y

A.3.5. MISC Instruction Encodings

MISC Instruction Encodings

31:29

28:27

26:20

19:15

14:12

11:7

6:0

Inst

00000

shamt=XLEN

rs1

101

rd

RVY-A=1111011

YHIR (pseudo)

00000

shamt=XLEN

rs1

101

rd

RVY-A=1111011

SRLIY

111

ybndswimm[8:0]

rs1

101

rd

RVY-A=1111011

YBNDSWI

A.4. RVY-A Instruction Allocation

A.4.1. Funct3 Allocations

funct3

Instruction(s)

000

R-Type (3-op and 1/2-op)

001

010

011

AMO

100

YADDI

101

MISC IMM (SRLIY/YBNDSWI)

110

Unallocated

111

Unallocated

A.4.2. R-Type 3-Operand (funct3=000)

funct7[6:3]

funct7[2:0]

000

001

010

011

100

101

110

111

0000

0001

0010

0011

0100

0101

YMODEW/YMODESWY/YMODESWI

0110

0111

1000

1001

1010

1011

1100

1101

1110

1111

1OP/2OP

A.4.3. R-Type 1-Op/2-Op (funct3=000, funct7=1111000)

Note: The 5-bit rs2 field is split into columns rs2[4:3] and rows rs2[2:0].

rs2[2:0]

rs2[4:3]

000

YAMASK

001

010

011

100

101

110

111

A.4.4. R-Type 1-Op/2-Op (funct3=000, funct7=1111010)

Note: The 5-bit rs2 field is split into columns rs2[4:3] and rows rs2[2:0].

rs2[2:0]

rs2[4:3]

000

001

010

011

100

101

110

111

A.4.5. AMO Sub-opcode Allocations (funct3=011)

funct7[6:3]

funct7[2:0]

000

001

010

011

100

101

110

111

0000

AMOSWAP.Y

0001

LR.Y

SC.Y

0010

0011

0100

0101

0110

0111

1000

1001

1010

1011

1100

1101

1110

1111

A.4.6. MISC Sub-opcode Allocations (funct3=101)

funct7[6:3]

funct7[2:0]

000

001

010

011

100

101

110

111

0000

YHIR (pseudo)/SRLIY

0001

0010

0011

0100

0101

0110

0111

1000

1001

1010

1011

1100

1101

1110

YBNDSWI

1111

YBNDSWI

Appendix B: Hybrid Mode Usage Models

The Zyhybrid extension enables a hart to switch between Capability Pointer Mode and Integral Pointer Mode. This functionality supports various usage models that facilitate the co-existence of and transition between non-CHERI software and pure-capability CHERI software.

B.1. High-Level Overview of Compatibility

There are three operating modes on an RVY-enabled core:

RVI/RVE mode (misa.Y=0 or xenvcfg.Y=0): The base ISA is RVI or RVE. There is no access to extended YLEN registers. custom-1/2/3 major opcodes remain available for vendor extensions.
RVY pure-capability mode (misa.Y=1 and xenvcfg.Y=1, and pc.P-bit = 0): The base ISA is RVY, and custom-1/2/3 major opcodes are used for RVY instructions. All loads and stores use a YLEN base register for permissions and bounds checks.
RVY in RVI compatibility mode (misa.Y=1 and xenvcfg.Y=1, and pc.P-bit = 1): The base ISA is RVY, and custom-1/2/3 major opcodes are used for RVY instructions. However, all loads and stores use a XLEN base register and check ddc for permissions and bounds. This mode requires Zyhybrid support.

B.2. Example Usage Models

Purecap Kernel with 64-bit Plain RISC-V Programs

In this model, a CHERI-aware operating system (the kernel) executes in Capability Pointer Mode, benefiting from full memory safety for kernel operations. User applications may be standard 64-bit RISC-V programs executing in Integral Pointer Mode. The kernel can decide based on the ELF header whether to start the program with CHERI enabled and in pure-capability mode or completely disabled.

To set up this environment and handle transitions, the following sequence is typical:

Initialize ddc for User Space: The kernel creates a data capability with read and write permissions. Its bounds are restricted to the user program’s data segment. This capability is written to the ddc CSR.
Setup pc for User Space: The kernel creates a capability for the user program entry point. It sets the P-bit to Integral Pointer Mode so the processor starts in Integral Pointer Mode. This capability is written to sepc.
Enter User Mode: The kernel executes the sret instruction. This installs the capability from sepc into pc, switching the mode to Integral Pointer Mode.

Return to Kernel Mode: When the user program makes a system call (via ecall) or an error occurs, the hardware saves the current pc into sepc. It then loads the kernel’s exception vector capability from stvec into pc. Since stvec was set up with its P-bit set for pure-capability execution, this switches the execution mode back to Capability Pointer Mode.

The kernel can disable CHERI for user mode by clearing the appropriate enable bits (e.g., bit in senvcfg.Y). This ensures that the user application cannot execute any RVY instructions. It also guarantees that the program has access to the custom-1/2/3 opcode space, which CHERI would otherwise use for its own instructions in Integral Pointer Mode.

To allow mixing plain RISC-V and pure-capability RISC-V user applications, the kernel must either context-switch the state of senvcfg.Y, or run the plain RISC-V applications in RVI compatibility mode, in which case the custom opcode space reserved by RVY is not available to userspace.

Plain RISC-V Program with a Purecap Library Sandbox

A large non-CHERI application (executing in Integral Pointer Mode) may wish to use a specific library that has been recompiled for CHERI (executing in Capability Pointer Mode) to process untrusted data (e.g., an image or video codec).

To set up this environment and handle transitions, the following sequence is typical:

Prepare Library Capabilities: A dynamic linker or runtime creates a capability for the library’s entry point with the P-bit set to Capability Pointer Mode. This capability only covers the library’s code and read-only data section.
Setup Sandbox Environment: Before running the library, the system sets ddc to NULL. This stops the library from accessing memory unless it uses specific capabilities given to it.
Invoke the Library: The main program (in Integral Pointer Mode) calls the library using JALR (RVY). This installs the capability into pc and switches the mode to Capability Pointer Mode. It also saves a return capability in the link register.
Return from the Library: When finished, the library returns by jumping to the return capability in the link register. This switches the mode back to Integral Pointer Mode.

Pure-capability RISC-V Program Calling into a Non-CHERI Library

Conversely, a new pure-capability application may need to link against a non-CHERI binary-only library for which source code is unavailable.

To set up this environment and handle transitions, the following sequence is typical:

Prepare Non-CHERI Code Capability: The pure-capability program creates a capability for the non-CHERI library’s functions with the P-bit set to Integral Pointer Mode. This capability only covers the non-CHERI library’s code region.
Setup ddc for Non-CHERI Code: In pure-capability code, ddc is usually NULL. Before calling the non-CHERI library, the program must load a capability into ddc. This capability must cover all memory the non-CHERI library needs (heap, globals, and stack).
Invoke Non-CHERI Code: The pure-capability program calls the non-CHERI code using JALR (RVY). This switches the processor to Integral Pointer Mode and installs the restricted pc.
Restore State on Return: When the non-CHERI library returns (usually using a standard JALR instruction), the pure-capability program restores its state and sets ddc back to NULL.

Hybrid Mode: RVI Code with Explicit Capabilities

For example, a Just-In-Time (JIT) compiler running in a pure-capability environment might generate code that uses standard integer addresses (or 32-bit compressed pointers) to avoid the overhead of larger capability pointers. While described here in the context of a JIT compiler and language runtime, this model is generalizable to other scenarios requiring efficient execution of code using smaller pointers within a larger pure-capability application.

This model sets up and switches modes just like the sandboxing models above. The main difference is that the generated code (running in Integral Pointer Mode) uses RVY instructions to talk to the pure-capability environment:

Accessing C Runtime Data: The code can access data in the surrounding C++ runtime using explicit capability-based load/store operations. At the same time, it uses standard integer addresses for its own heap via ddc. This can be achieved by executing YMODESWY before the memory access and YMODESWI to return back to address interpretation.
Calling Runtime Helpers: Runtime helper functions are given to the code as a sealed entry point capability. When the code calls these helpers, the processor automatically switches back to Capability Pointer Mode. This lets the runtime run with full safety before returning to the JIT code.

B.3. CSRs and Registers Involved in Mode Transitions

When transitioning between modes, certain capability registers and CSRs must be configured to ensure correct execution and authorization:

ddc (Default Data Capability): This CSR must be written with a capability covering the data memory that the code in Integral Pointer Mode is authorized to access. Memory accesses in Integral Pointer Mode are implicitly authorized by ddc.
Exception Return Capability Registers (sepc or mepc): When a privileged environment prepares to transition to a lower privilege mode via an exception return instruction, it must write the target executable capability to sepc or mepc. This capability must have its P-bit set to the desired execution mode (Capability Pointer Mode or Integral Pointer Mode). The exception return instruction installs this capability into pc, thereby setting the mode.
Trap Vector Registers (stvec or mtvec): These registers hold the capability for the exception vector. When an exception or interrupt occurs, the hardware installs this capability into pc, thereby setting the mode for the trap handler. For example, if stvec was set up with its P-bit set to 0, the processor will switch to Capability Pointer Mode on entry to the trap handler.

B.4. CHERI Enable Bits in Setup (when changing privilege levels)

In addition to setting up capabilities and mode bits, the execution environment must manage the CHERI enable bits provided by the privileged architecture:

Per-Privilege Enable Bits: The privileged architecture defines enable bits (often denoted as Y) in CSRs such as misa, menvcfg, and senvcfg to control CHERI availability in each privilege mode (see Chapter 24).
Reset State: On reset, CHERI is disabled for privilege levels below machine-mode to ensure backwards compatibility with standard RISC-V software. This behavior applies to harts that support execution in both Capability Pointer Mode and Integral Pointer Mode.
Enabling CHERI: A higher privilege mode must set the appropriate enable bits to allow a lower privilege mode to use RVY instructions or switch between Capability Pointer Mode and Integral Pointer Mode.
Disabled CHERI with Bounds Enforcement: If CHERI is disabled for a specific privilege mode, that mode cannot execute RVY instructions or change modes. However, the hardware continues to enforce the bounds defined by the last installed pc and ddc. This allows a kernel to run non-CHERI-aware code in a strict integer environment while still confining it within capability bounds.

B.5. Transition Instructions and Frequency

Transitions between modes are performed using specific instructions:

YMODESWI and YMODESWY: These instructions are used to switch between modes without changing the control flow. They are typically used when entering or leaving a section of code that requires a different mode.
JALR (RVY): When jumping to a target capability with the P-bit set differently from the current mode, the jump performs a mode switch. This is commonly used for function calls between compartments or libraries in different modes.

The frequency of transitions depends on the model:

Kernel/User boundary: Transitions occur on every system call, exception, and interrupt, which are relatively infrequent compared to instruction execution.
Library calls: Transitions occur on every call to and return from the sandboxed library. If these calls are frequent, the overhead of mode switching (which may involve saving/restoring registers and clearing state) should be considered.

Appendix C: CHERI System Implications

ARC-NOTE: Please advise whether this appendix should remain in this specification or move to supporting documentation.

CHERI processors need memory systems that support the capability tags in memory.

CHERI systems range from tiny IoT devices up to server chips.

There are two types of bus connections used in SoCs which contain CHERI CPUs:

Tag-aware buses, where the bus protocol is extended to carry the capability tag along with the data. This is typically done using user-defined bits in the protocol.
1. These buses will read capability tags from memory (if capability tags are present in the target memory) and return them to the requestor.
2. These buses will write the capability tag to memory as an extension of the data write.
Non-capability tag-aware buses, i.e., current non-CHERI-aware buses.
1. Reads of tagged memory will not read the capability tag.
2. Writes to tagged memory will set the capability tag of any YLEN-aligned YLEN-wide memory location to zero where any byte is overwritten by the memory write.

The fundamental rule for any CHERI system is that the capability tag and data are always accessed atomically. For every naturally aligned YLEN-wide memory location, it must never be possible to:

Update any data bytes without also writing the capability tag
1. This implies setting the capability tag to zero if a non-CHERI aware bus master overwrites a capability in memory
Read a capability tag value with mismatched (stale or newer) data
Set a capability tag without also atomically writing the associated data.

Clearing a capability tag may be performed without changing the associated data, provided stale tagged data can never be observed.

C.1. Small CHERI system example

Figure 44. Example small CHERI system with local capability tag storage

This example shows a minimum-sized system where only the local memory is extended to support capability tags. The capability tag-aware region is highlighted. All capability tags are created by the CHERI CPU, and only stored locally. The memory is shared with the system, probably via a secure DMA, which is not capability tag-aware.

Therefore, the connection between CPU and memory is tag-aware, and the connection to the system is not capability tag-aware.

All writes from the system port to the memory must clear any memory capability tags to follow the rules from above.

C.2. Large CHERI system example

Figure 45. Example large CHERI system with capability tag cache

In the case of a large CHERI SoC with caches, all the cached memory visible to the CHERI CPUs must support capability tags. All memory is backed up by DRAM, and standard DRAM does not offer the extra bit required for CHERI capability tag storage and so a typical system will have a capability tag cache IP.

A region of DRAM is typically reserved for CHERI capability tag storage.

The capability tag cache sits on the boundary of the capability tag-aware and non-tag-aware memory domains, and it provides the bridge between the two. It stores capability tags locally in its cache, and if there is a miss, it will create an extra bus request to access the region of DRAM reserved for capability tag storage. Therefore, in the case of a miss a single access is split into two - one to access the data and one to access the capability tag.

The key property of the capability tag cache is to preserve the atomic access of data and capability tags in the memory system so that all CPUs have a consistent view of capability tags and data.

The region of DRAM reserved for capability tag storage must only be written by the capability tag cache, therefore no bus initiators should be able to write to the DRAM without the transactions passing through the capability tag cache.

Therefore, the GPUs and peripherals cannot write to the capability tag storage in the DRAM or the capability tag supporting memory data storage region. These constraints will be part of the design of the network-on-chip. It is possible for the GPU and peripherals to read the capability tag supporting memory data storage region of the DRAM, if required.

It would be possible to allow a DMA to access the capability tagged memory region of the DRAM directly to allow swap to/from DRAM and external devices such as flash. This will require the highest level of security in the SoC, as the CHERI protection model relies on the integrity of the capability tags, and so the root-of-trust will need to authenticate and encrypt the transfer, with anti-rollback protection.

For further information on the capability tag cache see (Joannou et al., 2017).

C.3. Large CHERI pure-capability system example

Figure 46. Example large CHERI system with only tag-aware bus masters

In this example every DRAM access passes through the capability tag cache, and so all bus masters are capability tag-aware and can access the capability tags associated with memory if permitted by the network-on-chip.

The system topology is simpler than in Figure 45.

There is likely to be a performance difference between the two systems. The main motivation for Figure 45 is to avoid the GPU DRAM traffic needing to look-up every capability tag in the capability tag cache, potentially adding overhead to every transaction.

Bibliography

Filardo, N. W., Gutstein, B. F., Woodruff, J., Clarke, J., Rugg, P., Davis, B., Johnston, M., Norton, R., Chisnall, D., Moore, S. W., Neumann, P. G., & Watson, R. N. M. (2024). Cornucopia Reloaded: Load Barriers for CHERI Heap Temporal Safety. doi.org/10.1145/3620665.3640416

Joannou, A., Woodruff, J., Kovacsics, R., Moore, S. W., Bradbury, A., Xia, H., Watson, R. N. M., Chisnall, D., Roe, M., Davis, B., Napierala, E., Baldwin, J., Gudka, K., Neumann, P. G., Mazzinghi, A., Richardson, A., Son, S., & Markettos, A. T. (2017). Efficient Tagged Memory. www.cl.cam.ac.uk/research/security/ctsrd/pdfs/201711-iccd2017-efficient-tags.pdf

RISC-V. (2023). RISC-V Unprivileged Specification. github.com/riscv/riscv-isa-manual/releases/download/riscv-isa-release-056b6ff-2023-10-02/unpriv-isa-asciidoc.pdf

RISC-V. (2023). RISC-V Privileged Specification. github.com/riscv/riscv-isa-manual/releases/download/riscv-isa-release-056b6ff-2023-10-02/priv-isa-asciidoc.pdf

Watson, R. N. M., Neumann, P. G., Woodruff, J., Roe, M., Almatary, H., Anderson, J., Baldwin, J., Barnes, G., Chisnall, D., Clarke, J., Davis, B., Eisen, L., Filardo, N. W., Fuchs, F. A., Grisenthwaite, R., Joannou, A., Laurie, B., Markettos, A. T., Moore, S. W., … Xia, H. (2023). Capability Hardware Enhanced RISC Instructions: CHERI Instruction-Set Architecture (Version 9) (UCAM-CL-TR-987; Issue UCAM-CL-TR-987). University of Cambridge, Computer Laboratory. doi.org/10.48456/tr-987

Woodruff, J., Joannou, A., Xia, H., Fox, A., Norton, R. M., Chisnall, D., Davis, B., Gudka, K., Filardo, N. W., Markettos, A. T., & others. (2019). Cheri Concentrate: Practical compressed capabilities. IEEE Transactions on Computers, 68(10), 1455–1469. doi.org/10.1109/TC.2019.2914037

saved register specifier	xreg	integer ABI	RV32Y ABI
0	x8	s0	s0
1	x9	s1	s1
2	x18	s2	s2
3	x19	s3	s3
4	x20	s4	s4
5	x21	s5	s5
6	x22	s6	s6
7	x23	s7	s7

saved register specifier	xreg	integer ABI	RV32Y ABI
0	x8	s0	s0
1	x9	s1	s1
2	x18	s2	s2
3	x19	s3	s3
4	x20	s4	s4
5	x21	s5	s5
6	x22	s6	s6
7	x23	s7	s7

RISC-V Specification for CHERI Extensions

14.2. Interaction with Root Capabilities

14.3. Interaction with YPERMC and YPERMR

14.3.1. Additional YPERMC permission rules

saved register specifier	xreg	integer ABI	RV32Y ABI
0	x8	s0	s0
1	x9	s1	s1
2	x18	s2	s2
3	x19	s3	s3
4	x20	s4	s4
5	x21	s5	s5
6	x22	s6	s6
7	x23	s7	s7