SPIR-V Specification

This specification is protected by copyright laws and contains material proprietary to the Khronos Group, Inc. It or any components may not be reproduced, republished, distributed, transmitted, displayed, broadcast, or otherwise exploited in any manner without the express prior written permission of Khronos Group. You may use this specification for implementing the functionality therein, without altering or removing any trademark, copyright or other notice from the specification, but the receipt or possession of this specification does not convey any rights to reproduce, disclose, or distribute its contents, or to manufacture, use, or sell anything that it may describe, in whole or in part.

Khronos Group grants express permission to any current Promoter, Contributor or Adopter member of Khronos to copy and redistribute UNMODIFIED versions of this specification in any fashion, provided that NO CHARGE is made for the specification and the latest available update of the specification for any version of the API is used whenever possible. Such distributed specification may be reformatted AS LONG AS the contents of the specification are not changed in any way. The specification may be incorporated into a product that is sold as long as such product includes significant independent work developed by the seller. A link to the current version of this specification on the Khronos Group website should be included whenever possible with specification distributions.

Khronos Group makes no, and expressly disclaims any, representations or warranties, express or implied, regarding this specification, including, without limitation, any implied warranties of merchantability or fitness for a particular purpose or noninfringement of any intellectual property. Khronos Group makes no, and expressly disclaims any, warranties, express or implied, regarding the correctness, accuracy, completeness, timeliness, and reliability of the specification. Under no circumstances will the Khronos Group, or any of its Promoters, Contributors or Members or their respective partners, officers, directors, employees, agents, or representatives be liable for any damages, whether direct, indirect, special or consequential damages for lost revenues, lost profits, or otherwise, arising from or in connection with these materials.

Khronos, SYCL, SPIR, WebGL, EGL, COLLADA, StreamInput, OpenVX, OpenKCam, glTF, OpenKODE, OpenVG, OpenWF, OpenSL ES, OpenMAX, OpenMAX AL, OpenMAX IL and OpenMAX DL are trademarks and WebCL is a certification mark of the Khronos Group Inc. OpenCL is a trademark of Apple Inc. and OpenGL and OpenML are registered trademarks and the OpenGL ES and OpenGL SC logos are trademarks of Silicon Graphics International used under license by Khronos. All other product names, trademarks, and/or company names are used solely for identification and belong to their respective owners.

Table of Contents

Contributors and Acknowledgments

Connor Abbott, Intel

Alexey Bader, Intel

Dan Baker, Oxide Games

Kenneth Benzie, Codeplay

Gordon Brown, Codeplay

Pat Brown, NVIDIA

Diana Po-Yu Chen, MediaTek

Stephen Clarke, Imagination

Patrick Doane, Blizzard Entertainment

Stefanus Du Toit, Google

Tim Foley, Intel

Ben Gaster, Qualcomm

Alexander Galazin, ARM

Christopher Gautier, ARM

Neil Henning, Codeplay

Kerch Holt, NVIDIA

Lee Howes, Qualcomm

Roy Ju, MediaTek

Daniel Koch, NVIDIA

Ashwin Kolhe, NVIDIA

Raun Krisch, Intel

Graeme Leese, Broadcom

Yuan Lin, NVIDIA

Yaxun Liu, AMD

Timothy Lottes, Epic Games

John McDonald, Valve

David Neto, Google

Christophe Riccio, Unity

Andrew Richards, Codeplay

Ian Romanick, Intel

Graham Sellers, AMD

Robert Simpson, Qualcomm

Brian Sumner, AMD

Andrew Woloszyn, Google

Weifeng Zhang, Qualcomm

Note	Up-to-date HTML and PDF versions of this specification may be found at the Khronos SPIR-V Registry. (https://www.khronos.org/registry/spir-v/)

1. Introduction

Abstract

SPIR-V is a simple binary intermediate language for graphical shaders and compute kernels. A SPIR-V module contains multiple entry points with potentially shared functions in the entry point’s call trees. Each function contains a control-flow graph (CFG) of basic blocks, with optional instructions to express structured control flow. Load/store instructions are used to access declared variables, which includes all input/output (IO). Intermediate results bypassing load/store use static single-assignment (SSA) representation. Data objects are represented logically, with hierarchical type information: There is no flattening of aggregates or assignment to physical register banks, etc. Selectable addressing models establish whether general pointer operations may be used, or if memory access is purely logical.

This document fully defines SPIR-V, a Khronos-standard binary intermediate language for representing graphical-shader stages and compute kernels for multiple Khronos APIs.

1.1. Goals

SPIR-V has the following goals:

Provide a simple binary intermediate language for all functionality appearing in Khronos shaders/kernels.
Have a concise, transparent, self-contained specification (sections Specification and Binary Form).
Map easily to other intermediate languages.
Be the form passed by an API into a driver to set shaders/kernels.
Can be targeted by new front ends for novel high-level languages.
Allow the first steps of compilation and reflection to be done offline.
Be low-level enough to require a reverse-engineering step to reconstruct source code.
Improve portability by enabling shared tools to generate or operate on it.
Allow separation of core specification from source-language-specific sets of built-in functions.
Reduce compile time during application run time. (Eliminating most of the compile time during application run time is not a goal of this intermediate language. Target-specific register allocation and scheduling are still expected to take significant time.)
Allow some optimizations to be done offline.

1.2. About this document

This document aims to:

Include everything needed to fully understand, create, and consume SPIR-V. However:
- Imported sets of instructions (which implement source-specific built-in functions) will need their own specification.
- Many validation rules are client-API specific, and hence documented with client API and not in this specification.
Separate expository and specification language. The specification-proper is in Specification and Binary Form.

1.3. Extendability

SPIR-V can be extended by multiple vendors or parties simultaneously:

Using the OpExtension instruction to require new semantics that must be supported. Such new semantics would come from an extension document.
Reserving (registering) ranges of the token values, as described further below.
Aided by instruction skipping, also further described below.

Enumeration Token Values. It is easy to extend all the types, storage classes, opcodes, decorations, etc. by adding to the token values.

Registration. Ranges of token values in the Binary Form section can be pre-allocated to numerous vendors/parties. This allows combining multiple independent extensions without conflict. To register ranges, see https://www.khronos.org/registry/spir-v/api/spir-v.xml.

Extended Instructions. Sets of extended instructions can be provided and specified in separate specifications. These help personalize SPIR-V for different source languages or execution environments (client APIs). Multiple sets of extended instructions can be imported without conflict, as the extended instructions are selected by {set id, instruction number} pairs.

Instruction Skipping. Tools are encouraged to skip opcodes for features they are not required to process. This is trivially enabled by the word count in an instruction, which makes it easier to add new instructions without breaking existing tools.

1.4. Debuggability

SPIR-V can decorate, with a text string, virtually anything created in the shader: types, variables, functions, etc. This is required for externally visible symbols, and also allowed for naming the result of any instruction. This can be used to aid in understandability when disassembling or debugging lowered versions of SPIR-V.

Location information (file names, lines, and columns) can be interleaved with the instruction stream to track the origin of each instruction.

1.5. Design Principles

Regularity. All instructions start with a word count. This allows walking a SPIR-V module without decoding each opcode. All instructions have an opcode that dictates for all operands what kind of operand they are. For instructions with a variable number of operands, the number of variable operands is known by subtracting the number of non-variable words from the instruction’s word count.

Non Combinatorial. There is no combinatorial type explosion or need for large encode/decode tables for types. Rather, types are parameterized. Image types declare their dimensionality, arrayness, etc. all orthogonally, which greatly simplify code. This is done similarly for other types. It also applies to opcodes. Operations are orthogonal to scalar/vector size, but not to integer vs. floating-point differences.

Modeless. After a given execution model (e.g., pipeline stage) is specified, internal operation is essentially modeless: Generally, it will follow the rule: "same spelling, same semantics", and does not have mode bits that modify semantics. If a change to SPIR-V modifies semantics, it should use a different spelling. This makes consumers of SPIR-V much more robust. There are execution modes declared, but these are generally to affect the way the module interacts with the environment around it, not the internal semantics. Capabilities are also declared, but this is to declare the subset of functionality that is used, not to change any semantics of what is used.

Declarative. SPIR-V declares externally-visible modes like "writes depth", rather than having rules that require deduction from full shader inspection. It also explicitly declares what addressing modes, execution model, extended instruction sets, etc. will be used. See Language Capabilities for more information.

SSA. All results of intermediate operations are strictly SSA. However, declared variables reside in memory and use load/store for access, and such variables can be stored to multiple times.

IO. Some storage classes are for input/output (IO) and, fundamentally, IO will be done through load/store of variables declared in these storage classes.

1.6. Static Single Assignment (SSA)

SPIR-V includes a phi instruction to allow the merging together of intermediate results from split control flow. This allows split control flow without load/store to memory. SPIR-V is flexible in the degree to which load/store is used; it is possible to use control flow with no phi-instructions, while still staying in SSA form, by using memory load/store.

Some storage classes are for IO and, fundamentally, IO will be done through load/store, and initial load and final store can never be eliminated. Other storage classes are shader local and can have their load/store eliminated. It can be considered an optimization to largely eliminate such loads/stores by moving them into intermediate results in SSA form.

1.7. Built-In Variables

SPIR-V identifies built-in variables from a high-level language with an enumerant decoration. This assigns any unusual semantics to the variable. Built-in variables must otherwise be declared with their correct SPIR-V type and treated the same as any other variable.

1.8. Specialization

Specialization enables creating a portable SPIR-V module outside the target execution environment, based on constant values that won’t be known until inside the execution environment. For example, to size a fixed array with a constant not known during creation of a module, but known when the module will be lowered to the target architecture.

See Specialization in the next section for more details.

1.9. Example

The SPIR-V form is binary, not human readable, and fully described in Binary Form. This is an example disassembly to give a basic idea of what SPIR-V looks like:

GLSL fragment shader:

#version 450

in vec4 color1;
in vec4 multiplier;
noperspective in vec4 color2;
out vec4 color;

struct S {
    bool b;
    vec4 v[5];
    int i;
};

uniform blockName {
    S s;
    bool cond;
};

void main()
{
    vec4 scale = vec4(1.0, 1.0, 2.0, 1.0);

    if (cond)
        color = color1 + s.v[2];
    else
        color = sqrt(color2) * scale;

    for (int i = 0; i < 4; ++i)
        color *= multiplier;
}

Corresponding SPIR-V:

; Magic:     0x07230203 (SPIR-V)
; Version:   0x00010000 (Version: 1.0.0)
; Generator: 0x00080001 (Khronos Glslang Reference Front End; 1)
; Bound:     63
; Schema:    0

               OpCapability Shader
          %1 = OpExtInstImport "GLSL.std.450"
               OpMemoryModel Logical GLSL450
               OpEntryPoint Fragment %4 "main" %31 %33 %42 %57
               OpExecutionMode %4 OriginLowerLeft

; Debug information
               OpSource GLSL 450
               OpName %4 "main"
               OpName %9 "scale"
               OpName %17 "S"
               OpMemberName %17 0 "b"
               OpMemberName %17 1 "v"
               OpMemberName %17 2 "i"
               OpName %18 "blockName"
               OpMemberName %18 0 "s"
               OpMemberName %18 1 "cond"
               OpName %20 ""
               OpName %31 "color"
               OpName %33 "color1"
               OpName %42 "color2"
               OpName %48 "i"
               OpName %57 "multiplier"

; Annotations (non-debug)
               OpDecorate %15 ArrayStride 16
               OpMemberDecorate %17 0 Offset 0
               OpMemberDecorate %17 1 Offset 16
               OpMemberDecorate %17 2 Offset 96
               OpMemberDecorate %18 0 Offset 0
               OpMemberDecorate %18 1 Offset 112
               OpDecorate %18 Block
               OpDecorate %20 DescriptorSet 0
               OpDecorate %42 NoPerspective

; All types, variables, and constants
          %2 = OpTypeVoid
          %3 = OpTypeFunction %2                      ; void ()
          %6 = OpTypeFloat 32                         ; 32-bit float
          %7 = OpTypeVector %6 4                      ; vec4
          %8 = OpTypePointer Function %7              ; function-local vec4*
         %10 = OpConstant %6 1
         %11 = OpConstant %6 2
         %12 = OpConstantComposite %7 %10 %10 %11 %10 ; vec4(1.0, 1.0, 2.0, 1.0)
         %13 = OpTypeInt 32 0                         ; 32-bit int, sign-less
         %14 = OpConstant %13 5
         %15 = OpTypeArray %7 %14
         %16 = OpTypeInt 32 1
         %17 = OpTypeStruct %13 %15 %16
         %18 = OpTypeStruct %17 %13
         %19 = OpTypePointer Uniform %18
         %20 = OpVariable %19 Uniform
         %21 = OpConstant %16 1
         %22 = OpTypePointer Uniform %13
         %25 = OpTypeBool
         %26 = OpConstant %13 0
         %30 = OpTypePointer Output %7
         %31 = OpVariable %30 Output
         %32 = OpTypePointer Input %7
         %33 = OpVariable %32 Input
         %35 = OpConstant %16 0
         %36 = OpConstant %16 2
         %37 = OpTypePointer Uniform %7
         %42 = OpVariable %32 Input
         %47 = OpTypePointer Function %16
         %55 = OpConstant %16 4
         %57 = OpVariable %32 Input

; All functions
          %4 = OpFunction %2 None %3                  ; main()
          %5 = OpLabel
          %9 = OpVariable %8 Function
         %48 = OpVariable %47 Function
               OpStore %9 %12
         %23 = OpAccessChain %22 %20 %21              ; location of cond
         %24 = OpLoad %13 %23                         ; load 32-bit int from cond
         %27 = OpINotEqual %25 %24 %26                ; convert to bool
               OpSelectionMerge %29 None              ; structured if
               OpBranchConditional %27 %28 %41        ; if cond
         %28 = OpLabel                                ; then
         %34 = OpLoad %7 %33
         %38 = OpAccessChain %37 %20 %35 %21 %36      ; s.v[2]
         %39 = OpLoad %7 %38
         %40 = OpFAdd %7 %34 %39
               OpStore %31 %40
               OpBranch %29
         %41 = OpLabel                                ; else
         %43 = OpLoad %7 %42
         %44 = OpExtInst %7 %1 Sqrt %43               ; extended instruction sqrt
         %45 = OpLoad %7 %9
         %46 = OpFMul %7 %44 %45
               OpStore %31 %46
               OpBranch %29
         %29 = OpLabel                                ; endif
               OpStore %48 %35
               OpBranch %49
         %49 = OpLabel
               OpLoopMerge %51 %52 None               ; structured loop
               OpBranch %53
         %53 = OpLabel
         %54 = OpLoad %16 %48
         %56 = OpSLessThan %25 %54 %55                ; i < 4 ?
               OpBranchConditional %56 %50 %51        ; body or break
         %50 = OpLabel                                ; body
         %58 = OpLoad %7 %57
         %59 = OpLoad %7 %31
         %60 = OpFMul %7 %59 %58
               OpStore %31 %60
               OpBranch %52
         %52 = OpLabel                                ; continue target
         %61 = OpLoad %16 %48
         %62 = OpIAdd %16 %61 %21                     ; ++i
               OpStore %48 %62
               OpBranch %49                           ; loop back
         %51 = OpLabel                                ; loop merge point
               OpReturn
               OpFunctionEnd

2. Specification

2.1. Language Capabilities

A SPIR-V module is consumed by an execution environment, specified by a client API, that needs to support the features used by that SPIR-V module. Features are classified through capabilities. Capabilities used by a particular SPIR-V module must be declared early in that module with the OpCapability instruction. Then:

A validator can validate that the module uses only its declared capabilities.
An execution environment is allowed to reject modules declaring capabilities it does not support. (See client API specifications for environment-specific rules.)

All available capabilities and their dependencies form a capability hierarchy, fully listed in the capability section. Only top-level capabilities need to be explicitly declared; their dependencies are implicitly declared.

When an instruction, enumerant, or other feature specifies multiple enabling capabilities, only one such capability needs to be declared to use the feature. This declaration does not itself imply anything about the presence of the other enabling capabilities: The execution environment needs to support only the declared capability.

This (SPIR-V) specification provides capability-specific validation rules, in the validation section. To ensure portability, each client API needs to include the following:

Which capabilities in the capability section it requires environments to support, and hence allows in SPIR-V modules.
Required limits, if they are beyond the Universal Limits.
Any validation requirements specific to the environment that are not tied to specific capabilities, and hence not covered in the SPIR-V specification.

2.2. Terms

2.2.1. Instructions

Word: 32 bits.

<id>: A numerical name; the name used to refer to an object, a type, a function, a label, etc. An <id> always consumes one word. The <id>s defined by a module obey SSA.

Result <id>: Most instructions define a result, named by an <id> explicitly provided in the instruction. The Result <id> is used as an operand in other instructions to refer to the instruction that defined it.

Literal String: A nul-terminated stream of characters consuming an integral number of words. The character set is Unicode in the UTF-8 encoding scheme. The UTF-8 octets (8-bit bytes) are packed four per word, following the little-endian convention (i.e., the first octet is in the lowest-order 8 bits of the word). The final word contains the string’s nul-termination character (0), and all contents past the end of the string in the final word are padded with 0.

Literal Number: A numeric value consuming one or more words. An instruction will determine what type a literal will be interpreted as. When the type’s bit width is larger than one word, the literal’s low-order words appear first. When the type’s bit width is less than 32-bits, the literal’s value appears in the low-order bits of the word, and the high-order bits must be 0 for a floating-point type, or 0 for an integer type with Signedness of 0, or sign extended when Signedness is 1. (Similarly for the remaining bits of widths larger than 32 bits but not a multiple of 32 bits.)

Literal: A Literal String or a Literal Number.

Operand: A one-word argument to an instruction. E.g., it could be an <id>, or a (part of a) literal. Which form it holds is always explicitly known from the opcode.

Immediate: Operand(s) directly holding a literal value rather than an <id>. Immediate values larger than one word will consume multiple operands, one per word. That is, operand counting is always done per word, not per immediate.

WordCount: The complete number of words taken by an instruction, including the word holding the word count and opcode, and any optional operands. An instruction’s word count is the total space taken by the instruction.

Instruction: After a header, a module is simply a linear list of instructions. An instruction contains a word count, an opcode, an optional Result <id>, an optional <id> of the instruction’s type, and a variable list of operands. All instruction opcodes and semantics are listed in Instructions.

Decoration: Auxiliary information such as built-in variable, stream numbers, invariance, interpolation type, relaxed precision, etc., added to <id>s or structure-type members through Decorations. Decorations are enumerated in Decoration in the Binary Form section.

Object: An instantiation of a non-void type, either as the Result <id> of an operation, or created through OpVariable.

Memory Object: An object created through OpVariable. Such an object can die on function exit, if it was a function variable, or exist for the duration of an entry point.

Intermediate Object or Intermediate Value or Intermediate Result: An object created by an operation (not memory allocated by OpVariable) and dying on its last consumption.

Constant Instruction: Either a specialization-constant instruction or a fixed constant instruction: Instructions that start "OpConstant" or "OpSpec".

[a, b]: This square-bracket notation means the range from a to b, inclusive of a and b. Parenthesis exclude their end point, so, for example, (a, b] means a to b excluding a but including b.

2.2.2. Types

Boolean type: The type returned by OpTypeBool.

Integer type: Any width signed or unsigned type from OpTypeInt. By convention, the lowest-order bit will be referred to as bit-number 0, and the highest-order bit as bit-number Width - 1.

Floating-point type: Any width type from OpTypeFloat.

Numerical type: An integer type or a floating-point type.

Scalar: A single instance of a numerical type or Boolean type. Scalars will also be called components when being discussed either by themselves or in the context of the contents of a vector.

Vector: An ordered homogeneous collection of two or more scalars. Vector sizes are quite restrictive and dependent on the execution model.

Matrix: An ordered homogeneous collection of vectors. When vectors are part of a matrix, they will also be called columns. Matrix sizes are quite restrictive and dependent on the execution model.

Array: An ordered homogeneous collection of any non-void-type objects. When an object is part of an array, it will also be called an element. Array sizes are generally not restricted.

Structure: An ordered heterogeneous collection of any non-void types. When an object is part of a structure, it will also be called a member.

Aggregate: A structure or an array.

Composite: An aggregate, a matrix, or a vector.

Image: A traditional texture or image; SPIR-V has this single name for these. An image type is declared with OpTypeImage. An image does not include any information about how to access, filter, or sample it.

Sampler: Settings that describe how to access, filter, or sample an image. Can come either from literal declarations of settings or be an opaque reference to externally bound settings. A sampler does not include an image.

Sampled Image: An image combined with a sampler, enabling filtered accesses of the image’s contents.

Concrete Type: A numerical scalar, vector, or matrix type, or OpTypePointer when using a Physical addressing model, or any aggregate containing only these types.

Abstract Type: An OpTypeVoid or OpTypeBool, or OpTypePointer when using the Logical addressing model, or any aggregate type containing any of these.

Opaque Type: A type that is, or contains, or points to, or contains pointers to, any of the following types:

OpTypeImage
OpTypeSampler
OpTypeSampledImage
OpTypeOpaque
OpTypeEvent
OpTypeDeviceEvent
OpTypeReserveId
OpTypeQueue
OpTypePipe
OpTypeForwardPointer

2.2.3. Module

Module: A single unit of SPIR-V. It can contain multiple entry points, but only one set of capabilities.

Entry Point: A function in a module where execution begins. A single entry point is limited to a single execution model. An entry point is declared using OpEntryPoint.

Execution Model: A graphical-pipeline stage or OpenCL kernel. These are enumerated in Execution Model.

Execution Mode: Modes of operation relating to the interface or execution environment of the module. These are enumerated in Execution Mode. Generally, modes do not change the semantics of instructions within a SPIR-V module.

Vertex Processor: Any stage or execution model that processes vertices: Vertex, tessellation control, tessellation evaluation, and geometry. Explicitly excludes fragment and compute execution models.

2.2.4. Control Flow

Block: A contiguous sequence of instructions starting with an OpLabel, ending with a termination instruction. A block has no additional label or termination instructions.

Branch Instruction: One of the following, used as a termination instruction:

OpBranch
OpBranchConditional
OpSwitch
OpReturn
OpReturnValue

Termination Instruction: One of the following, used to terminate blocks:

any branch instruction
OpKill
OpUnreachable

Dominate: A block A dominates a block B, where A and B are in the same function, if every path from the function’s entry point to block B includes block A. A strictly dominates B only if A dominates B and A and B are different blocks.

Post Dominate: A block B post dominates a block A, where A and B are in the same function, if every path from A to a function-return instruction goes through block B.

Control-Flow Graph: The graph formed by a function’s blocks and branches. The blocks are the graph’s nodes, and the branches the graph’s edges.

CFG: Control-flow graph.

Back Edge: If a depth-first traversal is done on a function’s CFG, starting from the first block of the function, a back edge is a branch to a previously visited block. A back-edge block is the block containing such a branch.

Merge Instruction: One of the following, used before a branch instruction to declare structured control flow:

OpSelectionMerge
OpLoopMerge

Header Block: A block containing a merge instruction.

Loop Header: A header block whose merge instruction is an OpLoopMerge.

Merge Block: A block declared by the Merge Block operand of a merge instruction.

Break Block: A block containing a branch to the Merge Block of a loop header’s merge instruction.

Continue Block: A block containing a branch to an OpLoopMerge instruction’s Continue Target.

Return Block: A block containing an OpReturn or OpReturnValue branch.

Invocation: A single execution of an entry point in a SPIR-V module, operating only on the amount of data explicitly exposed by the semantics of the instructions. (Any implicit operation on additional instances of data would comprise additional invocations.) For example, in compute execution models, a single invocation operates only on a single work item, or, in a vertex execution model, a single invocation operates only on a single vertex.

Subgroup: The set of invocations exposed as running concurrently with the current invocation. In compute models, the current workgroup is a superset of the subgroup.

Invocation Group: The complete set of invocations collectively processing a particular compute workgroup or graphical operation, where the scope of a "graphical operation" is implementation dependent, but at least as large as a single point, line, triangle, or patch, and at most as large as a single rendering command, as defined by the client API.

Derivative Group: Defined only for the Fragment Execution Model: The set of invocations collectively processing a single point, line, or triangle, including any helper invocations.

Dynamic Instance: Within a single invocation, a single static instruction can be executed multiple times, giving multiple dynamic instances of that instruction. This can happen when the instruction is executed in a loop, or in a function called from multiple call sites, or combinations of multiple of these. Different loop iterations and different dynamic function-call-site chains yield different dynamic instances of such an instruction. Dynamic instances are distinguished by the control-flow path within an invocation, not by which invocation executed it. That is, different invocations of an entry point execute the same dynamic instances of an instruction when they follow the same control-flow path, starting from that entry point.

Dynamically Uniform: An <id> is dynamically uniform for a dynamic instance consuming it when its value is the same for all invocations (in the invocation group) that execute that dynamic instance.

Uniform Control Flow: Uniform control flow (or converged control flow) occurs when all invocations in the invocation group or derivative group execute the same control-flow path (and hence the same sequence of dynamic instances of instructions). Uniform control flow is the initial state at the entry point, and lasts until a conditional branch takes different control paths for different invocations (non-uniform or divergent control flow). Such divergence can reconverge, with all the invocations once again executing the same control-flow path, and this re-establishes the existence of uniform control flow. If control flow is uniform upon entry into a header block, and all invocations leave that dynamic instance of the header block’s control-flow construct via the header block’s declared merge block, then control flow reconverges to be uniform at that merge block.

2.3. Physical Layout of a SPIR-V Module and Instruction

A SPIR-V module is a single linear stream of words. The first words are shown in the following table:

Table 1. First Words of Physical Layout
Word Number	Contents
0	Magic Number.
1	Version number. The bytes are, high-order to low-order: 0 \| Major Number \| Minor Number \| 0 Hence, version 1.00 is the value 0x00010000.
2	Generator’s magic number. It is associated with the tool that generated the module. Its value does not affect any semantics, and is allowed to be 0. Using a non-0 value is encouraged, and can be registered with Khronos at https://www.khronos.org/registry/spir-v/api/spir-v.xml.
3	Bound; where all <id>s in this module are guaranteed to satisfy 0 < id < Bound Bound should be small, smaller is better, with all <id> in a module being densely packed and near 0.
4	0 (Reserved for instruction schema, if needed.)
5	First word of instruction stream, see below.

All remaining words are a linear sequence of instructions.

Each instruction is a stream of words:

Table 2. Instruction Physical Layout
Instruction Word Number	Contents
0	Opcode: The 16 high-order bits are the WordCount of the instruction. The 16 low-order bits are the opcode enumerant.
1	Optional instruction type <id> (presence determined by opcode).
.	Optional instruction Result <id> (presence determined by opcode).
.	Operand 1 (if needed)
.	Operand 2 (if needed)
…	…
WordCount - 1	Operand N (N is determined by WordCount minus the 1 to 3 words used for the opcode, instruction type <id>, and instruction Result <id>).

Instructions are variable length due both to having optional instruction type <id> and Result <id> words as well as a variable number of operands. The details for each specific instruction are given in the Binary Form section.

2.4. Logical Layout of a Module

The instructions of a SPIR-V module must be in the following order. For sections earlier than function definitions, it is invalid to use instructions other than those indicated.

All OpCapability instructions.
Optional OpExtension instructions (extensions to SPIR-V).
Optional OpExtInstImport instructions.
The single required OpMemoryModel instruction.
All entry point declarations, using OpEntryPoint.
All execution mode declarations, using OpExecutionMode.
These debug instructions, which must be in the following order:
1. all OpString, OpSourceExtension, OpSource, and OpSourceContinued, without forward references.
2. all OpName and all OpMemberName
All annotation instructions:
1. all decoration instructions (OpDecorate, OpMemberDecorate, OpGroupDecorate, OpGroupMemberDecorate, and OpDecorationGroup).
All type declarations (OpTypeXXX instructions), all constant instructions, and all global variable declarations (all OpVariable instructions whose Storage Class is not Function). This is the preferred location for OpUndef instructions, though they can also appear in function bodies. All operands in all these instructions must be declared before being used. Otherwise, they can be in any order. This section is the first section to allow use of OpLine debug information.
All function declarations ("declarations" are functions without a body; there is no forward declaration to a function with a body). A function declaration is as follows.
1. Function declaration, using OpFunction.
2. Function parameter declarations, using OpFunctionParameter.
3. Function end, using OpFunctionEnd.
All function definitions (functions with a body). A function definition is as follows.
1. Function definition, using OpFunction.
2. Function parameter declarations, using OpFunctionParameter.
3. Block
4. Block
5. …
6. Function end, using OpFunctionEnd.

Within a function definition:

A block always starts with an OpLabel instruction. This may be immediately preceded by an OpLine instruction, but the OpLabel is considered as the beginning of the block.
A block always ends with a termination instruction (see validation rules for more detail).
All OpVariable instructions in a function must have a Storage Class of Function.
All OpVariable instructions in a function must be in the first block in the function. These instructions, together with any immediately preceding OpLine instructions, must be the first instructions in that block. (Note the validation rules prevent OpPhi instructions in the first block of a function.)
A function definition (starts with OpFunction) can be immediately preceded by an OpLine instruction.

Forward references (an operand <id> that appears before the Result <id> defining it) are allowed for:

Operands that are an OpFunction. This allows for recursion and early declaration of entry points.
Annotation-instruction operands. This is required to fully know everything about a type or variable once it is declared.
Labels.
Loops can have forward references to a phi function.
An OpTypeForwardPointer has a forward reference to an OpTypePointer.
An OpTypeStruct operand that’s a forward reference to the Pointer Type operand to an OpTypeForwardPointer.
The list of <id> provided in the OpEntryPoint instruction.

In all cases, there is enough type information to enable a single simple pass through a module to transform it. For example, function calls have all the type information in the call, phi-functions don’t change type, and labels don’t have type. The pointer forward reference allows structures to contain pointers to themselves or to be mutually recursive (through pointers), without needing additional type information.

The Validation Rules section lists additional rules that must be satisfied.

2.5. Instructions

Most instructions create a Result <id>, as provided in the Result <id> field of the instruction. These Result <id>s are then referred to by other instructions through their <id> operands. All instruction operands are specified in the Binary Form section.

Instructions are explicit about whether they require immediates, rather than an <id> referring to some other result. This is strictly known just from the opcode.

An immediate 32-bit (or smaller) integer is always one operand directly holding a 32-bit two’s-complement value.
An immediate 32-bit float is always one operand, directly holding a 32-bit IEEE 754 floating-point representation.
An immediate 64-bit float is always two operands, directly holding a 64-bit IEEE 754 representation. The low-order 32 bits appear in the first operand.

2.5.1. SSA Form

A module is always in static single assignment (SSA) form. That is, there is always exactly one instruction resulting in any particular Result <id>. Storing into variables declared in memory is not subject to this; such stores do not create Result <id>s. Accessing declared variables is done through:

OpVariable to allocate an object in memory and create a Result <id> that is the name of a pointer to it.
OpAccessChain or OpInBoundsAccessChain to create a pointer to a subpart of a composite object in memory.
OpLoad through a pointer, giving the loaded object a Result <id> that can then be used as an operand in other instructions.
OpStore through a pointer, to write a value. There is no Result <id> for an OpStore.

OpLoad and OpStore instructions can often be eliminated, using intermediate results instead. When this happens in multiple control-flow paths, these values need to be merged again at the path’s merge point. Use OpPhi to merge such values together.

2.6. Entry Point and Execution Model

The OpEntryPoint instruction identifies an entry point with two key things: an execution model and a function definition. Execution models include Vertex, GLCompute, etc. (one for each graphical stage), as well as Kernel for OpenCL kernels. For the complete list, see Execution Model. An OpEntryPoint also supplies a name that can be used externally to identify the entry point, and a declaration of all the Input and Output variables that form its input/output interface.

The static function call graphs rooted at two entry points are allowed to overlap, so that function definitions and global variable definitions can be shared. The execution model and any execution modes associated with an entry point apply to the entire static function call graph rooted at that entry point. This rule implies that a function appearing in both call graphs of two distinct entry points may behave differently in each case. Similarly, variables whose semantics depend on properties of an entry point, e.g. those using the Input Storage Class, may behave differently when used in call graphs rooted in two different entry points.

2.7. Execution Modes

Information like the following is declared with OpExecutionMode instructions. For example,

number of invocations (Invocations)
vertex-order CCW (VertexOrderCcw)
triangle strip generation (OutputTriangleStrip)
number of output vertices (OutputVertices)
etc.

For a complete list, see Execution Mode.

2.8. Types and Variables

Types are built up hierarchically, using OpTypeXXX instructions. The Result <id> of an OpTypeXXX instruction becomes a type <id> for future use where type <id>s are needed (therefore, OpTypeXXX instructions do not have a type <id>, like most other instructions do).

The "leaves" to start building with are types like OpTypeFloat, OpTypeInt, OpTypeImage, OpTypeEvent, etc. Other types are built up from the Result <id> of these. The numerical types are parameterized to specify bit width and signed vs. unsigned.

Higher-level types are then constructed using opcodes like OpTypeVector, OpTypeMatrix, OpTypeImage, OpTypeArray, OpTypeRuntimeArray, OpTypeStruct, and OpTypePointer. These are parameterized by number of components, array size, member lists, etc. The image types are parameterized by the return type, dimensionality, arrayness, etc. To do sampling or filtering operations, a type from OpTypeSampledImage is used that contains both an image and a sampler. Such a sampled image can be set directly by the API, or combined in a SPIR-V module from an independent image and an independent sampler.

Types are built bottom up: A parameterizing operand in a type must be defined before being used.

Some additional information about the type of an <id> can be provided using the decoration instructions (OpDecorate, OpMemberDecorate, OpGroupDecorate, OpGroupMemberDecorate, and OpDecorationGroup). These can add, for example, Invariant to an <id> created by another instruction. See the full list of Decorations in the Binary Form section.

Two different type <id>s form, by definition, two different types. It is valid to declare multiple aggregate type <id>s having the same opcode and operands. This is to allow multiple instances of aggregate types with the same structure to be decorated differently. (Different decorations are not required; two different aggregate type <id>s are allowed to have identical declarations and decorations, and will still be two different types.) Non-aggregate types are different: It is invalid to declare multiple type <id>s for the same scalar, vector, or matrix type. That is, non-aggregate type declarations must all have different opcodes or operands. (Note that non-aggregate types cannot be decorated in ways that affect their type.)

Variables are declared to be of an already built type, and placed in a Storage Class. Storage classes include UniformConstant, Input, Workgroup, etc. and are fully specified in Storage Class. Variables declared with the Function Storage Class can have their lifetime’s specified within their function using the OpLifetimeStart and OpLifetimeStop instructions.

Intermediate results are typed by the instruction’s type <id>, which must validate with respect to the operation being done.

Built-in variables needing special driver handling (having unique semantics) are declared using OpDecorate or OpMemberDecorate with the BuiltIn Decoration, followed by a BuiltIn enumerant. This decoration is applied to a variable or a structure-type member.

2.9. Function Calling

To call a function defined in the current module or a function declared to be imported from another module, use OpFunctionCall with an operand that is the <id> of the OpFunction to call, and the <id>s of the arguments to pass. All arguments are passed by value into the called function. This includes pointers, through which a callee object could be modified.

2.10. Extended Instruction Sets

Many operations and/or built-in function calls from high-level languages are represented through extended instruction sets. Extended instruction sets will include things like

trigonometric functions: sin(), cos(), …
exponentiation functions: exp(), pow(), …
geometry functions: reflect(), smoothstep(), …
functions having rich performance/accuracy trade-offs
etc.

Non-extended instructions, those that are core SPIR-V instructions, are listed in the Binary Form section. Native operations include:

Basic arithmetic: +, -, *, min(), scalar * vector, etc.
Texturing, to help with back-end decoding and support special code-motion rules.
Derivatives, due to special code-motion rules.

Extended instruction sets are specified in independent specifications. They can be referenced (but not specified) in this specification. The separate extended instruction set specification will specify instruction opcodes, semantics, and instruction names.

To use an extended instruction set, first import it by name string using OpExtInstImport and giving it a Result <id>:

<extinst-id> OpExtInstImport "name-of-extended-instruction-set"

The "name-of-extended-instruction-set" is a literal string. The standard convention for this string is

"<source language name>.<package name>.<version>"

For example "GLSL.std.450" could be the name of the core built-in functions for GLSL versions 450 and earlier.

Note	There is nothing precluding having two "mirror" sets of instructions with different names but the same opcode values, which could, for example, let modifying just the import statement to change a performance/accuracy trade off.

Then, to call a specific extended instruction, use OpExtInst:

OpExtInst <extinst-id> instruction-number operand0, operand1, ...

Extended instruction-set specifications will provide semantics for each "instruction-number". It is up to the specific specification what the overloading rules are on operand type. The specification must be clear on its semantics, and producers/consumers of it must follow those semantics.

By convention, it is recommended that all external specifications include an enum {…} listing all the "instruction-numbers", and a mapping between these numbers and a string representing the instruction name. However, there are no requirements that instruction name strings are provided or mangled.

Note	Producing and consuming extended instructions can be done entirely through numbers (no string parsing). An extended instruction set specification provides opcode enumerant values for the instructions, and these will be produced by the front end and consumed by the back end.

2.11. Structured Control Flow

SPIR-V can explicitly declare structured control-flow constructs using merge instructions. These explicitly declare a header block before the control flow diverges and a merge block where control flow subsequently converges. These blocks delimit constructs that must nest, and can only be entered and exited in structured ways, as per the following.

Structured control-flow declarations must satisfy the following rules:

the merge block declared by a header block cannot be a merge block declared by any other header block
each header block must strictly dominate its merge block, unless the merge block is unreachable in the CFG
all CFG back edges must branch to a loop header, with each loop header having exactly one back edge branching to it
for a given loop header, its OpLoopMerge Continue Target, and corresponding back-edge block:
- the loop header must dominate the Continue Target, unless the Continue Target is unreachable in the CFG
- the Continue Target must dominate the back-edge block
- the back-edge block must post dominate the Continue Target

A structured control-flow construct is then defined as one of:

a selection construct: the set of blocks dominated by a selection header, minus the set of blocks dominated by the header’s merge block
a continue construct: the set of blocks dominated by an OpLoopMerge’s Continue Target and post dominated by the corresponding back-edge block
a loop construct: the set of blocks dominated by a loop header, minus the set of blocks dominated by the loop’s merge block, minus the loop’s corresponding continue construct
a case construct: the set of blocks dominated by an OpSwitch Target or Default, minus the set of blocks dominated by the OpSwitch’s merge block (this construct is only defined for those OpSwitch Target or Default that are not equal to the OpSwitch’s corresponding merge block)

The above structured control-flow constructs must satisfy the following rules:

if a construct contains another header block, then it also contains that header’s corresponding merge block
the only blocks in a construct that can branch outside the construct are
- a block branching to the construct’s merge block
- a block branching from one case construct to another, for the same OpSwitch
- a continue block for the innermost loop it is nested inside of
- a break block for the innermost loop it is nested inside of
- a return block
additionally for switches:
- an OpSwitch block dominates all its defined case constructs
- each case construct has at most one branch to another case construct
- each case construct is branched to by at most one other case construct
- if Target T1 branches to Target T2, or if Target T1 branches to the Default and the Default branches to Target T2, then T1 must immediately precede T2 in the list of the OpSwitch Target operands

2.12. Specialization

Specialization is intended for constant objects that will not have known constant values until after initial generation of a SPIR-V module. Such objects are called specialization constants.

A SPIR-V module containing specialization constants can consume one or more externally provided specializations: A set of final constant values for some subset of the module’s specialization constants. Applying these final constant values yields a new module having fewer remaining specialization constants. A module also contains default values for any specialization constants that never get externally specialized.

Note	No optimizing transforms are required to make a specialized module functionally correct. The specializing transform is straightforward and explicitly defined below.

Note	Ad hoc specializing should not be done through constants (OpConstant or OpConstantComposite) that get overwritten: A SPIR-V → SPIR-V transform might want to do something irreversible with the value of such a constant, unconstrained from the possibility that its value could be later changed.

Within a module, a Specialization Constant is declared with one of these instructions:

OpSpecConstantTrue
OpSpecConstantFalse
OpSpecConstant
OpSpecConstantComposite
OpSpecConstantOp

The literal operands to OpSpecConstant are the default numerical specialization constants. Similarly, the "True" and "False" parts of OpSpecConstantTrue and OpSpecConstantFalse provide the default Boolean specialization constants. These default values make an external specialization optional. However, such a default constant is applied only after all external specializations are complete, and none contained a specialization for it.

An external specialization is provided as a logical list of pairs. Each pair is a SpecId Decoration of a scalar specialization instruction along with its specialization constant. The numeric values are exactly what the operands would be to a corresponding OpConstant instruction. Boolean values are true if non-zero and false if zero.

Specializing a module is straightforward. The following specialization-constant instructions can be updated with specialization constants, and replaced in place, leaving everything else in the module exactly the same:

           OpSpecConstantTrue -> OpConstantTrue or OpConstantFalse
          OpSpecConstantFalse -> OpConstantTrue or OpConstantFalse
               OpSpecConstant -> OpConstant
      OpSpecConstantComposite -> OpConstantComposite

The OpSpecConstantOp instruction is specialized by executing the operation and replacing the instruction with the result. The result can be expressed in terms of a constant instruction that is not a specialization-constant instruction. (Note, however, this resulting instruction might not have the same size as the original instruction, so is not a "replaced in place" operation.)

When applying an external specialization, the following (and only the following) must be modified to be non-specialization-constant instructions:

specialization-constant instructions with values provided by the specialization
specialization-constant instructions that consume nothing but non-specialization constant instructions (including those that the partial specialization transformed from specialization-constant instructions; these are in order, so it is a single pass to do so)

A full specialization can also be done, when requested or required, in which all specialization-constant instructions will be modified to non-specialization-constant instructions, using the default values where required.

2.13. Linkage

The ability to have partially linked modules and libraries is provided as part of the Linkage capability.

By default, functions and global variables are private to a module and cannot be accessed by other modules. However, a module may be written to export or import functions and global (module scope) variables. Imported functions and global variable definitions are resolved at linkage time. A module is considered to be partially linked if it depends on imported values.

Within a module, imported or exported values are decorated using the Linkage Attributes Decoration. This decoration assigns the following linkage attributes to decorated values:

A Linkage Type.
A name, which is a Literal String, and is used to uniquely identify exported values.

Note	When resolving imported functions, the Function Control and all Function Parameter Attributes are taken from the function definition, and not from the function declaration.

2.14. Relaxed Precision

The RelaxedPrecision Decoration allows 32-bit integer and 32-bit floating-point operations to execute with a relaxed precision of somewhere between 16 and 32 bits.

For a floating-point operation, operating at relaxed precision means that the minimum requirements for range and precision are as follows:

the floating point range may be as small as (-2¹⁴, 2¹⁴)
the floating point magnitude range may be as small as (2^-14, 2¹⁴)
the relative floating point precision may be as small as 2^-10

Relative floating-point precision is defined as the worst case (i.e. largest) ratio of the smallest step in relation to the value for all non-zero values:

Precision_relative = (abs(v₁ - v₂)_min / abs(v₁))_max for v₁ ≠ 0, v₂ ≠ 0, v₁ ≠ v₂

For integer operations, operating at relaxed precision means that the operation will be evaluated by an operation in which, for some N, 16 ≤ N ≤ 32:

the operation is executed as though its type were N bits in size, and
the result is zero or sign extended to 32 bits as determined by the signedness of the result type of the operation.

The RelaxedPrecision Decoration can be applied to:

The <id> of a variable, where the variable’s type is a scalar, vector, or matrix, or an array of scalar, vector, or matrix. In all cases, the components in the type must be a 32-bit numerical type.
The Result <id> of an instruction that operates on numerical types, meaning the instruction is to operate at relaxed precision.
The Result <id> of an instruction that reads or filters from an image. E.g. OpImageSampleExplicitLod, meaning the instruction is to operate at relaxed precision.
The Result <id> of an OpFunction meaning the function’s returned result is at relaxed precision. It cannot be applied to OpTypeFunction or to an OpFunction whose return type is OpTypeVoid.
A structure-type member (through OpMemberDecorate).

When applied to a variable or structure member, all loads and stores from the decorated object may be treated as though they were decorated with RelaxedPrecision. Loads may also be decorated with RelaxedPrecision, in which case they are treated as operating at relaxed precision.

All loads and stores involving relaxed precision still read and write 32 bits of data, respectively. Floating-point data read or written in such a manner is written in full 32-bit floating-point format. However, a load or store might reduce the precision (as allowed by RelaxedPrecision) of the destination value.

For debugging portability of floating-point operations, OpQuantizeToF16 may be used to explicitly reduce the precision of a relaxed-precision result to 16-bit precision. (Integer-result precision can be reduced, for example, using left- and right-shift opcodes.)

For image-sampling operations, decorations can appear on both the sampling instruction and the image variable being sampled. If either is decorated, they both should be decorated, and when both are decorated their decorations must match. If only one is decorated, the sampling instruction can behave either as if both were decorated or neither were decorated.

2.15. Debug Information

Debug information is supplied with:

Source-code text through OpString, OpSource, and OpSourceContinued.
Object names through OpName and OpMemberName.
Line numbers through OpLine.

A module will not lose any semantics when all such instructions are removed.

2.15.1. Function-Name Mangling

There is no functional dependency on how functions are named. Signature-typing information is explicitly provided, without any need for name "unmangling". (Valid modules can be created without inclusion of mangled names.)

By convention, for debugging purposes, modules with OpSource Source Language of OpenCL use the Itanium name-mangling standard.

2.16. Validation Rules

2.16.1. Universal Validation Rules

All modules must obey the following, or it is an invalid module:

The stream of instructions must be ordered as described in the Logical Layout section.
Any use of a feature described by a capability in the capability section requires that capability to be declared, either directly, or as a "depends on" capability on a capability that is declared.
Non-structure types (scalars, vectors, arrays, etc.) with the same operand parameterization cannot be type aliases. For non-structures, two type <id>s match if-and-only-if the types match.
If the Logical addressing model is selected:
- OpVariable cannot allocate an object whose type is a pointer type (that is, it cannot create an object in memory that is itself a pointer and whose result would thus be a pointer to a pointer)
- A pointer can only be an operand to the following instructions:
  - OpLoad
  - OpStore
  - OpAccessChain
  - OpInBoundsAccessChain
  - OpFunctionCall
  - OpImageTexelPointer
  - OpCopyMemory
  - OpCopyObject
  - all OpAtomic instructions
  - extended instruction-set instructions that are explicitly identified as taking pointer operands
- A pointer can be the Result <id> of only the following instructions:
- All indexes in OpAccessChain and OpInBoundsAccessChain that are OpConstant with type of OpTypeInt with a signedness of 1 must not have their sign bit set.
SSA
- Each <id> must appear exactly once as the Result <id> of an instruction.
- The definition of an SSA <id> should dominate all uses of it, with the following exceptions:
  - Function calls may call functions not yet defined. However, note that the function’s argument and return types will already be known at the call site.
  - Uses in a phi-function in a loop may consume definitions in the loop that don’t dominate the use.
Entry point and execution model
- There is at least one OpEntryPoint instruction, unless the Linkage capability is being used.
- No function can be targeted by both an OpEntryPoint instruction and an OpFunctionCall instruction.
Functions
- A function declaration (an OpFunction with no basic blocks), must have a Linkage Attributes Decoration with the Import Linkage Type.
- A function definition (an OpFunction with basic blocks) cannot be decorated with the Import Linkage Type.
- A function cannot have both a declaration and a definition (no forward declarations).
Global (Module Scope) Variables
- It is illegal to initialize an imported variable. This means that a module-scope OpVariable with initialization value cannot be marked with the Import Linkage Type.
Control-Flow Graph (CFG)
- Blocks exist only within a function.
- The first block in a function definition is the entry point of that function and cannot be the target of any branch. (Note this means it will have no OpPhi instructions.)
- The order of blocks in a function must satisfy the rule that blocks appear before all blocks they dominate.
- Each block starts with a label.
  - A label is made by OpLabel.
  - This includes the first block of a function (OpFunction is not a label).
  - Labels are used only to form blocks.
- The last instruction of each block is a termination instruction.
- Termination instructions can only appear as the last instruction in a block.
- OpLabel instructions can only appear within a function.
- All branches within a function must be to labels in that function.
All OpFunctionCall Function operands are an <id> of an OpFunction in the same module.
Data rules
- Scalar floating-point types can be parameterized only as 32 bit, plus any additional sizes enabled by capabilities.
- Scalar integer types can be parameterized only as 32 bit, plus any additional sizes enabled by capabilities.
- Vector types can only be parameterized with numerical types or the OpTypeBool type.
- Vector types for can only be parameterized as having 2, 3, or 4 components, plus any additional sizes enabled by capabilities.
- Matrix types can only be parameterized with floating-point types.
- Matrix types can only be parameterized as having only 2, 3, or 4 columns.
- Specialization constants (see Specialization) are limited to integers, Booleans, floating-point numbers, and vectors of these.
- Forward reference operands in an OpTypeStruct
  - must be later declared with OpTypePointer
  - the type pointed to must be an OpTypeStruct
  - had an earlier OpTypeForwardPointer forward reference to the same <id>
- All OpSampledImage instructions must be in the same block in which their Result <id> are consumed. Result <id> from OpSampledImage instructions must not appear as operands to OpPhi instructions or OpSelect instructions, or any instructions other than the image lookup and query instructions specified to take an operand whose type is OpTypeSampledImage.
- Instructions for extracting a scalar image or scalar sampler out of a composite must only use dynamically-uniform indexes. They must be in the same block in which their Result <id> are consumed. Such Result <id> must not appear as operands to OpPhi instructions or OpSelect instructions, or any instructions other than the image instructions specified to operate on them.
Decoration rules
- The Aliased Decoration can only be applied to intermediate objects that are pointers to non-void types.
- The Linkage Attributes Decoration cannot be applied to functions targeted by an OpEntryPoint instruction.
- A BuiltIn Decoration can only be applied as follows:
  - When applied to a structure-type member, all members of that structure type must also be decorated with BuiltIn. (No allowed mixing of built-in variables and non-built-in variables within a single structure.)
  - When applied to a structure-type member, that structure type cannot be contained as a member of another structure type.
  - There is at most one object per Storage Class that can contain a structure type containing members decorated with BuiltIn, consumed per entry-point.
OpLoad and OpStore can only consume objects whose type is a pointer.
A Result <id> resulting from an instruction within a function can only be used in that function.
A function call must have the same number of arguments as the function definition (or declaration) has parameters, and their respective types must match.
An instruction requiring a specific number of operands must have that many operands. The word count must agree.
Each opcode specifies its own requirements for number and type of operands, and these must be followed.
Atomic access rules
- The pointers taken by atomic operation instructions must be a pointer into one of the following Storage Classes:
  - Uniform when used with the BufferBlock Decoration
  - Workgroup
  - CrossWorkgroup
  - Generic
  - AtomicCounter
  - Image
- All pointers used in atomic operation instructions must be pointers to one of the following:
  - 32-bit scalar integer
  - 64-bit scalar integer

2.16.2. Validation Rules for Shader Capabilities

CFG:
- Loops must be structured, having an OpLoopMerge instruction in their header.
- Selections must be structured, having an OpSelectionMerge instruction in their header.
Entry point and execution model
- Each entry point in a module, along with its corresponding static call tree within that module, forms a complete pipeline stage.
- Each OpEntryPoint with the Fragment Execution Model must have an OpExecutionMode for either the OriginLowerLeft or the OriginUpperLeft Execution Mode. (Exactly one of these is required.)
- An OpEntryPoint with the Fragment Execution Model can set at most one of the DepthGreater, DepthLess, or DepthUnchanged Execution Modes.
- An OpEntryPoint with one of the Tessellation Execution Modes can set at most one of the SpacingEqual, FractionalEven, or FractionalOdd Execution Modes.
- An OpEntryPoint with one of the Tessellation Execution Models can set at most one of the Triangles, Quads, or Isolines Execution Modes.
- An OpEntryPoint with one of the Tessellation Execution Models can set at most one of the VertexOrderCw or VertexOrderCcw Execution Modes.
- An OpEntryPoint with the Geometry Execution Model must set exactly one of the InputPoints, InputLines, InputLinesAdjacency, Triangles, or TrianglesAdjacency Execution Modes.
- An OpEntryPoint with the Geometry Execution Model must set exactly one of the OutputPoints, OutputLineStrip, or OutputTriangleStrip Execution Modes.
Composite objects in the UniformConstant, Uniform, and PushConstant Storage Classes must be explicitly laid out. The following apply to all the aggregate and matrix types describing such an object, recursively through their nested types:
- Each structure-type member must have an Offset Decoration.
- Each array type must have an ArrayStride Decoration.
- Each structure-type member that is a matrix or array-of-matrices must have be decorated with
  - a MatrixStride Decoration, and
  - one of the RowMajor or ColMajor Decorations.
- The ArrayStride, MatrixStride, and Offset Decorations must be large enough to hold the size of the objects they affect (that is, specifying overlap is invalid). Each ArrayStride and MatrixStride must be greater than zero, and no two members of a given structure can be assigned to the same Offset.
For structure objects in the Input and Output Storage Classes, the following apply:
- When applied to structure-type members, the Decorations Noperspective, Flat, Patch, Centroid, and Sample can only be applied to the top-level members of the structure type. (Nested objects' types cannot be structures whose members are decorated with these decorations.)
Decorations
- At most one of Noperspective or Flat Decorations can be applied to the same object or member.
- At most one of Patch, Centroid, or Sample Decorations can be applied to the same object or member.
- At most one of RowMajor and ColMajor Decorations can be applied to a structure type.
- At most one of Block and BufferBlock Decorations can be applied to a structure type.
All <id> used for Scope and Memory Semantics must be of an OpConstant.

2.16.3. Validation Rules for Kernel Capabilities

The Signedness in OpTypeInt must always be 0.

2.17. Universal Limits

These quantities are minimum limits for all implementations and validators. Implementations are allowed to support larger quantities. Specific APIs may impose larger minimums. See Language Capabilities.

Validators must either

inform when these limits are crossed, or
be explicitly parameterized with larger limits.

Table 3. Limits
Limited Entity	Minimum Limit
Limited Entity	Decimal	Hexadecimal
Characters in a literal string	65,535	FFFF
Instruction word count	65,535	FFFF
Result <id> bound See Physical Layout for the shader-specific bound.	4,194,303	3FFFFF
Control-flow nesting depth Measured per function, in program order, counting the maximum number of OpBranch, OpBranchConditional, or OpSwitch that are seen without yet seeing their corresponding Merge Block, as declared by OpSelectionMerge or OpLoopMerge.	1023	3FF
Global variables (Storage Class other than Function)	65,535	FFFF
Local variables (Function Storage Class)	524,287	7FFFF
Decorations per target <id>	Number of entries in the Decoration table.
Execution modes per entry point	255	FF
Indexes for OpAccessChain, OpInBoundsAccessChain, OpPtrAccessChain, OpInBoundsPtrAccessChain, OpCompositeExtract, and OpCompositeInsert	255	FF
Number of function parameters, per function declaration	255	FF
OpFunctionCall actual arguments	255	FF
OpExtInst actual arguments	255	FF
OpSwitch (literal, label) pairs	16,383	3FFF
OpTypeStruct members	16,383	3FFF
Structure nesting depth	255	FF

2.18. Memory Model

A memory model is chosen using a single OpMemoryModel instruction near the beginning of the module. This selects both an addressing model and a memory model.

The Logical addressing model means pointers are abstract, having no physical size or numeric value. In this mode, pointers can only be created from existing objects, and they cannot be stored into an object.

The non-Logical addressing models allow physical pointers to be formed. OpVariable can be used to create objects that hold pointers. These are declared for a specific Storage Class. Pointers for one Storage Class cannot be used to access objects in another Storage Class. However, they can be converted with conversion opcodes. Any particular addressing model must describe the bit width of pointers for each of the storage classes.

2.18.1. Memory Layout

When memory is shared between a SPIR-V module and an API, its contents are transparent, and must be agreed on. For example, the Offset, MatrixStride, and ArrayStride Decorations applied to members of a struct object can partially define how the memory is laid out. In addition, the following are always true, applied recursively as needed, of the offsets within the memory buffer:

a vector consumes contiguous memory with lower-numbered components appearing in smaller offsets than higher-numbered components, and with component 0 starting at the vector’s Offset Decoration, if present
in an array, lower-numbered elements appear at smaller offsets than higher-numbered elements, with element 0 starting at the Offset Decoration for the array, if present
a structure has lower-numbered members appearing at smaller offsets than higher-numbered members, with member 0 starting at the Offset Decoration for the structure, if present
in a matrix, lower-numbered columns appear at smaller offsets than higher-numbered columns, and lower-numbered components within the matrix’s vectors appearing at smaller offsets than high-numbered components, with component 0 of column 0 starting at the Offset Decoration, if present (the RowMajor and ColMajor Decorations dictate what is contiguous)

2.18.2. Aliasing

Here, aliasing means one of:

Two or more pointers that point into overlapping parts of the same underlying object. That is, two intermediates, both of which are typed pointers, that can be dereferenced (in bounds) such that both dereferences access the same memory.
Images, buffers, or other externally allocated objects where a function might access the same underlying memory via accesses to two different objects.

How aliasing is managed depends on the Memory Model:

The simple and GLSL memory models can assume that aliasing is generally not present. Specifically, the compiler is free to compile as if aliasing is not present, unless a pointer is explicitly indicated to be an alias. This is indicated by applying the Aliased Decoration to an intermediate object’s <id>. Applying Restrict is allowed, but has no effect.
The OpenCL memory models must assume that aliasing is generally present. Specifically, the compiler must compile as if aliasing is present, unless a pointer is explicitly indicated to not alias. This is done by applying the Restrict Decoration to an intermediate object’s <id>. Applying Aliased is allowed, but has no effect.

It is invalid to apply both Restrict and Aliased to the same <id>.

2.19. Derivatives

Derivatives appear only in the Fragment Execution Model. They can be implicit or explicit. Some image instructions consume implicit derivatives, while the derivative instructions compute explicit derivatives. In all cases, derivatives are well defined only if the derivative group has uniform control flow.

2.20. Code Motion

Texturing instructions in the Fragment Execution Model that rely on an implicit derivative cannot be moved into control flow that is not known to be uniform control flow within each derivative group.

3. Binary Form

This section contains the exact form for all instructions, starting with the numerical values for all fields. See Physical Layout for the order words appear in.

3.1. Magic Number

Magic number for a SPIR-V module.

Tip	Endianness: A module is defined as a stream of words, not a stream of bytes. However, if stored as a stream of bytes (e.g., in a file), the magic number can be used to deduce what endianness to apply to convert the byte stream back to a word stream.

Magic Number
0x07230203

Magic Number

0x07230203

3.2. Source Language

The source language is for debug purposes only, with no semantics that affect the meaning of other parts of the module. Used by OpSource.

Source Language
0	Unknown
1	ESSL
2	GLSL
3	OpenCL_C
4	OpenCL_CPP
5	HLSL

3.3. Execution Model

Used by OpEntryPoint.

Execution Model	Enabling Capabilities
0	Vertex Vertex shading stage.	Shader
1	TessellationControl Tessellation control (or hull) shading stage.	Tessellation
2	TessellationEvaluation Tessellation evaluation (or domain) shading stage.	Tessellation
3	Geometry Geometry shading stage.	Geometry
4	Fragment Fragment shading stage.	Shader
5	GLCompute Graphical compute shading stage.	Shader
6	Kernel Compute kernel.	Kernel

Execution Model

Enabling Capabilities

Vertex
Vertex shading stage.

Shader

TessellationControl
Tessellation control (or hull) shading stage.

Tessellation

TessellationEvaluation
Tessellation evaluation (or domain) shading stage.

Tessellation

Geometry
Geometry shading stage.

Geometry

Fragment
Fragment shading stage.

Shader

GLCompute
Graphical compute shading stage.

Shader

Kernel
Compute kernel.

Kernel

3.4. Addressing Model

Used by OpMemoryModel.

Addressing Model	Enabling Capabilities
0	Logical
1	Physical32 Indicates a 32-bit module, where the address width is equal to 32 bits.	Addresses
2	Physical64 Indicates a 64-bit module, where the address width is equal to 64 bits.	Addresses

Addressing Model

Enabling Capabilities

Logical

Physical32
Indicates a 32-bit module, where the address width is equal to 32 bits.

Addresses

Physical64
Indicates a 64-bit module, where the address width is equal to 64 bits.

Addresses

3.5. Memory Model

Used by OpMemoryModel.

Memory Model	Enabling Capabilities
0	Simple No shared memory consistency issues.	Shader
1	GLSL450 Memory model needed by later versions of GLSL and ESSL. Works across multiple versions.	Shader
2	OpenCL OpenCL memory model.	Kernel

Memory Model

Enabling Capabilities

Simple
No shared memory consistency issues.

Shader

GLSL450
Memory model needed by later versions of GLSL and ESSL. Works across multiple versions.

Shader

OpenCL
OpenCL memory model.

Kernel

3.6. Execution Mode

Declare the modes an entry point will execute in. Used by OpExecutionMode.

Execution Mode	Enabling Capabilities	Extra Operands
0	Invocations Number of times to invoke the geometry stage for each input primitive received. The default is to run once for each input primitive. It is invalid to specify a value greater than the target-dependent maximum. Only valid with the Geometry Execution Model.	Geometry	Literal Number Number of invocations
1	SpacingEqual Requests the tessellation primitive generator to divide edges into a collection of equal-sized segments. Only valid with one of the tessellation Execution Models.	Tessellation
2	SpacingFractionalEven Requests the tessellation primitive generator to divide edges into an even number of equal-length segments plus two additional shorter fractional segments. Only valid with one of the tessellation Execution Models.	Tessellation
3	SpacingFractionalOdd Requests the tessellation primitive generator to divide edges into an odd number of equal-length segments plus two additional shorter fractional segments. Only valid with one of the tessellation Execution Models.	Tessellation
4	VertexOrderCw Requests the tessellation primitive generator to generate triangles in clockwise order. Only valid with one of the tessellation Execution Models.	Tessellation
5	VertexOrderCcw Requests the tessellation primitive generator to generate triangles in counter-clockwise order. Only valid with one of the tessellation Execution Models.	Tessellation
6	PixelCenterInteger Pixels appear centered on whole-number pixel offsets. E.g., the coordinate (0.5, 0.5) appears to move to (0.0, 0.0). Only valid with the Fragment Execution Model. If a Fragment entry point does not have this set, pixels appear centered at offsets of (0.5, 0.5) from whole numbers	Shader
7	OriginUpperLeft Pixel coordinates appear to originate in the upper left, and increase toward the right and downward. Only valid with the Fragment Execution Model.	Shader
8	OriginLowerLeft Pixel coordinates appear to originate in the lower left, and increase toward the right and upward. Only valid with the Fragment Execution Model.	Shader
9	EarlyFragmentTests Fragment tests are to be performed before fragment shader execution. Only valid with the Fragment Execution Model.	Shader
10	PointMode Requests the tessellation primitive generator to generate a point for each distinct vertex in the subdivided primitive, rather than to generate lines or triangles. Only valid with one of the tessellation Execution Models.	Tessellation
11	Xfb This stage will run in transform feedback-capturing mode and this module is responsible for describing the transform-feedback setup. See the XfbBuffer, Offset, and XfbStride Decorations.	TransformFeedback
12	DepthReplacing This mode must be declared if this module potentially changes the fragment’s depth. Only valid with the Fragment Execution Model.	Shader
14	DepthGreater External optimizations may assume depth modifications will leave the fragment’s depth as greater than or equal to the fragment’s interpolated depth value (given by the z component of the FragCoord BuiltIn decorated variable). Only valid with the Fragment Execution Model.	Shader
15	DepthLess External optimizations may assume depth modifications leave the fragment’s depth less than the fragment’s interpolated depth value, (given by the z component of the FragCoord BuiltIn decorated variable). Only valid with the Fragment Execution Model.	Shader
16	DepthUnchanged External optimizations may assume this stage did not modify the fragment’s depth. However, DepthReplacing mode must accurately represent depth modification. Only valid with the Fragment Execution Model.	Shader
17	LocalSize Indicates the work-group size in the x, y, and z dimensions. Only valid with the GLCompute or Kernel Execution Models.		Literal Number x size	Literal Number y size	Literal Number z size
18	LocalSizeHint A hint to the compiler, which indicates the most likely to be used work-group size in the x, y, and z dimensions. Only valid with the Kernel Execution Model.	Kernel	Literal Number x size	Literal Number y size	Literal Number z size
19	InputPoints Stage input primitive is points. Only valid with the Geometry Execution Model.	Geometry
20	InputLines Stage input primitive is lines. Only valid with the Geometry Execution Model.	Geometry
21	InputLinesAdjacency Stage input primitive is lines adjacency. Only valid with the Geometry Execution Model.	Geometry
22	Triangles For a geometry stage, input primitive is triangles. For a tessellation stage, requests the tessellation primitive generator to generate triangles. Only valid with the Geometry or one of the tessellation Execution Models.	Geometry, Tessellation
23	InputTrianglesAdjacency Geometry stage input primitive is triangles adjacency. Only valid with the Geometry Execution Model.	Geometry
24	Quads Requests the tessellation primitive generator to generate quads. Only valid with one of the tessellation Execution Models.	Tessellation
25	Isolines Requests the tessellation primitive generator to generate isolines. Only valid with one of the tessellation Execution Models.	Tessellation
26	OutputVertices For a geometry stage, the maximum number of vertices the shader will ever emit in a single invocation. For a tessellation-control stage, the number of vertices in the output patch produced by the tessellation control shader, which also specifies the number of times the tessellation control shader is invoked. Only valid with the Geometry or one of the tessellation Execution Models.	Geometry, Tessellation	Literal Number Vertex count
27	OutputPoints Stage output primitive is points. Only valid with the Geometry Execution Model.	Geometry
28	OutputLineStrip Stage output primitive is line strip. Only valid with the Geometry Execution Model.	Geometry
29	OutputTriangleStrip Stage output primitive is triangle strip. Only valid with the Geometry Execution Model.	Geometry
30	VecTypeHint A hint to the compiler, which indicates that most operations used in the entry point are explicitly vectorized using a particular vector type. The 16 high-order bits of Vector Type operand specify the number of components of the vector. The 16 low-order bits of Vector Type operand specify the data type of the vector. These are the legal data type values: 0 represents an 8-bit integer value. 1 represents a 16-bit integer value. 2 represents a 32-bit integer value. 3 represents a 64-bit integer value. 4 represents a 16-bit float value. 5 represents a 32-bit float value. 6 represents a 64-bit float value. Only valid with the Kernel Execution Model.	Kernel	Literal Number Vector type
31	ContractionOff Indicates that floating-point-expressions contraction is disallowed. Only valid with the Kernel Execution Model.	Kernel
4446	PostDepthCoverage	SampleMaskPostDepthCoverage
5027	StencilRefReplacingEXT	StencilExportEXT

Execution Mode

Enabling Capabilities

Extra Operands

Invocations
Number of times to invoke the geometry stage for each input primitive received. The default is to run once for each input primitive. It is invalid to specify a value greater than the target-dependent maximum. Only valid with the Geometry Execution Model.

Geometry

Literal Number
Number of invocations

SpacingEqual
Requests the tessellation primitive generator to divide edges into a collection of equal-sized segments. Only valid with one of the tessellation Execution Models.

Tessellation

SpacingFractionalEven
Requests the tessellation primitive generator to divide edges into an even number of equal-length segments plus two additional shorter fractional segments. Only valid with one of the tessellation Execution Models.

Tessellation

SpacingFractionalOdd
Requests the tessellation primitive generator to divide edges into an odd number of equal-length segments plus two additional shorter fractional segments. Only valid with one of the tessellation Execution Models.

Tessellation

VertexOrderCw
Requests the tessellation primitive generator to generate triangles in clockwise order. Only valid with one of the tessellation Execution Models.

Tessellation

VertexOrderCcw
Requests the tessellation primitive generator to generate triangles in counter-clockwise order. Only valid with one of the tessellation Execution Models.

Tessellation

PixelCenterInteger
Pixels appear centered on whole-number pixel offsets. E.g., the coordinate (0.5, 0.5) appears to move to (0.0, 0.0). Only valid with the Fragment Execution Model. If a Fragment entry point does not have this set, pixels appear centered at offsets of (0.5, 0.5) from whole numbers

Shader

OriginUpperLeft
Pixel coordinates appear to originate in the upper left, and increase toward the right and downward. Only valid with the Fragment Execution Model.

Shader

OriginLowerLeft
Pixel coordinates appear to originate in the lower left, and increase toward the right and upward. Only valid with the Fragment Execution Model.

Shader

EarlyFragmentTests
Fragment tests are to be performed before fragment shader execution. Only valid with the Fragment Execution Model.

Shader

PointMode
Requests the tessellation primitive generator to generate a point for each distinct vertex in the subdivided primitive, rather than to generate lines or triangles. Only valid with one of the tessellation Execution Models.

Tessellation

Xfb
This stage will run in transform feedback-capturing mode and this module is responsible for describing the transform-feedback setup. See the XfbBuffer, Offset, and XfbStride Decorations.

TransformFeedback

DepthReplacing
This mode must be declared if this module potentially changes the fragment’s depth. Only valid with the Fragment Execution Model.

Shader

DepthGreater
External optimizations may assume depth modifications will leave the fragment’s depth as greater than or equal to the fragment’s interpolated depth value (given by the z component of the FragCoord BuiltIn decorated variable). Only valid with the Fragment Execution Model.

Shader

DepthLess
External optimizations may assume depth modifications leave the fragment’s depth less than the fragment’s interpolated depth value, (given by the z component of the FragCoord BuiltIn decorated variable). Only valid with the Fragment Execution Model.

Shader

DepthUnchanged
External optimizations may assume this stage did not modify the fragment’s depth. However, DepthReplacing mode must accurately represent depth modification. Only valid with the Fragment Execution Model.

Shader

LocalSize
Indicates the work-group size in the x, y, and z dimensions. Only valid with the GLCompute or Kernel Execution Models.

Literal Number
x size

Literal Number
y size

Literal Number
z size

LocalSizeHint
A hint to the compiler, which indicates the most likely to be used work-group size in the x, y, and z dimensions. Only valid with the Kernel Execution Model.

Kernel

Literal Number
x size

Literal Number
y size

Literal Number
z size

InputPoints
Stage input primitive is points. Only valid with the Geometry Execution Model.

Geometry

InputLines
Stage input primitive is lines. Only valid with the Geometry Execution Model.

Geometry

InputLinesAdjacency
Stage input primitive is lines adjacency. Only valid with the Geometry Execution Model.

Geometry

Triangles
For a geometry stage, input primitive is triangles. For a tessellation stage, requests the tessellation primitive generator to generate triangles. Only valid with the Geometry or one of the tessellation Execution Models.

Geometry, Tessellation

InputTrianglesAdjacency
Geometry stage input primitive is triangles adjacency. Only valid with the Geometry Execution Model.

Geometry

Quads
Requests the tessellation primitive generator to generate quads. Only valid with one of the tessellation Execution Models.

Tessellation

Isolines
Requests the tessellation primitive generator to generate isolines. Only valid with one of the tessellation Execution Models.

Tessellation

OutputVertices
For a geometry stage, the maximum number of vertices the shader will ever emit in a single invocation. For a tessellation-control stage, the number of vertices in the output patch produced by the tessellation control shader, which also specifies the number of times the tessellation control shader is invoked. Only valid with the Geometry or one of the tessellation Execution Models.

Geometry, Tessellation

Literal Number
Vertex count

OutputPoints
Stage output primitive is points. Only valid with the Geometry Execution Model.

Geometry

OutputLineStrip
Stage output primitive is line strip. Only valid with the Geometry Execution Model.

Geometry

OutputTriangleStrip
Stage output primitive is triangle strip. Only valid with the Geometry Execution Model.

Geometry

VecTypeHint
A hint to the compiler, which indicates that most operations used in the entry point are explicitly vectorized using a particular vector type. The 16 high-order bits of Vector Type operand specify the number of components of the vector. The 16 low-order bits of Vector Type operand specify the data type of the vector.

These are the legal data type values:
0 represents an 8-bit integer value.
1 represents a 16-bit integer value.
2 represents a 32-bit integer value.
3 represents a 64-bit integer value.
4 represents a 16-bit float value.
5 represents a 32-bit float value.
6 represents a 64-bit float value.

Only valid with the Kernel Execution Model.

Kernel

Literal Number
Vector type

ContractionOff
Indicates that floating-point-expressions contraction is disallowed. Only valid with the Kernel Execution Model.

Kernel

4446

PostDepthCoverage

SampleMaskPostDepthCoverage

5027

StencilRefReplacingEXT

StencilExportEXT

3.7. Storage Class

Class of storage for declared variables (does not include intermediate values). Used by:

OpTypePointer
OpTypeForwardPointer
OpVariable
OpGenericCastToPtrExplicit

Storage Class	Enabling Capabilities	Enabled by Extension
0	UniformConstant Shared externally, visible across all functions in all invocations in all work groups. Graphics uniform memory. OpenCL constant memory. Variables declared with this storage class are read-only. They may have initializers, as allowed by the client API.
1	Input Input from pipeline. Visible across all functions in the current invocation. Variables declared with this storage class are read-only, and cannot have initializers.
2	Uniform Shared externally, visible across all functions in all invocations in all work groups. Graphics uniform blocks and buffer blocks.	Shader
3	Output Output to pipeline. Visible across all functions in the current invocation.	Shader
4	Workgroup Shared across all invocations within a work group. Visible across all functions. The OpenGL "shared" storage qualifier. OpenCL local memory.
5	CrossWorkgroup Visible across all functions of all invocations of all work groups. OpenCL global memory.
6	Private Visible to all functions in the current invocation. Regular global memory.	Shader
7	Function Visible only within the declaring function of the current invocation. Regular function memory.
8	Generic For generic pointers, which overload the Function, Workgroup, and CrossWorkgroup Storage Classes.	GenericPointer
9	PushConstant For holding push-constant memory, visible across all functions in all invocations in all work groups. Intended to contain a small bank of values pushed from the API. Variables declared with this storage class are read-only, and cannot have initializers.	Shader
10	AtomicCounter For holding atomic counters. Visible across all functions of the current invocation. Atomic counter-specific memory.	AtomicStorage
11	Image For holding image memory.
12	StorageBuffer	Shader	SPV_KHR_storage_buffer_storage_class, SPV_KHR_variable_pointers

Storage Class

Enabling Capabilities

Enabled by Extension

UniformConstant
Shared externally, visible across all functions in all invocations in all work groups. Graphics uniform memory. OpenCL constant memory. Variables declared with this storage class are read-only. They may have initializers, as allowed by the client API.

Input
Input from pipeline. Visible across all functions in the current invocation. Variables declared with this storage class are read-only, and cannot have initializers.

Uniform
Shared externally, visible across all functions in all invocations in all work groups. Graphics uniform blocks and buffer blocks.

Shader

Output
Output to pipeline. Visible across all functions in the current invocation.

Shader

Workgroup
Shared across all invocations within a work group. Visible across all functions. The OpenGL "shared" storage qualifier. OpenCL local memory.

CrossWorkgroup
Visible across all functions of all invocations of all work groups. OpenCL global memory.

Private
Visible to all functions in the current invocation. Regular global memory.

Shader

Function
Visible only within the declaring function of the current invocation. Regular function memory.

Generic
For generic pointers, which overload the Function, Workgroup, and CrossWorkgroup Storage Classes.

GenericPointer

PushConstant
For holding push-constant memory, visible across all functions in all invocations in all work groups. Intended to contain a small bank of values pushed from the API. Variables declared with this storage class are read-only, and cannot have initializers.

Shader

AtomicCounter
For holding atomic counters. Visible across all functions of the current invocation. Atomic counter-specific memory.

AtomicStorage

Image
For holding image memory.

StorageBuffer

Shader

SPV_KHR_storage_buffer_storage_class, SPV_KHR_variable_pointers

3.8. Dim

Dimensionality of an image. Used by OpTypeImage.

Dim	Enabling Capabilities
0	1D	Sampled1D
1	2D
2	3D
3	Cube	Shader
4	Rect	SampledRect
5	Buffer	SampledBuffer
6	SubpassData	InputAttachment

Dim

Enabling Capabilities

Sampled1D

Cube

Shader

Rect

SampledRect

Buffer

SampledBuffer

SubpassData

InputAttachment

3.9. Sampler Addressing Mode

Addressing mode for creating constant samplers. Used by OpConstantSampler.

Sampler Addressing Mode	Enabling Capabilities
0	None The image coordinates used to sample elements of the image refer to a location inside the image, otherwise the results are undefined.	Kernel
1	ClampToEdge Out-of-range image coordinates are clamped to the extent.	Kernel
2	Clamp Out-of-range image coordinates will return a border color.	Kernel
3	Repeat Out-of-range image coordinates are wrapped to the valid range. Can only be used with normalized coordinates.	Kernel
4	RepeatMirrored Flip the image coordinate at every integer junction. Can only be used with normalized coordinates.	Kernel

Sampler Addressing Mode

Enabling Capabilities

None
The image coordinates used to sample elements of the image refer to a location inside the image, otherwise the results are undefined.

Kernel

ClampToEdge
Out-of-range image coordinates are clamped to the extent.

Kernel

Clamp
Out-of-range image coordinates will return a border color.

Kernel

Repeat
Out-of-range image coordinates are wrapped to the valid range. Can only be used with normalized coordinates.

Kernel

RepeatMirrored
Flip the image coordinate at every integer junction. Can only be used with normalized coordinates.

Kernel

3.10. Sampler Filter Mode

Filter mode for creating constant samplers. Used by OpConstantSampler.

Sampler Filter Mode	Enabling Capabilities
0	Nearest Use filter nearest mode when performing a read image operation.	Kernel
1	Linear Use filter linear mode when performing a read image operation.	Kernel

Sampler Filter Mode

Enabling Capabilities

Nearest
Use filter nearest mode when performing a read image operation.

Kernel

Linear
Use filter linear mode when performing a read image operation.

Kernel

3.11. Image Format

Declarative image format. Used by OpTypeImage.

Image Format	Enabling Capabilities
0	Unknown
1	Rgba32f	Shader
2	Rgba16f	Shader
3	R32f	Shader
4	Rgba8	Shader
5	Rgba8Snorm	Shader
6	Rg32f	StorageImageExtendedFormats
7	Rg16f	StorageImageExtendedFormats
8	R11fG11fB10f	StorageImageExtendedFormats
9	R16f	StorageImageExtendedFormats
10	Rgba16	StorageImageExtendedFormats
11	Rgb10A2	StorageImageExtendedFormats
12	Rg16	StorageImageExtendedFormats
13	Rg8	StorageImageExtendedFormats
14	R16	StorageImageExtendedFormats
15	R8	StorageImageExtendedFormats
16	Rgba16Snorm	StorageImageExtendedFormats
17	Rg16Snorm	StorageImageExtendedFormats
18	Rg8Snorm	StorageImageExtendedFormats
19	R16Snorm	StorageImageExtendedFormats
20	R8Snorm	StorageImageExtendedFormats
21	Rgba32i	Shader
22	Rgba16i	Shader
23	Rgba8i	Shader
24	R32i	Shader
25	Rg32i	StorageImageExtendedFormats
26	Rg16i	StorageImageExtendedFormats
27	Rg8i	StorageImageExtendedFormats
28	R16i	StorageImageExtendedFormats
29	R8i	StorageImageExtendedFormats
30	Rgba32ui	Shader
31	Rgba16ui	Shader
32	Rgba8ui	Shader
33	R32ui	Shader
34	Rgb10a2ui	StorageImageExtendedFormats
35	Rg32ui	StorageImageExtendedFormats
36	Rg16ui	StorageImageExtendedFormats
37	Rg8ui	StorageImageExtendedFormats
38	R16ui	StorageImageExtendedFormats
39	R8ui	StorageImageExtendedFormats

Image Format

Enabling Capabilities

Unknown

Rgba32f

Shader

Rgba16f

Shader

R32f

Shader

Rgba8

Shader

Rgba8Snorm

Shader

Rg32f

StorageImageExtendedFormats

Rg16f

StorageImageExtendedFormats

R11fG11fB10f

StorageImageExtendedFormats

R16f

StorageImageExtendedFormats

Rgba16

StorageImageExtendedFormats

Rgb10A2

StorageImageExtendedFormats

Rg16

StorageImageExtendedFormats

Rg8

StorageImageExtendedFormats

R16

StorageImageExtendedFormats

Rgba16Snorm

StorageImageExtendedFormats

Rg16Snorm

StorageImageExtendedFormats

Rg8Snorm

StorageImageExtendedFormats

R16Snorm

StorageImageExtendedFormats

R8Snorm

StorageImageExtendedFormats

Rgba32i

Shader

Rgba16i

Shader

Rgba8i

Shader

R32i

Shader

Rg32i

StorageImageExtendedFormats

Rg16i

StorageImageExtendedFormats

Rg8i

StorageImageExtendedFormats

R16i

StorageImageExtendedFormats

R8i

StorageImageExtendedFormats

Rgba32ui

Shader

Rgba16ui

Shader

Rgba8ui

Shader

R32ui

Shader

Rgb10a2ui

StorageImageExtendedFormats

Rg32ui

StorageImageExtendedFormats

Rg16ui

StorageImageExtendedFormats

Rg8ui

StorageImageExtendedFormats

R16ui

StorageImageExtendedFormats

R8ui

StorageImageExtendedFormats

3.12. Image Channel Order

Image channel order returned by OpImageQueryOrder.

Image Channel Order	Enabling Capabilities
0	R	Kernel
1	A	Kernel
2	RG	Kernel
3	RA	Kernel
4	RGB	Kernel
5	RGBA	Kernel
6	BGRA	Kernel
7	ARGB	Kernel
8	Intensity	Kernel
9	Luminance	Kernel
10	Rx	Kernel
11	RGx	Kernel
12	RGBx	Kernel
13	Depth	Kernel
14	DepthStencil	Kernel
15	sRGB	Kernel
16	sRGBx	Kernel
17	sRGBA	Kernel
18	sBGRA	Kernel
19	ABGR	Kernel

Image Channel Order

Enabling Capabilities

Kernel

RGB

Kernel

RGBA

Kernel

BGRA

Kernel

ARGB

Kernel

Intensity

Kernel

Luminance

Kernel

RGx

Kernel

RGBx

Kernel

Depth

Kernel

DepthStencil

Kernel

sRGB

Kernel

sRGBx

Kernel

sRGBA

Kernel

sBGRA

Kernel

ABGR

Kernel

3.13. Image Channel Data Type

Image channel data type returned by OpImageQueryFormat.

Image Channel Data Type	Enabling Capabilities
0	SnormInt8	Kernel
1	SnormInt16	Kernel
2	UnormInt8	Kernel
3	UnormInt16	Kernel
4	UnormShort565	Kernel
5	UnormShort555	Kernel
6	UnormInt101010	Kernel
7	SignedInt8	Kernel
8	SignedInt16	Kernel
9	SignedInt32	Kernel
10	UnsignedInt8	Kernel
11	UnsignedInt16	Kernel
12	UnsignedInt32	Kernel
13	HalfFloat	Kernel
14	Float	Kernel
15	UnormInt24	Kernel
16	UnormInt101010_2	Kernel

Image Channel Data Type

Enabling Capabilities

SnormInt8

Kernel

SnormInt16

Kernel

UnormInt8

Kernel

UnormInt16

Kernel

UnormShort565

Kernel

UnormShort555

Kernel

UnormInt101010

Kernel

SignedInt8

Kernel

SignedInt16

Kernel

SignedInt32

Kernel

UnsignedInt8

Kernel

UnsignedInt16

Kernel

UnsignedInt32

Kernel

HalfFloat

Kernel

Float

Kernel

UnormInt24

Kernel

UnormInt101010_2

Kernel

3.14. Image Operands

Additional operands to sampling, or getting texels from, an image. Bits that are set can indicate that another operand follows. If there are multiple following operands indicated, they are ordered: Those indicated by smaller-numbered bits appear first. At least one bit must be set (None is invalid).

This value is a literal mask; it can be formed by combining the bits from multiple rows in the table below.

Used by:

Image Operands	Enabling Capabilities
0x0	None
0x1	Bias A following operand is the bias added to the implicit level of detail. Only valid with implicit-lod instructions. It must be a floating-point type scalar. This can only be used with an OpTypeImage that has a Dim operand of 1D, 2D, 3D, or Cube, and the MS operand must be 0.	Shader
0x2	Lod A following operand is the explicit level-of-detail to use. Only valid with explicit-lod instructions. For sampling operations, it must be a floating-point type scalar. For fetch operations, it must be an integer type scalar. This can only be used with an OpTypeImage that has a Dim operand of 1D, 2D, 3D, or Cube, and the MS operand must be 0.
0x4	Grad Two following operands are dx followed by dy. These are explicit derivatives in the x and y direction to use in computing level of detail. Each is a scalar or vector containing (du/dx[, dv/dx] [, dw/dx]) and (du/dy[, dv/dy] [, dw/dy]). The number of components of each must equal the number of components in Coordinate, minus the array layer component, if present. Only valid with explicit-lod instructions. They must be a scalar or vector of floating-point type. This can only be used with an OpTypeImage that has an MS operand of 0. It is invalid to set both the Lod and Grad bits.
0x8	ConstOffset A following operand is added to (u, v, w) before texel lookup. It must be an <id> of an integer-based constant instruction of scalar or vector type. It is invalid for these to be outside a target-dependent allowed range. The number of components must equal the number of components in Coordinate, minus the array layer component, if present. Not valid with the Cube dimension.
0x10	Offset A following operand is added to (u, v, w) before texel lookup. It must be a scalar or vector of integer type. It is invalid for these to be outside a target-dependent allowed range. The number of components must equal the number of components in Coordinate, minus the array layer component, if present. Not valid with the Cube dimension.	ImageGatherExtended
0x20	ConstOffsets A following operand is Offsets. Offsets must be an <id> of a constant instruction making an array of size four of vectors of two integer components. Each gathered texel is identified by adding one of these array elements to the (u, v) sampled location. It is a compile-time error if this falls outside a target-dependent allowed range. Only valid with OpImageGather or OpImageDrefGather. Not valid with the Cube dimension.
0x40	Sample A following operand is the sample number of the sample to use. Only valid with OpImageFetch, OpImageRead, and OpImageWrite. It is invalid to have a Sample operand if the underlying OpTypeImage has MS of 0. It must be an integer type scalar.
0x80	MinLod A following operand is the minimum level-of-detail to use when accessing the image. Only valid with Implicit instructions and Grad instructions. It must be a floating-point type scalar. This can only be used with an OpTypeImage that has a Dim operand of 1D, 2D, 3D, or Cube, and the MS operand must be 0.	MinLod

Image Operands

Enabling Capabilities

0x0

None

0x1

Bias
A following operand is the bias added to the implicit level of detail. Only valid with implicit-lod instructions. It must be a floating-point type scalar. This can only be used with an OpTypeImage that has a Dim operand of 1D, 2D, 3D, or Cube, and the MS operand must be 0.

Shader

0x2

Lod
A following operand is the explicit level-of-detail to use. Only valid with explicit-lod instructions. For sampling operations, it must be a floating-point type scalar. For fetch operations, it must be an integer type scalar. This can only be used with an OpTypeImage that has a Dim operand of 1D, 2D, 3D, or Cube, and the MS operand must be 0.

0x4

Grad
Two following operands are dx followed by dy. These are explicit derivatives in the x and y direction to use in computing level of detail. Each is a scalar or vector containing (du/dx[, dv/dx] [, dw/dx]) and (du/dy[, dv/dy] [, dw/dy]). The number of components of each must equal the number of components in Coordinate, minus the array layer component, if present. Only valid with explicit-lod instructions. They must be a scalar or vector of floating-point type. This can only be used with an OpTypeImage that has an MS operand of 0. It is invalid to set both the Lod and Grad bits.

0x8

ConstOffset
A following operand is added to (u, v, w) before texel lookup. It must be an <id> of an integer-based constant instruction of scalar or vector type. It is invalid for these to be outside a target-dependent allowed range. The number of components must equal the number of components in Coordinate, minus the array layer component, if present. Not valid with the Cube dimension.

0x10

Offset
A following operand is added to (u, v, w) before texel lookup. It must be a scalar or vector of integer type. It is invalid for these to be outside a target-dependent allowed range. The number of components must equal the number of components in Coordinate, minus the array layer component, if present. Not valid with the Cube dimension.

ImageGatherExtended

0x20

ConstOffsets
A following operand is Offsets. Offsets must be an <id> of a constant instruction making an array of size four of vectors of two integer components. Each gathered texel is identified by adding one of these array elements to the (u, v) sampled location. It is a compile-time error if this falls outside a target-dependent allowed range. Only valid with OpImageGather or OpImageDrefGather. Not valid with the Cube dimension.

0x40

Sample
A following operand is the sample number of the sample to use. Only valid with OpImageFetch, OpImageRead, and OpImageWrite. It is invalid to have a Sample operand if the underlying OpTypeImage has MS of 0. It must be an integer type scalar.

0x80

MinLod
A following operand is the minimum level-of-detail to use when accessing the image. Only valid with Implicit instructions and Grad instructions. It must be a floating-point type scalar. This can only be used with an OpTypeImage that has a Dim operand of 1D, 2D, 3D, or Cube, and the MS operand must be 0.

MinLod

3.15. FP Fast Math Mode

Enables fast math operations which are otherwise unsafe.

Only valid on OpFAdd, OpFSub, OpFMul, OpFDiv, OpFRem, and OpFMod instructions.

This value is a literal mask; it can be formed by combining the bits from multiple rows in the table below.

FP Fast Math Mode	Enabling Capabilities
0x0	None
0x1	NotNaN Assume parameters and result are not NaN.	Kernel
0x2	NotInf Assume parameters and result are not +/- Inf.	Kernel
0x4	NSZ Treat the sign of a zero parameter or result as insignificant.	Kernel
0x8	AllowRecip Allow the usage of reciprocal rather than perform a division.	Kernel
0x10	Fast Allow algebraic transformations according to real-number associative and distributive algebra. This flag implies all the others.	Kernel

FP Fast Math Mode

Enabling Capabilities

0x0

None

0x1

NotNaN
Assume parameters and result are not NaN.

Kernel

0x2

NotInf
Assume parameters and result are not +/- Inf.

Kernel

0x4

NSZ
Treat the sign of a zero parameter or result as insignificant.

Kernel

0x8

AllowRecip
Allow the usage of reciprocal rather than perform a division.

Kernel

0x10

Fast
Allow algebraic transformations according to real-number associative and distributive algebra. This flag implies all the others.

Kernel

3.16. FP Rounding Mode

Associate a rounding mode to a floating-point conversion instruction.

FP Rounding Mode	Enabling Capabilities
0	RTE Round to nearest even.	Kernel, StorageUniformBufferBlock16, StorageUniform16, StoragePushConstant16, StorageInputOutput16
1	RTZ Round towards zero.	Kernel, StorageUniformBufferBlock16, StorageUniform16, StoragePushConstant16, StorageInputOutput16
2	RTP Round towards positive infinity.	Kernel, StorageUniformBufferBlock16, StorageUniform16, StoragePushConstant16, StorageInputOutput16
3	RTN Round towards negative infinity.	Kernel, StorageUniformBufferBlock16, StorageUniform16, StoragePushConstant16, StorageInputOutput16

FP Rounding Mode

Enabling Capabilities

RTE
Round to nearest even.

Kernel, StorageUniformBufferBlock16, StorageUniform16, StoragePushConstant16, StorageInputOutput16

RTZ
Round towards zero.

Kernel, StorageUniformBufferBlock16, StorageUniform16, StoragePushConstant16, StorageInputOutput16

RTP
Round towards positive infinity.

Kernel, StorageUniformBufferBlock16, StorageUniform16, StoragePushConstant16, StorageInputOutput16

RTN
Round towards negative infinity.

Kernel, StorageUniformBufferBlock16, StorageUniform16, StoragePushConstant16, StorageInputOutput16

3.17. Linkage Type

Associate a linkage type to functions or global variables. See linkage.

Linkage Type	Enabling Capabilities
0	Export Accessible by other modules as well.	Linkage
1	Import A declaration of a global variable or a function that exists in another module.	Linkage

Linkage Type

Enabling Capabilities

Export
Accessible by other modules as well.

Linkage

Import
A declaration of a global variable or a function that exists in another module.

Linkage

3.18. Access Qualifier

Defines the access permissions.

Used by OpTypeImage and OpTypePipe.

Access Qualifier	Enabling Capabilities
0	ReadOnly A read-only object.	Kernel
1	WriteOnly A write-only object.	Kernel
2	ReadWrite A readable and writable object.	Kernel

Access Qualifier

Enabling Capabilities

ReadOnly
A read-only object.

Kernel

WriteOnly
A write-only object.

Kernel

ReadWrite
A readable and writable object.

Kernel

3.19. Function Parameter Attribute

Adds additional information to the return type and to each parameter of a function.

Function Parameter Attribute	Enabling Capabilities
0	Zext Value should be zero extended if needed.	Kernel
1	Sext Value should be sign extended if needed.	Kernel
2	ByVal This indicates that the pointer parameter should really be passed by value to the function. Only valid for pointer parameters (not for ret value).	Kernel
3	Sret Indicates that the pointer parameter specifies the address of a structure that is the return value of the function in the source program. Only applicable to the first parameter which must be a pointer parameters.	Kernel
4	NoAlias Indicates that the memory pointed to by a pointer parameter is not accessed via pointer values which are not derived from this pointer parameter. Only valid for pointer parameters. Not valid on return values.	Kernel
5	NoCapture The callee does not make a copy of the pointer parameter into a location that is accessible after returning from the callee. Only valid for pointer parameters. Not valid on return values.	Kernel
6	NoWrite Can only read the memory pointed to by a pointer parameter. Only valid for pointer parameters. Not valid on return values.	Kernel
7	NoReadWrite Cannot dereference the memory pointed to by a pointer parameter. Only valid for pointer parameters. Not valid on return values.	Kernel

Function Parameter Attribute

Enabling Capabilities

Zext
Value should be zero extended if needed.

Kernel

Sext
Value should be sign extended if needed.

Kernel

ByVal
This indicates that the pointer parameter should really be passed by value to the function. Only valid for pointer parameters (not for ret value).

Kernel

Sret
Indicates that the pointer parameter specifies the address of a structure that is the return value of the function in the source program. Only applicable to the first parameter which must be a pointer parameters.

Kernel

NoAlias
Indicates that the memory pointed to by a pointer parameter is not accessed via pointer values which are not derived from this pointer parameter. Only valid for pointer parameters. Not valid on return values.

Kernel

NoCapture
The callee does not make a copy of the pointer parameter into a location that is accessible after returning from the callee. Only valid for pointer parameters. Not valid on return values.

Kernel

NoWrite
Can only read the memory pointed to by a pointer parameter. Only valid for pointer parameters. Not valid on return values.

Kernel

NoReadWrite
Cannot dereference the memory pointed to by a pointer parameter. Only valid for pointer parameters. Not valid on return values.

Kernel

3.20. Decoration

Used by OpDecorate and OpMemberDecorate.

Decoration	Enabling Capabilities	Extra Operands
0	RelaxedPrecision Allow reduced precision operations. To be used as described in Relaxed Precision.	Shader
1	SpecId Apply to a scalar specialization constant. Forms the API linkage for setting a specialized value. See specialization.	Shader	Literal Number Specialization Constant ID
2	Block Apply to a structure type to establish it is a non-SSBO-like shader-interface block.	Shader
3	BufferBlock Apply to a structure type to establish it is an SSBO-like shader-interface block.	Shader
4	RowMajor Applies only to a member of a structure type. Only valid on a matrix or array whose most basic element is a matrix. Indicates that components within a row are contiguous in memory.	Matrix
5	ColMajor Applies only to a member of a structure type. Only valid on a matrix or array whose most basic element is a matrix. Indicates that components within a column are contiguous in memory.	Matrix
6	ArrayStride Apply to an array type to specify the stride, in bytes, of the array’s elements. Must not be applied to anything other than an array type.	Shader	Literal Number Array Stride
7	MatrixStride Applies only to a member of a structure type. Only valid on a matrix or array whose most basic element is a matrix. Specifies the stride of rows in a RowMajor-decorated matrix, or columns in a ColMajor-decorated matrix.	Matrix	Literal Number Matrix Stride
8	GLSLShared Apply to a structure type to get GLSL shared memory layout.	Shader
9	GLSLPacked Apply to a structure type to get GLSL packed memory layout.	Shader
10	CPacked Apply to a structure type, to marks it as "packed", indicating that the alignment of the structure is one and that there is no padding between structure members.	Kernel
11	BuiltIn Apply to an object or a member of a structure type. Indicates which built-in variable the entity represents. See BuiltIn for more information.		BuiltIn
13	NoPerspective Apply to an object or a member of a structure type. Indicates that linear, non-perspective correct, interpolation must be used. Only valid for the Input and Output Storage Classes.	Shader
14	Flat Apply to an object or a member of a structure type. Indicates no interpolation will be done. The non-interpolated value will come from a vertex, as described in the API specification. Only valid for the Input and Output Storage Classes.	Shader
15	Patch Apply to an object or a member of a structure type. Indicates a tessellation patch. Only valid for the Input and Output Storage Classes. Invalid to use on objects or types referenced by non-tessellation Execution Models.	Tessellation
16	Centroid Apply to an object or a member of a structure type. When used with multi-sampling rasterization, allows a single interpolation location for an entire pixel. The interpolation location must lie in both the pixel and in the primitive being rasterized. Only valid for the Input and Output Storage Classes.	Shader
17	Sample Apply to an object or a member of a structure type. When used with multi-sampling rasterization, requires per-sample interpolation. The interpolation locations must be the locations of the samples lying in both the pixel and in the primitive being rasterized. Only valid for the Input and Output Storage Classes.	SampleRateShading
18	Invariant Apply to a variable, to indicate expressions computing its value be done invariant with respect to other modules computing the same expressions.	Shader
19	Restrict Apply to a variable, to indicate the compiler may compile as if there is no aliasing. See the Aliasing section for more detail.
20	Aliased Apply to a variable, to indicate the compiler is to generate accesses to the variable that work correctly in the presence of aliasing. See the Aliasing section for more detail.
21	Volatile Apply to an object or a member of a structure type. Can only be used for objects declared as storage images (see OpTypeImage) or in the Uniform Storage Class with the BufferBlock Decoration. This indicates the memory holding the variable is volatile memory. Accesses to volatile memory cannot be eliminated, duplicated, or combined with other accesses. The variable cannot be in the Function Storage Class.
22	Constant Indicates that a global variable is constant and will never be modified. Only allowed on global variables.	Kernel
23	Coherent Apply to an object or a member of a structure type. Can only be used for objects declared as storage images (see OpTypeImage) or in the Uniform Storage Class with the BufferBlock Decoration. This indicates the memory backing the object is coherent.
24	NonWritable Apply to an object or a member of a structure type. Can only be used for objects declared as storage images (see OpTypeImage) or in the Uniform Storage Class with the BufferBlock Decoration. This indicates the memory holding the variable is not writable, and that this module does not write to it.
25	NonReadable Apply to an object or a member of a structure type. Can only be used for objects declared as storage images (see OpTypeImage) or in the Uniform Storage Class with the BufferBlock Decoration. This indicates the memory holding the variable is not readable, and that this module does not read from it.
26	Uniform Apply to an object or a member of a structure type. Asserts that the value backing the decorated <id> is dynamically uniform, hence the consumer is allowed to assume this is the case.	Shader
28	SaturatedConversion Indicates that a conversion to an integer type which is outside the representable range of Result Type will be clamped to the nearest representable value of Result Type. NaN will be converted to 0. This decoration can only be applied to conversion instructions to integer types, not including the OpSatConvertUToS and OpSatConvertSToU instructions.	Kernel
29	Stream Apply to an object or a member of a structure type. Indicates the stream number to put an output on. Only valid for the Output Storage Class and the Geometry Execution Model.	GeometryStreams	Literal Number Stream Number
30	Location Apply to a variable or a structure-type member. Forms the main linkage for Storage Class Input and Output variables: - between the API and vertex-stage inputs, - between consecutive programmable stages, or - between fragment-stage outputs and the API. Also can tag variables or structure-type members in the UniformConstant Storage Class for linkage with the API. Only valid for the Input, Output, and UniformConstant Storage Classes.	Shader	Literal Number Location
31	Component Apply to an object or a member of a structure type. Indicates which component within a Location will be taken by the decorated entity. Only valid for the Input and Output Storage Classes.	Shader	Literal Number Component
32	Index Apply to a variable to identify a blend equation input index, used as described in the API specification. Only valid for the Output Storage Class and the Fragment Execution Model.	Shader	Literal Number Index
33	Binding Apply to a variable. Part of the main linkage between the API and SPIR-V modules for memory buffers, images, etc. See the API specification for more information.	Shader	Literal Number Binding Point
34	DescriptorSet Apply to a variable. Part of the main linkage between the API and SPIR-V modules for memory buffers, images, etc. See the API specification for more information.	Shader	Literal Number Descriptor Set
35	Offset Apply to a structure-type member. This gives the byte offset of the member relative to the beginning of the structure. Can be used, for example, by both uniform and transform-feedback buffers. It must not cause any overlap of the structure’s members, or overflow of a transform-feedback buffer’s XfbStride.	Shader	Literal Number Byte Offset
36	XfbBuffer Apply to an object or a member of a structure type. Indicates which transform-feedback buffer an output is written to. Only valid for the Output Storage Classes of vertex processing Execution Models.	TransformFeedback	Literal Number XFB Buffer Number
37	XfbStride Apply to anything XfbBuffer is applied to. Specifies the stride, in bytes, of transform-feedback buffer vertices. If the transform-feedback buffer is capturing any double-precision components, the stride must be a multiple of 8, otherwise it must be a multiple of 4.	TransformFeedback	Literal Number XFB Stride
38	FuncParamAttr Indicates a function return value or parameter attribute.	Kernel	Function Parameter Attribute Function Parameter Attribute
39	FPRoundingMode Indicates a floating-point rounding mode.	Kernel, StorageUniformBufferBlock16, StorageUniform16, StoragePushConstant16, StorageInputOutput16	FP Rounding Mode Floating-Point Rounding Mode
40	FPFastMathMode Indicates a floating-point fast math flag.	Kernel	FP Fast Math Mode Fast-Math Mode
41	LinkageAttributes Associate linkage attributes to values. Only valid on OpFunction or global (module scope) OpVariable. See linkage.	Linkage	Literal String Name	Linkage Type Linkage Type
42	NoContraction Apply to an arithmetic instruction to indicate the operation cannot be combined with another instruction to form a single operation. For example, if applied to an OpFMul, that multiply can’t be combined with an addition to yield a fused multiply-add operation. Furthermore, such operations are not allowed to reassociate; e.g., add(a + add(b+c)) cannot be transformed to add(add(a+b) + c).	Shader
43	InputAttachmentIndex Apply to a variable to provide an input-target index (as described in the API specification). Only valid in the Fragment Execution Model and for variables of type OpTypeImage with a Dim operand of SubpassData.	InputAttachment	Literal Number Attachment Index
44	Alignment Apply to a pointer. This declares a known minimum alignment the pointer has.	Kernel	Literal Number Alignment
4999	ExplicitInterpAMD
5248	OverrideCoverageNV	SampleMaskOverrideCoverageNV
5250	PassthroughNV	GeometryShaderPassthroughNV
5252	ViewportRelativeNV	ShaderViewportMaskNV
5256	SecondaryViewportRelativeNV	ShaderStereoViewNV	Literal Number Offset

Decoration

Enabling Capabilities

Extra Operands

RelaxedPrecision
Allow reduced precision operations. To be used as described in Relaxed Precision.

Shader

SpecId
Apply to a scalar specialization constant. Forms the API linkage for setting a specialized value. See specialization.

Shader

Literal Number
Specialization Constant ID

Block
Apply to a structure type to establish it is a non-SSBO-like shader-interface block.

Shader

BufferBlock
Apply to a structure type to establish it is an SSBO-like shader-interface block.

Shader

RowMajor
Applies only to a member of a structure type. Only valid on a matrix or array whose most basic element is a matrix. Indicates that components within a row are contiguous in memory.

Matrix

ColMajor
Applies only to a member of a structure type. Only valid on a matrix or array whose most basic element is a matrix. Indicates that components within a column are contiguous in memory.

Matrix

ArrayStride
Apply to an array type to specify the stride, in bytes, of the array’s elements. Must not be applied to anything other than an array type.

Shader

Literal Number
Array Stride

MatrixStride
Applies only to a member of a structure type. Only valid on a matrix or array whose most basic element is a matrix. Specifies the stride of rows in a RowMajor-decorated matrix, or columns in a ColMajor-decorated matrix.

Matrix

Literal Number
Matrix Stride

GLSLShared
Apply to a structure type to get GLSL shared memory layout.

Shader

GLSLPacked
Apply to a structure type to get GLSL packed memory layout.

Shader

CPacked
Apply to a structure type, to marks it as "packed", indicating that the alignment of the structure is one and that there is no padding between structure members.

Kernel

BuiltIn
Apply to an object or a member of a structure type. Indicates which built-in variable the entity represents. See BuiltIn for more information.

BuiltIn

NoPerspective
Apply to an object or a member of a structure type. Indicates that linear, non-perspective correct, interpolation must be used. Only valid for the Input and Output Storage Classes.

Shader

Flat
Apply to an object or a member of a structure type. Indicates no interpolation will be done. The non-interpolated value will come from a vertex, as described in the API specification. Only valid for the Input and Output Storage Classes.

Shader

Patch
Apply to an object or a member of a structure type. Indicates a tessellation patch. Only valid for the Input and Output Storage Classes. Invalid to use on objects or types referenced by non-tessellation Execution Models.

Tessellation

Centroid
Apply to an object or a member of a structure type. When used with multi-sampling rasterization, allows a single interpolation location for an entire pixel. The interpolation location must lie in both the pixel and in the primitive being rasterized. Only valid for the Input and Output Storage Classes.

Shader

Sample
Apply to an object or a member of a structure type. When used with multi-sampling rasterization, requires per-sample interpolation. The interpolation locations must be the locations of the samples lying in both the pixel and in the primitive being rasterized. Only valid for the Input and Output Storage Classes.

SampleRateShading

Invariant
Apply to a variable, to indicate expressions computing its value be done invariant with respect to other modules computing the same expressions.

Shader

Restrict
Apply to a variable, to indicate the compiler may compile as if there is no aliasing. See the Aliasing section for more detail.

Aliased
Apply to a variable, to indicate the compiler is to generate accesses to the variable that work correctly in the presence of aliasing. See the Aliasing section for more detail.

Volatile
Apply to an object or a member of a structure type. Can only be used for objects declared as storage images (see OpTypeImage) or in the Uniform Storage Class with the BufferBlock Decoration. This indicates the memory holding the variable is volatile memory. Accesses to volatile memory cannot be eliminated, duplicated, or combined with other accesses. The variable cannot be in the Function Storage Class.

Constant
Indicates that a global variable is constant and will never be modified. Only allowed on global variables.

Kernel

Coherent
Apply to an object or a member of a structure type. Can only be used for objects declared as storage images (see OpTypeImage) or in the Uniform Storage Class with the BufferBlock Decoration. This indicates the memory backing the object is coherent.

NonWritable
Apply to an object or a member of a structure type. Can only be used for objects declared as storage images (see OpTypeImage) or in the Uniform Storage Class with the BufferBlock Decoration. This indicates the memory holding the variable is not writable, and that this module does not write to it.

NonReadable
Apply to an object or a member of a structure type. Can only be used for objects declared as storage images (see OpTypeImage) or in the Uniform Storage Class with the BufferBlock Decoration. This indicates the memory holding the variable is not readable, and that this module does not read from it.

Uniform
Apply to an object or a member of a structure type. Asserts that the value backing the decorated <id> is dynamically uniform, hence the consumer is allowed to assume this is the case.

Shader

SaturatedConversion
Indicates that a conversion to an integer type which is outside the representable range of Result Type will be clamped to the nearest representable value of Result Type. NaN will be converted to 0.

This decoration can only be applied to conversion instructions to integer types, not including the OpSatConvertUToS and OpSatConvertSToU instructions.

Kernel

Stream
Apply to an object or a member of a structure type. Indicates the stream number to put an output on. Only valid for the Output Storage Class and the Geometry Execution Model.

GeometryStreams

Literal Number
Stream Number

Location
Apply to a variable or a structure-type member. Forms the main linkage for Storage Class Input and Output variables:
- between the API and vertex-stage inputs,
- between consecutive programmable stages, or
- between fragment-stage outputs and the API.
Also can tag variables or structure-type members in the UniformConstant Storage Class for linkage with the API.
Only valid for the Input, Output, and UniformConstant Storage Classes.

Shader

Literal Number
Location

Component
Apply to an object or a member of a structure type. Indicates which component within a Location will be taken by the decorated entity. Only valid for the Input and Output Storage Classes.

Shader

Literal Number
Component

Index
Apply to a variable to identify a blend equation input index, used as described in the API specification. Only valid for the Output Storage Class and the Fragment Execution Model.

Shader

Literal Number
Index

Binding
Apply to a variable. Part of the main linkage between the API and SPIR-V modules for memory buffers, images, etc. See the API specification for more information.

Shader

Literal Number
Binding Point

DescriptorSet
Apply to a variable. Part of the main linkage between the API and SPIR-V modules for memory buffers, images, etc. See the API specification for more information.

Shader

Literal Number
Descriptor Set

Offset
Apply to a structure-type member. This gives the byte offset of the member relative to the beginning of the structure. Can be used, for example, by both uniform and transform-feedback buffers. It must not cause any overlap of the structure’s members, or overflow of a transform-feedback buffer’s XfbStride.

Shader

Literal Number
Byte Offset

XfbBuffer
Apply to an object or a member of a structure type. Indicates which transform-feedback buffer an output is written to. Only valid for the Output Storage Classes of vertex processing Execution Models.

TransformFeedback

Literal Number
XFB Buffer Number

XfbStride
Apply to anything XfbBuffer is applied to. Specifies the stride, in bytes, of transform-feedback buffer vertices. If the transform-feedback buffer is capturing any double-precision components, the stride must be a multiple of 8, otherwise it must be a multiple of 4.

TransformFeedback

Literal Number
XFB Stride

FuncParamAttr
Indicates a function return value or parameter attribute.

Kernel

Function Parameter Attribute
Function Parameter Attribute

FPRoundingMode
Indicates a floating-point rounding mode.

Kernel, StorageUniformBufferBlock16, StorageUniform16, StoragePushConstant16, StorageInputOutput16

FP Rounding Mode
Floating-Point Rounding Mode

FPFastMathMode
Indicates a floating-point fast math flag.

Kernel

FP Fast Math Mode
Fast-Math Mode

LinkageAttributes
Associate linkage attributes to values. Only valid on OpFunction or global (module scope) OpVariable. See linkage.

Linkage

Literal String
Name

Linkage Type
Linkage Type

NoContraction
Apply to an arithmetic instruction to indicate the operation cannot be combined with another instruction to form a single operation. For example, if applied to an OpFMul, that multiply can’t be combined with an addition to yield a fused multiply-add operation. Furthermore, such operations are not allowed to reassociate; e.g., add(a + add(b+c)) cannot be transformed to add(add(a+b) + c).

Shader

InputAttachmentIndex
Apply to a variable to provide an input-target index (as described in the API specification). Only valid in the Fragment Execution Model and for variables of type OpTypeImage with a Dim operand of SubpassData.

InputAttachment

Literal Number
Attachment Index

Alignment
Apply to a pointer. This declares a known minimum alignment the pointer has.

Kernel

Literal Number
Alignment

4999

ExplicitInterpAMD

5248

OverrideCoverageNV

SampleMaskOverrideCoverageNV

5250

PassthroughNV

GeometryShaderPassthroughNV

5252

ViewportRelativeNV

ShaderViewportMaskNV

5256

SecondaryViewportRelativeNV

ShaderStereoViewNV

Literal Number
Offset

3.21. BuiltIn

Used when Decoration is BuiltIn. Apply to either

the result <id> of the variable declaration of the built-in variable, or
a structure-type member, if the built-in is a member of a structure.

As stated per entry below, these have additional semantics and constraints described by the client API.

BuiltIn	Enabling Capabilities
0	Position Output vertex position from a vertex processing Execution Model. See Vulkan or OpenGL API specifications for more detail.	Shader
1	PointSize Output point size from a vertex processing Execution Model. See Vulkan or OpenGL API specifications for more detail.	Shader
3	ClipDistance Array of clip distances. See Vulkan or OpenGL API specifications for more detail.	ClipDistance
4	CullDistance Array of clip distances. See Vulkan or OpenGL API specifications for more detail.	CullDistance
5	VertexId Input vertex ID to a Vertex Execution Model. See Vulkan or OpenGL API specifications for more detail.	Shader
6	InstanceId Input instance ID to a Vertex Execution Model. See Vulkan or OpenGL API specifications for more detail.	Shader
7	PrimitiveId Primitive ID in a Geometry Execution Model. See Vulkan or OpenGL API specifications for more detail.	Geometry, Tessellation
8	InvocationId Invocation ID, input to Geometry and TessellationControl Execution Model. See Vulkan or OpenGL API specifications for more detail.	Geometry, Tessellation
9	Layer Layer output by a Geometry Execution Model, input to a Fragment Execution Model, for multi-layer framebuffer. See Vulkan or OpenGL API specifications for more detail.	Geometry
10	ViewportIndex Viewport Index output by a Geometry stage, input to a Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.	MultiViewport
11	TessLevelOuter Output patch outer levels in a TessellationControl Execution Model. See Vulkan or OpenGL API specifications for more detail.	Tessellation
12	TessLevelInner Output patch inner levels in a TessellationControl Execution Model. See Vulkan or OpenGL API specifications for more detail.	Tessellation
13	TessCoord Input vertex position in TessellationEvaluation Execution Model. See Vulkan or OpenGL API specifications for more detail.	Tessellation
14	PatchVertices Input patch vertex count in a tessellation Execution Model. See Vulkan or OpenGL API specifications for more detail.	Tessellation
15	FragCoord Coordinates (x, y, z, 1/w) of the current fragment, input to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.	Shader
16	PointCoord Coordinates within a point, input to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.	Shader
17	FrontFacing Face direction, input to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.	Shader
18	SampleId Input sample number to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.	SampleRateShading
19	SamplePosition Input sample position to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.	SampleRateShading
20	SampleMask Input or output sample mask to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.	Shader
22	FragDepth Output fragment depth from the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.	Shader
23	HelperInvocation Input whether a helper invocation, to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.	Shader
24	NumWorkgroups Number of workgroups in GLCompute or Kernel Execution Models. See OpenCL, Vulkan, or OpenGL API specifications for more detail.
25	WorkgroupSize Work-group size in GLCompute or Kernel Execution Models. See OpenCL, Vulkan, or OpenGL API specifications for more detail.
26	WorkgroupId Work-group ID in GLCompute or Kernel Execution Models. See OpenCL, Vulkan, or OpenGL API specifications for more detail.
27	LocalInvocationId Local invocation ID in GLCompute or Kernel Execution Models. See OpenCL, Vulkan, or OpenGL API specifications for more detail.
28	GlobalInvocationId Global invocation ID in GLCompute or Kernel Execution Models. See OpenCL, Vulkan, or OpenGL API specifications for more detail.
29	LocalInvocationIndex Local invocation index in GLCompute Execution Models. See Vulkan or OpenGL API specifications for more detail. Work-group Linear ID in Kernel Execution Models. See OpenCL API specification for more detail.
30	WorkDim Work dimensions in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
31	GlobalSize Global size in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
32	EnqueuedWorkgroupSize Enqueued work-group size in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
33	GlobalOffset Global offset in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
34	GlobalLinearId Global linear ID in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
36	SubgroupSize Subgroup size in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
37	SubgroupMaxSize Subgroup maximum size in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
38	NumSubgroups Number of subgroups in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
39	NumEnqueuedSubgroups Number of enqueued subgroups in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
40	SubgroupId Subgroup ID in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
41	SubgroupLocalInvocationId Subgroup local invocation ID in Kernel Execution Models. See OpenCL API specification for more detail.	Kernel
42	VertexIndex Vertex index. See Vulkan or OpenGL API specifications for more detail.	Shader
43	InstanceIndex Instance index. See Vulkan or OpenGL API specifications for more detail.	Shader
4416	SubgroupEqMaskKHR	SubgroupBallotKHR
4417	SubgroupGeMaskKHR	SubgroupBallotKHR
4418	SubgroupGtMaskKHR	SubgroupBallotKHR
4419	SubgroupLeMaskKHR	SubgroupBallotKHR
4420	SubgroupLtMaskKHR	SubgroupBallotKHR
4424	BaseVertex	DrawParameters
4425	BaseInstance	DrawParameters
4426	DrawIndex	DrawParameters
4438	DeviceIndex	DeviceGroup
4440	ViewIndex	MultiView
4992	BaryCoordNoPerspAMD
4993	BaryCoordNoPerspCentroidAMD
4994	BaryCoordNoPerspSampleAMD
4995	BaryCoordSmoothAMD
4996	BaryCoordSmoothCentroidAMD
4997	BaryCoordSmoothSampleAMD
4998	BaryCoordPullModelAMD
5014	FragStencilRefEXT	StencilExportEXT
5253	ViewportMaskNV	ShaderViewportMaskNV
5257	SecondaryPositionNV	ShaderStereoViewNV
5258	SecondaryViewportMaskNV	ShaderStereoViewNV
5261	PositionPerViewNV	PerViewAttributesNV
5262	ViewportMaskPerViewNV	PerViewAttributesNV
5264	FullyCoveredEXT	FragmentFullyCoveredEXT

BuiltIn

Enabling Capabilities

Position
Output vertex position from a vertex processing Execution Model. See Vulkan or OpenGL API specifications for more detail.

Shader

PointSize
Output point size from a vertex processing Execution Model. See Vulkan or OpenGL API specifications for more detail.

Shader

ClipDistance
Array of clip distances. See Vulkan or OpenGL API specifications for more detail.

ClipDistance

CullDistance
Array of clip distances. See Vulkan or OpenGL API specifications for more detail.

CullDistance

VertexId
Input vertex ID to a Vertex Execution Model. See Vulkan or OpenGL API specifications for more detail.

Shader

InstanceId
Input instance ID to a Vertex Execution Model. See Vulkan or OpenGL API specifications for more detail.

Shader

PrimitiveId
Primitive ID in a Geometry Execution Model. See Vulkan or OpenGL API specifications for more detail.

Geometry, Tessellation

InvocationId
Invocation ID, input to Geometry and TessellationControl Execution Model. See Vulkan or OpenGL API specifications for more detail.

Geometry, Tessellation

Layer
Layer output by a Geometry Execution Model, input to a Fragment Execution Model, for multi-layer framebuffer. See Vulkan or OpenGL API specifications for more detail.

Geometry

ViewportIndex
Viewport Index output by a Geometry stage, input to a Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.

MultiViewport

TessLevelOuter
Output patch outer levels in a TessellationControl Execution Model. See Vulkan or OpenGL API specifications for more detail.

Tessellation

TessLevelInner
Output patch inner levels in a TessellationControl Execution Model. See Vulkan or OpenGL API specifications for more detail.

Tessellation

TessCoord
Input vertex position in TessellationEvaluation Execution Model. See Vulkan or OpenGL API specifications for more detail.

Tessellation

PatchVertices
Input patch vertex count in a tessellation Execution Model. See Vulkan or OpenGL API specifications for more detail.

Tessellation

FragCoord
Coordinates (x, y, z, 1/w) of the current fragment, input to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.

Shader

PointCoord
Coordinates within a point, input to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.

Shader

FrontFacing
Face direction, input to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.

Shader

SampleId
Input sample number to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.

SampleRateShading

SamplePosition
Input sample position to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.

SampleRateShading

SampleMask
Input or output sample mask to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.

Shader

FragDepth
Output fragment depth from the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.

Shader

HelperInvocation
Input whether a helper invocation, to the Fragment Execution Model. See Vulkan or OpenGL API specifications for more detail.

Shader

NumWorkgroups
Number of workgroups in GLCompute or Kernel Execution Models. See OpenCL, Vulkan, or OpenGL API specifications for more detail.

WorkgroupSize
Work-group size in GLCompute or Kernel Execution Models. See OpenCL, Vulkan, or OpenGL API specifications for more detail.

WorkgroupId
Work-group ID in GLCompute or Kernel Execution Models. See OpenCL, Vulkan, or OpenGL API specifications for more detail.

LocalInvocationId
Local invocation ID in GLCompute or Kernel Execution Models. See OpenCL, Vulkan, or OpenGL API specifications for more detail.

GlobalInvocationId
Global invocation ID in GLCompute or Kernel Execution Models. See OpenCL, Vulkan, or OpenGL API specifications for more detail.

LocalInvocationIndex
Local invocation index in GLCompute Execution Models. See Vulkan or OpenGL API specifications for more detail.

Work-group Linear ID in Kernel Execution Models. See OpenCL API specification for more detail.

WorkDim
Work dimensions in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

GlobalSize
Global size in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

EnqueuedWorkgroupSize
Enqueued work-group size in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

GlobalOffset
Global offset in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

GlobalLinearId
Global linear ID in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

SubgroupSize
Subgroup size in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

SubgroupMaxSize
Subgroup maximum size in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

NumSubgroups
Number of subgroups in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

NumEnqueuedSubgroups
Number of enqueued subgroups in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

SubgroupId
Subgroup ID in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

SubgroupLocalInvocationId
Subgroup local invocation ID in Kernel Execution Models. See OpenCL API specification for more detail.

Kernel

VertexIndex
Vertex index. See Vulkan or OpenGL API specifications for more detail.

Shader

InstanceIndex
Instance index. See Vulkan or OpenGL API specifications for more detail.

Shader

4416

SubgroupEqMaskKHR

SubgroupBallotKHR

4417

SubgroupGeMaskKHR

SubgroupBallotKHR

4418

SubgroupGtMaskKHR

SubgroupBallotKHR

4419

SubgroupLeMaskKHR

SubgroupBallotKHR

4420

SubgroupLtMaskKHR

SubgroupBallotKHR

4424

BaseVertex

DrawParameters

4425

BaseInstance

DrawParameters

4426

DrawIndex

DrawParameters

4438

DeviceIndex

DeviceGroup

4440

ViewIndex

MultiView

4992

BaryCoordNoPerspAMD

4993

BaryCoordNoPerspCentroidAMD

4994

BaryCoordNoPerspSampleAMD

4995

BaryCoordSmoothAMD

4996

BaryCoordSmoothCentroidAMD

4997

BaryCoordSmoothSampleAMD

4998

BaryCoordPullModelAMD

5014

FragStencilRefEXT

StencilExportEXT

5253

ViewportMaskNV

ShaderViewportMaskNV

5257

SecondaryPositionNV

ShaderStereoViewNV

5258

SecondaryViewportMaskNV

ShaderStereoViewNV

5261

PositionPerViewNV

PerViewAttributesNV

5262

ViewportMaskPerViewNV

PerViewAttributesNV

5264

FullyCoveredEXT

FragmentFullyCoveredEXT

3.22. Selection Control

This value is a literal mask; it can be formed by combining the bits from multiple rows in the table below.

Used by OpSelectionMerge.

Selection Control
0x0	None
0x1	Flatten Strong request, to the extent possible, to remove the control flow for this selection.
0x2	DontFlatten Strong request, to the extent possible, to keep this selection as control flow.

Selection Control

0x0

None

0x1

Flatten
Strong request, to the extent possible, to remove the control flow for this selection.

0x2

DontFlatten
Strong request, to the extent possible, to keep this selection as control flow.

3.23. Loop Control

This value is a literal mask; it can be formed by combining the bits from multiple rows in the table below.

Used by OpLoopMerge.

Loop Control
0x0	None
0x1	Unroll Strong request, to the extent possible, to unroll or unwind this loop.
0x2	DontUnroll Strong request, to the extent possible, to keep this loop as a loop, without unrolling.

Loop Control

0x0

None

0x1

Unroll
Strong request, to the extent possible, to unroll or unwind this loop.

0x2

DontUnroll
Strong request, to the extent possible, to keep this loop as a loop, without unrolling.

3.24. Function Control

This value is a literal mask; it can be formed by combining the bits from multiple rows in the table below.

Used by OpFunction.

Function Control
0x0	None
0x1	Inline Strong request, to the extent possible, to inline the function.
0x2	DontInline Strong request, to the extent possible, to not inline the function.
0x4	Pure Compiler can assume this function has no side effect, but might read global memory or read through dereferenced function parameters. Always computes the same result for the same argument values.
0x8	Const Compiler can assume this function has no side effects, and will not access global memory or dereference function parameters. Always computes the same result for the same argument values.

Function Control

0x0

None

0x1

Inline
Strong request, to the extent possible, to inline the function.

0x2

DontInline
Strong request, to the extent possible, to not inline the function.

0x4

Pure
Compiler can assume this function has no side effect, but might read global memory or read through dereferenced function parameters. Always computes the same result for the same argument values.

0x8

Const
Compiler can assume this function has no side effects, and will not access global memory or dereference function parameters. Always computes the same result for the same argument values.

3.25. Memory Semantics <id>

Must be an <id> of a 32-bit integer scalar that contains a mask. The rest of this description is about that mask.

Memory semantics define memory-order constraints, and on what storage classes those constraints apply to. The memory order constrains the allowed orders in which memory operations in this invocation can made visible to another invocation. The storage classes specify to which subsets of memory these constraints are to be applied. Storage classes not selected are not being constrained.

Despite being a mask and allowing multiple bits to be combined, at most one of the first four (low-order) bits can be set. Requesting both Acquire and Release semantics is done by setting the AcquireRelease bit, not by setting two bits.

This value is a mask; it can be formed by combining the bits from multiple rows in the table below.

Used by:

Memory Semantics	Enabling Capabilities
0x0	None (Relaxed)
0x2	Acquire All memory operations provided in program order after this memory operation will execute after this memory operation.
0x4	Release All memory operations provided in program order before this memory operation will execute before this memory operation.
0x8	AcquireRelease Has the properties of both Acquire and Release semantics. It is used for read-modify-write operations.
0x10	SequentiallyConsistent All observers will see this memory access in the same order with respect to other sequentially-consistent memory accesses from this invocation.
0x40	UniformMemory Apply the memory-ordering constraints to Uniform Storage Class memory.	Shader
0x80	SubgroupMemory Apply the memory-ordering constraints to subgroup memory.
0x100	WorkgroupMemory Apply the memory-ordering constraints to Workgroup Storage Class memory.
0x200	CrossWorkgroupMemory Apply the memory-ordering constraints to CrossWorkgroup Storage Class memory.
0x400	AtomicCounterMemory Apply the memory-ordering constraints to AtomicCounter Storage Class memory.	AtomicStorage
0x800	ImageMemory Apply the memory-ordering constraints to image contents (types declared by OpTypeImage), or to accesses done through pointers to the Image Storage Class.

Memory Semantics

Enabling Capabilities

0x0

None (Relaxed)

0x2

Acquire
All memory operations provided in program order after this memory operation will execute after this memory operation.

0x4

Release
All memory operations provided in program order before this memory operation will execute before this memory operation.

0x8

AcquireRelease
Has the properties of both Acquire and Release semantics. It is used for read-modify-write operations.

0x10

SequentiallyConsistent
All observers will see this memory access in the same order with respect to other sequentially-consistent memory accesses from this invocation.

0x40

UniformMemory
Apply the memory-ordering constraints to Uniform Storage Class memory.

Shader

0x80

SubgroupMemory
Apply the memory-ordering constraints to subgroup memory.

0x100

WorkgroupMemory
Apply the memory-ordering constraints to Workgroup Storage Class memory.

0x200

CrossWorkgroupMemory
Apply the memory-ordering constraints to CrossWorkgroup Storage Class memory.

0x400

AtomicCounterMemory
Apply the memory-ordering constraints to AtomicCounter Storage Class memory.

AtomicStorage

0x800

ImageMemory
Apply the memory-ordering constraints to image contents (types declared by OpTypeImage), or to accesses done through pointers to the Image Storage Class.

3.26. Memory Access

Memory access semantics.

This value is a literal mask; it can be formed by combining the bits from multiple rows in the table below.

Used by:

OpLoad
OpStore
OpCopyMemory
OpCopyMemorySized

Memory Access
0x0	None
0x1	Volatile This access cannot be eliminated, duplicated, or combined with other accesses.
0x2	Aligned This access has a known alignment, provided as a literal in the next operand.
0x4	Nontemporal Hints that the accessed address is not likely to be accessed again in the near future.

Memory Access

0x0

None

0x1

Volatile
This access cannot be eliminated, duplicated, or combined with other accesses.

0x2

Aligned
This access has a known alignment, provided as a literal in the next operand.

0x4

Nontemporal
Hints that the accessed address is not likely to be accessed again in the near future.

3.27. Scope <id>

Must be an <id> of a 32-bit integer scalar that contains a mask. The rest of this description is about that mask.

The execution scope or memory scope of an operation. When used as a memory scope, it specifies the distance of synchronization from the current invocation. When used as an execution scope, it specifies the set of executing invocations taking part in the operation. Used by:

OpControlBarrier
OpMemoryBarrier
OpAtomicLoad
OpAtomicStore
OpAtomicExchange
OpAtomicCompareExchange
OpAtomicCompareExchangeWeak
OpAtomicIIncrement
OpAtomicIDecrement
OpAtomicIAdd
OpAtomicISub
OpAtomicSMin
OpAtomicUMin
OpAtomicSMax
OpAtomicUMax
OpAtomicAnd
OpAtomicOr
OpAtomicXor
OpGroupAsyncCopy
OpGroupWaitEvents
OpGroupAll
OpGroupAny
OpGroupBroadcast
OpGroupIAdd
OpGroupFAdd
OpGroupFMin
OpGroupUMin
OpGroupSMin
OpGroupFMax
OpGroupUMax
OpGroupSMax
OpGroupReserveReadPipePackets
OpGroupReserveWritePipePackets
OpGroupCommitReadPipe
OpGroupCommitWritePipe
OpAtomicFlagTestAndSet
OpAtomicFlagClear
OpGroupIAddNonUniformAMD
OpGroupFAddNonUniformAMD
OpGroupFMinNonUniformAMD
OpGroupUMinNonUniformAMD
OpGroupSMinNonUniformAMD
OpGroupFMaxNonUniformAMD
OpGroupUMaxNonUniformAMD
OpGroupSMaxNonUniformAMD

Scope
0	CrossDevice Scope crosses multiple devices.
1	Device Scope is the current device.
2	Workgroup Scope is the current workgroup.
3	Subgroup Scope is the current subgroup.
4	Invocation Scope is the current Invocation.

3.28. Group Operation

Defines the class of workgroup or subgroup operation. Used by:

OpGroupIAdd
OpGroupFAdd
OpGroupFMin
OpGroupUMin
OpGroupSMin
OpGroupFMax
OpGroupUMax
OpGroupSMax
OpGroupIAddNonUniformAMD
OpGroupFAddNonUniformAMD
OpGroupFMinNonUniformAMD
OpGroupUMinNonUniformAMD
OpGroupSMinNonUniformAMD
OpGroupFMaxNonUniformAMD
OpGroupUMaxNonUniformAMD
OpGroupSMaxNonUniformAMD

Group Operation	Enabling Capabilities
0	Reduce A reduction operation for all values of a specific value X specified by invocations within a workgroup.	Kernel
1	InclusiveScan A binary operation with an identity I and n (where n is the size of the workgroup) elements[a₀, a₁, … a_n-1] resulting in [a₀, (a₀ op a₁), …(a₀ op a₁ op … op a_n-1)]	Kernel
2	ExclusiveScan A binary operation with an identity I and n (where n is the size of the workgroup) elements[a₀, a₁, … a_n-1] resulting in [I, a₀, (a₀ op a₁), … (a₀ op a₁ op … op a_n-2)].	Kernel

Group Operation

Enabling Capabilities

Reduce
A reduction operation for all values of a specific value X specified by invocations within a workgroup.

Kernel

InclusiveScan
A binary operation with an identity I and n (where n is the size of the workgroup) elements[a₀, a₁, … a_n-1] resulting in [a₀, (a₀ op a₁), …(a₀ op a₁ op … op a_n-1)]

Kernel

ExclusiveScan
A binary operation with an identity I and n (where n is the size of the workgroup) elements[a₀, a₁, … a_n-1] resulting in [I, a₀, (a₀ op a₁), … (a₀ op a₁ op … op a_n-2)].

Kernel

3.29. Kernel Enqueue Flags

Specify when the child kernel begins execution.

Note: Implementations are not required to honor this flag. Implementations may not schedule kernel launch earlier than the point specified by this flag, however. Used by OpEnqueueKernel.

Kernel Enqueue Flags	Enabling Capabilities
0	NoWait Indicates that the enqueued kernels do not need to wait for the parent kernel to finish execution before they begin execution.	Kernel
1	WaitKernel Indicates that all work-items of the parent kernel must finish executing and all immediate side effects committed before the enqueued child kernel may begin execution. Note: Immediate meaning not side effects resulting from child kernels. The side effects would include stores to global memory and pipe reads and writes.	Kernel
2	WaitWorkGroup Indicates that the enqueued kernels wait only for the workgroup that enqueued the kernels to finish before they begin execution. Note: This acts as a memory synchronization point between work-items in a work-group and child kernels enqueued by work-items in the work-group.	Kernel

Kernel Enqueue Flags

Enabling Capabilities

NoWait
Indicates that the enqueued kernels do not need to wait for the parent kernel to finish execution before they begin execution.

Kernel

WaitKernel
Indicates that all work-items of the parent kernel must finish executing and all immediate side effects committed before the enqueued child kernel may begin execution.

Note: Immediate meaning not side effects resulting from child kernels. The side effects would include stores to global memory and pipe reads and writes.

Kernel

WaitWorkGroup
Indicates that the enqueued kernels wait only for the workgroup that enqueued the kernels to finish before they begin execution.

Note: This acts as a memory synchronization point between work-items in a work-group and child kernels enqueued by work-items in the work-group.

Kernel

3.30. Kernel Profiling Info

Specify the profiling information to be queried. Used by OpCaptureEventProfilingInfo.

This value is a mask; it can be formed by combining the bits from multiple rows in the table below.

Kernel Profiling Info	Enabling Capabilities
0x0	None
0x1	CmdExecTime Indicates that the profiling info queried is the execution time.	Kernel

Kernel Profiling Info

Enabling Capabilities

0x0

None

0x1

CmdExecTime
Indicates that the profiling info queried is the execution time.

Kernel

3.31. Capability

Capabilities a module can declare it uses. All used capabilities must be declared, either directly or through a dependency: all capabilities that a declared capability depends on are automatically implied.

The Depends On column lists the dependencies for each capability. These are the ones implicitly declared. It is not necessary (but allowed) to declare a dependency for a declared capability.

See the capabilities section for more detail. Used by OpCapability.

Capability	Depends On	Enabled by Extension
0	Matrix Uses OpTypeMatrix.
1	Shader Uses Vertex, Fragment, or GLCompute Execution Models.	Matrix
2	Geometry Uses the Geometry Execution Model.	Shader
3	Tessellation Uses the TessellationControl or TessellationEvaluation Execution Models.	Shader
4	Addresses Uses physical addressing, non-logical addressing modes.
5	Linkage Uses partially linked modules and libraries.
6	Kernel Uses the Kernel Execution Model.
7	Vector16 Uses OpTypeVector to declare 8 component or 16 component vectors.	Kernel
8	Float16Buffer Allows a 16-bit OpTypeFloat instruction for the sole purpose of creating an OpTypePointer to a 16-bit float. Pointers to a 16-bit float cannot be dereferenced directly, they must only be dereferenced via an extended instruction. All other uses of 16-bit OpTypeFloat are disallowed.	Kernel
9	Float16 Uses OpTypeFloat to declare the 16-bit floating-point type.
10	Float64 Uses OpTypeFloat to declare the 64-bit floating-point type.
11	Int64 Uses OpTypeInt to declare 64-bit integer types.
12	Int64Atomics Uses atomic instructions on 64-bit integer types.	Int64
13	ImageBasic Uses OpTypeImage or OpTypeSampler in a Kernel.	Kernel
14	ImageReadWrite Uses OpTypeImage with the ReadWrite access qualifier.	ImageBasic
15	ImageMipmap Uses non-zero Lod Image Operands.	ImageBasic
17	Pipes Uses OpTypePipe, OpTypeReserveId, or pipe instructions.	Kernel
18	Groups Uses group instructions.
19	DeviceEnqueue Uses OpTypeQueue, OpTypeDeviceEvent, and device side enqueue instructions.	Kernel
20	LiteralSampler Samplers are made from literals within the module. See OpConstantSampler.	Kernel
21	AtomicStorage Uses the AtomicCounter Storage Class, allowing use of only the OpAtomicLoad, OpAtomicIIncrement, and OpAtomicIDecrement instructions.	Shader
22	Int16 Uses OpTypeInt to declare 16-bit integer types.
23	TessellationPointSize Tessellation stage exports point size.	Tessellation
24	GeometryPointSize Geometry stage exports point size	Geometry
25	ImageGatherExtended Uses texture gather with non-constant or independent offsets	Shader
27	StorageImageMultisample Uses multi-sample images for non-sampled images.	Shader
28	UniformBufferArrayDynamicIndexing Block-decorated arrays in uniform storage classes use dynamically uniform indexing.	Shader
29	SampledImageArrayDynamicIndexing Arrays of sampled images use dynamically uniform indexing.	Shader
30	StorageBufferArrayDynamicIndexing BufferBlock-decorated arrays in uniform storage classes use dynamically uniform indexing.	Shader
31	StorageImageArrayDynamicIndexing Arrays of non-sampled images are accessed with dynamically uniform indexing.	Shader
32	ClipDistance Uses the ClipDistance BuiltIn.	Shader
33	CullDistance Uses the CullDistance BuiltIn.	Shader
34	ImageCubeArray Uses the Cube Dim with the Arrayed operand in OpTypeImage, without a sampler.	SampledCubeArray
35	SampleRateShading Uses per-sample rate shading.	Shader
36	ImageRect Uses the Rect Dim without a sampler.	SampledRect
37	SampledRect Uses the Rect Dim with a sampler.	Shader
38	GenericPointer Uses the Generic Storage Class.	Addresses
39	Int8 Uses OpTypeInt to declare 8-bit integer types.	Kernel
40	InputAttachment Uses the SubpassData Dim.	Shader
41	SparseResidency Uses OpImageSparse… instructions.	Shader
42	MinLod Uses the MinLod Image Operand.	Shader
43	Sampled1D Uses the 1D Dim with a sampler.
44	Image1D Uses the 1D Dim without a sampler.	Sampled1D
45	SampledCubeArray Uses the Cube Dim with the Arrayed operand in OpTypeImage, with a sampler.	Shader
46	SampledBuffer Uses the Buffer Dim with a sampler.
47	ImageBuffer Uses the Buffer Dim without a sampler.	SampledBuffer
48	ImageMSArray An MS operand in OpTypeImage indicates multisampled, used without a sampler.	Shader
49	StorageImageExtendedFormats One of a large set of more advanced image formats are used, namely one of those in the Image Format table listed as requiring this capability.	Shader
50	ImageQuery The sizes, number of samples, or lod, etc. are queried.	Shader
51	DerivativeControl Uses fine or coarse-grained derivatives, e.g., OpDPdxFine.	Shader
52	InterpolationFunction Uses one of the InterpolateAtCentroid, InterpolateAtSample, or InterpolateAtOffset GLSL.std.450 extended instructions.	Shader
53	TransformFeedback Uses the Xfb Execution Mode.	Shader
54	GeometryStreams Uses multiple numbered streams for geometry-stage output.	Geometry
55	StorageImageReadWithoutFormat OpImageRead can use the Unknown Image Format.	Shader
56	StorageImageWriteWithoutFormat OpImageWrite can use the Unknown Image Format.	Shader
57	MultiViewport Multiple viewports are used.	Geometry
4423	SubgroupBallotKHR		SPV_KHR_shader_ballot
4427	DrawParameters		SPV_KHR_shader_draw_parameters
4431	SubgroupVoteKHR		SPV_KHR_subgroup_vote
4433	StorageBuffer16BitAccess		SPV_KHR_16bit_storage
4433	StorageUniformBufferBlock16		SPV_KHR_16bit_storage
4434	UniformAndStorageBuffer16BitAccess	StorageBuffer16BitAccess, StorageUniformBufferBlock16	SPV_KHR_16bit_storage
4434	StorageUniform16	StorageBuffer16BitAccess, StorageUniformBufferBlock16	SPV_KHR_16bit_storage
4435	StoragePushConstant16		SPV_KHR_16bit_storage
4436	StorageInputOutput16		SPV_KHR_16bit_storage
4437	DeviceGroup		SPV_KHR_device_group
4439	MultiView	Shader	SPV_KHR_multiview
4441	VariablePointersStorageBuffer	Shader	SPV_KHR_variable_pointers
4442	VariablePointers	VariablePointersStorageBuffer	SPV_KHR_variable_pointers
4445	AtomicStorageOps		SPV_KHR_shader_atomic_counter_ops
4447	SampleMaskPostDepthCoverage		SPV_KHR_post_depth_coverage
5009	ImageGatherBiasLodAMD	Shader	SPV_AMD_texture_gather_bias_lod
5010	FragmentMaskAMD	Shader	SPV_AMD_shader_fragment_mask
5013	StencilExportEXT	Shader	SPV_EXT_shader_stencil_export
5015	ImageReadWriteLodAMD	Shader	SPV_AMD_shader_image_load_store_lod
5249	SampleMaskOverrideCoverageNV	SampleRateShading	SPV_NV_sample_mask_override_coverage
5251	GeometryShaderPassthroughNV	Geometry	SPV_NV_geometry_shader_passthrough
5254	ShaderViewportIndexLayerEXT	MultiViewport	SPV_EXT_shader_viewport_index_layer
5254	ShaderViewportIndexLayerNV	MultiViewport	SPV_NV_viewport_array2
5255	ShaderViewportMaskNV	ShaderViewportIndexLayerNV	SPV_NV_viewport_array2
5259	ShaderStereoViewNV	ShaderViewportMaskNV	SPV_NV_stereo_view_rendering
5260	PerViewAttributesNV	MultiView	SPV_NVX_multiview_per_view_attributes
5265	FragmentFullyCoveredEXT	Shader	SPV_EXT_fragment_fully_covered
5568	SubgroupShuffleINTEL		SPV_INTEL_subgroups
5569	SubgroupBufferBlockIOINTEL		SPV_INTEL_subgroups
5570	SubgroupImageBlockIOINTEL		SPV_INTEL_subgroups

Capability

Depends On

Enabled by Extension

Matrix
Uses OpTypeMatrix.

Shader
Uses Vertex, Fragment, or GLCompute Execution Models.

Matrix

Geometry
Uses the Geometry Execution Model.

Shader

Tessellation
Uses the TessellationControl or TessellationEvaluation Execution Models.

Shader

Addresses
Uses physical addressing, non-logical addressing modes.

Linkage
Uses partially linked modules and libraries.

Kernel
Uses the Kernel Execution Model.

Vector16
Uses OpTypeVector to declare 8 component or 16 component vectors.

Kernel

Float16Buffer
Allows a 16-bit OpTypeFloat instruction for the sole purpose of creating an OpTypePointer to a 16-bit float. Pointers to a 16-bit float cannot be dereferenced directly, they must only be dereferenced via an extended instruction. All other uses of 16-bit OpTypeFloat are disallowed.

Kernel

Float16
Uses OpTypeFloat to declare the 16-bit floating-point type.

Float64
Uses OpTypeFloat to declare the 64-bit floating-point type.

Int64
Uses OpTypeInt to declare 64-bit integer types.

Int64Atomics
Uses atomic instructions on 64-bit integer types.

Int64

ImageBasic
Uses OpTypeImage or OpTypeSampler in a Kernel.

Kernel

ImageReadWrite
Uses OpTypeImage with the ReadWrite access qualifier.

ImageBasic

ImageMipmap
Uses non-zero Lod Image Operands.

ImageBasic

Pipes
Uses OpTypePipe, OpTypeReserveId, or pipe instructions.

Kernel

Groups
Uses group instructions.

DeviceEnqueue
Uses OpTypeQueue, OpTypeDeviceEvent, and device side enqueue instructions.

Kernel

LiteralSampler
Samplers are made from literals within the module. See OpConstantSampler.

Kernel

AtomicStorage
Uses the AtomicCounter Storage Class, allowing use of only the OpAtomicLoad, OpAtomicIIncrement, and OpAtomicIDecrement instructions.

Shader

Int16
Uses OpTypeInt to declare 16-bit integer types.

TessellationPointSize
Tessellation stage exports point size.

Tessellation

GeometryPointSize
Geometry stage exports point size

Geometry

ImageGatherExtended
Uses texture gather with non-constant or independent offsets

Shader

StorageImageMultisample
Uses multi-sample images for non-sampled images.

Shader

UniformBufferArrayDynamicIndexing
Block-decorated arrays in uniform storage classes use dynamically uniform indexing.

Shader

SampledImageArrayDynamicIndexing
Arrays of sampled images use dynamically uniform indexing.

Shader

StorageBufferArrayDynamicIndexing
BufferBlock-decorated arrays in uniform storage classes use dynamically uniform indexing.

Shader

StorageImageArrayDynamicIndexing
Arrays of non-sampled images are accessed with dynamically uniform indexing.

Shader

ClipDistance
Uses the ClipDistance BuiltIn.

Shader

CullDistance
Uses the CullDistance BuiltIn.

Shader

ImageCubeArray
Uses the Cube Dim with the Arrayed operand in OpTypeImage, without a sampler.

SampledCubeArray

SampleRateShading
Uses per-sample rate shading.

Shader

ImageRect
Uses the Rect Dim without a sampler.

SampledRect

SampledRect
Uses the Rect Dim with a sampler.

Shader

GenericPointer
Uses the Generic Storage Class.

Addresses

Int8
Uses OpTypeInt to declare 8-bit integer types.

Kernel

InputAttachment
Uses the SubpassData Dim.

Shader

SparseResidency
Uses OpImageSparse… instructions.

Shader

MinLod
Uses the MinLod Image Operand.

Shader

Sampled1D
Uses the 1D Dim with a sampler.

Image1D
Uses the 1D Dim without a sampler.

Sampled1D

SampledCubeArray
Uses the Cube Dim with the Arrayed operand in OpTypeImage, with a sampler.

Shader

SampledBuffer
Uses the Buffer Dim with a sampler.

ImageBuffer
Uses the Buffer Dim without a sampler.

SampledBuffer

ImageMSArray
An MS operand in OpTypeImage indicates multisampled, used without a sampler.

Shader

StorageImageExtendedFormats
One of a large set of more advanced image formats are used, namely one of those in the Image Format table listed as requiring this capability.

Shader

ImageQuery
The sizes, number of samples, or lod, etc. are queried.

Shader

DerivativeControl
Uses fine or coarse-grained derivatives, e.g., OpDPdxFine.

Shader

InterpolationFunction
Uses one of the InterpolateAtCentroid, InterpolateAtSample, or InterpolateAtOffset GLSL.std.450 extended instructions.

Shader

TransformFeedback
Uses the Xfb Execution Mode.

Shader

GeometryStreams
Uses multiple numbered streams for geometry-stage output.

Geometry

StorageImageReadWithoutFormat
OpImageRead can use the Unknown Image Format.

Shader

StorageImageWriteWithoutFormat
OpImageWrite can use the Unknown Image Format.

Shader

MultiViewport
Multiple viewports are used.

Geometry

4423

SubgroupBallotKHR

SPV_KHR_shader_ballot

4427

DrawParameters

SPV_KHR_shader_draw_parameters

4431

SubgroupVoteKHR

SPV_KHR_subgroup_vote

4433

StorageBuffer16BitAccess

SPV_KHR_16bit_storage

4433

StorageUniformBufferBlock16

SPV_KHR_16bit_storage

4434

UniformAndStorageBuffer16BitAccess

StorageBuffer16BitAccess, StorageUniformBufferBlock16

SPV_KHR_16bit_storage

4434

StorageUniform16

StorageBuffer16BitAccess, StorageUniformBufferBlock16

SPV_KHR_16bit_storage

4435

StoragePushConstant16

SPV_KHR_16bit_storage

4436

StorageInputOutput16

SPV_KHR_16bit_storage

4437

DeviceGroup

SPV_KHR_device_group

4439

MultiView

Shader

SPV_KHR_multiview

4441

VariablePointersStorageBuffer

Shader

SPV_KHR_variable_pointers

4442

VariablePointers

VariablePointersStorageBuffer

SPV_KHR_variable_pointers

4445

AtomicStorageOps

SPV_KHR_shader_atomic_counter_ops

4447

SampleMaskPostDepthCoverage

SPV_KHR_post_depth_coverage

5009

ImageGatherBiasLodAMD

Shader

SPV_AMD_texture_gather_bias_lod

5010

FragmentMaskAMD

Shader

SPV_AMD_shader_fragment_mask

5013

StencilExportEXT

Shader

SPV_EXT_shader_stencil_export

5015

ImageReadWriteLodAMD

Shader

SPV_AMD_shader_image_load_store_lod

5249

SampleMaskOverrideCoverageNV

SampleRateShading

SPV_NV_sample_mask_override_coverage

5251

GeometryShaderPassthroughNV

Geometry

SPV_NV_geometry_shader_passthrough

5254

ShaderViewportIndexLayerEXT

MultiViewport

SPV_EXT_shader_viewport_index_layer

5254

ShaderViewportIndexLayerNV

MultiViewport

SPV_NV_viewport_array2

5255

ShaderViewportMaskNV

ShaderViewportIndexLayerNV

SPV_NV_viewport_array2

5259

ShaderStereoViewNV

ShaderViewportMaskNV

SPV_NV_stereo_view_rendering

5260

PerViewAttributesNV

MultiView

SPV_NVX_multiview_per_view_attributes

5265

FragmentFullyCoveredEXT

Shader

SPV_EXT_fragment_fully_covered

5568

SubgroupShuffleINTEL

SPV_INTEL_subgroups

5569

SubgroupBufferBlockIOINTEL

SPV_INTEL_subgroups

5570

SubgroupImageBlockIOINTEL

SPV_INTEL_subgroups

3.32. Instructions

Form for each instruction:

Opcode Name

Instruction description.

Word Count is the high-order 16 bits of word 0 of the instruction, holding its total WordCount. If the instruction takes a variable number of operands, Word Count will also say "+ variable", after stating the minimum size of the instruction.

Opcode is the low-order 16 bits of word 0 of the instruction, holding its opcode enumerant.

Results, when present, are any Result <id> or Result Type created by the instruction. Each one is always 32 bits.

Operands, when present, are any literals, other instruction’s Result <id>, etc., consumed by the instruction. Each one is always 32 bits.

Capability Enabling Capabilities
(when needed)

Word Count

Opcode

Results

Operands

3.32.1. Miscellaneous Instructions

OpNop

This has no semantic impact and can safely be removed from a module.

OpUndef

Make an intermediate object whose value is undefined.

Result Type is the type of object to make.

Each consumption of Result <id> yields an arbitrary, possibly different bit pattern.

<id>
Result Type

SPIR-V Specification

Contributors and Acknowledgments

1. Introduction

1.1. Goals

1.2. About this document

1.3. Extendability

1.4. Debuggability

1.5. Design Principles

1.6. Static Single Assignment (SSA)

1.7. Built-In Variables

1.8. Specialization

1.9. Example

2. Specification

2.1. Language Capabilities

2.2. Terms

2.2.1. Instructions

2.2.2. Types

2.2.3. Module

2.2.4. Control Flow

2.3. Physical Layout of a SPIR-V Module and Instruction

2.4. Logical Layout of a Module

2.5. Instructions

2.5.1. SSA Form

2.6. Entry Point and Execution Model

2.7. Execution Modes

2.8. Types and Variables

2.9. Function Calling

2.10. Extended Instruction Sets

2.11. Structured Control Flow

2.12. Specialization

2.13. Linkage

2.14. Relaxed Precision

2.15. Debug Information

2.15.1. Function-Name Mangling

2.16. Validation Rules

2.16.1. Universal Validation Rules

2.16.2. Validation Rules for Shader Capabilities

2.16.3. Validation Rules for Kernel Capabilities

2.17. Universal Limits

2.18. Memory Model

2.18.1. Memory Layout

2.18.2. Aliasing

2.19. Derivatives

2.20. Code Motion

3. Binary Form

3.1. Magic Number

3.2. Source Language

3.3. Execution Model

3.4. Addressing Model

3.5. Memory Model

3.6. Execution Mode

3.7. Storage Class

3.8. Dim

3.9. Sampler Addressing Mode

3.10. Sampler Filter Mode

3.11. Image Format

3.12. Image Channel Order

3.13. Image Channel Data Type

3.14. Image Operands

3.15. FP Fast Math Mode

3.16. FP Rounding Mode

3.17. Linkage Type

3.18. Access Qualifier

3.19. Function Parameter Attribute

3.20. Decoration

3.21. BuiltIn

3.22. Selection Control

3.23. Loop Control

3.24. Function Control

3.25. Memory Semantics <id>

3.26. Memory Access

3.27. Scope <id>

3.28. Group Operation

3.29. Kernel Enqueue Flags

3.30. Kernel Profiling Info

3.31. Capability

3.32. Instructions

3.32.1. Miscellaneous Instructions

3.32.2. Debug Instructions

3.32.3. Annotation Instructions