The Java Virtual Machine Instruction Set
A Java virtual machine instruction consists of an opcode specifying the operation to be performed, followed by zero or more operands embodying values to be operated upon. This chapter gives details about the format of each Java virtual machine instruction and the operation it performs.
6.1 Assumptions: The Meaning of «Must»
The description of each instruction is always given in the context of Java virtual machine code that satisfies the static and structural constraints of Chapter 4, «The class File Format.» In the description of individual Java virtual machine instructions, we frequently state that some situation «must» or «must not» be the case: «The value2 must be of type int .» The constraints of Chapter 4 guarantee that all such expectations will in fact be met. If some constraint (a «must» or «must not») in an instruction description is not satisfied at run time, the behavior of the Java virtual machine is undefined.
The Java virtual machine checks that Java virtual machine code satisfies the static and structural constraints at link time using a class file verifier (see Section 4.9, «Verification of class Files»). Thus, a Java virtual machine will only attempt to execute code from valid class files. Performing verification at link time is attractive in that the checks are performed just once, substantially reducing the amount of work that must be done at run time. Other implementation strategies are possible, provided that they comply with The Java TM Language Specification and The Java TM Virtual Machine Specification.
6.2 Reserved Opcodes
In addition to the opcodes of the instructions specified later in this chapter, which are used in class files (see Chapter 4, «The class File Format»), three opcodes are reserved for internal use by a Java virtual machine implementation. If Sun extends the instruction set of the Java virtual machine in the future, these reserved opcodes are guaranteed not to be used.
Two of the reserved opcodes, numbers 254 (0xfe) and 255 (0xff), have the mnemonics impdep1 and impdep2, respectively. These instructions are intended to provide «back doors» or traps to implementation-specific functionality implemented in software and hardware, respectively. The third reserved opcode, number 202 (0xca), has the mnemonic breakpoint and is intended to be used by debuggers to implement breakpoints.
Although these opcodes have been reserved, they may be used only inside a Java virtual machine implementation. They cannot appear in valid class files. Tools such as debuggers or JIT code generators (§3.13) that might directly interact with Java virtual machine code that has been already loaded and executed may encounter these opcodes. Such tools should attempt to behave gracefully if they encounter any of these reserved instructions.
6.3 Virtual Machine Errors
A Java virtual machine implementation throws an object that is an instance of a subclass of the class VirtualMachineError when an internal error or resource limitation prevents it from correctly implementing the Java programming language. The Java virtual machine specification cannot predict where resource limitations or internal errors may be encountered and does not mandate precisely when they can be reported. Thus, any of the virtual machine errors listed as subclasses of VirtualMachineError in Section 2.16.4 may be thrown at any time during the operation of the Java virtual machine.
6.4 Format of Instruction Descriptions
Java virtual machine instructions are represented in this chapter by entries of the form shown in Figure 6.1, in alphabetical order and each beginning on a new page. For example:
mnemonic
Short description of the instruction
A longer description detailing constraints on operand stack contents or constant pool entries, the operation performed, the type of the results, etc.
If any linking exceptions may be thrown by the execution of this instruction, they are set off one to a line, in the order in which they must be thrown.
If any runtime exceptions can be thrown by the execution of an instruction, they are set off one to a line, in the order in which they must be thrown.
Other than the linking and runtime exceptions, if any, listed for an instruction, that instruction must not throw any runtime exceptions except for instances of VirtualMachineError or its subclasses.
Comments not strictly part of the specification of an instruction are set aside as notes at the end of the description.
Each cell in the instruction format diagram represents a single 8-bit byte. The instruction’s mnemonic is its name. Its opcode is its numeric representation and is given in both decimal and hexadecimal forms. Only the numeric representation is actually present in the Java virtual machine code in a class file.
Keep in mind that there are «operands» generated at compile time and embedded within Java virtual machine instructions, as well as «operands» calculated at run time and supplied on the operand stack. Although they are supplied from several different areas, all these operands represent the same thing: values to be operated upon by the Java virtual machine instruction being executed. By implicitly taking many of its operands from its operand stack, rather than representing them explicitly in its compiled code as additional operand bytes, register numbers, etc., the Java virtual machine’s code stays compact.
Some instructions are presented as members of a family of related instructions sharing a single description, format, and operand stack diagram. As such, a family of instructions includes several opcodes and opcode mnemonics; only the family mnemonic appears in the instruction format diagram, and a separate forms line lists all member mnemonics and opcodes. For example, the forms line for the lconst_ family of instructions, giving mnemonic and opcode information for the two instructions in that family (lconst_0 and lconst_1), is
lconst_0 = 9 (0x9)
lconst_1 = 10 (0xa)
In the description of the Java virtual machine instructions, the effect of an instruction’s execution on the operand stack (§3.6.2) of the current frame (§3.6) is represented textually, with the stack growing from left to right and each value represented separately. Thus,
Operand Stack
shows an operation that begins by having value2 on top of the operand stack with value1 just beneath it. As a result of the execution of the instruction, value1 and value2 are popped from the operand stack and replaced by result value, which has been calculated by the instruction. The remainder of the operand stack, represented by an ellipsis (. ), is unaffected by the instruction’s execution.
Values of types long and double are represented by a single entry on the operand stack. 1
1 Note that, in the first edition of this specification, values on the operand stack of types long and double were each represented in the stack diagram by two entries.
The Java TM Virtual Machine Specification
Copyright © 1999 Sun Microsystems, Inc. All rights reserved
Please send any comments or corrections through our feedback form