WDC 65C02


The Western Design Center 65C02 microprocessor is an enhanced CMOS version of the popular nMOS-based 8-bit MOS Technology 6502. It uses less power than the original 6502, fixes several problems, and adds new instructions and addressing modes. The power usage is on the order of 10 to 20 times less than the original 6502 running at the same speed; its reduced power consumption has made it useful in portable computer roles and industrial microcontroller systems. The 65C02 has also been used in some home computers, as well as in embedded applications, including implanted medical devices.
Development of the WDC 65C02 began in 1981 with samples released in early 1983. The 65C02
was officially released sometime shortly after. WDC licensed the design to Synertek, NCR, GTE Microcircuits, and Rockwell Semiconductor. Rockwell's primary interest was in the embedded market and asked for several new commands to be added to aid in this role. These were later copied back into the baseline version, at which point WDC added two new commands of their own to create the W65C02. Sanyo later licensed the design as well, and Seiko Epson produced a further modified version as the HuC6280.
Early versions used 40-pin DIP packaging, and were available in 1, 2 and 4 MHz versions, matching the speeds of the original nMOS versions. Later versions were produced in PLCC and QFP packages, as well as PDIP, and with much higher clock speed ratings. The current version from WDC, the W65C02S-14 has a fully static core and officially runs at speeds up to 14 MHz when powered at 5 volts.

Introduction and features

The 65C02 is a low cost, general-purpose 8-bit microprocessor with a 16-bit program counter and address bus. The register set is small, with a single 8-bit accumulator, two 8-bit index registers, an 8-bit status register, and a 16-bit program counter. In addition to the single accumulator, the first 256 bytes of RAM, the "zero page", allow faster access through addressing modes that use an 8-bit memory address instead of a 16-bit address. The stack lies in the next 256 bytes, page one, and cannot be moved or extended. The stack grows downward with the stack pointer starting at $01FF and decrementing with each byte that is pushed. The 65C02 has a variable-length instruction set, varying between one and three bytes per instruction.
The basic architecture of the 65C02 is identical to the original 6502, and may be considered a low-power implementation of that design. At 1 MHz, the most popular speed for the original 6502, the 65C02 requires only 20 mW, while the original uses 450 mW, a reduction of over twenty times. The manually optimized core and low power use is intended to make the 65C02 well suited for low power system-on-chip designs.
A Verilog hardware description model is available for designing the W65C02S core into an application-specific integrated circuit or a field-programmable gate array. As is common in the semiconductor industry, WDC offers a development system, which includes a developer board, an in-circuit emulator and a software development system.
The W65C02S6T is the production version, and is available in PDIP-40, PLCC-44 and QFP-44 packages. The maximum officially supported Ø2 clock speed is 14 MHz when operated at 5 volts, indicated by a –14 part number suffix. The "S" designation indicates that the part has a fully static core, a feature that supports stopping the Ø2 clock in either phase with no loss of state.  Typical microprocessors not implemented in CMOS have dynamic cores and will lose state if they are not continuously clocked at a rate between some minimum and maximum specified values.
The "6T" designation indicates the process geometry and that Taiwan Semiconductor Manufacturing Company is the foundry that produces WDC's wafers.

General logic features

[Image:Sitronix ST2064B silicon die.jpg|thumb|Die photograph of a Sitronix ST2064B microcontroller showing embedded W65C02S core in the upper right]

Logic features

  • Vector pull output indicates when interrupt vectors are being addressed.
  • Memory lock output indicates to other bus masters when a read-modify-write instruction is being processed.WAit-for-Interrupt and STo'P' instructions reduce power consumption, decrease interrupt latency and enable synchronization with external events.

Electrical features

Clocking features

The W65C02S may be operated at any convenient supply voltage between 1.8 and 5 volts. The data sheet AC characteristics table lists operational characteristics at 5 V at 14 MHz, 3.3 V or 3 V at 8 MHz, 2.5 V at 4 MHz, and 1.8 V at 2 MHz.  This information may be an artifact of an earlier data sheet, as a graph indicates that typical devices are capable of operation at higher speeds than suggested by the AC characteristics table, and that reliable operation at 20 MHz should be readily attainable with VDD at 5 volts, assuming the supporting hardware will allow it.
The W65C02S support for arbitrary clock rates allows it to use a clock that runs at a rate ideal for some other part of the system, such as 13.5 MHz, 14.31818 MHz, 14.75 MHz, 14.7456, etc., as long as VDD is sufficient to support the frequency. Designer Bill Mensch has pointed out that FMAX is affected by off-chip factors, such as the capacitive load on the microprocessor's pins. Minimizing load by using short signal tracks and fewest devices helps raise FMAX.  The PLCC and QFP packages have less pin-to-pin capacitance than the PDIP package, and are more economical in the use of printed circuit board space.
WDC has reported that FPGA realizations of the W65C02S have been successfully operated at 200 MHz.

Comparison with the NMOS 6502

Basic architecture

Although the 65C02 can mostly be thought of as a low-power 6502, it also fixes several bugs found in the original and adds new instructions, addressing modes and features that can assist the programmer in writing smaller and faster-executing programs. It is estimated that the average 6502 assembly language program can be made 10 to 15 percent smaller on the 65C02 and see a similar improvement in performance, largely through avoided memory accesses through the use of fewer instructions to accomplish a given task.

Undocumented instructions removed

The original 6502 has 56 instructions, which, when combined with different addressing modes, produce a total of 151 opcodes of the possible 256 8-bit opcode patterns. The remaining 105 unused opcodes are undefined, with the set of codes with low-order 4-bits with 3, 7, B or F left entirely unused, the code with low-order 2 having only a single opcode.
On the 6502, some of these leftover codes actually perform computation. Due to the way the 6502's instruction decoder works, simply setting certain bits in the opcode causes parts of the instruction processing to take place. Some of these opcodes immediately crash the processor, while others perform useful functions and were even given unofficial assembler mnemonics by some programmers.
The 65C02 adds new opcodes that use some of these previously undocumented instruction slots. For example, $FF is used for the new BBS instruction. Those which remain truly unused are equivalent to NOPs. 6502 programs using those opcodes will not work on the 65C02.

Bug fixes

A flaw that is present in all NMOS variants of the 6502 involves the jump instruction when using indirect addressing. In this addressing mode, the target address of the JMP instruction is fetched from memory, the jump vector, rather than being an operand to the JMP instruction. For example, JMP would fetch the value in memory locations and and load those values into the program counter, which would then cause the processor to continue execution at the address stored in the vector.
The flaw, which some consider a bug, appears when the vector address ends in, which is the boundary of a memory page. In this case, JMP will fetch the most significant byte of the target address from of the original page rather than of the new page. Hence JMP would get the least significant byte of the target address at and the most significant byte of the target address from rather than. The original 6502 documentation does not state that the address will cross pages in this fashion, so one cannot consider it to be a bug per-se. But many 6502 users perceived this complication to be a weakness, so it was eliminated in the 65C02 at the cost of spending another cycle to update the pointer.
Another by-design weakness that was revised by popular demand, the state of the ecimal flag in the NMOS 6502's status register is undefined after a reset or interrupt. This means programmers have to set the flag to a known value in order to avoid random errors caused by arithmetic operations performed in the mode other than the one intended, constituting software bugs. As a result, one finds a CLD instruction in almost all 6502 interrupt handlers, as well as early in the reset code. The 65C02 automatically clears this flag after pushing the status register onto the stack in response any interrupt or in response to a hardware reset, thus placing the processor back into binary arithmetic mode. This usually saves a few bytes in the software and eliminates the possibility of a common programming mistake, at the cost of increasing the size of code that runs in decimal mode as programmers have to remember to SED in places they previously assumed it would still be set.
During decimal mode arithmetic, the NMOS 6502 will put the egative, oerflow and ero flags into officially undefined states. Programmers found that the CPU updates these three flags to reflect the result of underlying binary arithmetic, that is, the flags reflect a result computed prior to the processor performing decimal correction. In contrast, the 65C02 sets these flags according to the result of decimal arithmetic, at the cost of an extra clock cycle per arithmetic instruction. Some writers assert that the V flag on the 65C02 is still incorrect in decimal mode, but the flag may also be considered to be meaningless because decimal arithmetic is always unsigned.
When executing a read-modify-write instruction, such as INC addr, all NMOS variants will do a double write on addr, first rewriting the current value found at addr and then writing the modified value. This behavior can result in difficult-to-resolve bugs if addr is a hardware register. This may occur if the hardware is watching for changes to the value in the register and then performs an action, in this case, it will perform two actions, one with the original value and then again with the new value. The 65C02 instead performs a double read of addr, followed by a single write.
When performing indexed addressing, if indexing crosses a page boundary all NMOS variants will read from an invalid address before accessing the correct address. As with a R-M-W instruction, this behavior can cause problems when accessing hardware registers via indexing. The 65C02 fixed this problem by performing a dummy read of the instruction opcode when indexing crosses a page boundary. However, this fix introduced a new bug that occurs when the base address is on an even page boundary. With the new bug, a dummy read is performed on the base address prior to indexing, such that LDA $1200,X will do a dummy read on prior to the value of X being added to. Again, if indexing on hardware register addresses, this bug can result in undefined behavior.
If an NMOS 6502 is fetching a BRK opcode at the same time a hardware interrupt occurs, the BRK will be ignored as the processor reacts to the hardware interrupt. The 65C02 correctly handles this situation by servicing the interrupt and then executing BRK.

New addressing modes

The 6502 has two indirect addressing modes which dereference through 16-bit addresses stored in page zero:
  • Indexed indirect e.g.. Adds the X register to the given page zero address before reading the 16-bit vector. In this example, if X is 5 then the 16-bit address is read from locations $15/$16. This is useful when there is an array of pointers in page zero.
  • Indirect indexed e.g.. Adds the Y register to the 16-bit vector read from the given page zero address. In this example, if Y is 5 and locations contain the vector then the read address will be. This performs pointer-offset addressing.
A downside of this model is that if indexing is not needed but the address is in the zero page, one of the index registers must still be set to zero and used in one of these instructions. Therefore the 65C02 adds a non-indexed indirect addressing mode, e.g., to all instructions that can use indexed indirect and indirect indexed modes. This leaves the index registers free for other uses.
The 6502's instruction has a unique addressing mode known as "absolute indirect" that reads a 16-bit value from a given memory address and then jumps to the address in that 16-bit value. For instance, if memory location holds and holds, will read those two bytes, construct the value, and then jump to that location.
One common use for indirect addressing is to build branch tables, a list of entry points for subroutines that can be accessed using an index. For instance, a device driver might list the entry points for,,, etc in a table at. is the third entry, zero indexed, and each address requires 16-bits, so to call one would use something similar to. If the driver is updated and the subroutine code moves in memory, any existing code will still work as long as the table of pointers remains at.
The 65C02 adds the new "indexed absolute indirect" mode which eases the use of branch tables. This mode adds the value of the X register to the absolute address and takes the 16-bit address from the resulting location. For instance, to access the function from the table above, one stores 4 in X, then executes. This style of access makes accessing branch tables simpler as a single base address is used in conjunction with an 8-bit offset. The same can be achieved in the NMOS version using indexed indirect mode, but only if the table is in the zero page, a limited resource. Allowing these tables to be constructed outside zero page not only lessens the demand for this resource but also allows the tables to be placed in ROM.

New and modified instructions

In addition to the new addressing modes, the "base model" 65C02 also adds a set of new instructions.INC and DEC with no parameters now increment or decrement the accumulator. This was an odd oversight in the original instruction set, which only included INX/DEX, INY/DEY, and INC addr/DEC addr. Some assemblers use the alternate forms INA/DEA or INC A/DEC A.STZ addr, STore Zero in addr, replaces the need to LDA #0;STA addr and doesn't require changing the value of the accumulator. As this task is common in most programs, using STZ can reduce code size, both by eliminating the LDA as well as any code needed to save the value of the accumulator, typically a PHA PLA pair.PHX,PLX,PHY,PLY push and pull the X and Y registers to and from the stack. Previously, only the accumulator and status register had push and pull instructions. X and Y could be stacked only by moving them to the accumulator first with TXA or TYA, thereby changing the accumulator contents, then using PHA.BRA, branch always, operates like a JMP but uses a 1-byte relative address like other branches, saving a byte. The speed is often the same as the 3 cycle absolute JMP unless a page is crossed which would make the BRA version 1 cycle longer. As the address is relative, it is also useful when writing relocatable code.

Bit manipulation instructions

Both WDC and Rockwell contributed improvements to the bit testing and manipulation functions in the 65C02. WDC added new addressing modes to the BIT instruction that was present in the 6502, as well two new instructions for convenient manipulation of bit fields, a common activity in device drivers.
in the 65C02 adds immediate mode, zero page indexed by X and absolute indexed by X addressing. Immediate mode addressing is particularly convenient in that it is completely non-destructive. For example:
may be used in place of:
The operation changes the value in the accumulator, so the original value loaded from $1234 is lost. Using leaves the value in the accumulator unchanged, so subsequent code can make additional tests against the original value, avoiding having to re-load the value from memory.
In addition to the enhancements of the instruction, WDC added two instructions designed to conveniently manipulate bit fields:TSB addr and TRB addr, Test and Set Bits and Test and Reset Bits.
Rockwell's changes added more bit manipulation instructions for any bit in zero page, to directly set or reset a bit with a 2-byte instruction, or to test and branch on a bit with a single 3-byte instruction. The new instructions were available from the start in Rockwell's R65C00 family, but were not part of the original 65C02 specification and not found in versions made by WDC or its other licensees. These were later copied back into the baseline design, and were available in later WDC versions.
Rockwell-specific instructions are:SMBbit# ''zp and RMBbit# zp. Set or Reset bit number bit# in zero page byte zp.BBSbit# zp,addr and BBRbit# zp,addr''. Branch on Bit Set/Reset.
Each of,,, and replaces a sequence of three instructions.

Low-power modes

In addition to the new commands above, WDC also added the STP and WAI instructions for supporting low-power modes.
, STop the Processor, halts all processing until a hardware reset is issued. This can be used to put a system to "sleep" and then rapidly "wake" it with a reset.
t has a similar effect, halting all processing, but this instruction resumes normal execution on the reception of an interrupt. Without this instruction, waiting for a hardware interrupt generally involves running a loop suspend the program until interrupt processing breaks out of the loop, sometimes known as "spinning". This means the processor runs during the entire process, using power while doing nothing, even when no interrupts are occurring. In the 65C02, interrupt code can be written by having a followed immediately by a or to the handler. When the is encountered, processing stops and the processor goes into low-power mode. When an interrupt is received, the processor immediately executes the and handles the request.
This has the added advantage of slightly improving performance. In the spinning case, the interrupt might arrive in the middle of one of the loop's instructions, and to allow it to restart after returning from the handler, the processor spends three cycles to save its location. With, the processor enters the low-power state in a known location where all instructions are guaranteed to be complete, so when the interrupt arrives it cannot possibly interrupt an instruction and the interrupt response can be immediate. Plus, since the program expects the interrupt, the processor can safely continue without spending time saving state; the program is responsible to perform any necessary state-saving before the.

65SC02

The 65SC02 is a variant of the WDC 65C02 without WAI, STP and bit instructions.

Uses

Home computers

Video game consoles

Other products