Memories in Computers Part 3: Flash Memories: Dr. William R. Huber, P.E
Memories in Computers Part 3: Flash Memories: Dr. William R. Huber, P.E
Memories in Computers Part 3: Flash Memories: Dr. William R. Huber, P.E
Memories in Computers
Part 3: Flash Memories
by
1
D. Kahng and S.M. Sze, "A floating-gate and its application to memory devices," The Bell System Technical
Journal, vol. 46, no. 6, July-August 1967, pp. 1288-1295
2
Masuoka, F. and H. Iizuka, “Semiconductor Memory Device and Method for Manufacturing the Same,” U. S.
Patent 4,531,203; Application date Nov. 13, 1981; FAP date Dec. 20, 1980; Issue date July 23, 1985; NOR-
structure flash memory
3
Masuoka, F. et al., “A New Flash EEPROM Cell Using Triple Polysilicon Technology,” IEEE IEDM Technical
Digest, 1984, p. 464
4
Momodomi, M., Masuoka, F., et al., “Electrically Erasable Programmable Read-Only Memory with NAND Cell
Structure,” U. S. Patent 5,050,125; Application date Nov. 17, 1988; FAP date Nov. 18, 1987; Issue date Sep.
17, 1991
5
Fulford, B., “Unsung Hero,” Forbes.com, June 24, 2002 (www.forbes.com/global/2002/0624/030_print.html)
6
Prince, B., Semiconductor Memories, Second Edition, ISBN 0 471 92465 2, 1983, John Wiley & Sons, West
Sussex, England, page 586
B. Storage Mechanism
As in a DRAM, information in a flash memory is stored as charge (or lack of
charge) on a capacitor. In a DRAM, charge leaks off of the capacitor through the
access transistor, and thus must be refreshed on the order of every 100 milliseconds.
In a flash memory, the charge is stored on a floating conductive layer, usually
polycrystalline silicon (polysilicon) that is surrounded by a high quality silicon dioxide
insulator. Leakage from a flash floating gate is so small that retention times are on the
order of 10 years! Starting with the analogy for DRAM storage of a balloon filled with air
that leaks out over time, flash memory storage would be akin to a pressurized steel
tank.
The basic floating gate structure is shown below. The green gate regions are
typically doped polysilicon; the gray
regions between and below the gates are
thin silicon dioxide; and the N+ source
and drain are doped by ion implantation
into the P- substrate. Without the floating
gate, the structure is a standard NMOS
transistor. Thus the floating gate
transistor is a simple addition to standard
MOS processing. The trick is getting
electrons onto and off of the floating gate, which would seem to be electrically isolated.
1. Moving Electrons Through an Insulator
The process of moving electrons to the floating gate is called “Programming” the
memory cell, and removing electrons from the floating gate is called “Erasing” the
memory cell. Programming acts to increase the threshold voltage of the MOS
transistor; erasing reduces the threshold voltage to its lowest value.
There are two primary methods for getting electrons to move through the gate
dielectric: hot electron injection and Fowler-Nordheim tunneling.
high energy electrons are able to jump over the 3.2 eV energy barrier at the interface
between the silicon substrate and the silicon dioxide. The positive control gate voltage
attracts the electrons toward the floating gate, where they lodge in trap sites. As the
floating gate becomes more negatively charged, additional electrons from the channel
are repelled, so HEI is self-limiting. Because of the high source-drain current, the +12V
supply is quite difficult to generate on-chip, so an off-chip high voltage supply is usually
required.
b. Fowler-Nordheim (F-N) Tunneling7,10
With the simple structure at left, F-N tunneling
is useful only for erasing, i.e., removing
electrons from the floating gate. To erase the
cell, the control gate is grounded and the drain
is biased to a high positive voltage (the source
is usually floating). The high electric field
across the gate-dielectric sandwich repels
electrons from the floating gate and attracts
them to the drain. F-N tunneling is not self-
limiting, so over-erasure (so that the transistor becomes depletion mode) is a common
problem. Depending on the transistor and array structure (more on array structures
7
Brown, W.D. and J.E. Brewer, Editors; Nonvolatile Semiconductor Memory Technology, ISBN 0-7803-
1173-6, 1998, IEEE Press, New York, NY
8
Cottrell, P.E. et al., “Hot Electron Emission in N-Channel IGFETs,” IEEE Journal of Solid-State Circuits,
Volume SC-14, 1979, page 442
9
Eitan, B. and D. Frohman-Bentchkowsky, “Hot Electron Injection into the Oxide in N-Channel MOS Devices,”
IEEE Transactions on Electron Devices, Volume ED-23, 1981, page 328
10
Lenzlinger, M. and E.H. Snow, “Fowler-Nordheim Tunneling in Thermally Grown SiO2,” Journal of Applied
Physics, Vol. 40, 1969, p. 278
below), F-N tunneling can occur from the floating gate to the source, drain, source and
drain, the substrate or even to a dedicated erase electrode.
2. More Complex Structures
The diagrams at the right
show the simple structure we
have been discussing [(a)],
along with three more complex
structures.11 The three more
complex structures [(b), (c) and
(d)] all include a “split gate”
transistor, in which a series
transistor directly controlled by
the control gate prevents
conduction of the floating gate
transistor in case it is over-
erased. The following table summarizes the important properties of each of these cell
structures.
Drawing (a) (b) (c) (d)
Programming HEI HEI HEI F-N Tunneling
Method from Drain
Erasing F-N Tunneling F-N Tunneling to F-N Tunneling F-N Tunneling
Method to Drain separate poly to Drain to Drain
erase gate
Advantages Simple Less stress on Low current
thin gate oxide allows on-chip
high voltage
generation
Disadvantages HEI could HEI could require HEI could Programming
require off-chip off-chip high require off-chip by F-N
high voltage; voltage; high voltage Tunneling is
No protection High voltage slower that HEI
against over- required for poly-
erasure to-poly erase
11
Prince, B., Semiconductor Memories, Second Edition, ISBN 0 471 92465 2, 1983, John Wiley & Sons,
West Sussex, England, page 186
12
Brown, W.D. and J.E. Brewer, Editors; op. cit.; pp. 255-281
13
Ibid., p. 256; Originally presented by Chang, C.; “Flash Memory Reliability;” Tutorial at International Reliability
Physics Symposium; 1993
These measures and others have pushed endurance to beyond 105 cycles, sufficient for
all but the most critical applications.
If a memory claims to be nonvolatile, it had better retain data for a long time.
Data retention in nonvolatile memory cells has improved from the “longer than one hour”
of Kahng and Sze’s first floating gate memory device14 to 10-20 years on current
devices. Retention time is influenced by many factors, but the only one under user
control is ambient temperature. In general, the lower the temperature (to the minimum
allowable temperature), the longer the data retention.
14
D. Kahng and S.M. Sze, op. cit., page 1295
voltage pulses of the same or different amplitude and duration (depending on the
specific device design) are applied to the erase gate.
3. Programming in a NOR Array
First we have to recognize that erased memory cells have a low threshold
voltage, thus are set to a “1”. Thus, only those cells intended to store a “0” have to be
programmed to a high threshold voltage by moving electrons to their floating gate. To
achieve the channel hot electron programming, the selected word line (and its
associated control gates) is biased to a boosted voltage of 12V; and the bit lines of
those cells to be programmed are pulsed to 7V. With source terminals grounded, a high
source-to-drain current will flow, and some of the resulting hot electrons will be injected
into the gate oxide and onto the floating gate. Usually a “program verify” operation is
performed; and if the desired threshold voltage level (above VDD) has not been
achieved, additional pulses of the same or different amplitude and duration will be
applied until the desired threshold results.
4. Reading in a NOR Array
There are different possibilities for reading the cells in a NOR array. As the
readout process is non-destructive, there is no need to restore the information in the
cells. Therefore we do not necessarily need a sense amplifier dedicated to each bit
line. One approach is to precharge the bit lines to a voltage above the sense point of
the sense amplifiers, typically VDD/2. Then the selected word line is driven to VDD. Any
memory cell storing a “1”, with a low threshold voltage, will turn on and discharge its bit
line. The associated sense amplifier will detect that low voltage and invert it to output a
“1”. Any memory cell that was programmed to a high threshold voltage, a “0”, will not
turn on and will not discharge its bit line; the sense amplifier will detect that high voltage,
invert it and output a “0”.
15
Spansion Application Note , “Flash Memory: An Overview,” November 10, 2005
a. Erasing
With the indicated voltages, all cells will be erased to “1”. In fact, all cells on the
entire length of the word lines and between the Select Gate devices will be erased. The
20V bias on the P-well will attract electrons from the floating gates via F-N tunneling.
The resulting threshold voltage of those erased cells must be negative, so that they will
conduct with zero applied gate voltage. As with the NOR array, an “erase verify”
operation insures that all cells are erased properly.
b. Programming
With the voltages indicated, only the cell on Bit Line 1 and Word Line 2 will be
programmed to a high threshold voltage (storing a “0”). Of course, any or all other cells
along the same word line can be programmed at the same time by biasing their bit lines
at 7V. Cells in other locations can be programmed with the appropriate voltages.
Electrons will travel from the P-well to the floating gate by F-N tunneling. The objective
of the programming is to increase the threshold voltage to about VDD/2, or 2.5V, so that
the programmed cells will be non-conductive with zero control gate voltage and will
conduct with VDD (5V) on the word line/control gate. Again, as with the NOR array, a
“program verify” operation is performed; and if the desired threshold voltage level
(VDD/2) has not been achieved, additional pulses of the same or different amplitude and
duration will be applied until the desired threshold results.
c. Reading
During reading, all word lines/control gates except for the one containing the cell
(or cells) to be read are biased at VDD. Because the threshold of even the programmed
cells is VDD/2, all of these transistors will be conducting. The selected word line is
biased at zero volts. Thus, if the cell to be read is erased (storing a “1”), it will also
conduct and the bit line, which had been precharged to 2.5V, will discharge to ground.
If the cell to be read is programmed (storing a “0”), it will not conduct and the bit line will
remain at its precharged voltage of 2.5V. The sense amplifier, connected to the top of
the bit line, will detect either case and output the appropriate result. To read multiple
bits along the selected word line, it is only necessary to connect a sense amplifier to
each of the desired bit lines.
Note that the bit line discharge path is through 18 transistors (or more, depending
on the memory architecture) in series. Thus the time to discharge the bit line (read the
selected cell) will be much longer than that for the NOR array where a single transistor
discharges the bit line.
As the table shows, NOR flash excels in random access performance; while
NAND flash has much faster block erase times and faster page program times. The
most important factor for most applications doesn’t appear in the table: cost per bit.
Because the silicon area per bit for NAND is only about 40% of that for NOR, NAND has
a very significant density and cost advantage.
16
Numonyx Embedded Flash Memory (J3 65 nm) Single Bit Cell (SBC) Data Sheet, March 2010
17
Micron MT29F2G08A NAND Flash Data Sheet, Rev. 1, 1/06 EN
The main memory array consists of 128 blocks, and a block is the smallest
erasable unit. Each block contains 128K Bytes, and a Byte is the smallest
addressable/programmable unit.
18
Numonyx Embedded Flash Memory (J3 65nm) Single Bit per Cell (SBC) Flash Data Sheet, March 2010
How do the address inputs map to the memory array? The following memory
map provides the details:
The memory array is byte-
addressable. The most
significant 7 bits of the address,
A[23:17], select the block; and
the remaining 17 bits, A[16:0],
select the byte within that block.
The addresses shown to the left
of each block are expressed in
hexadecimal notation. Each
symbol (0 thru 9, A thru F)
represents 4 binary bits. For
example, 6h corresponds to
0110 binary (or 6 decimal); and
Eh corresponds to 1110 binary
or 14 decimal. Appendix A
provides conversions among
hexadecimal, decimal and
binary notations.
The functions of some of the other pins shown on the Functional Block Diagram
are as follows.
CE[2:0] are the Chip Enable inputs. When appropriate logic levels are applied to
the CE inputs, the device is activated.
DQ[7:0] is the low byte data bus. Data input and output, as well as command
inputs are carried on these pins.
DQ[15:8] is the high byte data bus. In the x8 mode we are examining, these pins
float and are not used.
RP# is the RESET pin. When low it resets internal circuits and places the device
in the power-down mode. When high, it enables normal operation.
OE# is Output Enable. When low, it activates the device outputs during a read
operation. When high, the outputs are disabled.
WE# is Write Enable. When low, it enables writing to the memory array, the
Command User Interface and the write buffer. When high, no writes can occur.
STS is an open-drain status pin. In its default mode, it acts as a Ready/Busy#
(R/B#) pin. The pin must be tied to VDD through a pull-up resistor.
VPEN is Program Enable. When low, it prevents erasing or programming.
The signals (CE, RP#, OE#, WE#, VPEN) control bus operations, as tabulated
below:
2. Device Commands
Many different commands are available for this NOR flash memory, as tabulated
below. We will examine the most important ones.
Note that most commands require a Setup Cycle and a Confirm Cycle. This is
done to prevent accidental erasure or programming. Because the NOR device has
dedicated address pins, addresses and command codes can be loaded into the device
simultaneously. All commands load zeros into the highest 8 bits, so command codes
are often identified without the two leading zeros.
19
Programming converts a stored “1” to a “0.” Only erasing can convert a “0” to a “1,” so erasing usually
precedes programming.
First notice that, unlike DRAMs and the NOR-based device, there are no dedicated
address pins on this NAND device. Addresses, Data In and Data Out (as well as
commands) are all multiplexed on the pins labeled I/O [7:0]. Addressing is best
understood by looking at the internal organization of the memory array, as shown here:
20
Micron MT29F2G08A NAND Flash Data Sheet, Rev. 1, 1/06 EN
The Cache Register and Data Register are specialized memory arrays21 used in
the reading and programming of the main memory. The main memory array consists of
2,048 blocks, and a block is the smallest erasable unit. Each block contains 64 pages;
and each page contains 2K bytes plus 64 “extra” bytes that can be used for error
management functions such as bad byte address storage. A page is the smallest unit
that can be programmed, although some devices offer special options to program
smaller storage units.
How do the address inputs map to the memory array? The following memory
map provides the details:
21
Although the data sheet does not say so, the Cache Register and Data Register are probably static RAMs.
As shown on the Memory Map above, the Block is selected by addresses A18-
A28 (11 addresses; 211 = 2048). The page is selected by addresses A12-A17 (6
addresses; 26 = 64); and the column within a page is selected by addresses A0-A11 (12
addresses; 212 = 4096).
As shown in the Array Addressing table above, the addresses are input via
I/O [7:0] on 5 successive clock cycles; CA0 –CA7 on cycle 1; CA8-CA11 on cycle 2;
RA12-RA19 on cycle 3; RA20-RA27 on cycle 4; and RA28 on cycle 5.
The functions of the other pins shown on the Functional Block Diagram are as
follows.
CE# is the Chip Enable input (active low). When CE# is low, transfers can occur
between the system and the NAND device.
CLE is the Command Latch Enable. When CLE is high, information is
transferred from the I/O [7:0] pins to the Command Register. Otherwise it must be
driven low.
ALE is the Address Latch Enable. When ALE is high, information is transferred
from the I/O [7:0] to the Address Register. Otherwise it must be driven low.
WE# is Write Enable, and when low, gates transfers from the system to the
NAND device.
RE# is Read Enable, and when low, gates transfers from the NAND device to the
system.
WP# is Write Protect. All PROGRAM and ERASE operations are disabled when
WP# is low.
The R/B# (Ready/Busy#) output of the device indicates the status of internal
operations. When that signal is high (the pin must be tied to VDD through a pull-up
resistor), the device is ready to accept an external command. R/B# is normally tied to
an interrupt pin on the system controller.
The six signals entering the Control Logic box in the functional block diagram on
page 21 (CE#, CLE, ALE, WE#, RE#, WP#) control the device mode, as tabulated
below:
2. Command Sequence
Operations of the memory are controlled by a Command Sequence, consisting
of:
Command cycle 1;
Address (5 or 3 cycles);
Command cycle 2 (if required);
Data cycles (either READ or WRITE).
Commands, which appear on the I/O [7:0] pins, are written to the Command
Register when:
CE# and ALE are LOW, and
CLE is HIGH, and
The device is not busy, and
WE# transitions LOW to HIGH.
3. READ Operations
For a PAGE READ operation, the command input 00h22 (I/O [7:0] pins all LOW)
would be written to the Command Register. Then the address inputs would follow on
the next 5 cycles. Then the command input 30h (I/O [7:0] pins set to 0-0-1-1-0-0-0-0)
would be written to the command register. Finally RE# would cycle to read out
sequential bytes of data starting at the initial column address and ending at the end of
the page.
22
The suffix “h” indicates that the preceding numbers (and/or letters) are in hexadecimal notation. The first “0”
describes the contents of I/O pins 7 to 4 and the second “0” describes the contents of I/O pins 3 to 0.
Appendix A provides conversions among hexadecimal, decimal and binary notations.
23
Color-coding matches the waveforms shown on the next page
causes the data from the Data Register (which came from the memory page
to be read) to be transferred again, this time to the Cache Register.
While data is being output from the Cache Register by pulsing RE#, the next
sequential page of data is being loaded into the Data Register.
The second page (and subsequent pages if desired) of data is transferred
from the Data Register to the Cache Register by issuing additional PAGE
READ CACHE MODE START (31h) commands.
To read out the last page of data, a PAGE MODE CACHE READ START
LAST (3Fh) command is issued. This command causes transfer of the last
desired page of data from the Data Register to the Cache Register so it can
be read out by pulsing RE#.
The waveforms for the PAGE READ CACHE MODE are shown below.
4. PROGRAM Operations
Similar to the PAGE READ operation, the memory array is also programmed on
a page basis by the PROGRAM PAGE command. After the block, page and starting
column address are loaded into the internal Address Register, data is sequentially
written to the Data Register, up to the end of the page. Then the data is programmed
into the selected destination page.
Within a block, pages must be programmed from the least significant bit (LSB)
page of the block to the most significant bit (MSB) page of the block.
Here are the steps for the PROGRAM PAGE (80h-10h) operation:
Issue the SERIAL DATA INPUT (80h) command into the command register
(CLE, RE# and WP# high, ALE and CE# low, and WE# low-to-high transition);
Load the address (block, page and starting column address) into the Address
Register using 5 cycles (ALE, RE# and WP# high, CLE and CE# low, and
WE# cycling 5 times);
Input the data to be programmed, starting at the selected address, byte-by
byte (RE# and WP# high, CLE, ALE, and CE# low and WE# cycling);
After all data is loaded, issue the PROGRAM (10h) command into the
command register (CLE, RE# and WP# high, ALE and CE# low, and WE#
low-to-high transition). The internal program circuits automatically execute
the proper algorithm and control all the necessary timing to program and
verify the operation. Write verification detects any “ones” that are not
successfully written to “zeros.”
At the end of programming, R/B# goes high. Then issue the READ STATUS
REGISTER (70h) command to determine if the programming was successful.
Bit 0 of the Status Register (I/O pin 0) reports the results of programming. If
I/O 0 = 0, the programming was successful. If I/O 0 = 1, the programming
had an error.
5. ERASE Operation
Erasing occurs at the block level, one block at a time. Here are the steps for the
BLOCK ERASE (60h-D0h) operation:
Issue the ERASE SETUP (60h) command;
Load the address (block and page) into the Address Register using 3 cycles.
Although the page addresses A[17:12] are loaded, they are ignored.
Issue the ERASE CONFIRM (D0h) command. The internal erase circuits
automatically control the timing and erase-verify operations.
At the end of erasing, R/B# goes high. Then issue the READ STATUS
REGISTER (70h) command to determine if the erasing was successful. If bit
0 of the Status Register (I/O pin 0) = 0, the erasing was successful.
If I/O 0 = 1, the erasing had an error.
24
http://en.wikipedia.org/wiki/Greinacher_multiplier
25
Cockcroft, J.D. and E.T. Walton, “Production of high velocity positive ions,” Proceedings of the Royal Society,
A, Vol. 136, 1932, pp. 619-630
26
Dickson, J. “On-chip High-Voltage Generation in NMOS Integrated Circuits Using an Improved Voltage
Multiplier Technique” IEEE Journal of Solid-State Circuits, Vol. 11, No. 6, June 1976, pp. 374–378
Charge pumps involve capacitors which are connected in different ways during
charging and discharging. The diagram below left shows the charging of capacitor
C1.27 Then switches S1 and S2 (which are actually implemented by MOS transistors)
change connections, as shown at below right, with the following results:
The voltage at the bottom of capacitor C1 immediately rises from ground to
5V;
Because the voltage across a capacitor cannot change instantaneously (that
would take infinite current; i = C dv/dt), the voltage at the top of capacitor C1
rises to 10V;
When capacitors C1 and C2 are connected via switch S1, they share the
charge stored on C1. The resulting voltage at the load, represented by
resistor RL, will rise.
After several cycles from the left configuration to the right configuration, and
assuming RL is very large, the output voltage will approach 10V. More stages
can be added to achieve higher voltages. Practical charge pump circuits can
achieve conversion efficiencies up to 90% or so.
27
From http://www.answers.com/topic/charge-pump
A common factor in all charge pump circuits is that the DC supply voltage has to
be converted to AC, which is then used to drive either the switches (as in the circuit on
p. 29) or the capacitors (as in Dickson’s circuit above). Dickson uses a single stage
oscillator to accomplish the DC-AC conversion; most modern devices use a ring
oscillator (an odd-stage inverter loop).
Most integrated charge pump circuits have very low driving capability. That is the
characteristic that limits their capability to support hot electron injection in flash
memories. The following table from Dickson’s paper illustrates this limitation:29
Dickson’s charge
pump boosts the
14V power supply
voltage to about
35V, but it can
supply less than
10 µa of current
to the load.
28
Dickson, op. cit., p. 377
29
Ibid., p. 378
I. Multi-Level Storage
From the first days of integrated circuit development in 1958, one primary
objective has been to increase the density (number of transistors/bits/functions per chip)
of the devices. This increase in density was observed by Gordon Moore, the co-founder
of Intel, in 1965. Moore observed30 that the number of transistors per square inch of
silicon had doubled every year since the integrated circuit was invented, and predicted
that such doubling would continue for the next 10 years. His observation was later
dubbed “Moore’s Law.” The density increase has slowed a bit, to a doubling every 18
months, but continues to this day. In fact, future roadmaps of semiconductor innovation
are based on the continued validity of Moore’s Law.
Flash memories follow, and indeed have extended Moore’s Law. The devices
we have discussed thus far store one bit per memory cell and achieve densities up to
about 4 GB (32 x 109 bits) per chip. Devices using single bit per cell are abbreviated as
either SBC or SLC (for Single Level Cell).
Because the charge on the floating gate of a flash memory doesn’t leak off,
circuit designers have been able to design circuits that enable storage of more than one
bit per memory cell. Devices using this Multi-Level Cell (MLC) (4 levels yielding 2 bits-
per-cell) technology now are available with capacities up to 8 GB (64 x 109 bits) per
chip.
Designs with even more bits per cell are possible. At the February 2011
International Solid State Circuits Conference, Samsung discussed an 8 level, 3 bit-per-
cell device with a capacity of 8GB (64 x 109 bits). The downside of resolving so many
levels with a single cell is reduced endurance. Instead of 100,000 erase/program cycles
common on an SLC device, 2 bit-per-cell and 3 bit-per-cell devices might achieve as
few as 3,000 cycles.
30
Moore, G., Electronics Magazine, "Cramming more components onto integrated circuits," April 19, 1965
How does the MLC technology work? Let’s compare SLC and MLC cell storage:31
In a single-level cell, there are
two possible states per cell:
either “1” (low threshold voltage;
erased) or “0” (high threshold
voltage; programmed). The
reference point, used by the
sense amplifier to distinguish
between the two states, is set at
a voltage between the two states.
31
http://www.micron.com/products/nand_flash/nandcom.html#mlcnand
The table below compares some characteristics of SLC and MLC NAND flash
devices:32
As expected, the precise programming to achieve four distinct threshold voltage levels
takes longer, 900 µs for MLC vs. 220 µs for SLC. A more serious limitation is the
endurance, or number of program/erase cycles—5,000 for MLC vs. 100,000 for SLC.
Therefore, MLC is typically used in low-performance consumer applications such as
media players, cell phones and flash memory cards (USB, SD/MMC and CF—more
about flash memory cards in a later section). Finally, the error rate of MLC flash is
much higher than SLC; so much more robust error detection and correction is required
for MLC applications.
32
Micron Technical Note TN-29-19, “NAND Flash 101,” Rev. B, 4/10 EN, page 24
number) will be a “1.” So the transmission becomes 0-0-1-0-1, with the final “1” being
the parity bit. Now suppose a single bit error occurs in bit 2 during the transmission,
and the received data stream is 0-1-1-0-1. That received data stream has odd parity
(three “1’s”), thus indicating that an error occurred. But we don’t know which bit is in
error, because all single bit errors give the same result—“bad parity.”
2. Hamming Code
If more “extra” bits are included with a message, and if those bits can be
arranged such that different bad bits produce different error results, then bad bits could
be identified and corrected. This is the basis of the Hamming Code, invented in 1950
by Richard W. Hamming of Bell Labs. Hamming Code adds three check bits to every
four data bits of the data being transmitted. This structure is denoted as (7,4), and can
correct any single-bit error, or detect all single-bit and two-bit errors. The goal of
Hamming codes in general is to create a set of parity bits that overlap such that a
single-bit error in a data bit or a parity bit can be detected and corrected. Here is a
Venn diagram pictorial representation of the (7,4) Hamming Code:33
Data bits are denoted by d1, d2, d3
and d4. Parity bits are denoted by p1, p2 and
p3. The following table shows which data
bits are covered by which parity bits:
33
From Wikipedia, http://en.wikipedia.org/wiki/Hamming(7,4)
Now assume that a transmission error has caused the fifth bit (bold and
underlined in the sent information above) to flip to a 1. Thus, the received information is
0 1 1 0 1 1 1. The corresponding
Venn diagram is:
Bit 5, the bad bit, is shown in blue type. This bad
bit results in parity errors (shown in red type) in the
red and green circles; i.e., both of these circles
have three 1’s, or odd parity instead of the desired
even parity. If bad parity is detected, then the data
bit that overlaps only the bad parity circles is the bit
with the error. It is now a simple matter to correct
the bad bit by inversion, and to extract the original
data bits.
34
Micron Technical Note TN-29-19, “NAND Flash 101,” Rev. B, 4/10 EN, pp. 24-25
storage on each page (64 bytes per 2K-byte page). This spare area can be used to
store the ECC code as well as other software information, such as wear-leveling or
logical-to-physical block-mapping information. ECC can be performed in hardware or
software; however, hardware implementation provides a performance advantage.
During a programming operation, the ECC circuit calculates the ECC code based
on the data stored on the page. The ECC code for the data area is then written to the
corresponding spare area. When the data is read out, the ECC code is also read out,
and the reverse operation is applied to check that the data is correct. It is possible for
the ECC algorithm to correct data errors. The number of data errors that can be
corrected depends on the correction strength of the algorithm used. The inclusion of
ECC in hardware or software provides a robust solution at the system level.
Simple Hamming codes provide the easiest hardware implementation for error
correction; however, they can correct only single-bit errors. Reed-Solomon codes
provide more robust error correction capability and are used in many controllers on the
market today. BCH codes are also becoming popular due to their improved efficiency
over Reed-Solomon codes.
The next table shows the number of bits required for various ECC correction
levels.
Note: Codes in the shaded table cells can fit in the spare area of the NAND flash device
discussed in Section G.
35
http://extmedia.micron.com/webmedia/clearnand_intro/clearnand.html
36
http://www.eetimes.com/General/DisplayPrintViewContent?contentItemId=4211180
37
http://extmedia.micron.com/webmedia/clearnand_intro/clearnand.html
The number of bits of flash used are staggering. The vertical scale of the chart is
“Million GB.” So the first major division—10,000 Million GB—is 1019 Bytes of memory!
Where does all of the flash memory come from? The US International Trade
Commission monitors world trade in various products, and has published a report that
provides insight into the flash memory market. The following charts show the major
flash memory vendors up to 2005.41 It is interesting to note how Intel’s market share
has dropped, from 75% in 1992 to just 11% in 2005. The other interesting point is that
only Intel has had a significant market share throughout the history of flash memory.
38
http://www.databeans.net/reports/current_reports/SemiProd_Flash.php as of August 2010
39
Clarke, Peter; “NAND flash market set to grow 18% in 2011;”
http://www.eetimes.com/General/DisplayPrintViewContent?contentItemId=4212348; January 20, 2011
40
Micron Technical Note TN-29-19, “NAND Flash 101,” Rev. B, 4/10 EN, page 1
41
Yinug, F., “The Rise of the Flash Memory Market: Its Impact on Firm Behavior and Global Semiconductor
Trade Patterns,” United States International Trade Commission, July 2007
42
Estimated at $2.5 Billion, Ibid., footnote on page 11
43
Numonyx was acquired by Micron on February 9, 2010
The following table lists the most important characteristics of all of the card formats (as
of 2009):44
Primary Entry
Card family Varieties Major features
Vendors date
Thinner (3.3 mm), flash based only, sizes
I
available up to 128 GB
CompactFlash SanDisk 1994 Thicker (5.0 mm), older flash based, but
II usually Microdrives, sizes available up to
128 GB
Very thin (45.0 × 37.0 × 0.76 mm thick), no
SmartMedia Toshiba 3.3/5 V 1995 wear leveling controller, sizes available up
to 128 MB
Thin and small (24 mm × 32 mm ×
MMC 1997
1.4 mm), sizes available up to 16 GB
RS-
2003/ Compact size (24 mm × 18 mm × 1.4 mm),
MMC/MMC
2005 sizes available up to 16 GB
Siemens AG, Mobile
MultiMediaCard
SanDisk Compact size (24 mm × 32 mm × 1.4 mm),
MMCplus 2005 faster, optional DRM, sizes available up to
16 GB
Sub compact size (14 mm × 12 mm ×
MMCmicro 2005
1.1 mm), optional DRM, 16 MB – 4 GB
Small (32 mm × 24 mm × 2.1 mm), DRM,
SD 1999
sizes available up to 4 GB
Compact size (21.5 mm x 20 mm x
miniSD 2003
1.4 mm), DRM, available up to 16 GB
Sub compact size (11 mm x 15 mm x
microSD 2005
1 mm), DRM, available up to 16 GB
Physically the same as SD, but offers
Panasonic, SDHC 2006 higher capacity and transfer speed,
SanDisk, available up to 32 GB
Secure Digital Physically the same as miniSD, but offers
Toshiba,
Kodak miniSDHC 2008 higher capacity and transfer speed,
available up to 32 GB
Physically the same as microSD, but offers
microSDHC 2007 higher capacity and transfer speed,
available up to 32 GB
Physically the same as SD, but offers
higher capacity and transfer speed,
SDXC 2009
currently available up to 64 GB although
the standard goes up to 2 TB
44
http://en.wikipedia.org/wiki/Comparison_of_memory_cards This website provides extensive information
about all types of removable flash memory cards.
Primary Entry
Card family Varieties Major features
Vendors date
Thin and narrow (50 mm x 21.5 mm x
Standard 1998 2.8 mm), optional DRM, available up to
128 MB
HDD SSD
Flash memory has no moving mechanical parts, its read and write speeds are
100 to 1000 times faster, it makes no noise, has comparable reliability, and consumes
much less power. But it costs much, much more per GB. And in consumer electronics,
cost is an overriding factor. The following graph compares the cost history of DRAMs,
Flash and HDD:46
45
Ekker, N, T. Coughlin and J. Handy; “Solid State Storage 101—An Introduction to Solid State Storage,”
SNIA, January 2009
46
http://www.storagesearch.com/semico-art1.html
1
From this graph, it looks like flash prices will never catch up with HDD prices. If
this graph tells the whole story, why would anyone ever consider flash as an HDD
replacement?
However, if we look at the bigger picture and consider System Cost vs. Storage
Capacity, the picture is somewhat different.47
47
loc. cit.
From this graph we see that HDDs have a floor price, regardless of capacity, of
about $100. And we can buy a flash-based system with 512 MB of memory for that
same $100. Of course, we could buy an HDD-based system with 64 GB of capacity for
$100 also.
So will flash-based SSDs ever replace HDDs? The answer is not yet clear. But
in some systems where the advantages of SSDs are essential, the answer is yes. For
example, the Apple MacBook Air comes standard with a 64 GB SSD, and can be
upgraded to 128 GB or 256 GB. The Apple iPad has a standard 16 GB SSD, and can
be upgraded to a 32 GB or 64 GB SSD. The Amazon Kindle has a 2 GB NAND flash
memory to store all those books.
In late 2010, Intel announced the 310 Series of SSDs, including a 40 GB version
for $99; an 80 GB model for $179; a 120 GB model for $249; and a 160 GB model for
$415. In January 2011, Micron announced sample availability of its RealSSD C400
SSDs in capacities from 64 to 512 GB. Memoright, a Taiwan-based company, recently
announced a line of 2.5” form factor SSDs based on MLC NAND flash memory chips.
Capacities are 60, 120, 240 and 400 GB.
Beyond these all-flash solutions is a hybrid SSD/HDD approach that takes
advantage of the speed and low power of SSDs and the low cost of HDDs. Future
consumer systems are likely to use this approach, at least until flash memory gets much
less expensive.
Possibly the strongest evidence that SSDs will become a vital part of future
computer systems is the involvement of JEDEC, the most important industry-wide
standards-making body for semiconductor devices. In September 2010, JEDEC
released two standards for determining the endurance of SSDs, JESD218 and
JESD219. These standards were developed by JEDEC’s JC-64.8 Subcommittee for
Solid State Drives.48 In January 2011, JEDEC announced that the JC-64.8
Subcommittee will develop standards for SSDs for applications beyond conventional
disk drive form factors, such as tablets and ultra thin notebook PCs.49
48
“Solid State Drives” at http://www.jedec.org/standards-documents/focus/flash/solid-state-drives
49
http://www.eetimes.com/electronics-news/4212163/Jedec-to-create-standards-for-smaller-
SSDs?cid=NL_EETimesDaily
Appendix A
Hexadecimal-Decimal-Binary Conversions
Cited References
1. D. Kahng and S.M. Sze, "A floating-gate and its application to memory devices,"
The Bell System Technical Journal, vol. 46, no. 6, July-August 1967, pp. 1288-1295
2. Masuoka, F. and H. Iizuka, “Semiconductor Memory Device and Method for
Manufacturing the Same,” U. S. Patent 4,531,203; Application date Nov. 13, 1981;
FAP date Dec. 20, 1980; Issue date July 23, 1985
3. Masuoka, F. et al., “A New Flash EEPROM Cell Using Triple Polysilicon
Technology,” IEEE IEDM Technical Digest, 1984
4. Momodomi, M., Masuoka, F., et al., “Electrically Erasable Programmable Read-
Only Memory with NAND Cell Structure,” U. S. Patent 5,050,125; Application date
Nov. 17, 1988; FAP date Nov. 18, 1987; Issue date Sep. 17, 1991
5. Fulford, B., “Unsung Hero,” Forbes.com, June 24, 2002
(www.forbes.com/global/2002/0624/030_print.html)
6. Prince, B., Semiconductor Memories, Second Edition, ISBN 0 471 92465 2, 1983,
John Wiley & Sons, West Sussex, England
7. Brown, W.D. and J.E. Brewer, Editors; Nonvolatile Semiconductor Memory
Technology, ISBN 0-7803-1173-6, 1998, IEEE Press, New York, NY
8. Cottrell, P.E. et al., “Hot Electron Emission in N-Channel IGFETs,” IEEE Journal of
Solid-State Circuits, Volume SC-14, 1979
9. Eitan, B. and D. Frohman-Bentchkowsky, “Hot Electron Injection into the Oxide in
N-Channel MOS Devices,” IEEE Transactions on Electron Devices, Volume ED-23,
1981
10. Lenzlinger, M. and E.H. Snow, “Fowler-Nordheim Tunneling in Thermally Grown
SiO2,” Journal of Applied Physics, Vol. 40, 1969
11. Spansion Application Note , “Flash Memory: An Overview,” November 10, 2005
12. Numonyx Embedded Flash Memory (J3 65 nm) Single Bit Cell (SBC) Data Sheet,
March 2010
13. Micron MT29F2G08A NAND Flash Data Sheet, Rev. 1, 1/06 EN
14. http://en.wikipedia.org/wiki/Greinacher_multiplier
15. Cockcroft, J.D. and E.T. Walton, “Production of high velocity positive ions,”
Proceedings of the Royal Society, A, Vol. 136, 1932, pp. 619-630
16. Dickson, J. “On-chip High-Voltage Generation in NMOS Integrated Circuits Using
an Improved Voltage Multiplier Technique” IEEE Journal of Solid-State Circuits, Vol.
11, No. 6, June 1976, pp. 374–378
17. http://www.answers.com/topic/charge-pump
18. Moore, G., Electronics Magazine, "Cramming more components onto integrated
circuits," April 19, 1965
19. http://www.micron.com/products/nand_flash/nandcom.html#mlcnand
20. Micron Technical Note TN-29-19, “NAND Flash 101,” Rev. B, 4/10 EN
21. Wikipedia, http://en.wikipedia.org/wiki/Hamming(7,4)
22. http://extmedia.micron.com/webmedia/clearnand_intro/clearnand.html
23. http://www.eetimes.com/General/DisplayPrintViewContent?contentItemId=4211180
24. http://www.databeans.net/reports/current_reports/SemiProd_Flash.php
25. Clarke, Peter; “NAND flash market set to grow 18% in 2011;”
http://www.eetimes.com/General/DisplayPrintViewContent?contentItemId=4212348;
January 20, 2011
26. Yinug, F., “The Rise of the Flash Memory Market: Its Impact on Firm Behavior and
Global Semiconductor Trade Patterns,” United States International Trade
Commission, July 2007
27. “Solid State Drives” at http://www.jedec.org/standards-documents/focus/flash/solid-
state-drives
28. http://www.eetimes.com/electronics-news/4212163/Jedec-to-create-standards-for-
smaller-SSDs?cid=NL_EETimesDaily
Other References