Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, ChinaInstitute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, China
Abstract: A vertical two-terminal silicon PNPN diode is presented for use in a high-density memory cell. The device design for high-speed operations was studied with experiments and calibrated simulations, which proves that the proposed memory cell can be operated at nanosecond range. The static and dynamic power dissipations were also studied, which indicated the availability of the proposed memory cell for VLSI applications. Moreover, the memory cell is compatible with CMOS process, has little impact from process variation, and has good reliability.
To meet the increasing demands for large volume density SRAM in CMOS technology, two approaches can be taken at device level. The first approach is to scale down the device's dimensions[1], which has proved effective, but comes at a high cost and became difficult for the deca-nanometer scaled CMOS process[2, 3]. Another approach is to implement an alternative functional device in the cell, so as to obtain a less complex cell structure with smaller SRAM area. In this work, a simple two-terminal vertical memory cell is proposed, which provides excellent potential for high-density integration. Moreover, the new memory cell has only two terminals, such that the cross-point array[4] with the proposed memory cell becomes conceivable. The cross-point design can achieve the highest integration density in planar technology to date and has great potential for 3-D integration[5].
2.
New memory cell concept
The new memory cell has a vertical four-layer alternatively doped structure with only two terminals, as shown in Fig. 1(a). It exhibits the hysteresis curve shown in Fig. 1(b)[6]. When the voltage between terminals A and C (VAC) increases in the forward direction, the memory cell stays at a low conductance branch until VAC reaches the latch up voltage (VLU). At VLU, the latch-up[6] is triggered and the current flowing through the diode (IA) increases abruptly. As VAC increases beyond VLU, the memory cell gets into a high conductance state and is latched until VAC returns to the latch down voltage (VLD) as VAC decreases in the reverse direction, resulting in a bistable region between the latchup and latchdown voltages (VLD < VAC < VLU). The memorization mechanism is as following. When VAC is applied between VLD and VLU, there are two stable states for the leakage current that can be defined as “1” or “0” (I1 or I0 respectively). “1” or “0” can be stably conserved once VAC keeps constant. To change the cell into “1” state from “0” state, i.e. to write “1” state, a positive voltage pulse is used to latch up the diode. To change the cell into “0” state from “1” state, i.e. to write “0” state, a negative voltage pulse is used to latch down the PNPN diode. For reading the stored state, the large I1/I0 ratio (≈104, which will be shown in a later experiment) can be utilized for read operation.
Figure
1.
(a) The structure of the proposed memory cell, and (b) the hysteresis curve of the new memory cell and the write voltage pulses.
This section is divided into four parts: write-1 optimization, write-0 optimization, read operation and discussion. Write-1 optimization discusses the optimization methods used to increase the write-1 speed. Write-0 optimization discusses the optimization method used to increase write-0 speed. The read operation paragraph discusses the read method and the read speed. Finally, the discussion paragraph describes the process variation, reliability, and power dissipation in the retention and writing “1”/“0” operations.
3.1
Write-1 optimization
To begin the study, the PNPN diode, named Cell 1, was fabricated on the CMOS line. Only a few steps were needed, as shown in Fig. 2. The fabrication flow is compatible with the traditional CMOS process and there is no high-temperature process when integrated with the CMOS technology. The doping concentration DP1 and DN2 are at 1020 cm−3 level, and the N1 and P1 layer are separately at 1 × 1017 cm−3 and 1 × 1018 cm−3 doping level with the thickness of 0.8 μm and 0.2 μm, respectively. For the convenience of direct probe testing, the cell was fabricated with a 50 × 50 μm2 active cell area, as shown in Fig. 3. Although it has a large area, the fabricated cell can reflect the electronic performances of a small device well due to its quasi 1-D structure (the effect of the diode sidewall surface can be ignored). The measured static I–V curve of Cell 1 is given in Fig. 3, which exhibits a good hysteresis property with a high/low conductance current ratio of about 104 times.
Figure
2.
The process flow of the PNPN diode memory cell.
Figure
3.
The measured hysteresis curve and the SEM view of Cell 1. Note that at VAC> 1.6 V, the IA is constant because the actual current through the cell hits the compliance level of the SMU.
Write-1 measurements of Cell 1 were thus made. The measuring circuit is shown in Fig. 4(a). It is hard to measure IA directly, so a resistor was connected in series with the diode and the voltage VA was measured to see the state transition. The first attempt was at hundreds of μs speed level with a VH= 4.4 V >VLU (Vhold is chosen to be 0.8 V and the reason why will be discussed later). The measurement result is shown in Fig. 4(a). We can see that Cell 1 got into the transition state (the definition of transition state is given in Fig. 1(b)) at first and got into “1” state finally as the pulse was over. The measuring result demonstrates that Cell 1 is functional feasible, based on this measure, the measuring speed was increased to hundreds of ns level, which is the limit of our equipment. At this measuring level, we can see that an obvious delay time (Tdr)[7] exists before Cell 1 gets into the transition state (the blue line in Fig. 4(b)). Tdr is the time needed to build up the base regions' charge distribution of the transition state. Tdr limits the speed increase of write-1 operation. To increase the write-1 speed, Tdr must be reduced. We tried increasing VH to reduce Tdr. The measuring result is shown Fig. 4(b), we can see that although Tdr was reduced to about 50 ns, this delay is still large for ns range operation, moreover, a VH of 4.6 V is high for VLSI and may cause reliability problems.
Figure
4.
(a) The pulsed measuring circuit and the low speed measure, (b) the increased frequency write-1 measure, in which delay time was observed.
Besides VH, the delay time is also determined by VLU. Lower VLU can provide a faster switch-on speed. So we further tried decreasing VLU to reduce Tdr. VLU can be reduced by decreasing the centre two layers' doping concentrations or increasing the outer two layers' doping concentrations[6]. In this work, the DP2 was reduced and Cell 2 was obtained with a VLU of 0.95 V, as shown in Fig. 5(a). Pulsed transient measure was thus conducted. In Fig. 5(b), Tdr was not observed because it is too small, which indicates that the optimized cell has good potential for high-speed write-1 operation. To study its ns-level performance, a calibrated simulation was used. The fabricated memory cell was first rebuilt with Sentaurus Process simulation[8]. Then, the Sentaurus Device simulation tool[8] was used for this rebuilt cell to mimic the measure. The physical models used in device simulations were Fermi statistics; basic mobility model describing the phonon scattering; Masetti model describing the doping dependence of mobility; Canali model describing mobility saturation at high electric field; Slotboom model describing the bandgap narrowing; doping dependent SRH recombination with field enhancement; Auger recombination; surface recombination; nonlocal tunneling; and avalanche multiplication[9]. The simulation result is given in Fig. 6(a), which shows that a Tdr of about 0.6 ns was obtained with a VH of 2.5 V. The simulation indicated that reducing VLU is an effective method to decease Tdr and Cell 2 is available for ns level write-1 operation. Moreover, the Tdr–VH relationship was simulated and is given in Fig. 6(b). It can be seen that Cell 2 has the same Tdr–VH trend as Cell 1. Based on this relationship, optimization can be used between Tdr and VH for practical application.
Figure
5.
(a) The measured hysteresis curve of Cell 2 and its simulated hysteresis curve. Note that at VAC> 1 V, the IA is constant because the actual current through the cell hits the compliance of the SMU. (b) The write-1 operation measure of Cell 2
From Fig. 6(a), it can be seen that there are other two pieces of time that also limit write-1. They are Ts and Tdf. In next two paragraphs, we will discuss Ts and Tdf and their optimization method respectively.
Figure
6.
(a) The write-1 simulation indicates that Cell 2 can be operated at ns level with an available voltage and (b) the simulated Tdr–VH trend of Cell 2.
Ts is important for array operation, it is the time needed to buffer the Tdr fluctuation caused by process induced VLU variation. To determine the value of Ts, the distribution of VLU on the same wafer is measured, as shown in Fig. 7(a). According to the simulated Tdr–VLU relationship in Fig. 7(b), it can be concluded that the Tdr fluctuates between 0.55–0.80 ns. To ensure that all the cells are written into “1” with the same 2.5 V pulse, we only need to ensure the cell with the largest Tdr has enough time to be latched-up. Therefore, Ts is 0.8−0.6= 0.2 ns. Ts can be reduced by decreasing the Tdr fluctuating range, which can be realized by improving process accuracy. Note that Cell 2 was fabricated on our experimental line and thus process variations could be better controlled in industry.
Figure
7.
(a) The measured VLU distribution and (b) the simulated Tdr–VLU relationship.
Tdf is the time needed to move out the excess charges for the “1” state (the base-region charge amount of the transition state is larger than the “1” states, when the cell starts form the transition state to “1” state, these excess charges needed to be moved out). To increase write-1 speed, Tdf should be reduced too. Tdf is determined by Vhold. The base-region charge amount of the “1” state is directly proportional to Vhold, which means that more charges need to be moved out when Vhold is lower. To demonstrate this point, Cell 2 was simulated with different Vhold. The simulation results are directly given in Fig. 8(a) and summarized in Fig. 8(b), which demonstrates the Tdf–Vhold trend well. Increasing Vhold can reduce Tdf and also increases I1 at the same time. Larger I1 causes a higher 1-retention power, which will be discussed in later section. Furthermore, as Vhold approaches VLU, a storage stability problem also occurs. Therefore, comprehensive design is needed when deciding Vhold. After discussing Tdr, Ts and Tdf separately, we can clearly see that the necessary time for a successful write-1 operation is the sum of these three pieces of time. For Cell 2, this time is about 1.4 ns, which is available as a SRAM memory cell.
Figure
8.
(a) The simulated write-1 operations with different Vholds, and (b) the simulated Tdf–Vhold trend.
The write-0 speed is as important as the write-1 speed for a memory cell. To study Cell 2's write-0 speed, negative pulsed transient measures were conducted. From Fig. 9(a), it can be seen that the memory cell was not written into “0” state even when the voltage pulse was increased to −4 V towards the negative direction. The reason is that when the memory cell is in “1” state, the base regions are full of excess charges that need a long time (μs level) to be moved out. To solve this problem, we can reduce the base regions' thicknesses to introduce a punch-through design (PTD)[9]. The PTD is that the entire base regions are depleted at the designed write-0 voltage, as shown in Fig. 9(b). The PTD can improve the turn-off time by about three orders of magnitude over a traditional PNPN diode. With the PTD, the PNPN diode can be turned off in nanoseconds simply by using the anode to a negative voltage exceeding a certain threshold (Vth). Based on Cell 2, the N1 layer and P2 layer were reduced to 50 nm and 150 nm respectively in fabrication to introduce the PTD. The newly optimized cell was named as Cell 3. The measured I–V curve of Cell 3 is given in Fig. 9(b), from which we can see that a good hysteresis property was kept after the introduction of the PTD. Then, a pulsed transient measure was conducted upon Cell 3, as shown in Fig. 10(a). It can be seen that the write-0 speed was obviously improved; the punch-through threshold voltage Vth is about –1.8 V. Once again, simulation was used to study Cell 3's ns-level performance. The simulation result is given in Fig. 10(b), which demonstrates that Cell 3 has an ns-level write-0 speed with available operating voltage for SRAM application. Actually, the write-0 voltage can be further reduced by decreasing base-region thicknesses. The decrease of VL also causes the decrease of storing-0 stability. Therefore, optimization is needed between VL and storing-0 stability. To study whether Cell 3 kept the high-speed write-1 property, the alternative write operations were simulated, as shown in Fig. 11. The simulation demonstrated Cell 3 can be used as a bistable memory cell with steady high-speed programming ability. Table 1 lists the studied cells in the optimization patch.
Figure
9.
(a) The write-0 measures of Cell 2, and (b) the hysteresis curve of the Cell 3 in comparison with Cell 2 and the illustration of the punch-trough design.
A current current-latch-based sense amplifier (CLSA)[10] can be used for read operation. In read operation, the VAC is kept at Vhold, a CLSA can mirror the current flowing through the memory cell I1/I0 and output VDD/0 V to drive the circuit of next level. The read speed is determined by I1, which is about 2 × 10−5 A/μm2. This current level is available for ns level read operation with a nano-level CMOS process.
3.4
Discussion
Besides operation speed, the process variation, reliability and power dissipation (PD) are also important for a memory cell's practical application. The process variation of Cell 3 was measured, which can be directly reflected by the VLU and VLD distributions, as shown in Fig. 12(a). As a simple vertical structure, the process variation can be well controlled. Measures about reliability were also conducted. The VLU and VLD variation with the 3 V to −3 V to 3 V sweep (10−8 s period) times is given in Fig. 12(b), which indicates the high reliability of the proposed memory cell. At last, the power dissipated in the operations of the proposed cell was studied. To compare with the 45 nm 6T SRAM cell, it was supposed that Cell 3 has a 45 × 45 nm2 active cell area, which can be realized by using state-of-the-art lithography technology[11]. There are three kinds of PD in single cell operations. They are retention PD, write PD and read PD. The retention PD is the product of the retention leakage current (I1 or I0 that depends on the stored state) and Vhold. Therefore, the 1-retention PD is 40 nW and the 0-retention PD is 100 pW. There are two conditions of write PD: write-1 PD and write-0 PD. Write-1 PD is mainly dissipated in Ts. Therefore, the average PD in the write-1 cycle is approximately (VHIH)(TS/T1) (T1 is the write-1 period, IH definition is shown in Fig. 1(b)), which is about 400 nW when T1= 2 ns. For write-0 operation, the PD can be obtained by the integration of (VACIAC)/T0 (T0 is the write-0 period), which is about 100 nW when T0= 2 ns. For read operation, the PD is different in read-1 and read-0 operation. The retention current is used for read, so the read-1 PD and read-0 PD is at the same value with the 1-retention and 0-retention PD separately.
4.
Conclusion
Table 2 summarizes the main performance indicators of the proposed memory cell (Cell 3) and the 6T SRAM cell in 45 nm process[12, 13] (in fact, the 6T SRAM cell performance spreads over a wide range depending on the practical application, the data given in Table 2 is a typical range). We can see that the proposed cell offers excellent packing density, low power dissipations with available speed. Moreover, the proposed memory cell is compatible with traditional CMOS technology, has good design flexibility and good reliability due to its gateless structure. It has a great potential in a broad range of applications including embedded memory and stand-alone memory.
Table
2.
The main performance of the proposed cell and the 6T SRAM cell in 45 nm process.
Kuhn K J. Considerations for ultimate CMOS scaling. IEEE Trans Electron Devices, 2012, 59(7):1813 doi: 10.1109/TED.2012.2193129
[2]
Asenov A, Kaya S, Brown A R. Intrinsic parameter fluctuations in decananometer MOSFETs introduced by gate line edgeroughness. IEEE Trans Electron Devices, 2003, 50(5):1254 doi: 10.1109/TED.2003.813457
[3]
Kampen C, Evanschitzky P, Burenkov A, et al. Lithographyinduced layout variations in 6-t SRAM cells. International Conference on Simulation of Semiconductor Processes and Devices (SISPAD), 2010:149
[4]
Liang J, Gnana R, Jeyasingh D, et al. An ultra-low reset current cross-point phase change memory with carbon nanotube electrodes. IEEE Trans Electron Devices, 2012, 59(4):1155 doi: 10.1109/TED.2012.2184542
Gibbons J F. A critique of the theory of p-n-p-n devices. IEEE Trans Electron Devices, 1964, 11(9):406 doi: 10.1109/T-ED.1964.15352
[7]
Tserng H Q, Plumlee H R. The turn-on delay time of silicon p-n-p-n switches. Proc IEEE, 1970, 58(5):792 doi: 10.1109/PROC.1970.7736
[8]
Synopsys Inc. TCAD Sentaurus user manual. 2010
[9]
Heremans P L. Fast turn-off of two-terminal double heterojunction optical thyristors. Appl Phys Lett, 1992, 61(11):1326 doi: 10.1063/1.107581
[10]
Tsai M F, Tsai J H, Fan M L, et al. Variation tolerant CLSAs for nanoscale bulk-CMOS and FinFET SRAM. APCCAS, 2012:471
[11]
Ronse K, Jansen P, Gronheid R, et al. Lithography options for the 32 nm half pitch node and beyond. IEEE Trans Circuits Syst Ⅰ:Regular Papers, 2009, 56(8):1884 doi: 10.1109/TCSI.2009.2028417
[12]
Akashe S, Rastogi S, Sharma S. Specific power illustration of proposed 7T SRAM with 6T SRAM using 45 nm technology. ICONSET, 2011
[13]
Razavipour G, Afzali-Kusha A, Pedram M. Design and analysis of two low-power SRAM cell structures. IEEE Trans VLSI, 2009, 17(10):1551 doi: 10.1109/TVLSI.2008.2004590
Fig. 1.
(a) The structure of the proposed memory cell, and (b) the hysteresis curve of the new memory cell and the write voltage pulses.
Fig. 3.
The measured hysteresis curve and the SEM view of Cell 1. Note that at VAC> 1.6 V, the IA is constant because the actual current through the cell hits the compliance level of the SMU.
Fig. 5.
(a) The measured hysteresis curve of Cell 2 and its simulated hysteresis curve. Note that at VAC> 1 V, the IA is constant because the actual current through the cell hits the compliance of the SMU. (b) The write-1 operation measure of Cell 2
Fig. 6.
(a) The write-1 simulation indicates that Cell 2 can be operated at ns level with an available voltage and (b) the simulated Tdr–VH trend of Cell 2.
Fig. 9.
(a) The write-0 measures of Cell 2, and (b) the hysteresis curve of the Cell 3 in comparison with Cell 2 and the illustration of the punch-trough design.
Table 1.
The studied cells in the optimization patch.
Table 2.
The main performance of the proposed cell and the 6T SRAM cell in 45 nm process.
[1]
Kuhn K J. Considerations for ultimate CMOS scaling. IEEE Trans Electron Devices, 2012, 59(7):1813 doi: 10.1109/TED.2012.2193129
[2]
Asenov A, Kaya S, Brown A R. Intrinsic parameter fluctuations in decananometer MOSFETs introduced by gate line edgeroughness. IEEE Trans Electron Devices, 2003, 50(5):1254 doi: 10.1109/TED.2003.813457
[3]
Kampen C, Evanschitzky P, Burenkov A, et al. Lithographyinduced layout variations in 6-t SRAM cells. International Conference on Simulation of Semiconductor Processes and Devices (SISPAD), 2010:149
[4]
Liang J, Gnana R, Jeyasingh D, et al. An ultra-low reset current cross-point phase change memory with carbon nanotube electrodes. IEEE Trans Electron Devices, 2012, 59(4):1155 doi: 10.1109/TED.2012.2184542
Gibbons J F. A critique of the theory of p-n-p-n devices. IEEE Trans Electron Devices, 1964, 11(9):406 doi: 10.1109/T-ED.1964.15352
[7]
Tserng H Q, Plumlee H R. The turn-on delay time of silicon p-n-p-n switches. Proc IEEE, 1970, 58(5):792 doi: 10.1109/PROC.1970.7736
[8]
Synopsys Inc. TCAD Sentaurus user manual. 2010
[9]
Heremans P L. Fast turn-off of two-terminal double heterojunction optical thyristors. Appl Phys Lett, 1992, 61(11):1326 doi: 10.1063/1.107581
[10]
Tsai M F, Tsai J H, Fan M L, et al. Variation tolerant CLSAs for nanoscale bulk-CMOS and FinFET SRAM. APCCAS, 2012:471
[11]
Ronse K, Jansen P, Gronheid R, et al. Lithography options for the 32 nm half pitch node and beyond. IEEE Trans Circuits Syst Ⅰ:Regular Papers, 2009, 56(8):1884 doi: 10.1109/TCSI.2009.2028417
[12]
Akashe S, Rastogi S, Sharma S. Specific power illustration of proposed 7T SRAM with 6T SRAM using 45 nm technology. ICONSET, 2011
[13]
Razavipour G, Afzali-Kusha A, Pedram M. Design and analysis of two low-power SRAM cell structures. IEEE Trans VLSI, 2009, 17(10):1551 doi: 10.1109/TVLSI.2008.2004590
Ma Jiehui, Fang Gaozhan, Lan Yongsheng and, Ma Xiaoyu
Chinese Journal of Semiconductors , 2005, 26(3): 476-479.
Search
GET CITATION
Xiaodong Tong, Hao Wu, Qingqing Liang, Huicai Zhong, Huilong Zhu, Chao Zhao, Tianchun Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. Journal of Semiconductors, 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006
X D Tong, H Wu, Q Q Liang, H C Zhong, H L Zhu, C Zhao, T C Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. J. Semicond., 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006.
Export: BibTexEndNote
Share:
Article Metrics
Article views: 2445 TimesPDF downloads: 15 TimesCited by: 0 Times
History
Received: 13 June 2013Revised: 26 July 2013Online:Published: 01 January 2014
Xiaodong Tong, Hao Wu, Qingqing Liang, Huicai Zhong, Huilong Zhu, Chao Zhao, Tianchun Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. Journal of Semiconductors, 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006 ****X D Tong, H Wu, Q Q Liang, H C Zhong, H L Zhu, C Zhao, T C Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. J. Semicond., 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006.
Citation:
Xiaodong Tong, Hao Wu, Qingqing Liang, Huicai Zhong, Huilong Zhu, Chao Zhao, Tianchun Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. Journal of Semiconductors, 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006
****
X D Tong, H Wu, Q Q Liang, H C Zhong, H L Zhu, C Zhao, T C Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. J. Semicond., 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006.
Xiaodong Tong, Hao Wu, Qingqing Liang, Huicai Zhong, Huilong Zhu, Chao Zhao, Tianchun Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. Journal of Semiconductors, 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006 ****X D Tong, H Wu, Q Q Liang, H C Zhong, H L Zhu, C Zhao, T C Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. J. Semicond., 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006.
Citation:
Xiaodong Tong, Hao Wu, Qingqing Liang, Huicai Zhong, Huilong Zhu, Chao Zhao, Tianchun Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. Journal of Semiconductors, 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006
****
X D Tong, H Wu, Q Q Liang, H C Zhong, H L Zhu, C Zhao, T C Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. J. Semicond., 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006.
A vertical two-terminal silicon PNPN diode is presented for use in a high-density memory cell. The device design for high-speed operations was studied with experiments and calibrated simulations, which proves that the proposed memory cell can be operated at nanosecond range. The static and dynamic power dissipations were also studied, which indicated the availability of the proposed memory cell for VLSI applications. Moreover, the memory cell is compatible with CMOS process, has little impact from process variation, and has good reliability.
To meet the increasing demands for large volume density SRAM in CMOS technology, two approaches can be taken at device level. The first approach is to scale down the device's dimensions[1], which has proved effective, but comes at a high cost and became difficult for the deca-nanometer scaled CMOS process[2, 3]. Another approach is to implement an alternative functional device in the cell, so as to obtain a less complex cell structure with smaller SRAM area. In this work, a simple two-terminal vertical memory cell is proposed, which provides excellent potential for high-density integration. Moreover, the new memory cell has only two terminals, such that the cross-point array[4] with the proposed memory cell becomes conceivable. The cross-point design can achieve the highest integration density in planar technology to date and has great potential for 3-D integration[5].
2.
New memory cell concept
The new memory cell has a vertical four-layer alternatively doped structure with only two terminals, as shown in Fig. 1(a). It exhibits the hysteresis curve shown in Fig. 1(b)[6]. When the voltage between terminals A and C (VAC) increases in the forward direction, the memory cell stays at a low conductance branch until VAC reaches the latch up voltage (VLU). At VLU, the latch-up[6] is triggered and the current flowing through the diode (IA) increases abruptly. As VAC increases beyond VLU, the memory cell gets into a high conductance state and is latched until VAC returns to the latch down voltage (VLD) as VAC decreases in the reverse direction, resulting in a bistable region between the latchup and latchdown voltages (VLD < VAC < VLU). The memorization mechanism is as following. When VAC is applied between VLD and VLU, there are two stable states for the leakage current that can be defined as “1” or “0” (I1 or I0 respectively). “1” or “0” can be stably conserved once VAC keeps constant. To change the cell into “1” state from “0” state, i.e. to write “1” state, a positive voltage pulse is used to latch up the diode. To change the cell into “0” state from “1” state, i.e. to write “0” state, a negative voltage pulse is used to latch down the PNPN diode. For reading the stored state, the large I1/I0 ratio (≈104, which will be shown in a later experiment) can be utilized for read operation.
Figure
1.
(a) The structure of the proposed memory cell, and (b) the hysteresis curve of the new memory cell and the write voltage pulses.
This section is divided into four parts: write-1 optimization, write-0 optimization, read operation and discussion. Write-1 optimization discusses the optimization methods used to increase the write-1 speed. Write-0 optimization discusses the optimization method used to increase write-0 speed. The read operation paragraph discusses the read method and the read speed. Finally, the discussion paragraph describes the process variation, reliability, and power dissipation in the retention and writing “1”/“0” operations.
3.1
Write-1 optimization
To begin the study, the PNPN diode, named Cell 1, was fabricated on the CMOS line. Only a few steps were needed, as shown in Fig. 2. The fabrication flow is compatible with the traditional CMOS process and there is no high-temperature process when integrated with the CMOS technology. The doping concentration DP1 and DN2 are at 1020 cm−3 level, and the N1 and P1 layer are separately at 1 × 1017 cm−3 and 1 × 1018 cm−3 doping level with the thickness of 0.8 μm and 0.2 μm, respectively. For the convenience of direct probe testing, the cell was fabricated with a 50 × 50 μm2 active cell area, as shown in Fig. 3. Although it has a large area, the fabricated cell can reflect the electronic performances of a small device well due to its quasi 1-D structure (the effect of the diode sidewall surface can be ignored). The measured static I–V curve of Cell 1 is given in Fig. 3, which exhibits a good hysteresis property with a high/low conductance current ratio of about 104 times.
Figure
2.
The process flow of the PNPN diode memory cell.
Figure
3.
The measured hysteresis curve and the SEM view of Cell 1. Note that at VAC> 1.6 V, the IA is constant because the actual current through the cell hits the compliance level of the SMU.
Write-1 measurements of Cell 1 were thus made. The measuring circuit is shown in Fig. 4(a). It is hard to measure IA directly, so a resistor was connected in series with the diode and the voltage VA was measured to see the state transition. The first attempt was at hundreds of μs speed level with a VH= 4.4 V >VLU (Vhold is chosen to be 0.8 V and the reason why will be discussed later). The measurement result is shown in Fig. 4(a). We can see that Cell 1 got into the transition state (the definition of transition state is given in Fig. 1(b)) at first and got into “1” state finally as the pulse was over. The measuring result demonstrates that Cell 1 is functional feasible, based on this measure, the measuring speed was increased to hundreds of ns level, which is the limit of our equipment. At this measuring level, we can see that an obvious delay time (Tdr)[7] exists before Cell 1 gets into the transition state (the blue line in Fig. 4(b)). Tdr is the time needed to build up the base regions' charge distribution of the transition state. Tdr limits the speed increase of write-1 operation. To increase the write-1 speed, Tdr must be reduced. We tried increasing VH to reduce Tdr. The measuring result is shown Fig. 4(b), we can see that although Tdr was reduced to about 50 ns, this delay is still large for ns range operation, moreover, a VH of 4.6 V is high for VLSI and may cause reliability problems.
Figure
4.
(a) The pulsed measuring circuit and the low speed measure, (b) the increased frequency write-1 measure, in which delay time was observed.
Besides VH, the delay time is also determined by VLU. Lower VLU can provide a faster switch-on speed. So we further tried decreasing VLU to reduce Tdr. VLU can be reduced by decreasing the centre two layers' doping concentrations or increasing the outer two layers' doping concentrations[6]. In this work, the DP2 was reduced and Cell 2 was obtained with a VLU of 0.95 V, as shown in Fig. 5(a). Pulsed transient measure was thus conducted. In Fig. 5(b), Tdr was not observed because it is too small, which indicates that the optimized cell has good potential for high-speed write-1 operation. To study its ns-level performance, a calibrated simulation was used. The fabricated memory cell was first rebuilt with Sentaurus Process simulation[8]. Then, the Sentaurus Device simulation tool[8] was used for this rebuilt cell to mimic the measure. The physical models used in device simulations were Fermi statistics; basic mobility model describing the phonon scattering; Masetti model describing the doping dependence of mobility; Canali model describing mobility saturation at high electric field; Slotboom model describing the bandgap narrowing; doping dependent SRH recombination with field enhancement; Auger recombination; surface recombination; nonlocal tunneling; and avalanche multiplication[9]. The simulation result is given in Fig. 6(a), which shows that a Tdr of about 0.6 ns was obtained with a VH of 2.5 V. The simulation indicated that reducing VLU is an effective method to decease Tdr and Cell 2 is available for ns level write-1 operation. Moreover, the Tdr–VH relationship was simulated and is given in Fig. 6(b). It can be seen that Cell 2 has the same Tdr–VH trend as Cell 1. Based on this relationship, optimization can be used between Tdr and VH for practical application.
Figure
5.
(a) The measured hysteresis curve of Cell 2 and its simulated hysteresis curve. Note that at VAC> 1 V, the IA is constant because the actual current through the cell hits the compliance of the SMU. (b) The write-1 operation measure of Cell 2
From Fig. 6(a), it can be seen that there are other two pieces of time that also limit write-1. They are Ts and Tdf. In next two paragraphs, we will discuss Ts and Tdf and their optimization method respectively.
Figure
6.
(a) The write-1 simulation indicates that Cell 2 can be operated at ns level with an available voltage and (b) the simulated Tdr–VH trend of Cell 2.
Ts is important for array operation, it is the time needed to buffer the Tdr fluctuation caused by process induced VLU variation. To determine the value of Ts, the distribution of VLU on the same wafer is measured, as shown in Fig. 7(a). According to the simulated Tdr–VLU relationship in Fig. 7(b), it can be concluded that the Tdr fluctuates between 0.55–0.80 ns. To ensure that all the cells are written into “1” with the same 2.5 V pulse, we only need to ensure the cell with the largest Tdr has enough time to be latched-up. Therefore, Ts is 0.8−0.6= 0.2 ns. Ts can be reduced by decreasing the Tdr fluctuating range, which can be realized by improving process accuracy. Note that Cell 2 was fabricated on our experimental line and thus process variations could be better controlled in industry.
Figure
7.
(a) The measured VLU distribution and (b) the simulated Tdr–VLU relationship.
Tdf is the time needed to move out the excess charges for the “1” state (the base-region charge amount of the transition state is larger than the “1” states, when the cell starts form the transition state to “1” state, these excess charges needed to be moved out). To increase write-1 speed, Tdf should be reduced too. Tdf is determined by Vhold. The base-region charge amount of the “1” state is directly proportional to Vhold, which means that more charges need to be moved out when Vhold is lower. To demonstrate this point, Cell 2 was simulated with different Vhold. The simulation results are directly given in Fig. 8(a) and summarized in Fig. 8(b), which demonstrates the Tdf–Vhold trend well. Increasing Vhold can reduce Tdf and also increases I1 at the same time. Larger I1 causes a higher 1-retention power, which will be discussed in later section. Furthermore, as Vhold approaches VLU, a storage stability problem also occurs. Therefore, comprehensive design is needed when deciding Vhold. After discussing Tdr, Ts and Tdf separately, we can clearly see that the necessary time for a successful write-1 operation is the sum of these three pieces of time. For Cell 2, this time is about 1.4 ns, which is available as a SRAM memory cell.
Figure
8.
(a) The simulated write-1 operations with different Vholds, and (b) the simulated Tdf–Vhold trend.
The write-0 speed is as important as the write-1 speed for a memory cell. To study Cell 2's write-0 speed, negative pulsed transient measures were conducted. From Fig. 9(a), it can be seen that the memory cell was not written into “0” state even when the voltage pulse was increased to −4 V towards the negative direction. The reason is that when the memory cell is in “1” state, the base regions are full of excess charges that need a long time (μs level) to be moved out. To solve this problem, we can reduce the base regions' thicknesses to introduce a punch-through design (PTD)[9]. The PTD is that the entire base regions are depleted at the designed write-0 voltage, as shown in Fig. 9(b). The PTD can improve the turn-off time by about three orders of magnitude over a traditional PNPN diode. With the PTD, the PNPN diode can be turned off in nanoseconds simply by using the anode to a negative voltage exceeding a certain threshold (Vth). Based on Cell 2, the N1 layer and P2 layer were reduced to 50 nm and 150 nm respectively in fabrication to introduce the PTD. The newly optimized cell was named as Cell 3. The measured I–V curve of Cell 3 is given in Fig. 9(b), from which we can see that a good hysteresis property was kept after the introduction of the PTD. Then, a pulsed transient measure was conducted upon Cell 3, as shown in Fig. 10(a). It can be seen that the write-0 speed was obviously improved; the punch-through threshold voltage Vth is about –1.8 V. Once again, simulation was used to study Cell 3's ns-level performance. The simulation result is given in Fig. 10(b), which demonstrates that Cell 3 has an ns-level write-0 speed with available operating voltage for SRAM application. Actually, the write-0 voltage can be further reduced by decreasing base-region thicknesses. The decrease of VL also causes the decrease of storing-0 stability. Therefore, optimization is needed between VL and storing-0 stability. To study whether Cell 3 kept the high-speed write-1 property, the alternative write operations were simulated, as shown in Fig. 11. The simulation demonstrated Cell 3 can be used as a bistable memory cell with steady high-speed programming ability. Table 1 lists the studied cells in the optimization patch.
Figure
9.
(a) The write-0 measures of Cell 2, and (b) the hysteresis curve of the Cell 3 in comparison with Cell 2 and the illustration of the punch-trough design.
A current current-latch-based sense amplifier (CLSA)[10] can be used for read operation. In read operation, the VAC is kept at Vhold, a CLSA can mirror the current flowing through the memory cell I1/I0 and output VDD/0 V to drive the circuit of next level. The read speed is determined by I1, which is about 2 × 10−5 A/μm2. This current level is available for ns level read operation with a nano-level CMOS process.
3.4
Discussion
Besides operation speed, the process variation, reliability and power dissipation (PD) are also important for a memory cell's practical application. The process variation of Cell 3 was measured, which can be directly reflected by the VLU and VLD distributions, as shown in Fig. 12(a). As a simple vertical structure, the process variation can be well controlled. Measures about reliability were also conducted. The VLU and VLD variation with the 3 V to −3 V to 3 V sweep (10−8 s period) times is given in Fig. 12(b), which indicates the high reliability of the proposed memory cell. At last, the power dissipated in the operations of the proposed cell was studied. To compare with the 45 nm 6T SRAM cell, it was supposed that Cell 3 has a 45 × 45 nm2 active cell area, which can be realized by using state-of-the-art lithography technology[11]. There are three kinds of PD in single cell operations. They are retention PD, write PD and read PD. The retention PD is the product of the retention leakage current (I1 or I0 that depends on the stored state) and Vhold. Therefore, the 1-retention PD is 40 nW and the 0-retention PD is 100 pW. There are two conditions of write PD: write-1 PD and write-0 PD. Write-1 PD is mainly dissipated in Ts. Therefore, the average PD in the write-1 cycle is approximately (VHIH)(TS/T1) (T1 is the write-1 period, IH definition is shown in Fig. 1(b)), which is about 400 nW when T1= 2 ns. For write-0 operation, the PD can be obtained by the integration of (VACIAC)/T0 (T0 is the write-0 period), which is about 100 nW when T0= 2 ns. For read operation, the PD is different in read-1 and read-0 operation. The retention current is used for read, so the read-1 PD and read-0 PD is at the same value with the 1-retention and 0-retention PD separately.
4.
Conclusion
Table 2 summarizes the main performance indicators of the proposed memory cell (Cell 3) and the 6T SRAM cell in 45 nm process[12, 13] (in fact, the 6T SRAM cell performance spreads over a wide range depending on the practical application, the data given in Table 2 is a typical range). We can see that the proposed cell offers excellent packing density, low power dissipations with available speed. Moreover, the proposed memory cell is compatible with traditional CMOS technology, has good design flexibility and good reliability due to its gateless structure. It has a great potential in a broad range of applications including embedded memory and stand-alone memory.
Table
2.
The main performance of the proposed cell and the 6T SRAM cell in 45 nm process.
Kuhn K J. Considerations for ultimate CMOS scaling. IEEE Trans Electron Devices, 2012, 59(7):1813 doi: 10.1109/TED.2012.2193129
[2]
Asenov A, Kaya S, Brown A R. Intrinsic parameter fluctuations in decananometer MOSFETs introduced by gate line edgeroughness. IEEE Trans Electron Devices, 2003, 50(5):1254 doi: 10.1109/TED.2003.813457
[3]
Kampen C, Evanschitzky P, Burenkov A, et al. Lithographyinduced layout variations in 6-t SRAM cells. International Conference on Simulation of Semiconductor Processes and Devices (SISPAD), 2010:149
[4]
Liang J, Gnana R, Jeyasingh D, et al. An ultra-low reset current cross-point phase change memory with carbon nanotube electrodes. IEEE Trans Electron Devices, 2012, 59(4):1155 doi: 10.1109/TED.2012.2184542
Gibbons J F. A critique of the theory of p-n-p-n devices. IEEE Trans Electron Devices, 1964, 11(9):406 doi: 10.1109/T-ED.1964.15352
[7]
Tserng H Q, Plumlee H R. The turn-on delay time of silicon p-n-p-n switches. Proc IEEE, 1970, 58(5):792 doi: 10.1109/PROC.1970.7736
[8]
Synopsys Inc. TCAD Sentaurus user manual. 2010
[9]
Heremans P L. Fast turn-off of two-terminal double heterojunction optical thyristors. Appl Phys Lett, 1992, 61(11):1326 doi: 10.1063/1.107581
[10]
Tsai M F, Tsai J H, Fan M L, et al. Variation tolerant CLSAs for nanoscale bulk-CMOS and FinFET SRAM. APCCAS, 2012:471
[11]
Ronse K, Jansen P, Gronheid R, et al. Lithography options for the 32 nm half pitch node and beyond. IEEE Trans Circuits Syst Ⅰ:Regular Papers, 2009, 56(8):1884 doi: 10.1109/TCSI.2009.2028417
[12]
Akashe S, Rastogi S, Sharma S. Specific power illustration of proposed 7T SRAM with 6T SRAM using 45 nm technology. ICONSET, 2011
[13]
Razavipour G, Afzali-Kusha A, Pedram M. Design and analysis of two low-power SRAM cell structures. IEEE Trans VLSI, 2009, 17(10):1551 doi: 10.1109/TVLSI.2008.2004590
Xiaodong Tong, Hao Wu, Qingqing Liang, Huicai Zhong, Huilong Zhu, Chao Zhao, Tianchun Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. Journal of Semiconductors, 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006 ****X D Tong, H Wu, Q Q Liang, H C Zhong, H L Zhu, C Zhao, T C Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. J. Semicond., 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006.
Xiaodong Tong, Hao Wu, Qingqing Liang, Huicai Zhong, Huilong Zhu, Chao Zhao, Tianchun Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. Journal of Semiconductors, 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006
****
X D Tong, H Wu, Q Q Liang, H C Zhong, H L Zhu, C Zhao, T C Ye. Design of two-terminal PNPN diode for high-density and high-speed memory applications[J]. J. Semicond., 2014, 35(1): 014006. doi: 10.1088/1674-4926/35/1/014006.
Figure Fig. 1. (a) The structure of the proposed memory cell, and (b) the hysteresis curve of the new memory cell and the write voltage pulses.
Figure Fig. 2. The process flow of the PNPN diode memory cell.
Figure Fig. 3. The measured hysteresis curve and the SEM view of Cell 1. Note that at VAC> 1.6 V, the IA is constant because the actual current through the cell hits the compliance level of the SMU.
Figure Fig. 4. (a) The pulsed measuring circuit and the low speed measure, (b) the increased frequency write-1 measure, in which delay time was observed.
Figure Fig. 5. (a) The measured hysteresis curve of Cell 2 and its simulated hysteresis curve. Note that at VAC> 1 V, the IA is constant because the actual current through the cell hits the compliance of the SMU. (b) The write-1 operation measure of Cell 2
Figure Fig. 6. (a) The write-1 simulation indicates that Cell 2 can be operated at ns level with an available voltage and (b) the simulated Tdr–VH trend of Cell 2.
Figure Fig. 7. (a) The measured VLU distribution and (b) the simulated Tdr–VLU relationship.
Figure Fig. 8. (a) The simulated write-1 operations with different Vholds, and (b) the simulated Tdf–Vhold trend.
Figure Fig. 9. (a) The write-0 measures of Cell 2, and (b) the hysteresis curve of the Cell 3 in comparison with Cell 2 and the illustration of the punch-trough design.
Figure Fig. 10. (a) The write-0 measures of Cell 3, and (b) the ns-level write-0 operation simulation of Cell 3.
Figure Fig. 11. (a) The process variation and (b) the reliability measure of Cell 3.