1. Introduction
Along with the development of integrated circuit (IC) technology, more and more functions are able to be integrated in a single chip, termed system on a chip (SoC). In an SoC, analog circuits are normally compliant with digital circuits in terms of manufacturing process, and therefore some analog architectures which are suitable for scaling become more popular. Taking analog-to-digital converters (ADCs) as an example, among different topologies of data converters, the successive approximation register (SAR) ADC with its advantage of low power, is well compatible with advanced processes because it has no other active analog circuits and only one comparator. Thanks to new switching schemes and faster processes, SAR ADCs have achieved several tens of, MS/s to low GS/s sampling rates with 5-bit to 10-bit resolutions, which are necessary building blocks for 802.11/a/b/g wireless networks, digital TV applications, and mobile applications, such as DVB-T, DVB-H, and TDMB[1-3]. In battery-powered SoC applications, SAR ADCs are widely adopted for their high power efficiency. Another feature of an ADC is that its sampling frequency can be adjusted for different applications.
2. ADC architecture and function
Figure 1 shows the architecture of the 10-bit SAR ADC. It is comprised of differential capacitor arrays, a dynamic latch comparator with a special preceding source-follower and SAR logic with a high-speed clock inside. The capacitor array functions as a 9-bit DAC (digital to analog converter), which generates the required voltages for comparison. A differential structure is applied to overcome the non-ideal effects of the switched capacitor circuit. A latch structure is widely used in SAR ADC comparators for its high speed. The special source-follower is proposed to diminish the leakage current. SAR logic requires a clock with at least ten times the sampling clock frequency and a tunable high speed clock generator is implemented. Detailed information about the circuit design will be described in the following sections and this section focuses on the function of the SAR ADC.
The fact, that SAR ADC can achieve more than 10 MS/s sampling rate nowadays, mainly owing to the possibility of using small total capacitance and the speed benefit from the advanced technology. As we know, the most time-consuming part of a whole SAR conversion is the settling time taken by the inner DAC. As the settling time is proportional to time constant, the smaller the total capacitance, the faster speed the SAR ADC can achieve. Further, smaller total capacitance also leads to less energy being stored on the capacitive array. Therefore, a small total capacitance is of great importance for high speed and low power SAR ADCs. Two techniques are used in this design to obtain small total capacitance, the top-plate-sample switching scheme and the split capacitor structure.
Reference [2] did an excellent job with the top-plate-sample switching procedure. Rather than using a split capacitor structure, it obtained small total capacitance by making use of three metal layers to create only a 4.8 fF unit capacitor (
The split structure with the unit bridge capacitor (
2.1 Switching scheme
Figure 2 shows the voltage change of the capacitive arrays in a cycle, which helps to understand how the ADC works. During the sampling phase, sampling switches
vp0−vn0=vin. |
(1) |
Then sampling switches
vpni=vpi−vni=vpn(i−1)−b(i−1)×29−i511×(vref+−vref−), |
(2) |
where the bit comparing result is
bi={1,vpi>vni,−1,vpi<vni. |
Equations (2) are derived from Eqs. (4) and (5) presented in Section 3 with
The ADC converts analog signal ranging from
2.2 Timing strategy
More details about the timing strategy of the ADC are illustrated in Fig. 3. As shown in the upper part of Fig. 3, there are nine equal conversion phases (cnv1 to cnv9) and one sample phase in each cycle. The sampling clock (sampleclk) is frequency-fixed and provided outside the chip, while the oscillating clock (selfclk) is generated inside the chip and its frequency can be adjusted. The positive edge of "sampleclk" ends the sample phase and starts the conversion phase. After nine equal conversion phases, the tunable clock generator (selfclk) stops oscillating and the rest time of this cycle is for the sample phase. As the cycle is determined by the "sampleclk" which is fixed, the sum of nine conversion phases and sample phase is constant. Therefore the time distribution between the conversion phases and the sample phase can be controlled by adjusting the "selfclk" frequency. The faster the "selfclk" is, the longer the sample phase is.
During each conversion phase, several actions are operated one by one shown in the lower part of Fig. 3. Firstly, the rising edge of "selfclk" triggers the start of comparison (strobe). When the comparison is finished (take
The frequency of "selfclk" can be used to adjust the time distribution between the conversion phases and the sample phase, so can the duty cycle of "selfclk". The falling edge of "selfclk" triggers the preamplifier (pre-amp) in the comparator to turn from reset stage (reset) to release stage (release). So the lengths of the two stages of the preamplifier can be adjusted by the duty cycle of "selfclk".
3. Parasitic analysis in 9-bit DAC
3.1 Parasitic capacitance
The 10-bit SAR ADC needs a 9-bit DAC to generate voltage to compare. Usually a capacitor array is a reasonable choice for medium resolution and high-speed applications. Instead of using a binary weighted structure, a split capacitive array is adopted in this design to achieve small total capacitance, which is shown in Fig. 4.
The biggest feature of the split structure is that a bridge capacitor
When a switch for
ΔVLOWER_TOP_CiL=CiL[4∑k=1C2k−1L+5∑k=1C2k−1HCbridge×(5∑k=1C2k−1H+Cbridge)−1+Cparasitic]−1ΔV. |
(3) |
Thus, the voltage change on the top plate of the higher 5-bit side is
ΔVHIGHER_TOP_CiL=Cbridge5∑k=1C2k−1H+Cbridge×ΔVLOWER_TOP_CiL=iCu511Cu+32CparasiticΔV. |
(4) |
Similarly, when a switch for
ΔVHIGHER_TOP_CiH=CiH[5∑k=1C2k−1H+(4∑k=1C2k−1L+Cparasitic)Cbridge(4∑k=1C2k−1L+Cparasitic)+Cbridge]ΔV=i(16Cu+Cparasitic)511Cu+32CparasiticΔV. |
(5) |
When
To fairly compare the two gains, the gain of
16ΔVHIGHER_TOP_CiLΔVHIGHER_TOP_CiH=16Cu16Cu+Cparasitic<1. |
(6) |
The gain difference leads to a large DNL (differential nonlinearity) error between every "xxxxx1111" and "xxxxx0000". Figure 5 illustrates such
LSB=5∑j=1ΔVj−1HIGHER_TOP_C2H+4∑j=1ΔVj−1HIGHER_TOP_C2L511, |
(7) |
DNLmax_Cparasitic=(ΔVHIGHER_TOP_C1H−4∑j=1ΔVj−1HIGHER_TOP_C2L−LSB)/LSB=480Cparasitic511Cu+31Cparasitic. |
(8) |
From Eq. (8), small
The three-row layout method successfully eases the connection complexity, which makes the key parasitic capacitance very small. In this design, each capacitive array consists of three capacitor rows, and each row contains 35 unit capacitors (Fig. 6). Three rows from the top down correspond to lower 4 bits, the bridge capacitor and higher 5 bits (Fig. 4), which employ 15, 1, and 31 unit capacitors, respectively. At each row, the used unit capacitors are symmetrically distributed in the middle, while the redundant unit capacitors are used as dummies and distributed on two sides. In this way, the connection of the common top plate is short and the key parasitic capacitance to ground (
However, the measurement DNL
16ΔVHIGHER_TOP_CiLΔVHIGHER_TOP_CiH=16(Cu+ΔCbridge)16Cu+Cparasitic+ΔCbridge=1,ifΔCbridge=Cparasitic15. |
(9) |
From Eq. (9), the nonlinearity is able to be calibrated if
3.2 Parasitic resistance
The charge redistribution time is critical for a high speed SAR ADC. As the higher bits have larger voltage to settle, the higher the bit is, the stricter its requirement is. To simply the analysis, the bridge capacitor and the lower 4-bit side are modeled as a
The sampling switch uses the bootstrapping technique and others are common MOS switches. The on-resistance of each MOS switch is carefully designed and follows the next principle. The principle is that the product of
During the sample phase, the input signal is sampled onto the capacitive array through
τsample=(Rsample+Ru32)×32Cu=RsampleCtotal+RuCu. |
(10) |
During the conversion phase, the sample switch is turned off. And one branch with the weight of x changes its bottom-plate voltage from
τconversion=(Rux+Ru32−x)xCu(32−x)CuxCu+(32−x)Cu=RuCu. |
(11) |
From Eq. (11) we can see that all the branches have the same
The phase time should be long enough to make sure that the dynamic error is smaller than one LSB,
errordynamic=Vtransferexp(−Tphaseτphase)<1,LSB, |
(12) |
where
4. The proposed comparator
Figure 8 shows the schematic of the proposed comparator. The comparator is comprised of four parts, an anti-leakage source follower, a preamplifier, a dynamic latch and an SR latch to lock the comparing result. The source follower in the first stage, whose gain is smaller than 1, seems to make no sense here, because the less-than-1 gain cannot help reduce offset but lowers speed and adds additional current. The key lies in the differential input MOS transistors of the first stage, which are native 2.5 V transistors. They are designed to eliminate the gate leakage current of the comparator.
4.1 Overcome leakage current
The total 720 fF capacitive array is sensitive to charge leakage, and a 115.2 nA leakage current can result in one LSB voltage drop. As shown in Fig. 1, the differential input transistors of the comparator are connected to the capacitive arrays and charge can leak out through the transistor gate. The standard transistor in the 65 nm GP process has more than 100 nA leakage current in extreme corner and temperature for its ultra-thin gate oxide, so the 2.5 V IO transistor with a thick gate oxide is a good choice. However, there comes another problem that the IO transistor has higher than 0.7 V threshold voltage and the 0 V to 1 V range input cannot guarantee to turn it on. Finally a pair of native 2.5 V IO transistors is used to form a pseudo differential input stage of the comparator.
Another current leakage source is the channel leakage current of the bootstrap switch, which is not that serious and is diminished to an endurance amount by using high threshold voltage transistors in place of standard transistors.
4.2 Meta-stability
All latching comparators have the problem of meta-stability. The comparator speed requirement for the meta-stability effect can be calculated by the following simple model in Fig. 9.
From the model we can achieve the following expressions,
i=Cdvdt=(A−1)vR, |
(13) |
v=v0exp(A−1)tτ, |
(14) |
where
In the SAR ADC timing strategy, output voltage
Vini_mim=Vlogic_trigexp[−(A−1)Tcompare_maxτ]≪1,LSB. |
(15) |
The actual comparator behavior is much more complicated. Three operation phases can be divided[7]. Usually, the last phase during which only the cross-coupled inverters M10–13 are active takes most of the time and performs as the simple latching model. The actual time constant of the comparator can be achieved by simulation. In our design, the 65 nm GP process provides a rather fast comparator and Equation (15) can be easily satisfied.
However, no matter how fast the comparator is, it always has the possibility of taking more than
5. SAR logic and clock
Comparing to conventional SAR logic, a flip-flop delay is saved by making better use of the logical information. Another important part of the SAR logic is the inner ultra-high speed clock (selfclk). As there is no PLL (phase locking loop) provided inside the chip, a tunable clock generator is designed.
5.1 Critical path optimization
A small logical optimization is depicted in Fig. 10. The arc in the upper circuits shows the data critical path in the conventional SAR logic. By making further use of the "begin" signal information, the lower circuits are implemented with a one-OR-and-one-"D flip–flop"-long data path. For stage i, the OR result of “begin <i>” and "cmp_done" triggers “D <i>”. As “begin <i>” lasts high once "start" passed by, the OR result remains high and “D <i>” will not be disturbed by the next "cmp_done" rising edges. At the same time, “begin<i+1>” is sent to reset “D <i>”, which guarantees that no "cmp_done" rising edge can trigger “D <i>” until "start" has passed by “DB<i+1>” (“begin<i+1>” remains high). The critical path is relaxed for one flip-flop delay. But given that this logical delay will repeat 9 times, so every tiny improvement makes a small contribution to speeding up the conversion. Besides, buffering should be paid attention as the scaling process leads to a smaller buffer capability. The buffering time cost on a wire-to-wire parasitic capacitor is almost as long as the logical delay time.
5.2 Tunable high speed clock generator
The schematic in Fig. 11 is the tunable clock generator used in the design. M1–M9 mirrors the current
6. Measurement results
The prototype was fabricated using 1P9M (one-poly-nine-metal) 65 nm GP CMOS technology. The full micrograph of the core is shown in Fig. 12. The total core area of the chip is 0.13 mm
Figure 13 shows the ADC output spectrum with the input frequency at 1.8 MHz under 80 MS/s sampling rate. The dynamic performance of the ADC is shown in Fig. 14 with sampling frequency sweeping from 10 to 85 MHz. The measured DNL and INL are shown in Fig. 15. All the measured performance results are summarized in Table 1 and Table 2 shows the comparison between the proposed ADC and previous work. From Table 2 we can see that this paper achieves the highest sampling speed among the comparative literature. Its power efficiency is much higher than the pipelined ADC with a similar spec and the SAR ADC in low speed. The state-of-the-art work[11] benefits from the SOI process while this design overcomes the new problems resulting from the ultra-deep sub-micron standard CMOS process, which has an urgent need in communicating and consumer electronics application. Reasons and improvement solutions to the relatively lower SINAD are presented in Section 7.
![]() |
![]() |
7. Improvement solutions
In this section, the main reasons for the relatively lower SINAD are summarized and feasible improvement solutions are provided.
7.1 Bonding wire and reference oscillation
The DAC in the 10-bit 80-MS/s SAR ADC works in more than 800 MHz frequency, so its two references
Figure 16 illustrates the circuit model.
7.2 The DAC and comparator's non-ideal effect
An obvious observation of the INL diagram in Fig. 15 is that it has 32 regular slopes. Careful analysis finds that all of, the 32 slopes fall between the bit "xxxxx1111" and "xxxxx0000". This is a strong evidence for split-structure distortion. Based on the analysis in Section 3.1, the distortion is mainly caused by parasitic capacitance and bridge capacitor mismatch. Because of the high sensitivity of the structure and the randomness of mismatch, calibration is a necessity to achieve 10-bit resolution using a split structure. The calibration can be implemented by adjusting the
The comparator meta-stability mentioned in Section 4.2 is also a factor that affects measurement results. Another significant factor of the comparator is mismatch. Its statistic mismatch causes ADC offset, while its dynamic mismatch leads to ADC distortion. Similar to the DAC, the comparator mismatch needs to be calibrated as well.
8. Conclusion
A 10-bit 80-MS/s reference-free PLL-free SAR ADC for communication and consumer electronics application is presented. The top-plate sampling scheme and the split capacitive structure lead to a small total capacitance of 720 fF. All of the design is in 1-V supply and special transistors are chosen to overcome the charge leakage of the 65 nm GP process. Without using a PLL, a tunable high-speed clock generator is implemented. The prototype ADC occupies an active area of 0.13 mm