A 224-Gb S DAC-Based PAM-4 Quarter-Rate Transmitter With 8-Tap FFE in 10-Nm FinFET
A 224-Gb S DAC-Based PAM-4 Quarter-Rate Transmitter With 8-Tap FFE in 10-Nm FinFET
1, JANUARY 2022
Abstract— This article presents analysis, design details, and applications continue increasing exponentially, and this will
measurement result of a 224-Gb/s four-level pulse amplitude continue to drive the need for higher per-pin bandwidth
modulation (PAM-4) transmitter (TX) consisting of a 7-bit voltage density. Therefore, as 100–116-Gb/s transceiver standards and
digital-to-analog converter (DAC) driver, digital 8-tap feed-
forward equalizer (FFE), and a 28-GHz inductively peaked clock implementation mature, development has already started on
distribution network. The TX DAC uses quarter-rate clocking the next generation of copper signaling, which will double the
with a 4:1 pulse-based data serialization architecture. Design per-pin data rate to 200–232 Gb/s. This article describes a 7-bit
techniques for generating and distributing low-jitter CMOS digital-to-analog converter (DAC)-based transmitter (TX) with
clocks up to 29 GHz, timing closure in the serializer, 112-Gbaud a digital feed-forward equalizer (FFE) capable of sending data
4:1 data MUX using 1-UI pulse generator, and bandwidth/return
loss/group delay optimized output pad network using a 9th-order up to 232 Gb/s using PAM-4 modulation.
LC filter are described. Fabricated in the Intel 10-nm FinFET Doubling the data rate for a wireline TX to 200–232 Gb/s
process technology, the TX demonstrates random jitter (RJ) requires addressing several fundamental design challenges.
of 65 fsrms with nominal output swing of 1.0 Vppd at 224 Gb/s The first challenge is increasing the analog bandwidth of
achieving 1.88-pJ/b energy efficiency including an on-die LC the data path while maintaining about 1-Vppd nominal output
phase-locked loop (PLL). To the best of authors’ knowledge, this
TX achieved the highest data rate with the lowest RJ for CMOS swing and adequate linearity for PAM-4 signaling. The band-
SerDes TXs reported to date. width requirement depends on the data modulation scheme
and the baud rate used for data transmission. For example,
Index Terms— 10 nm, 4:1 serializer, CMOS, digital-to-analog
converter (DAC), feed-forward equalizer (FFE), FinFET, four- the shift from non-return-to-zero (NRZ) to PAM-4 signaling
level pulse amplitude modulation (PAM-4), I/O, LC filter, match- in the 56-Gb/s generation of LR copper SerDes standards
ing network, quarter rate, SerDes, transmitter (TX). provided a path to double the data rate without (to first order)
increasing the baud rate or analog bandwidth for 28-Gb/s
NRZ transceivers. However, this change in modulation came
I. I NTRODUCTION at the expense of signal-to-noise ratio (SNR) and raw bit-error
rate (BER), which must be offset by forward error correction
W IRELINE IOs have doubled per-lane data rate every
3–4 years over the past two decades due to increasing
aggregate bandwidth demand in high-performance computing,
(FEC) logic that adds power and delay/latency. Standards for
200–232 Gb/s such as IEEE Ethernet and OIF-CEI are in
networking/communications, and most recently from machine development, and the choice for modulation is still being
learning and AI [1]. Recent publications have demonstrated investigated and debated taking into consideration of the
complete long-reach (LR) electrical transceivers operating up capability of circuit/channel components. To demonstrate the
to 112–116 Gb/s using four-level pulse amplitude modulation capability of CMOS TX, we chose PAM-4 modulation because
(PAM-4) [2]–[7]. Aggregate bandwidth requirements in these 1) it is the simplest modulation (compared with other higher
order modulations, such as PAM-5/6/8) that is backward
Manuscript received April 19, 2021; revised July 1, 2021; accepted compatible to 56/112-G standards and 2) it exercises the
August 20, 2021. Date of publication September 14, 2021; date of current upper bound of the bandwidth and noise/jitter requirements for
version December 29, 2021. This article was approved by Associate Editor
Amir Amirkhany. (Jihwan Kim and Sandipan Kundu contributed equally to SerDes. If a DAC-based TX can achieve 200–232 Gb/s with
this work.) (Corresponding author: Jihwan Kim.) PAM-4, the same data rate can be achieved with higher order
Jihwan Kim, Sandipan Kundu, Ajay Balankutty, Bong Chan Kim, modulation provided that the DAC has sufficient dynamic
Stephen T. Kim, Yutao Liu, Savyasaachi Keshava Murthy, Priya Wali, Kai Yu,
Hyung Seok Kim, Chuan-Chang Liu, Dongseok Shin, Yongping Fan, and range and resolution.
Frank O’Mahony are with Intel Corporation, Hillsboro, OR 97124 USA The second fundamental challenge for doubling the TX data
(e-mail: [email protected]). rate is generating clocks with adequate phase spacing and jitter
Matthew Beach is with Foundation Devices Inc., Boston, MA 02109 USA.
Ariel Cohen and Yoav Segal are with Intel Corporation, Jerusalem 97774, to serialize and re-time the transmitted symbols to 112 Gbaud.
Israel. A 14-GHz quarter-rate clocking architecture is commonly
Peng Li is with Intel Corporation, Santa Clara, CA 95054 USA. used in 112-Gb/s PAM-4 (56 Gbaud) TXs [4], [8]–[10]. For
Color versions of one or more figures in this article are available at
https://doi.org/10.1109/JSSC.2021.3108969. 112-Gbaud operation, the same clock frequency could be
Digital Object Identifier 10.1109/JSSC.2021.3108969 maintained by doubling the number of phases (octal clocking
0018-9200 © 2021 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
KIM et al.: 224-Gb/s DAC-BASED PAM-4 QUARTER-RATE TRANSMITTER WITH 8-Tap FFE 7
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
8 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 57, NO. 1, JANUARY 2022
lower frequency [20]. Furthermore, multi-phase clocks relax power/area-efficient, but it introduces a drawback of large
the timing constraint in final data serialization [20]–[22]. output loading capacitance from the multi-segment driver. If a
Quadrature clock calibration is more complex than just four-way time interleaving scheme is used for data serializa-
correcting clock duty cycle for half-rate architectures, but tion, three out of four driver slices will be idle during any given
many designs have achieved <100-fs detection/correction time. This results in ∼4× device and interconnect capacitance
resolution for QEC/DCC [8], [9], [15], [18]. The advantage overhead at the pad compared with a single driver. Our
of quarter-rate clocking over half-rate clocking becomes simulations indicated that the single-stage approach would not
more prominent when the baud rate of the TX is doubled be effective for the 224-Gb/s TX even when using inductors
to 100–116 Gbaud. A half-rate clocking architecture with for pad bandwidth extension. On the other hand, the two-stage
a 2:1 serializer requires 50–58-GHz clock generation and approach minimizes the output capacitance with usage of a
distribution, which would not be a power-efficient option in dedicated driver. However, the internal 4:1 MUX pre-driver
modern CMOS technologies. Moreover, meeting the 1-UI must support the full analog bandwidth of the TX signal. The
timing window at the final 2:1 MUX across PVT variation size of the pre-driver and the driver must be carefully chosen
would be a strenuous design challenge. to minimize the overall power consumption while satisfying
Another possible clocking option for a 100–116-Gbaud TX the full-rate bandwidth requirement.
is using the eight-phase (octal) clocking and an 8:1 data To maximize the swing, linearity, and bandwidth, we used a
serialization. This approach would further relax the design two-stage output stage consisting of the actively peaked CML
burden for clock generation, distribution, and timing constraint 4:1 MUX cascaded with the NMOS CML driver.
in the data path. But it would increase the complexity of
the clock calibration scheme since eight clock phases have
to be accurately calibrated. The bandwidth of the 8:1 MUX III. C IRCUIT I MPLEMENTATION : C LOCKING
is another design hurdle to tackle since higher multiplexing
A. Inductively Peaked CMOS Clock Buffer
factors come with more parasitic capacitance at the MUX
output. Despite these challenges, octal clocking is still a viable This section analyzes properties of inductively peaked
solution for future development of 100–116-Gbaud TXs. CMOS clock buffers that were used within the clock distrib-
The quarter-rate clocking with 4:1 serialization architecture ution network. Although a clock buffer is a non-linear time-
was chosen in this prototype TX because it achieves a good varying system, useful insights can still be drawn by treating
compromise between the circuit bandwidth, power, jitter, and its output network as a linear time-invariant (LTI) system with
design complexity. proper input excitations. Fig. 2 shows three types of CMOS
clock buffers: inductor-less, shunt-series peaked, and series-
D. Driver shunt peaked buffers with their simplified models and transfer
The source-series terminated (SST) driver and current-mode functions. From the analysis using the models and circuit
logic (CML) driver are the two main circuit topologies used simulation results, we can explain the following properties of
in SerDes TX drivers. The SST driver as in [3], [15], and [18] the inductively peaked clock buffers.
is straightforward to implement and works seamlessly with 1) Jitter Filtering (Attenuation): Fig. 3 shows the mag-
CMOS logic in pre-driving stages. The SST driver generally nitude and phase responses of the inductor-less and the
exhibits good linearity because it does not suffer from voltage shunt-series peaked buffers. The series-shunt peaked buffer
headroom and mismatch in bias current that limit the linearity exhibits similar frequency response to the shunt-series one,
performance of the CML-based counterpart. However, good so it is not shown for this comparison. The presence of a
linearity and large swing require a relatively large SST switch zero in the transfer functions of the inductively peaked buffers
size such that the discrete resistors contribute most of the pull- creates a bandpass characteristic, which attenuates HF random
up/down impedance. This sizing trade-off leads to high power jitter (RJ) caused by thermal noise in the clock buffers (i.e.,
consumption in the pre-driving stage and clock distribution. it reduces the integrated voltage noise at the output). At the
In addition, the output swing of the SST driver is set by same time, a sharper slope at the transition point due to
the supply voltage of the driver. To increase the swing level extended bandwidth reduces the conversion of intrinsic buffer
beyond nominal supply voltage, special circuits are required voltage noise into jitter. Because of these two effects, the
for level shifting and device protection from overstress. inductively peaked buffers attenuate jitter as the clock passes
On the other hand, the NMOS-type CML driver as in [4], through them. This is a key advantage over conventional
[8]–[10], and [16] can generate high output swing without CMOS clock buffers which tend to amplify high-frequency
requiring level shifters or device protection circuits in the pre- jitter due to incomplete voltage level settling.
driver. The 4:1 data serialization can be performed in current 2) Lower Buffer Delay: The presence of a zero in the
domain using a transconductance (G m ) stage [9], [23] or in the transfer function of inductive buffers also provides a phase
voltage domain using CMOS logic circuits [3]. An important lead (positive phase), which makes the output edge appear
decision is whether to use the CML 4:1 MUX as a direct faster than that of inductor-less buffer with phase lag (negative
driver as in [20] or to cascade it with a dedicated output CML phase). Thus, the input-to-output delay for the inductively
driver as in [9] and [23]. The first option removes internal, full- peaked clock buffer is smaller than the inductor-less one,
rate nets except for the output pad where passive inductors which enables less susceptibility to the supply noise. We use
are used to extend the bandwidth. This approach can be this delay property in the quadrature clock generator to vary
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
KIM et al.: 224-Gb/s DAC-BASED PAM-4 QUARTER-RATE TRANSMITTER WITH 8-Tap FFE 9
Fig. 2. Three types of CMOS clock buffers. (a) Inductor-less. (b) Shunt-series inductively peaked. (c) Series-shunt inductively peaked. Here, F(Vin ) represents
a describing function to model the non-linear, time-varying voltage-dependent, voltage-source-based clock buffer.
Fig. 3. Magnitude and phase responses of (a) inductor-less and (b) shunt-
series inductively peaked CMOS clock buffers using their transfer functions.
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
10 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 57, NO. 1, JANUARY 2022
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
KIM et al.: 224-Gb/s DAC-BASED PAM-4 QUARTER-RATE TRANSMITTER WITH 8-Tap FFE 11
Fig. 8. Data path: (a) overall architecture and (b) timing loops.
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
KIM et al.: 224-Gb/s DAC-BASED PAM-4 QUARTER-RATE TRANSMITTER WITH 8-Tap FFE 13
Fig. 11. (a) Block diagram of the 8:1 and 4:1 data serialization path.
(b) Timing diagram of the 4:1 serialization path.
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
14 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 57, NO. 1, JANUARY 2022
Fig. 12. DAC: (a) segmentation and layout floor plan of the DAC, (b) a schematic of the unit slice, (c) waveforms of pulse generator with VSSHI, and
(d) simulated waveforms at nodes X and Y of DAC unit slice.
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
KIM et al.: 224-Gb/s DAC-BASED PAM-4 QUARTER-RATE TRANSMITTER WITH 8-Tap FFE 15
Fig. 14. (a) Jitter amplification simulation method. (b) Simulated jitter ampli-
fication. (c) Simulated magnitude of jitter transfer function for falling–rising
and falling–falling edge-based pulse generators.
Fig. 17. (a) Schematic. (b) Layout floor plan with a full 3-D EM simulated
mode. (c) Simulated voltage gain and group delay. (d) Simulated PRBS13 eye
diagram of the output pad network.
Fig. 15. (a) Schematic diagrams. (b) Simulated S21 and S22 of cascaded LC
networks.
Fig. 18. (a) Block diagram and a picture of test platform. (b) Measured S21
of the package and on-package connector.
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
16 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 57, NO. 1, JANUARY 2022
Fig. 19. Measurement results of the TX. (a) Eye diagrams for 56, 112, 116 G baud operations. (b) RJ with clock distribution supply and temperature sweep.
(c) Output reflection of the TX.
V. M EASUREMENT R ESULTS
The TX was fabricated in the Intel 10-nm FinFET tech- Fig. 20. Measured 168-Gb/s PAM-8 eye diagram (56 Gbaud).
nology. All measurements were done through the on-package
connector, in a similar way to what was used in [8]. This measured to be 2.4/2.8/2.4 ps and 90/110/90 mV, respectively,
connector brings out the high-speed signal with less than at 224 Gb/s or 1/9/2.3/1.8 ps and 75/92/75 mV, respectively,
4-dB insertion loss at 56 GHz not including the extra cable, at 232 Gb/s for BER of 1e-4. Fig. 19(b) shows the 112-Gbaud
SMA connector, and the dc block. The measurement setup is clock pattern RJ performance over a temperature and clock
illustrated in Fig. 18 along with measured insertion loss of the distribution supply voltage sweep. The TX demonstrates open
package and the on-package connector. eyes and <100-fsrms RJ until the temperature was increased
The eye diagrams measured using Keysight’s 100-GHz to 110 ◦ C and the supply voltage was lowered to 0.8 V. The
bandwidth real-time oscilloscope (UXR1002A) are presented return loss was measured with Keysight’s 60-GHz bandwidth
in Fig. 19(a). Using 1.8-V (PLL), 0.8-V (clock distribu- vector network analyzer (VNA), and the result is presented
tion), 1-V (data path), and 1.5-V (driver) supplies, the TX in Fig. 19(c). It shows good correlation to simulation while
achieved 154-fsrms RJ, 376-fspp DJ at 56-Gbaud (half-rate) meeting the design target scaled up in frequency from the
clock pattern with a 4-MHz 1st-order CDR filter applied at Ethernet 100G return loss guideline. Note that the frequency-
the scope. Configuring the clock distribution to the HF mode, domain return loss guideline will be replaced by effective
the TX achieved 65-fsrms RJ, 247-fspp DJ at 112 Gbaud, and return loss (ERL) for the compliance specification in upcom-
69-fsrms RJ, 276-fspp DJ at the 116-Gbaud operations. This ing standard. A measured 168-Gb/s (56 Gbaud) QPRBS-13
result confirms that the HF mode clock distribution has lower PAM-8 eye diagram using the LF clock distribution is shown
HF jitter due to the jitter filtering effect of the inductively in Fig. 20. TX FFE was enabled, but the channel loss
peaked CMOS buffers. The NRZ and PAM-4 eye diagrams de-embedding and scope’s equalization were not used.
with a QPRBS-13 data pattern are also presented in Fig. 19(a). The clock spacing error calibration was done by FSM, and
The TX demonstrated 1.0-Vppd swing and shows no sign the measured DCC/QEC range and resolution are presented
of bandwidth degradation at 56 Gbaud. For 112/116-Gbaud in Fig. 21. The coarse and fine control of quadrature clock
operations, we applied TX FFE (C−1 /C0 /C+1 /C+2 /C+3 = generator (quad-gen) demonstrate 3.3-ps range with <300-fs
−0.01/0.86/−0.1/−0.02/0.01 at 224 Gb/s) and channel loss resolution and 2-ps range with <60-fs resolution, respectively.
de-embedding to maximize the eye opening but did not use any The resistor control for the quad-gen provided additional 7-ps
scope equalization. The PAM-4 eye widths and heights were range with <700-fs step size. The DCC control through 7-bit
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
KIM et al.: 224-Gb/s DAC-BASED PAM-4 QUARTER-RATE TRANSMITTER WITH 8-Tap FFE 17
TABLE I
C OMPARISON TABLE OF THE S TATE - OF - THE -A RT S ER D ES TX S W ITH D ATA R ATE H IGHER T HAN 100 Gb/s
Fig. 23. (a) Chip micro-photograph. (b) Power break-down of the TX.
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
18 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 57, NO. 1, JANUARY 2022
jitter performance among SerDes TXs reported to date. The [16] C. Loi et al., “A 400 Gb/s transceiver for PAM-4 optical direct-detect
performance of the TX is compared with other state-of-the-art application in 16 nm FinFET,” in IEEE Int. Solid-State Circuits Conf.
(ISSCC) Dig. Tech. Papers, Feb. 2019, pp. 120–121.
TXs in Table I. The results demonstrate the effectiveness of the [17] E. Groen et al., “A 10-to-112 Gb/s DSP-DAC-based transmitter with
architecture choices and design techniques using PAM-4 mod- 1.2 Vppd output swing in 7 nm FinFET,” in IEEE Int. Solid-State Circuits
ulation. It provides a feasible path for the next-generation Conf. (ISSCC) Dig. Tech. Papers, Feb. 2020, pp. 120–121.
[18] M. A. Kossel et al., “An 8b DAC-based SST TX using metal gate
SerDes TX that can support direct backward compatibility resistors with 1.4 pJ/b Efficiency at 112 Gb/s PAM-4 and 8-tap FFE
with the existing 56/112 Gb/s PAM-4 signaling ecosystem. in 7 nm CMOS,” in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig.
Tech. Papers, Feb. 2021, pp. 130–131.
[19] T. Toifl et al., “A 0.3 pJ/bit 112GB/S PAM4 1+0.5D TX-DFE precoder
ACKNOWLEDGMENT and 8-tap FFE in 14NM CMOS,” in Proc. IEEE Symp. VLSI Circuits,
Jun. 2018, pp. 53–54.
The authors thank Tzu-Chien Hsueh, Gurmukh [20] J. Kim et al., “A 112 Gb/s PAM-4 56 Gb/s NRZ reconfigurable
Singh, Zeev Toroker, Vladislav, Tsirkin, Eran Maday, transmitter with three-tap FFE in 10-nm FinFET,” IEEE J. Solid-State
Noam Familia, Alexander Pogrebinsk, Yoel Krupnik, Circuits, vol. 54, no. 1, pp. 29–42, Jan. 2019.
[21] A. A. Hafez, M.-S. Chen, and C.-K. Yang, “A 32–48 Gb/s serializing
Ziguo Qian, Cemil Geyik, Ling Li Ong, Kin Wai Lee, transmitter using multiphase serialization in 65 nm CMOS technology,”
Dennis Baker, Byron Grossnickle, Jonathan Fernow Jr, IEEE J. Solid-State Circuits, vol. 50, no. 3, pp. 763–775, Mar. 2015.
Eric Karl, Ying Zhang, and Gary Patton for their contribution [22] J. Kim et al., “A 16-to-40 Gb/s quarter-rate NRZ/PAM4 dual-mode
transmitter in 14 nm CMOS,” in IEEE Int. Solid-State Circuits Conf.
and support for this work. (ISSCC) Dig. Tech. Papers, Feb. 2015, pp. 60–61.
[23] Y. Frans et al., “A 40-to-64 Gb/s NRZ transmitter with supply-regulated
R EFERENCES front-end in 16 nm FinFET,” IEEE J. Solid-State Circuits, vol. 51, no. 12,
pp. 3167–3177, Dec. 2016.
[1] IEEE International Solid-State Circuits Conference 2020 Technology [24] F. H. Raab, “Class-F power amplifiers with maximally flat waveforms,”
Trends. Accessed: Apr. 8, 2021. [Online]. Available: http://isscc.org/wp- IEEE Trans. Microw. Theory Techn., vol. 45, no. 11, pp. 2007–2012,
content/uploads/sites/17/2020/03/isscc2020.press_kit_final.pdf Nov. 1997.
[2] Y. Krupnik et al., “112-Gb/s PAM4 ADC-based SERDES receiver with [25] E. Groen et al., “10-to-112-Gb/s DSP-DAC-based transmitter in 7-nm
resonant AFE for long-reach channels,” IEEE J. Solid-State Circuits, FinFET with flex clocking architecture,” IEEE J. Solid-State Circuits,
vol. 55, no. 4, pp. 1077–1085, Apr. 2020. vol. 56, no. 1, pp. 30–42, Jan. 2021.
[3] T. Ali et al., “A 460 mW 112 Gb/s DSP-based transceiver with 38 dB [26] P.-C. Chiang, H.-W. Hung, H.-Y. Chu, G.-S. Chen, and J. Lee, “60 Gb/s
loss compensation for next-generation data centers in 7 nm FinFET NRZ and PAM4 transmitter for 400 GbE in 65 nm CMOS,” in IEEE
technology,” in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, Feb. 2014,
Papers, Feb. 2020, pp. 118–119. pp. 42–43.
[4] J. Im et al., “A 112 Gb/s PAM-4 long-reach wireline transceiver using a [27] Industry Council on ESD Target Levels. Accessed: Apr. 8, 2021.
36-way time-interleaved SAR-ADC and inverter-based RX analog front- [Online]. Available: https://www.esdindustrycouncil.org/ic/en/
end in 7 nm FinFET,” in IEEE Int. Solid-State Circuits Conf. (ISSCC) [28] M.-S. Chen and C.-K. Yang, “A 50–64 Gb/s serializing transmitter with
Dig. Tech. Papers, Feb. 2020, pp. 116–117. a 4-tap, LC-ladder-filter-based FFE in 65 nm CMOS technology,” IEEE
[5] M.-A. LaCroix et al., “A 116 Gb/s DSP-based wireline transceiver in J. Solid-State Circuits, vol. 50, no. 8, pp. 30–42, Aug. 2015.
7 nm CMOS achieving 6 pJ/b at 45 dB loss in PAM-4/duo-PAM-4 and
52 dB in PAM-2,” in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig.
Tech. Papers, Feb. 2021, pp. 132–134.
[6] P. Mishra et al., “A 112 Gb/s ADC-DSP-based PAM-4 transceiver for
long-reach applications with >40 dB channel loss in 7 nm FinFET,” Jihwan Kim (Member, IEEE) received the B.S.
in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, degree in electrical and computer engineering from
Feb. 2021, pp. 138–140. Hanyang University, Seoul, South Korea, in 2005,
[7] D. Xu et al., “A scalable adaptive ADC/DSP-based and the M.S. and Ph.D. degrees in electrical and
1.25-to-56 Gbps/112 Gbps high-speed transceiver architecture using computer engineering from Georgia Institute of
decision-directed MMSE CDR in 16 nm and 7 nm,” in IEEE Int. Technology, Atlanta, GA, USA, in 2007 and 2011,
Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, Feb. 2021, respectively.
pp. 134–136. His doctoral research focused on design techniques
[8] J. Kim et al., “A 112 Gb/s PAM-4 transmitter with 3-tap FFE in 10 nm for RF and mm-wavefront-end integrated circuits,
CMOS,” in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. including power amplifiers (PAs), mixers, low-noise
Papers, Feb. 2018, pp. 102–103. amplifiers (LNAs), and voltage-controlled oscilla-
[9] Z. Toprak-Deniz et al., “A 128 Gb/s 1.3 pJ/b PAM-4 transmitter with tors (VCOs) using CMOS/SiGe technologies. Since 2011, he has been
reconfigurable 3-tap FFE in 14 nm CMOS,” in IEEE Int. Solid-State with Intel’s Advanced Design, Hillsboro, OR, USA, working on designing
Circuits Conf. (ISSCC) Dig. Tech. Papers, Feb. 2019, pp. 122–123. integrated circuits and systems for ultrahigh-speed and low-power wireline
[10] K. Tan et al., “A 112-Gb/S PAM4 transmitter in 16 nm FinFET,” in data communications.
Proc. IEEE Symp. VLSI Circuits, Jun. 2018, pp. 45–46.
[11] C. Auth et al., “A 10 nm high performance and low-power CMOS
technology featuring 3rd generation FinFET transistors, self-aligned
quad patterning, contact over active gate and cobalt local interconnects,”
in IEDM Tech. Dig., Dec. 2017, pp. 29.1.1–29.1.4.
[12] D. Shin, H. S. Kim, C.-C. Liu, P. Wali, S. K. Murthy, and Y. Fan,
“A 23.9-to-29.4 GHz digital LC-PLL with a coupled frequency doubler Sandipan Kundu (Member, IEEE) received
for wireline applications in 10 nm FinFET,” in IEEE Int. Solid-State the B.Tech. degree in electronics and electrical
Circuits Conf. (ISSCC) Dig. Tech. Papers, Feb. 2021, pp. 188–189. communication engineering from IIT Kharagpur,
[13] IEEE P802.3ck 400Gb/s Ethernet Task Force. Accessed: Apr. 8, 2021. Kharagpur, India, in 2007, and the Ph.D. degree in
[Online]. Available: http://www.ieee802.org/3/ck/ electrical and computer engineering from Carnegie
[14] M. Choi et al., “An output-bandwidth-optimized 200 Gb/s PAM-4 Mellon University, Pittsburgh, PA, USA, in 2013.
100 Gb/s NRZ transmitter with 5-tap FFE in 28 nm CMOS,” in IEEE He is currently with Intel Corporation, Hillsboro,
Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers, Feb. 2021, OR, USA, working on I/O circuit research. His
pp. 128–129. current research interests include RF, high-speed
[15] C. Menolfi et al., “A 112 Gb/S 2.6 pJ/b 8-tap FFE PAM-4 SST TX in I/O, and mixed-signal IC design.
14 nm CMOS,” in IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Dr. Kundu was a recipient of the Analog Devices
Tech. Papers, Feb. 2018, pp. 104–105. Outstanding Student Designer Award in 2011.
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
KIM et al.: 224-Gb/s DAC-BASED PAM-4 QUARTER-RATE TRANSMITTER WITH 8-Tap FFE 19
Ajay Balankutty (Member, IEEE) received the Savyasaachi Keshava Murthy received the
B.Tech. degree in electronic and communication B.S. degree in electronics and communication
engineering from the National Institute of Technol- from Visvesvaraya Technological University,
ogy Calicut, India, in 2001, and the M.S. and Ph.D. India, in 2005, and the M.S. degree in electrical
degrees in electrical engineering from Columbia and computer engineering from Portland State
University, New York City, NY, USA, in 2010. University, Portland, OR, USA, in 2010.
From 2001 to 2005, he was with Analog Devices, Since 2014, he has been a Component Design
Inc., Bengaluru, India. He is currently with Intel Cor- Engineer with Intel Corporation, Hillsboro, OR,
poration, Hillsboro, OR, USA. His current research USA, focused on post silicon validation of high-
interests include high-speed IOs and RF/millimeter- speed mixed-signal I/O circuits, including SerDes,
wave circuits. DDR, and PLLs for wireline applications.
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.
20 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 57, NO. 1, JANUARY 2022
Dongseok Shin received the B.S. degree in electron- Peng (Mike) Li (Fellow, IEEE) received the B.S.
ics engineering from the University of Seoul, South degree in space physics from the University of
Korea, in 2004, the M.S. degree in electronics and Science and Technology of China, Hefei, China, in
computer engineering from Korea University, South 1985, the M.S. degree in physics and the M.S.E.
Korea, in 2006, and the Ph.D. degree in electrical degree in electrical and computer engineering, and
engineering from Virginia Institute of Technology, the Ph.D. degree in physics from The University of
VA, USA, in 2017. Alabama in Huntsville (UAH), in 1987, 1991, and
From 2006 to 2012, he was with the Graph- 1991, respectively.
ics Design Team, SK Hynix Semiconductor, South He began his career in 1991 as a Post-Doctorate
Korea. He is currently with the Advanced Design Researcher on high-energy astrophysics with the
Group, Intel Corporation, Hillsboro, OR, USA, Space Sciences Laboratory, University of California
where he is involved in designing digital PLLs. at Berkeley, Berkeley. In 2015, he joined Intel Corporation with the acquisition
of Altera Corporation, where he had held a similar role since 2012. Before
joining Altera in 2007, he spent nearly a decade at Wavecrest Corporation,
culminating in his seven-year tenure as the Chief Technology Officer (CTO).
Ariel Cohen received the B.Sc. degree in electrical- He has been elected as an Affiliated Professor with the Department of
engineering from Ben-Gurion University of the Electrical Engineering, University of Washington, Seattle, since 2010. He is
Negev, Israel, in 1997, and the M.Sc. and Ph.D. an Intel Fellow and the Technologist for high-speed I/O and interconnects at
degrees in neuroscience from The Hebrew Uni- Intel Corporation. He serves as a Intel’s Technical Expert and an Adviser in
versity of Jerusalem, Israel, in 2003 and 2008, high-speed I/O and link technology, standards, SerDes architecture, electrical
respectively. and optical signaling and interconnects, silicon photonics integration, optical
He is a Senior Principal Engineer with the field-programmable gate arrays (OFPGAs), high-speed simulation, debug and
Mixed-Signal IP Group, Intel, and leads the tech- test for jitter, noise, signaling, and power integrity, from design validation
nologies for 112/224-Gb/s ADC-based SerDes since to high-volume manufacturing (HVM). As a Distinguished Scientist and
2015. From 2008 to 2015, he led the development Technologist, he has contributed extensively to standards during his industry
of integrated 10-Gb/s Ethernet PHY, high-accuracy career, including PCI Express, Ethernet, Optical Internetworking Forum (OIF),
thermal sensors, and sigma–delta ADCs and DACs teams. From 2005 to 2008, JEDEC, Fiber Channel, and SATA/SAS. He has also published widely,
he took part in establishing the Bioelectronics Laboratory and developed ultra- including more than 110 referred articles, five books, and book chapters on
sensitive silicone-based biosensors for protein detection. Since 2012, he has jitter and high-speed architecture, testing, modeling, and analysis and holds
been an External Lecturer with the Department of Computing Engineering, more than 40 patents.
The Hebrew University of Jerusalem. His research interests include SerDes, Dr. Li was named as an Altera Fellow in 2012, an Intel Fellow in 2015, and
analog circuits, ADCs, thermal sensors, biosensors, and neuroelectronic an Engineer of the year in 2018 (Designcon). He served as the BOD Member
hybrids. for OIF in 2018.
Dr. Cohen was a recipient of the 2016 and 2021 Intel Achievement Awards.
Authorized licensed use limited to: Synopsys. Downloaded on November 12,2023 at 19:57:59 UTC from IEEE Xplore. Restrictions apply.