

Low Swing and Column Multiplexed Bitline



#### Techniques for Low-Vmin, Noise-Tolerant,

# High-Density, 1R1W 8T-bitcell SRAM in 10nm FinFET CMOS

Jaydeep P. Kulkarni, Andres Malavasi, Charles Augustine,

Carlos Tokunaga, Jim Tschanz, Muhammad Khellah, Vivek De

Circuit Research Lab, Intel Corporation, Hillsboro, OR

2020 Symposia on VLSI Technology and Circuits

# Outline

- Introduction: 8T SRAM Array
- Proposed Low Swing (LS) Bitline Design
- Proposed Column Multiplexed (CM) Bitline Design
- Measurement Results
- Summary

# Outline

- Introduction: 8T SRAM Array
- Proposed Low Swing (LS) Bitline Design
- Proposed Column Multiplexed (CM) Bitline Design
- Measurement Results
- Summary

# Introduction: 1R1W 8T SRAM Bitcell Array



- Decoupled read/write ports for lower Vmin (Register files, L1 caches)
- Hierarchical read bitlines for high performance design
- Large signal sensing limits no. of bits/LBL degrading bit-density

# Review: High Density, Low Vmin 1R1W 8T SRAM



PU (Pull-Up) = 1Fin, PG (Pass-Gate) = 1Fin PD (Pull-Down) = 1Fin, RD (Read Port) = 2Fins

Ref: K-H. Koo, VLSI Symposium 2015

• High Density 1-1-1-2-2 fin bitcell with 32b/LBL, Large signal sensing

# Review: High Density, Low Vmin 1R1W 8T SRAM



- High Density 1-1-1-2-2 fin bitcell with 32b/LBL, Large signal sensing
- Increasing array level bit density using charge share or asymmetric sense amplifier with 256b/BL at the expense of increased power

# Outline

- Introduction: 8T SRAM Array
- Proposed Low Swing (LS) Bitline Design
- Proposed Column Multiplexed (CM) Bitline Design
- Measurement Results
- Summary

# **Baseline Design- Read Path Timing Diagram**



- Large signal, single ended sensing, 1 cycle domino read cycle
- NAND gate driven keeper stack turned OFF during precharge for aging

#### Low Swing Bitline Evolution



# Low Swing Bitline Evolution



- A series NMOS clipper (N<sub>1</sub>) read path  $\rightarrow$  LBL C<sub>DYN</sub> reduction
- $C_{\text{DYN}}$  reduction partially offsets delay degradation due to series clipper

# Low Swing Bitline Evolution



- A series NMOS clipper (N<sub>1</sub>) read path  $\rightarrow$  LBL C<sub>DYN</sub> reduction
- $C_{\text{DYN}}$  reduction partially offsets delay degradation due to series clipper
- If LS\_LBL drives NAND, delay improves but reduced noise tolerance

# Low Swing Bitline with Split Input NAND Keeper-1/2



- NAND inputs for P3 and N3 are split for read delay vs. noise trade-off
- Low swing LBL node drives NAND PMOS ( $P_3$ )  $\rightarrow$  Early keeper turn off

2020 Symposia on VLSI Technology and Circuits

# Low Swing Bitline with Split Input NAND Keeper-2/2



- Transient noise event: Clipper (N<sub>1</sub>) in sub-threshold  $\rightarrow$  shields LBL node
- Split input NAND with NMOS (N<sub>3</sub>) strongly connected to full swing LBL

# Outline

- Introduction: 8T SRAM Array
- Proposed Low Swing (LS) Bitline Design
- Proposed Column Multiplexed (CM) Bitline Design
- Measurement Results
- Summary

# Column Multiplexed + Low Swing BL Technique-1/2

![](_page_14_Figure_1.jpeg)

- Configuring statically biased clipper into a column multiplexer control
- Split input NAND with 2 keeper control pull-up paths
- Reduced  $C_{DYN}$  due to BL multiplexing + low swing operation

# Column Multiplexed + Low Swing BL Technique-2/2

![](_page_15_Figure_1.jpeg)

- Reduced bitline leakage due to stacking effect of inactive clipper
- Keeper can be downsized due to reduced BL leakage  $\rightarrow$  lower Vmin
- Vmin,  $C_{\text{DYN}}$  savings can be utilized for increased bit density

#### **Vmin and Read Delay Statistical Simulations**

![](_page_16_Figure_1.jpeg)

• Lower read Vmin: 60mV for LS BL, 80 mV for LS+CM BL technique

2020 Symposia on VLSI Technology and Circuits

#### **Vmin and Read Delay Statistical Simulations**

![](_page_17_Figure_1.jpeg)

- Lower read Vmin: 60mV for LS BL, 80 mV for LS+CM BL technique
- Lower Vmin across multiple process corners

# Outline

- Introduction: 8T SRAM Array
- Proposed Low Swing (LS) Bitline Design
- Proposed Column Multiplexed (CM) Bitline Design
- Measurement Results
- Summary

#### 10nm FinFET Test-chip

![](_page_19_Figure_1.jpeg)

CM1.3

2020 Symposia on VLSI Technology and Circuits

#### **Measured Read Vmin Results**

![](_page_20_Figure_1.jpeg)

- 30mV lower read Vmin with LS BL technique alone @950MHz, 1Mb target
- 40mV lower read Vmin with combined LS+CM BL technique

#### **Measured Read Vmin Results**

![](_page_21_Figure_1.jpeg)

- 30mV lower read Vmin with LS BL technique alone @960MHz, 1Mb target
- 40mV lower read Vmin with combined LS+CM BL technique
- Consistent Vmin savings across operating frequency range

#### **Measured Bitline Power Results**

![](_page_22_Figure_1.jpeg)

• 18% savings for LS BL, 30% savings for LS+CM across the voltage range

2020 Symposia on VLSI Technology and Circuits

#### Measured Bitline Power Results

![](_page_23_Figure_1.jpeg)

- 18% savings for LS BL, 30% savings for LS+CM across the voltage range
- BL power savings increase at lower Vcc as Vcc-Vt swing reduces

#### Noise Induced Failures in Read-0 Operation

![](_page_24_Figure_1.jpeg)

- Low frequency read-0 test with all '1's stored on unselected bits
- WLVss node voltage of wordline drivers is gradually increased
- Increased BL leakage due to weakly turning ON unselected RWLs

#### **Measured Noise Induced Read-0 Failures**

![](_page_25_Figure_1.jpeg)

• LS+CM BL with reduced # of bits/sub-LBL achieves superior noise tolerance than LS BL; although both are better than baseline case

#### **Measured Noise Induced Read-0 Failures**

![](_page_26_Figure_1.jpeg)

- LS+CM BL with reduced # of bits/sub-LBL achieves superior noise tolerance than LS BL; although both are better than baseline case
- Sustained noise tolerance improvement even at lower Vcc

#### Measured Vmin, Power Sensitivity to LBL Precharge

![](_page_27_Figure_1.jpeg)

- Vmin reduced further by lowering LBL precharge level (changing Vbias)
- 20-30mV lower read Vmin with 100mV lower LBL precharge level

#### Measured Vmin, Power Sensitivity to LBL Precharge

![](_page_28_Figure_1.jpeg)

- Vmin reduced further by lowering LBL precharge level (changing Vbias)
- 20-30mV additional read Vmin savings with lower LBL precharge level
- BL power savings increase across the operating voltage range

#### Measured Noise Sensitivity to LBL Precharge

![](_page_29_Figure_1.jpeg)

 With lower LBL precharge, noise induced failures not degraded for WLVss< 50mV and Vcc> 460mV, showing robust noise tolerance

# Outline

- Introduction: 8T SRAM Array
- Proposed Low Swing (LS) Bitline Design
- Proposed Column Multiplexed (CM) Bitline Design
- Measurement Results
- Summary

#### Summary: Low Swing and Column Multiplexed Bitline

- Series clipper in read bitline path with split input NAND Keeper
- Further saving by configuring clipper as column multiplexer

#### Summary: Low Swing and Column Multiplexed Bitline

- Series clipper in read bitline path with split input NAND Keeper
- Further saving by configuring clipper as column multiplexer
- 10nm FinFET CMOS measurements: 1.09Mb HD 1R1W 8T SRAM Array

| Figure of Merit (FoM)        | LS BL | LS+CM BL    |
|------------------------------|-------|-------------|
| Read Vmin savings            | 30mV  | 40mV        |
| Bitline power savings        | 18%   | 30%         |
| Noise tolerance increase     | 44%   | <b>72</b> % |
| Extra transistors in LBL I/O | 1     | 5           |
| Array level area overhead    | 0%    | 1.8%        |

• Simultaneous improvement in FoMs with no (minor) area increase