### 國立清華大學 電機工程學系 102 學年度第一學期 ### EE-6250 超大型積體電路測試 VLSI Testing 授課教師: 黃錫瑜 2013 年 Fall Semester 清華大學電機系 超大型積體電路測試 講義 ### What You can Benefit from this Course? ### Values of Acquired Knowledge - Making ICs more testable - Making ICs/Boards/Systems more debuggable - Making ICs Faster Time-to-Market - Making ICs Faster Time-to-Volume ### Academic Training - Testing is a rich field as you will know. - Testing is a good topic for MS/Ph.D. theses. ### Career Development - IC 設計公司 (讓晶片的可測試性更高) - 半導體廠(故障診斷,良率追蹤與分析與改善) - 測試產業 (量產測試之規劃與執行) - 電子系統廠 (系統故障診斷,可靠度分析與改善) Ch1-3 2 ### Design Verification, Testing and Diagnosis ### • Design Verification: Ascertain the design perform its specified behavior ### Testing: Exercise the system and analyze the response to ascertain whether it behaves correctly after manufacturing ### • Diagnosis: To locate the cause(s) of misbehavior after the incorrect behavior is detected Ch1-5 ### Manufacturing Defects ### Material Defects - bulk defects (cracks, crystal imperfections) - surface impurities ### Processing Faults - missing contact windows - parasitic transistors - oxide breakdown ### Time-Dependent Failures - dielectric breakdown - electro-migration ### Packaging Failures - contact degradation - seal leaks Ch1-6 ### Faults, Errors and Failures - Fault: - A physical defect within a circuit or a system - May or may not cause a system failure - Error: - Manifestation of a fault that results in incorrect circuit (system) outputs or states - Caused by faults - Failure: - Deviation of a circuit or system from its specified behavior - Fails to do what it should do - Caused by an error - Fault ---> Error ---> Failure Ch1-7 ### **Reliability Test** - Temperature Related - Hi-Temperature Life Test - Low-Temperature Life Test - Temperature-Cycling Test - Humidity Test - Salt Mist Test - UV (Ultra-Violet) Test - ESD Test - ESD stands for Electro-Static Discharge - Whole Mechanical Test Ch1-8 ### **Detailed Reliability Test Items** ### Temperature Related - Operation: 0°C/120hr ~ 70°C/120hr (商規) - Operation: -40°C/120hr ~ 85°C/120hr (工規) - Storage: -40°C/200hr ~ 85°C/500hr - Junction Temperature: Max. 95°C ### Humidity Test - Operation: 25°C/95% humidity (商規) - Operation: 40°C/95% humidity (工規) - Storage: 85°C/95% humidity ### Salt Mist Test - Salt Water Spray ### UV Test - UV (254nm), 15Ws/cm<sup>2</sup> - X-ray exposure, 0.1Gy/1hr ### ESD Test - For example, For Contact Pads, ±4KV, Human Body Mode ### Whole Mechanical Test - Vibration (15G, 10 to 2KHz), Impact, Torque, Bending, Drop test Ch1-9 4 ### Courses on Agilent 93000 at CIC ### Sample Information: (What to expect from that kind of course) | 上課地點 | 新竹CIC - 訓練教室B ( <u>新竹市科學園區展業—路26號八樓</u> ) | |------|-------------------------------------------------------------------------------------------------------------------------| | 課程說明 | 本課程將介紹如何利用Agilent 93000 SoC Tester來進行數位晶片的測試,<br>課程內容包含loadboard使用方式、測試程式開發、量測結果分析。 | | 課程大綱 | (1) Agilent 93000 SoC Tester 介紹<br>(2) Loadboard 使用方法<br>(3) Test Pattern 轉換<br>(4) Test flow 建立<br>(5) Result Analysis | | | Ch1-11 | ### Purpose of Testing - Verify Manufacturing of Circuit - Improve System Reliability - Diminish System Cost - · Cost of repair - goes up by an order of magnitude each step away from the fab. line Ch1-12 ### Fault Coverage - Fault Coverage T - Is the measure of the ability of a set of tests to detect a given class of faults that may occur on the device under test (DUT) $$T = \frac{\text{No. of detected faults}}{\text{No. of all possible faults}}$$ Ch1-14 ### **Defect Level** ### Defect Level - Is the fraction of the shipped parts that are defective (單位 ppm or ppb) $$DL = 1 - Y^{(1-T)}$$ Y: yield T: fault coverage Ch1-15 8 | DPM v.s. | Yield | and | Coverage | |----------|-------|-----|----------| |----------|-------|-----|----------| | Yield | Fault Coverage | Defective PPM | | |-------|----------------|---------------|--| | 50% | 90% | 67,000 | | | 75% | 90% | 28,000 | | | 90% | 90% | 10,000 | | | 95% | 90% | 5,000 | | | 99% | 90% | 1,000 | | | 90% | 90% | 10,000 | | | 90% | 95% | 5,000 | | | 90% | 99% | 1,000 | | | 90% | 99.9% | 100 | | Ch1-17 ### Why Testing Is Difficult? - Test application time could explode for exhaustive testing of VLSI - For a combinational circuit with 50 inputs, we need $2^{50} = 1.126 \times 10^{15}$ test patterns. - Assume one test per 10<sup>-7</sup>sec, it takes 1.125x10<sup>8</sup>sec = 3.57yrs. to test such a circuit. - Test generation for sequential circuits are even more difficult due to the lack of controllability and observability at flip-flops (latches) - Functional testing - may NOT be able to detect the physical faults Ch1-18 ### DEC Alpha Chip (1994) - 64-bit RISC - 200 MHz - 400 MIPS - 200 Mflops - 16.8 x 13.9 mm<sup>2</sup> die - 0.68 million transistors - 431-pin package - 3.3 V - 30 W power consumption Ch1-19 ### **Functional v.s. Structural Testing** - I/O functional tests inadequate for manufacturing - Exhaustive testing is prohibitively expensive **Question: How to Generate Compact yet High-Quality Test Vectors?** ### **Why Fault Model?** - Fault model identifies target faults - Model faults most likely to occur - Fault model limits the scope of test generation - Create tests only for the modeled faults - Fault model makes effectiveness measurable by experiments - Fault coverage can be computed for specific test patterns to reflect its effectiveness - Fault model makes analysis possible - Associate specific defects with specific test patterns Scientific Study: Hypothesis (Assumption) → Evaluation → Refinement Ch2-3 ### **Fault Modeling** - Fault Modeling - Model the effects of physical defects on the logic function and timing - Physical Defects - Silicon Defects - Photolithographic Defects - Mask Contamination - Process Variation - Defective Oxides ### Common Fault Types Used To Guide Test Generation - Stuck-at Faults - Bridging Faults - Open Faults - Transistor Stuck-On Faults - Delay Faults - IDDQ Faults (Quiescent current at VDD pin) - Memory Faults IDDQ Testing: canary in the coalmine, alarming of un-modeled defects 金絲雀 Ch2-5 ### **Single Stuck-At Fault** ### Test Vector Fault-Free Response 1/0 1/0 stuck-at-0 ### **Assumptions:** - Only One line is faulty - Faulty line permanently set to 0 or 1 - Fault can be at an input or output of a gate ### **Multiple Stuck-At Faults** - Several stuck-at faults occur at the same time - Mostly used in logic diagnosis - For a circuit with k lines - there are 2k single stuck-at faults - there are 3k-1 multiple stuck-at faults - A line could be stuck-at-0, stuck-at-1, or fault-free - One out of 3k resulting circuits is fault-free Ch2-7 ### Why Single Stuck-At Fault Model? - Complexity is greatly reduced - Many different physical defects may be modeled by the same logical single stuck-at fault - Stuck-at fault is technology independent - Can be applied to TTL, ECL, CMOS, BiCMOS etc. - Design style independent - Gate array, standard cell, custom VLSI - Detection capability of un-modeled defects - Empirically, many defects accidentally detected by test derived based on single stuck-at fault - Cover a large percentage of multiple stuck-at faults Single SA model survives well (due to its simplicity and effectiveness) ### **Multiple Faults** - Multiple stuck-fault coverage by single-fault tests of combinational circuit: - 4-bit ALU (Hughes & McCluskey, ITC-84) All double and most triple-faults covered. - Large circuits (Jacob & Biswas, ITC-87) Almost 100% multiple faults covered for circuits with 3 or more outputs. Ch2-9 ### **Bridging Faults** - Two or more normally distinct points (lines) are shorted together erroneously - Logic effect depends on technology - Wired-AND for TTL - Wired-OR for ECL - CMOS? ## Pridging Faults For CMOS Logic • The result - could be AND-bridging or OR-bridging - depends on the inputs VDD (f and g) are AND-bridging fault pull to VDD GND Ch2-11 # CMOS Transistor Stuck-Open (I) • Transistor stuck-open • May cause the output to be floating • The fault exhibits sequential behavior • Need two-pattern test (to set it to a known value first) Responses: Fault-free 0→1 Faulty 0→0 Ch2-13 ### **Summary of Stuck-Open Faults** - First Report: - Wadsack, Bell System Technology, J., 1978 - Recent Results - Woodhall et. al, ITC-87 (1-micron CMOS chips) - 4552 chips passed the test - 1255 chips (27.57%) failed tests for stuck-at faults - 44 chips (0.97%) failed tests for stuck-open faults - 4 chips with stuck-open faults passed tests for stuck-at faults - Conclusion - Stuck-at faults are about 20 times more frequent than stuckopen faults - About 91% of chips with stuck-open faults may also have stuck-at faults - Faulty chips escaping tests for stuck-at faults = 0.121% Ch2-15 ### **Functional Faults** - Fault effects modeled at a higher level than logic for functional modules, such as - Decoder - Multiplexers - Adders - Counters - ROMs ### **Functional Faults of Decoders** - **f(L<sub>i</sub>/L<sub>k</sub>):** One active output, but wrong one - Instead of input line Li, Lk is selected - $f(L_i/L_{i+k})$ : More than one active outputs - In addition to line L<sub>i</sub>, L<sub>k</sub> is also selected - **f(L<sub>i</sub>/0):** No active output - None of the lines is selected Ch2-17 ### **Memory Faults** - Parametric Faults - Any fault that causes the response to deviate from its fault-free nominal value by some amount - Ex. A cell with parametric delay fault (with for example 93% more than normal) - Due to all kinds of factors like PVT variation - Functional Faults - Stuck Faults in Address Register, Data Register, and Address Decoder - Cell Stuck Faults - Adjacent Cell Coupling Faults - Pattern-Sensitive Faults ### **Memory Faults** - Pattern-sensitive faults: the presence of a faulty signal depends on the signal values of the neighboring cells - Mostly in DRAMs | 0<br>0<br>0 | 0 | 0 | a=b=0 <b>→</b> d= | |-------------|---|---|----------------------------| | 0 | d | b | a=b=0 → d=0<br>a=b=1 → d=1 | | 0 | а | 0 | u===1 2 u=1 | - Adjacent cell coupling faults - Pattern sensitivity between a pair of cells Ch2-19 ### **Memory Testing** - Test could be time-consuming - The length of the test sequence for memory testing could be prohibitively long - Example: - A pattern sensitive test is 5n<sup>2</sup> long for an n-bit RAM - Testing a 1-M bit chip at 10ns pattern would take 14 hours - For a 64-M bit chip, it would take 6 years ### **PLA Faults** - Stuck-at Faults - Cross-point Faults - Extra/Missing Transistors - Bridging Faults - Break Faults Ch2-21 Ch2-22 ### • s-a-0 & s-a-1 faults - on inputs, input inverters, product lines, and outputs are easy to simulate in its gate-level model A B C f1 f2 A B C Gate-level model A B C Gate-level model AND-Array OR-Array - Missing Crosspoint in OR-array - Disappearance fault ### **Summary of PLA Faults** - Cross-Point Faults - 80 ~ 85% covered by stuck-fault tests - Layout-dependence in folded PLA - Bridging Faults - 99% covered by stuck-fault tests - Layout-dependence in all PLAs - (Ref: Agrawal & Johnson, ICCD-86) Ch2-25 ### **Delay Testing** - Chip with Timing Defects - may pass the DC stuck-fault testing, but fail when operated at the system speed - For example, a chip may pass the test under 10 MHz operation, but fail under 100 MHz - Delay Fault Models - Gate-Delay Fault - Path-Delay Fault ### **Gate-Delay Fault (I)** ### Slow to Rise $-\overline{x}$ is slow to rise when channel resistance R1 is abnormally high Ch2-27 ### **Gate-Delay Fault (II)** ### Test Based on Gate-Delay Fault May not detect those delay faults that result from the accumulation of a number of small incremental delay defects along a path !! (Disadvantage) ### **Path-Delay Fault** - Associated with a Path (e.g., A-B-C-Z) - Whose delay exceeds the clock interval - More complicated than gate-delay fault - Because the number of paths grows exponentially Ch2-29 ### **Fault Detection** - Fault Activation - Fault Propagation ### **Definition Of Fault Detection** - A test (vector) t detects a fault f iff - t detects $f \Leftrightarrow z(t) \neq z_f(t)$ - Example The test $(x_1,x_2,x_3) = (100)$ detects f because $z_1(100)=0$ while $z_{1f}(100)=1$ Ch2-31 ### **Fault Detection Requirement** - A test t that detects a fault f - (1) Activate f (or generate a fault effect at the site of the fault) - (2) Propagate the fault effect to a primary output w - Sensitized Line: - A line whose faulty value is different from its fault-free one is said to be sensitized by the test in the faulty circuit - Sensitized Path: - A path composed of sensitized lines is called a sensitized path z (1011)=0 $z_f$ (1011)=1 1011 detects the fault f (G $_2$ stuck-at 1) $v/v_f$ : v = signal value in the fault free circuit $v_f = \text{signal value in the faulty circuit}$ Ch2-33 ### **Detectability** - A fault f is said to be detectable - if there exists a test t that detects f; otherwise, f is an undetectable fault - For an undetectable fault f - No test can simultaneously activate f and create a sensitized path to a primary output ### **Undetectable Fault** - G₁ output stuck-at-0 fault is undetectable - Undetectable faults do not change the function of the circuit - The related circuit can be deleted to simplify the circuit Ch2-35 ### **Test Set** - Complete detection test set: - A set of tests that detect any detectable faults in a class of faults - The quality of a test set - is measured by fault coverage - Fault coverage: - Fraction of faults that are detected by a test set - The fault coverage - can be determined by fault simulation - >95% is typically required for single stuck-at fault model - >99.9% in IBM ### **Fault Collapsing** - Fault Equivalence - Fault Dominance - Checkpoint Theorem ### **Fault Equivalence** ### Distinguishing test – A test t distinguishes faults $\alpha$ and $\beta$ if $$Z_{\alpha}(t) \oplus Z_{\beta}(t) = 1$$ ### • Equivalent Faults - Two faults, $\alpha$ & $\beta$ are said to be equivalent in a circuit, iff the function under $\alpha$ is equal to the function under $\beta$ for any input combination (sequence) of the circuit. - No test can distinguish between $\alpha$ and $\beta$ Ch2-39 ### **Fault Equivalence** - AND gate: - all s-a-0 faults are equivalent - OR gate: - all s-a-1 faults are equivalent - NAND gate: - all the input s-a-0 faults and the output s-a-1 faults are equivalent - all input s-a-1 faults and the output s-a-0 faults are equivalent - Inverter: - input s-a-1 and output s-a-0 are equivalent input s-a-0 and output s-a-1 are equivalent ### **Equivalence Fault Collapsing** n+2 instead of 2(n+1) faults need to be considered for n-input gates Ch2-41 ### **Equivalent Fault Group** - In a combinational circuit - Many faults may form an equivalent group - These equivalent faults can be found by sweeping the circuit from the primary outputs to the primary inputs Three faults shown are equivalent! #### **Fault Dominance** - AND gate: - Output s-a-1 dominates any input s-a-1 - NAND gate: - Output s-a-0 dominates any input s-a-1 - OR gate: - Output s-a-0 dominates any input s-a-0 - NOR gate: - Output s-a-1 dominates any input s-a-0 - Dominance fault collapsing: - The reduction of the set of faults to be analyzed based on dominance relation Ch2-45 #### Stem v.s. Branch Faults C: stem of a multiple fanout A & B: branches Detect A sa1: $$z(t) \oplus z_f(t) = (\mathbf{C}\mathbf{D} \oplus \mathbf{C}\mathbf{E}) \oplus (\mathbf{D} \oplus \mathbf{C}\mathbf{E}) = \mathbf{D} \oplus \mathbf{C}\mathbf{D} = \mathbf{1}$$ $$\Rightarrow$$ (C = 0, D = 1) • Detect C sa1: $$z(t) \oplus z_f(t) = (\mathbf{C}\mathbf{D} \oplus \mathbf{C}\mathbf{E}) \oplus (\mathbf{D} \oplus \mathbf{E}) = \mathbf{1}$$ $$\Rightarrow$$ (C = 0, D = 1) or (C = 0, E = 1) - Hence, C sa1 dominates A sa1 - Similarly - C sa1 dominates B sa1 - C sa0 dominates A sa0 - C sa0 dominates B sa0 - In general, there might be no equivalence or dominance relations between stem and branch faults Ch2-46 #### **Analysis of a Single Gate** | AB | C | A<br>sa1 | B<br>sa1 | C<br>sa1 | A<br>sa0 | B<br>sa0 | C<br>sa0 | |----|---|----------|----------|----------|----------|----------|----------| | 00 | 0 | | | 1 | | | | | 01 | 0 | 1 | | 1 | | | | | 10 | 0 | | 1 | 1 | | | | | 11 | 1 | | | | 0 | 0 | 0 | - Fault Equivalence Class - **Negligible fault** - (A s-a-0, B s-a-0, C s-a-0) - Fault Dominance Relations - (C s-a-1 > A s-a-1) and (C s-a-1 > B s-a-1) - Faults that can be ignored: - A s-a-0, B s-a-0, and C s-a-1 Ch2-47 #### **Fault Collapsing** - Equivalence + Dominance - For each n-input gate, we only need to consider n+1 faults during test generation # Rule When fault α dominates fault β, then an arrow is pointing from α to β Application Find out the transitive dominance relations among faults a s-a-0 d s-a-0 d s-a-1 ch2-49 #### **Prime Fault** $\square$ $\alpha$ is a prime fault if every fault that is dominated by $\alpha$ is also equivalent to $\alpha$ Ch2-51 #### Why Fault Collapsing? - Memory and CPU-time saving - Ease testing generation and fault simulation \* 30 total faults $\rightarrow$ 12 prime faults #### **Checkpoint Theorem** #### Checkpoints for test generation - A test set detects every fault on the primary inputs and fanout branches is complete - I.e., this test set detects all other faults too - Therefore, primary inputs and fanout branches form a sufficient set of checkpoints in test generation - In fanout-free combinational circuits, primary inputs are the sole checkpoints Ch2-53 #### Why Inputs + Branches Are Enough? #### Example - Checkpoints are marked in blue - Sweeping the circuit from PI to PO to examine every gate, e.g., based on an order of (A->B->C->D->E) - For each gate, output faults are detected if every input fault is detected #### Fault Collapsing + Checkpoint #### • Example: - 10 checkpoint faults - a s-a-0 <=> d s-a-0 , c s-a-0 <=> e s-a-0 b s-a-0 > d s-a-0 , b s-a-1 > d s-a-1 - 6 tests are enough #### Outline - Fault Simulation for Comb. Ckt - Basic of Logic Simulation - Parallel Fault Simulation - Deductive Fault Simulation - Concurrent Fault Simulation - Approximation Approach - Techniques for Sequential Circuits Note: Comb. Ckt: Combinational Circuits #### Why Fault Simulation? - To evaluate the quality of a test set - I.e., to compute its fault coverage - Part of an ATPG program - A vector usually detects multiple faults - Fault simulation is used to compute the faults accidentally detected by a particular vector - To construct fault-dictionary - For post-testing diagnosis - To Evaluate the fault coverage of a functional patterns Ch3-3 2 #### Some Basics for Logic Simulation - · For fault simulation purpose, - mostly the gate delay is assumed to be zero unless the delay faults are considered. Our main concern is the functional faults - The logic values - can be either two (0, 1) or three values (0, 1, X) - Two simulation mechanisms: - Oblivious compiled-code: - circuit is translated into a program and all gates are executed for each pattern. (may have redundant computation) - Interpretive event-driven: - Simulating a vector is viewed as a sequence of value-change events propagating from the PI's to the PO's - Only those logic gates affected by the events are re-evaluated Ch3-5 #### Compiled-Code Simulation - Compiled code - LOAD A /\* load accumulator with value of A \*/ - AND B /\* calculate A and B \*/ - AND C /\* calculate E = AB and C \*/ - OR D /\* calculate $Z = E \text{ or } D^*$ / - STORE Z /\* store result of Z\*/ #### **Characteristics of Fault Simulation** - Fault activity with respect to fault-free circuit - is often sparse both in time and in space. - For example - F1 is not activated by the given pattern, while F2 affects only the lower part of this circuit. #### Fault Simulation Techniques - Serial Fault Simulation - trivial single-fault single-pattern - Parallel Fault Simulation - Deductive Fault Simulation - Concurrent Fault Simulation #### Parallel Fault Simulation - Simulate multiple circuits at a time: - The inherent parallel operation of computer words to simulate faulty circuits in parallel with fault-free circuit - The number of faulty circuits, or faults, can be processed simultaneously is limited by the word length, e.g., 32 circuits for a 32-bit computer - Extra Cost: - An event, a value-change of a single fault or fault-free circuit leads to the computation of the entire word - The fault-free logic simulation is repeated for each pass Ch3-11 6 #### **Deductive Fault Simulation** - Simulate all faulty circuits in one pass - For each pattern, sweep the circuit from PI's to PO's. - During the process, <u>a list of faults</u> is associated with each line - The list contains faults that would produce a fault effect on this line - The union fault list at every PO contains the detected faults by the simulated input vector - Major operation: fault list propagation - Related to the gate types and values - The size of the list may grow dynamically, leading to a potential memory explosion problem Ch3-15 #### Controlling Value of a Logic Gate Whenever there is a '0' in the inputs, Z will be '0' - → Controlling value for NAND gate is '0' - → Non-Controlling value is '1' | Gate Type | Controlling | Non-Controlling | | | |-----------|-------------|-----------------|--|--| | | Value | Value | | | | AND | <b>'0'</b> | <b>'1'</b> | | | | OR | '1' | <b>'0'</b> | | | | NAND | <b>'0'</b> | '1' | | | | NOR | '1' | <b>'</b> 0' | | | #### **Example: Fault List Propagation** Fault-free simulation results: {A=0, B=0, C=0} Q: What is the detected fault list at line C? (Reasoning) To create a fault effect at line C, we need {A=1, B=1} - → which means that we need a fault effect at A as well as B - → It can be achieved in faulty circuits LA · LB - → Also C/1 is a new fault to be included in the fault list of C LA, LB, LC are fault list propagated to their respective lines LA is the set of all faults not in LA Ch3-17 #### **Example: Fault List Propagation** LA, LB, LC are detected fault list at their respective lines Consider a two-input AND-gate: Non-controlling case: **Case 1:** A=1, B=1, C=1 at fault-free, $LC = LA + LB + \{C/0\}$ Controlling cases: Case 2: A=1, B=0, C=0 at fault-free, $LC = \overline{LA} \cdot LB + \{C/1\}$ Case 3: A=0, B=0, C=0 at fault-free, $LC = LA \cdot LB + \{C/1\}$ LA is the set of all faults not in LA • Consider 3 faults: B/1, F/0, and J/0 Fault List at PI's: $$LB = \{B/1\}, LF = \{F/0\}, LA = \emptyset, LC = LD = \{B/1\}$$ Ch3-19 #### Example: Deductive Simulation (2) • Consider 3 faults: B/1, F/0, and J/0 Fault Lists at G and E: $$\begin{split} LB &= \{B/1\}, \, LF = \{F/0\}, \, LA = \varphi, \, LC = LD = \{B/1\}, \\ LG &= (\overline{L}A * LC) = \{B/1\} \\ LE &= (LD) = \{B/1\} \end{split}$$ # • Consider 3 faults: B/1, F/0, and J/0 Computed Fault List at H: $$LB = \{B/1\}, LF = \{F/0\}, LC=LD = \{B/1\}, LG = \{B/1\}, LE = \{B/1\}$$ $LH = (LE + LF) = \{B/1, F/0\}$ Ch3-21 #### Example: Deductive Simulation (4) • Consider 3 faults: B/1, F/0, and J/0 Final Fault List at the output J: LB = {B/1}, LF = {F/0}, LC=LD = {B/1}, LG = {B/1}, LE = {B/1} LH = {B/1, F/0}, LJ = ( $$\overline{L}G \cdot LH$$ ) {F/0, J/0} ### Example: Even-Driven Deductive Fault Simulation • When A changes from 1 to 0 Event-driven operation: $$\begin{split} LB &= \{B/1\}, \ LF = \{F/0\}, \ LA = \varphi \\ LC &= LD = \{B/1\}, \ LG = \varphi, \\ LE &= \{B/1\}, \ LH = \{B/1, F/0\}, \ LJ = \{B/1, F/0, J/0\} \end{split}$$ Ch3-23 #### **Concurrent Fault Simulation** - Simulate all faulty circuits in one pass: - Each gate retains a list of fault copies, each of which stores the status of a fault exhibiting difference from fault-free values - Simulation mechanism - is similar to the conceptual fault simulation except that only the dynamical difference w.r.t. fault-free circuit is retained. - Theoretically, - all faults in a circuit can be processed in one pass - Practically, - memory explosion problem may restrict the number of faults that can be processed in each pass #### Outline - Fault Simulation for Comb. Circuits - Approximation Approach - Critical Path Tracing - Probabilistic Approach - Techniques for Sequential Circuits #### Sensitive Input and Critical Path - Sensitive Input of a gate: - A gate input i is sensitive if complementing the value of i changes the value of the gate output - Critical line - Assume that the fault-free value of w is v in response to t - A line w is critical w.r.t. a pattern t iff t detects the fault w stuck-at $\overline{v}$ - · Critical paths - Paths consisting of critical lines only Ch3-33 #### **Basics of Critical Path Tracing** - A gate input i is critical w.r.t. a pattern t if - (1) the gate output is critical and - (2) *i* is a sensitive input to *t* - Use recursion to prove that *i* is also critical - In a fanout-free circuit - the criticality of a line can be determined by backward traversal to the sensitive gate's inputs from PO's, in linear time #### **Analysis of Critical Path Tracing** - Three-step Procedure: - Step 1: Fault-free simulation - Step 2: Mark the sensitive inputs of each gate - Step 3: Identification of the critical lines by backward critical path tracing) - Complexity is O(G) - Where G is the gate count - for fanout-free circuits --- very rare in practice - Application - Applied to fanout-free regions, while stem faults are still simulated by parallel-pattern fault simulator. #### **Anomaly of Critical Path Tracing** • Stem criticality is hard to infer from branches. E.g. is B/1 detectable by the given pattern? - It turns out that B/1 is not detectable even though both C and D are critical, because their effects cancel out each other at gate J, (i.e., fault masking problem) - There is also a so-called multiple path sensitization problem. Ch3-37 #### Multiple Path Sensitization Both C and D are not critical, yet B is critical and B/0 can be detected at J by multiple path sensitization. #### Parallel and Distributed Simulation #### · To share the fault simulation effort by a number of processors either tightly connected as in parallel computation or loosely connected as in distributed computation. #### The speed-up with respect to the processor number depends on the degree of duplicated computation, and the communication overhead among processors. #### The distributed simulation - on a cluster of networked workstations is especially appealing. Ch3-39 #### Distributed Simulation Techniques #### Fault Partition - Distributes faults among many processors. - Works relatively well for both combinational and sequential circuits. #### · Pattern Partition - Distributes patterns among processors. - no duplicated logic simulation - Works well for combinational circuits. #### Circuit Partition - Difficult to achieve synchronization without incurring excessive communication overhead. #### **Fault Grading** - Approximate fault coverage - Can be obtained in much shorter computational time than regular fault simulation. - Not suitable for high fault-coverage requirement. - Typical fault grading methods: - Toggle test, e.g. DATAS - Detection probability computation, e.g. STAFAN - Fault sampling - estimate from a selected subset of total faults - Test set sampling - estimate from a subset of complete test sequence #### **STAFAN** - Compute fault detection probability from logic simulation. - $-d_l$ = detection probability of s-a-0 on l = C1(l)O(l) - $-d_l$ = detection probability of s-a-1 on l = CO(l)O(l) $$C0(l) = \frac{0 - count}{n}, \quad C1(l) = \frac{1 - count}{n}$$ $$S(l) = \frac{sensitization - count}{n}$$ $$O(l) = S(l)O(m)$$ - m is the immediate successor of $\boldsymbol{l}$ - observability can be computed backwards from POs Ch3-43 #### STAFAN (cont.) $$d_f^n = 1 - (1 - d_f)^n$$ n is the no. of vectors Statistical Fault Coverage $$=\frac{\sum_{\Phi} d_f^n}{|\Phi|}$$ the summation of each fault's detection probability $\Phi$ is the set of faults of interest More sophisticated than toggle test with same computation complexity #### Outline - Fault Simulation for Comb. Circuits - Approximation Approach - Toggle Counting - Critical Path Tracing - Probabilistic Approach - Techniques for Sequential Circuits Ch3-45 ## Fault Grading for Functional Input Sequence #### **Inputs:** - (1) A test application program - (2) A sequential design **Output: The fault coverage** **Application: High-Performance CPU Designs** Major challenge: often too time-consuming #### Hypertrophic Faults - A hypertrophic fault - Is a fault that diverges from the fault-free circuit with a large number of Xs, which usually is a stuck-at fault occurring at a control line and thus prevents the circuit initialization - A small number of hypertrophic faults - account for a large percentage of fault events and CPU time - These faults are sometimes dropped - as potentially detected faults to reduce simulation time. However, the resultant fault coverage then becomes approximate A potentially detected fault is a fault detected only when the circuit is powered on in certain states, not every state. Ch3-49 #### Fault Emulation We can utilize FPGA to speed up the sequential fault grading #### Fault Injection Should Be Efficient! #### Fault Injection - Is to convert a fault-free FPGA implementation to a faulty one - If not efficient, could become the new bottleneck #### • (1) Static Fault Injection Directly changes the configuration of the fault-free implementation to a faulty one #### • (2) Dynamic Fault Injection - Do not change the configuration directly - Fault inject is injected through the control of some hardware originally built-in to the netlist #### 國立清華大學電機系 #### **EE-6250** 超大型積體電路測試 VLSI Testing # Chapter 4 Automatic Test Pattern Generation #### **General ATPG Flow** - ATPG (Automatic Test Pattern Generation) - Generate a set of vectors for a set of target faults - Basic flow Initialize the vector set to NULL #### Repeat Generate a new test vector **Evaluate fault coverage for the test vector** If the test vector is acceptable, then add it to the vector set Until required fault coverage is obtained - To accelerate the ATPG - Random patterns are often generated first to detect easyto-detect faults, then a deterministic TG is performed to generate tests for the remaining faults ch4-2 ## **Combinational ATPG** - Test Generation (TG) Methods - Based on Truth Table - Based on Boolean Equation - Based on Structural Analysis - Milestone Structural ATPG Algorithms - D-algorithm [Roth 1967] - 9-Valued D-algorithm [Cha 1978] - **PODEM** [Goel 1981] - FAN [Fujiwara 1983] ch4-3 Ex: How to generate tests for the stuck-at 0 fault (fault $\alpha$ )? | abc | f | $\textbf{f}\alpha$ | |-----|---|--------------------| | 000 | 0 | 0 | | 001 | 0 | 0 | | 010 | 0 | 0 | | 011 | 0 | 0 | | 100 | 0 | 0 | | 101 | 1 | 1 | | 110 | 1 | 0 | | 111 | 1 | 1 | ch4-5 x stuck-at 0 ## **Test Generation Methods** (Using Boolean Equation) f = ab+ac, $f\alpha = ac$ $T_{\alpha}$ = the set of all tests for fault $\alpha$ $= ON_set(f \oplus f\alpha)$ = $ON_set(f) * OFF_set(f\alpha) + OFF_set(f) * ON_set(f\alpha)$ = $\{(a,b,c) \mid (ab+ac)(ac)' + (ab+ac)'(ac) = 1\}$ Boolean equation $= \{(a,b,c) \mid abc'=1\}$ $= \{ (110) \}.$ High complexity !! Since it needs to compute the faulty function for each fault. ON\_set(f): All input combinations to which f evaluates to 1. OFF\_set(f): All input combinations to which f evaluates to 0. Note: a function is characterized by its ON\_SET ### **Boolean Difference** - Physical Meaning of Boolean Difference - For a logic function $F(X)=F(x_1,...,x_i,...,x_n)$ , find all the input combinations that make a value-change at $x_i$ also cause a value-change at F. - Logic Operation of Boolean Difference - The Boolean difference of F(X) w.r.t. input xi is $$\begin{aligned} dF(x)/dx_i &= F_i(0) \oplus F_i(1) = F_i(0) \cdot F_i(1)' + F_i(0)' \cdot F_i(1) \\ & \text{Where} \\ F_i(0) &= F(x_1, ..., 0, ..., x_n) \\ F_i(1) &= F(x_1, ..., 1, ..., x_n) \end{aligned}$$ • Illustrations of Boolean Difference .5 # Test Generation By Boolean Difference (con't) #### Case 2: Faults are present at internal lines. G(i.e., F with h floating) = h + ac dG/dh = G(h=0) $$\oplus$$ G(h=1) = (ac $\oplus$ 1) = (a'+c') Test-set for h s-a-1 is $$\{ (a,b,c)| \ h' \bullet (a'+c')=1 \ \} = \{ \ (a,b,c)| \ (a'+b') \bullet (a'+c')=1 \ \} = \{ \ (0xx), \ (x00) \ \}.$$ Test-set for h s-a-0 is $$\{(a,b,c)| \frac{h}{t} \cdot \underline{(a'+c')} = 1\} = \{(110)\}.$$ For fault activation For fault sensitization ch4-11 ## **Outline** - Test Generation (TG) Methods - Based on Truth Table - Based on Boolean Equation - Based on Structural Analysis - D-algorithm [Roth 1967] - 9-Valued D-algorithm [Cha 1978] - **PODEM** [Goel 1981] - FAN [Fujiwara 1983] # **Test Generation Method** (From Circuit Structure) - Two basic goals - (1) Fault activation (FA) - (2) Fault propagation (FP) - Both of which requires Line Justification (LJ), I.e., finding input combinations that force certain signals to their desired values - Notations: - 1/0 is denoted as D, meaning that good-value is 1 while faulty value is 0 - Similarly, 0/1 is denoted D' - Both D and D' are called fault effects (FE) ch4-13 ## **Common Concepts for Structural TG** - Fault activation - Setting the faulty signal to either 0 or 1 is a Line Justification problem - Fault propagation - (1) select a path to a PO → decisions - (2) Once the path is selected → a set of line justification (LJ) problems are to be solved - Line Justification - Involves decisions or implications - Incorrect decisions: need backtracking To justify c=1 $\rightarrow$ a=1 and b=1 (implication) To justify c=0 $\rightarrow$ a=0 or b=0 (decision) #### **Branch-and-Bound Search** #### Test Generation - Is a branch-and-bound search - Every decision point is a branching point - If a set of decisions lead to a conflict (or bound), a backtrack is taken to explore other decisions - A test is found when - (1) fault effect is propagated to a PO - (2) all internal lines are justified - No test is found after all possible decisions are tried → Then, target fault is undetectable - Since the search is exhaustive, it will find a test if one exists For a combinational circuit, an undetectable fault is also a redundant fault $\rightarrow$ Can be used to simplify circuit. ## **Implications** ### Implications - Computation of the values that can be uniquely determined - Local implication: propagation of values from one line to its immediate successors or predecessors - Global implication: the propagation involving a larger area of the circuit and re-convergent fanout #### Maximum Implication Principle - Perform as many implications as possible - It helps to either reduce the number of problems that need decisions or to reach an inconsistency sooner ch4-19 | | 484E12.6.2 | | | STAR | | |----------|-----------------------------|---------------------------------|------------|----------------------------|-----------------| | Decision | Implication | Comments | 1 | ı | ı | | | a=0<br>h=1<br>b=1 | Active the fault Unique D-drive | e=1 | k=D'<br>e'=0<br>j=1 | Propagate via k | | | c=1<br>g=D | | l=1<br>m=1 | , | Propagate via n | | | i=D'<br>d'=0 | Propagate via i | | n=D<br>f'=0<br>f=1 | | | j=1 | | Propagate via n | | m=D' | Contradiction | | | n=D<br>e' <b>=</b> 0<br>e=1 | | f=1 | m=D'<br>f'=0<br>l=1<br>n=D | Propagate via m | ## 9-Value D-Algorithm - Logic values (fault-free / faulty) - {0/0, 0/1, <mark>0/u</mark>, 1/0, 1/1, <mark>1/u</mark>, u/0, u/1, u/u}, - where 0/u={0,D'}, 1/u={D,1}, u/0={0,D}, u/1={D',1}, u/u={0,1,D,D'}. ## Advantage: - Automatically considers multiple-path sensitization, thus reducing the amount of search in D-algorithm - The speed-up is NOT very significant in practice because most faults are detected through singlepath sensitization # Final Step of 9-Value D-Algorithm #### To derive the test vector - A = $(0/1) \rightarrow 0$ (take the fault-free one) - B = (1/u) → 1 - C = (1/u) → 1 - D = (u/1) → 1 - E = (u/1) → 1 - F = (u/1) → 1 #### The final vector - (A,B,C,D,E,F) = (0, 1, 1, 1, 1, 1) #### **Outline** - Test Generation (TG) Methods - Based on Truth Table - Based on Boolean Equation - Based on Structural Analysis - D-algorithm [Roth 1967] - 9-Valued D-algorithm [Cha 1978] - **PODEM** [Goel 1981] - FAN [Fujiwara 1983] ch4-33 # PODEM: Path-Oriented DEcision Making - Fault Activation (FA) and Propagation (FP) - lead to sets of Line Justification (LJ) problems. The LJ problems can be solved via value assignments. - In D-algorithm - TG is done through indirect signal assignment for FA, FP, and LJ, that eventually maps into assignments at PI's - The decision points are at internal lines - The worst-case number of backtracks is exponential in terms of the number of decision points (e.g., at least 2<sup>k</sup> for k decision nodes) - In PODEM - The test generation is done through a sequence of direct assignments at PI's - Decision points are at PIs, thus the number of backtracking might be fewer ## **Search Space of PODEM** - Complete Search Space - A binary tree with 2<sup>n</sup> leaf nodes, where n is the number of Pl's - Fast Test Generation - Need to find a path leading to a SUCCESS terminal quickly # Objective() and Backtrace() #### PODEM - Also aims at establishing a sensitization path based on fault activation and propagation like D-algorithm - Instead of justifying the signal values required for sensitizing the selected path, objectives are setup to guide the decision process at PI's - Objective - is a signal-value pair (w, v<sub>w</sub>) - Backtrace - Backtrace maps a desired objective into a PI assignment that is likely to contribute to the achievement of the objective - Is a process that traverses the circuit back from the objective signal to PI's - The result is a PI signal-value pair $(x, v_x)$ 往輸入端追蹤 No signal value is actually assigned during backtrace! # **Objective Routine** - Objective Routine Involves - The selection of a D-frontier, G - The selection of an unspecified input gate of G ``` Objective() { /* The target fault is ws-a-v*/ /* Let variable obj be a signal-value pair */ if (the value of w is x) obj = (w, v'); else { select a gate (G) from the D-frontier; select an input (j) of G with value x; c = controlling value of G; obj = (j, c'); } return (obj); } ch4-37 ``` ## 後追蹤 Backtrace Routine - Backtrace Routine - Involves finding an all-x path from objective site to a PI, I.e., every signal in this path has value x ``` Backtrace(w, v<sub>w</sub>) { /* Maps objective into a PI assignment */ G = w; /* objective node */ v = v<sub>w</sub>; /* objective value */ while (G is a gate output) { /* not reached PI yet */ inv = inversion of G; select an input (j) of G with value x; G = j; /* new objective node */ v = v⊕inv; /* new objective value */ } /* G is a PI */ return (G, v); } ``` ## **Decision Tree in PODEM** - Decision node: the PI selected through backtrace for value assignment - Branch: the value assignment to the selected PI ch4-45 # **Terminating Conditions** ## D-algorithm - Success: - (1) Fault effect at an output (D-frontier may not be empty) - (2) J-frontier is empty - Failure: - (1) D-frontier is empty (all possible paths are false) - (2) J-frontier is not empty #### PODEM - Success: - · Fault effect seen at an output - Failure: - Every PI assignment leads to failure, in which D-frontier is empty while fault has been activated ## **PODEM: Recursive Algorithm** ``` PODEM () /* using depth-first-search */ If(error at PO) return(SUCCESS); If(test not possible) return(FAILURE); (k, v_k) = Objective(); /* choose a line to be justified */ (j, v_i) = Backtrace(k, v_k); /* choose the PI to be assigned */ /* make a decision */ Imply (j, v<sub>i</sub>); If ( PODEM()==SUCCESS ) return (SUCCESS); Imply (j, v<sub>i</sub>'); /* reverse decision */ If ( PODEM()==SUCCESS ) return(SUCCESS); Imply (j, x); What PI to assign? Return (FAILURE); end Recursive-call Recursive-call If necessary ch4-47 ``` ## **Overview of PODEM** #### PODEM - examines all possible input patterns implicitly but exhaustively (branch-and-bound) for finding a test - It is complete like D-algorithm (I.e., will find one if a test exists) #### Other Key Features - No J-frontier, since there are no values that require justification - No consistency check, as conflicts can never occur - No backward implication, because values are propagated only forward - Backtracking is implicitly done by simulation rather than by an explicit and time-consuming save/restore process - Experimental results show that PODEM is generally faster than the D-algorithm ## The Selection Strategy in PODEM - In Objective() and Backtrace() - Selections are done arbitrarily in original PODEM - The algorithm will be more efficient if certain guidance used in the selections of objective node and backtrace path - Selection Principle - Principle 1: Among several unsolved problems - → Attack the hardest one - Ex: to justify a '1' at an AND-gate output - Principle 2: Among several solutions for solving a problem - → Try the easiest one - Ex: to justify a '1' at OR-gate output ch4-49 ## **Controllability As Guidance** - Controllability of a signal w - CY1(w): the probability that line w has value 1. - CYO(w): the probability that line w has value 0. - Example: - f = ab - Assume CY1(a)=CY0(a)=CY1(b)=CY0(b)=0.5 - $\rightarrow$ CY1(f)=CY1(a)xCY1(b)=0.25, - $\rightarrow$ CY0(f)=CY0(a)+CY0(b)-CY0(a)xCY0(b)=0.75 - Example of Smart Backtracing - Objective (c, 1) $\rightarrow$ choose path c $\rightarrow$ a for backtracing - Objective (c, 0) $\rightarrow$ choose path c $\rightarrow$ a for backtracing ## **Testability Analysis** #### Applications - To give an early warning about the testing problems that lie ahead - To provide guidance in ATPG #### Complexity Should be simpler than ATPG and fault simulation, i.e., need to be linear or almost linear in terms of circuit size #### Topology analysis - Only the structure of the circuit is analyzed - No test vectors are involved - Only approximate, reconvergent fanouts cause inaccuracy ch4-51 #### SCOAP (Sandia Controllability/Observability Analysis Program) #### Computes six numbers for each node N - CC<sup>0</sup>(N) and CC<sup>1</sup>(N) - Combinational 0 and 1 controllability of a node N - SC<sup>0</sup>(N) and SC<sup>1</sup>(N) - Sequential 0 and 1 controllability of a node N - CO(N) - Combinational observability - SO(N) - Sequential observability 值越大→代表越困難 ## **Controllability Measure (con't)** - CC<sup>0</sup>(N) and CC<sup>1</sup>(N) - The number of combinational nodes that must be assigned values to justify a 0 or 1 at node N - SC<sup>0</sup>(N) and SC<sup>1</sup>(N) - The number of sequential nodes that must be assigned values to justify a 0 or 1 at node N $$\begin{split} &CC^0(Y) = CC^0(x1) + CC^0(x2) + CC^0(x3) + 1 \\ &CC^1(Y) = min \left[ \ CC^1(x1), \ CC^1(x2), \ CC^1(x3) \ \right] + 1 \\ &SC^0(Y) = SC^0(x1) + SC^0(x2) + SC^0(x3) \\ &SC^1(Y) = min \left[ \ SC^1(x1), \ SC^1(x2), \ SC^1(x3) \ \right] \end{split}$$ ch4-55 ## **Observability Measure** - CO(N) and SO(N) - The observability of a node N is a function of the output observability and of the cost of holding all other inputs at non-controlling values Example: X1 observable: (Y observable) + (side-inputs 配合) $$CO(x1) = CO(Y) + CC^{0}(x2) + CC^{0}(x3) + 1$$ $SO(x1) = SO(Y) + SC^{0}(x2) + SC^{0}(x3)$ ## **Outline** - Test Generation (TG) Methods - Based on Truth Table - Based on Boolean Equation - Based on Structural Analysis - D-algorithm [Roth 1967] - 9-Valued D-algorithm [Cha 1978] - **PODEM** [Goel 1981] - **FAN** [Fujiwara 1983] ## **FAN (Fanout Oriented) Algorithm** #### • FAN Introduces two major extensions to PODEM's backtracing algorithm #### 1st extension Rather than stopping at PI's, backtracing in FAN may stop at an internal lines #### 2nd extension FAN uses multiple backtrace procedure, which attempts to satisfy a set of objectives simultaneously ch4-67 ## **Headlines and Bound Lines** #### Bound line - A line reachable from at least one stem - Free line - A line that is NOT bound line - Head line - A free line that directly feeds a bound line # **Why Stops at Head Lines?** - Head lines are mutually independent - Hence, for each given value combination at head lines, there always exists an input combination to realize it. - FAN has two-steps - Step 1: PODEM using headlines as pseudo-PI's - Step 2: Generate real input pattern to realize the value combination at head lines. ## **Why Multiple Backtrace?** - Drawback of Single Backtrace - A PI assignment satisfying one objective →may preclude achieving another one, and this leads to backtracking - Multiple Backtrace - Starts from a set of objectives (Current\_objectives) - Maps these multiple objectives into a head-line assignment k=v<sub>k</sub> that is likely to - Contribute to the achievement of a subset of the objectives - Or show that some subset of the original objectives cannot be simultaneously achieved ch4-71 #### **Multiple Backtrace Algorithm** Mbacktrace (Current\_objectives) { while (Current\_objectives ≠ ∅) { remove one entry (k, v<sub>k</sub>) from Current\_objectives; switch (type of entry) { 1. HEAD\_LINE: add (k, v<sub>k</sub>) to Head\_objectives; 2. FANOUT\_BRANCH: j = stem(k); increment no. of requests at j for v<sub>k</sub>; /\* count 0s and 1s \*/ add j to Stem\_objectives; 3. OTHERS: inv = inversion of k; c = controlling value of k; select an input (j) of k with value x; if $((v_k \oplus inv) == c)$ add(j, c) to Current\_objectives; else { for every input (j) of k with value x add(j, c') to Current\_objectives; } } TO BE CONTINUED ... # **Multiple Backtrace (con't)** ch4-7 ``` Mbacktrace (Current_objectives) { while (Current_objectives ≠ ∅) {body in previous page} if(Stem_objectives ≠ ∅) { remove the highest-level stem (k) from Stem_Objectives; v<sub>k</sub> = most requested value of k; /* recursive call here */ add (k, v<sub>k</sub>) to Current_objectives; return (Mbacktrace(Current_objectives); } else { remove one objective (k, v<sub>k</sub>) from Head_objectives; return (k, v<sub>k</sub>) } } ``` ### References - [1] Sellers et al., "Analyzing errors with the Boolean difference", IEEE Trans. Computers, pp. 676-683, 1968. - [2] J. P. Roth, "Diagnosis of Automata Failures: A Calculus and a Method", IBM Journal of Research and Development, pp. 278-291, July, 1966. - [2"] J. P. Roth et al., "Programmed Algorithms to Compute Tests to Detect and Distinguish Between Failures in Logic Circuits", IEEE Trans. Electronic Computers, pp. 567-579, Oct. 1967. - [3] C. W. Cha et al, "9-V Algorithm for Test Pattern Generation of Combinational Digital Circuits", IEEE TC, pp. 193-200, March, 1978. - [4] P. Goel, "An Implicit Enumeration Algorithm to Generate Tests for Combinational Logic Circuits", IEEE Trans. Computers, pp. 215-222, March, 1981. - [5] H. Fujiwara and T. Shimono, "On the Acceleration of Test Generation Algorithms", IEEE TC, pp. 1137-1144, Dec. 1983. - [6] M. H. Schulz et al., "SOCRATES: A Highly Efficient Automatic Test Pattern Generation System", IEEE Trans. on CAD, pp. 126-137, 1988. - [6'] M. H. Schulz and E. Auth, "Improved Deterministic Test Pattern Generation with Applications to Redundancy Identification", IEEE Trans CAD, pp. 811-816, 1989. ch4-75 # Introduction Why DFT? What is DFT? Ad-Hoc Approaches Full Scan Partial Scan ### Why DFT? - Direct Testing is Way Too Difficult! - Large number of FFs - Embedded memory blocks - Embedded analog blocks - Design For Testability is inevitable - · Like death and tax ch5-3 ### Design For Testability - Definition - Design For Testability (DFT) refers to those design techniques that make test generation and testing cost-effective - DFT Methods - Ad-hoc methods - Scan, full and partial - Built-In Self-Test (BIST) - Boundary scan - Cost of DFT - Pin count, area, performance, design-time, test-time # Why DFT Isn't Universally Used Previously? - Short-sighted view of management - Time-to-market pressure - Life-cycle cost ignored by development management/contractors/buyers - Area/functionality/performance myths - Lack of knowledge by design engineers - Testing is someone else's problem - Lack of tools to support DFT until recently We don't' have to worry about this management barrier any more → Most design teams now have DfT people ch5-5 ### **Important Factors** - Controllability - Measure the ease of controlling a line - Observability - Measure the ease of observing a line at PO - Predictability - Measure the ease of predicting output values - DFT deals with ways of improving - Controllability - Observability - Predictability ### Outline - Introduction - Ad-Hoc Approaches - Test Points - Design Rules - Full Scan - Partial Scan ch5-7 ### Ad-Hoc Design For Testability - Design Guidelines - Avoid redundancy - Avoid asynchronous logic - Avoid clock gating (e.g., ripple counter) - Avoid large fan-in - Consider tester requirements (tri-stating, etc.) - Disadvantages - High fault coverage not guaranteed - Manual test generation - Design iterations required ### Some Ad-Hoc DFT Techniques Test Points Initialization Delay outpu Monostable multivibrators - One-shot circuit Oscillators and clocks One-shot Counters / Shift-Registers - Add control points to long counters input Partition large circuits output Logical redundancy · Break global feedback paths ch5-9 ### On-Line Self-Test & Fault Tolerance By Redundancy ### Information Redundancy - Outputs = (information-bits) + (check-bits) - Information bits are the original normal outputs - Check bits always maintains a specific pre-defined logical or mathematical relationship with the corresponding information bits - Any time, if the information-bits and check-bits violate the pre-defined relationship, then it indicates an error ### Hardware Redundancy Use extra hardware (e.g., duplicate or triplicate the system) so that the fault within one module will be masked (I.e., the faulty effect never observed at the final output) # O/1 Injection Circuitry Normal operation When CP\_enable = 0 Inject 0 Set CP\_enable = 1 and CP = 0 Inject 1 Set CP\_enable = 1 and CP = 1 7 # Sharing Between Test Points & Normal I/O - Advantage: Even fewer I/O pins for Test Points - Overhead: Extra MUX delay for normal I/O ### **Control Point Selection** - Impact - The controllability of the fanout-cone of the added point is improved - Common selections - Control, address, and data buses - Enable / Hold inputs - Enable and read/write inputs to memory - Clock and set/clear signals of flip-flops - Data select inputs to multiplexers and demultiplexers ### Example: Use CP to Fix DFT Rule Violation DFT rule violations - The set/clear signal of a flip-flop is generated by other logic, instead of directly controlled by an input pin - Gated clock signals Violation Fix - Add a control point to the set/clear signal or clock signals Q Violation CK clear logic CLEAR ch5-17 CK - logic clear # Example: Fixing Tri-State Bus Contention Bus Contention A stuck-at-fault at the tri-state enable line may cause bus contention – multiple active drivers are connected to the bus simultaneously Fix Add CPs to turn off tri-state devices during testing (A Bus Contention Scenario in the presence of a fault) Enable line stuck-at-1 Unpredictable voltage on bus may cause a fault to go unnoticed ### **Observation Point Selection** - Impact - The observability of the fanin-cone (or transitive fanins) of the added OP is improved - Common choice - Stem lines having a large number of fanouts - Global feedback paths - Redundant signal lines - Output of logic devices having many inputs - MUX, XOR trees - Output from state devices - Address, control and data buses (常為電路區塊間之介面訊號) ### Outline - Introduction - Ad-Hoc Approaches - Full Scan - The Concept - Scan Cell Design - Random Access Scan - Partial Scan ch5-23 ### What Is Scan? ### • Objective - To provide controllability and observability at internal state variables for testing - Method - Add test mode control signal(s) to circuit - Connect flip-flops to form shift registers in test mode - Make inputs/outputs of the flip-flops in the shift register controllable and observable - Types - Internal scan - Full scan, Partial scan, Random access - Boundary scan ### **Procedure of Applying Test Patterns** Pl's PPI's - Comb. portion ### Notation - Test vectors $T = \langle t_i^I, t_i^F \rangle i = 1, 2, ...$ - Output Response $R = \langle r_i^0, r_i^F \rangle i = 1, 2, ...$ ### Test Application - (1) i = 1; - (2) Scan-in t<sub>1</sub>F /\* scan-in the first state vector for PPI's \*/ - (3) Apply t<sub>i</sub><sup>I</sup> /\* apply current input vector at PI's \*/ - (4) Observe r<sub>i</sub><sup>O</sup> /\* observe current output response at PO's \*/ - (5) Capture PPOs to FFs as r<sub>i</sub>F /\* capture the response at PPO's to FFs \*/ - (Set to 'Normal Mode' by raising SE to '1' for one clock cycle) - (6) Scan-out $r_{i}^{\,F}$ while scanning-in $t_{i+1}^{\,F}/^{*}$ overlap scan-in and scan-out $^{*}/$ - (7) i = i+1; Goto step (3) ch5-29 PO's PPO's ### Testing Scan Chain? ### Common practice - Scan chain is often first tested before testing the core logic by a so-called flush test - which pumps random vectors in and out of the scan chain - Procedure (flush test of scan chain) - (1) i = 0; - (2) Scan-in 1st random vector to flip-flops - (3) Scan-out (i)<sup>th</sup> random vector while scanning-in (i+1)<sup>th</sup> vector for flip-flops. - The (i)<sup>th</sup> scan-out vector should be identical to (i)<sup>th</sup> vector scanned in earlier, otherwise scan-chain is mal-functioning - (4) If necessary i = i+1, goto step (3) ### Some Problems With Full Scan - Problems - Area overhead - Synopsys Mentor-Graphics SynTest (華騰科技) Cadence **Major Commercial Test Tool Companies** - Possible performance degradation - High test application time - Power dissipation - Features of Commercial Tools - Scan-rule violation check (e.g., DFT rule check) - Scan insertion (convert a FF to its scan version) - ATPG (both combinational and sequential) - Scan chain reordering after layout ch5-37 ### Performance Overhead - The increase of delay along the normal data paths include: - Extra gate delay due to the multiplexer - Extra delay due to the capacitive loading of the scan-wiring at each flip-flop's output - Timing-driven partial scan - Try to avoid scan flip-flops that belong to the timing critical paths - The flip-flop selection algorithm for partial scan can take this into consideration to reduce the timing impact of scan to the design ### Overhead of Scan Design - Number of CMOS gates = 2000 - Fraction of flip-flops = 0.478 | Scan<br>implementation | Predicted overhead | Actual area overhead | Normalized operating frequency | |------------------------|--------------------|----------------------|--------------------------------| | None | 0 | 0 | 1.0 | | Hierarchical | 14.05% | 16.93% | 0.87 | | Optimized | 14.05% | 11.9% | 0.91 | ### Partial Scan - · Basic idea - Select a subset of flip-flops for scan - Lower overhead (area and speed) - Relaxed design rules - Cycle-breaking technique - Cheng & Agrawal, IEEE Trans. On Computers, April 1990 - Select scan flip-flops to simplify sequential ATPG - Overhead is about 25% off than full scan - Timing-driven partial scan - Jou & Cheng, ICCAD, Nov. 1991 - Allow optimization of area, timing, and testability simultaneously ### Partial Scan For Cycle-Free Structure - Select minimal set of flip-flops - To eliminate some or all cycles - Self-loops of unit length - Are not broken to reduce scan overhead - The number of self-loops in real design can be quite large - · Limit the length of - Consecutive self-loop paths - Long consecutive self-loop paths in large circuits may pose problems to sequential ATPG ch5-47 ### Test Generation for Partial Scan Circuits - Separate scan clock is used - · Scan flip-flops are removed - And their input and output signals are added to the PO/PI lists - · A sequential circuit test generator - is used for test generation - The vector sequences - Are then converted into scan sequences - Each vector is preceded by a scan-in sequence to set the states of scanned flip-flops - A scan-out sequence is added at the end of applying each vector ### Summary of Partial-Scan - Partial Scan - Allows the trade-off between test generation effort and hardware overhead to be automatically explored - Breaking Cycles - Dramatically simplifies the sequential ATPG - Limiting the Length of Self-Loop Paths - Is crucial in reducing test generation effort for large circuits - Performance Degradation - Can be minimized by using timing analysis data for flip-flop selection ## Chapter 6 # **Delay Testing** Acknowledgements: Mainly based on the lecture notes of "VLSI Test Principles and Architectures" ch6-1 ## Introduction of Delay Testing - □ Delay Faulty: - Fault that cause delay across a circuit to violate certain timing constraint - □ Delay Fault Models: - Path delay fault - Too much delay along a path - Transition fault (or Gate delay fault) - Too much delay across a particular gate ### # Applications of Delay Tests - □ Launch-off shifting (LOS) - Aka (also known as) skewed-load - v1 is arbitrary, v2 is derived by a 1-bit shift of v1 - □ Launch-off capture (LOC) - Aka broadside or double-capture - v1 is arbitrary, v2 is derived from v1 through the circuit function ### Transition Fault Model - Assumption: - a large/gross delay is present at a circuit node - □ Path independence: - Irrespective of which path the effect is propagated, the gross delay defect will be arriving late at an observable point - De-facto standard in Industry - Simple and the number of faults is linear to circuit size - Also needs 2 vectors to test a fault - □ Formulation of transition-fault test generation: - Node x slow-to-rise (x-STR) can be modeled simply as two stuck-at faults - (1) First time-frame: x/1 needs to be excited - (2) Second time-frame: x/0 needs to be excited and propagated ## **Summary** - More and more ICs require delay testing (or called timing testing, performance testing), to ensure that an IC can perform up to its target speed. - □ Better understand what LOS, LOC means, since It's industrial practice. - □ Some IC, e.g., CPU, needs to go through speed binning process, to determine the "quality bin" of each IC and its sell price. - Delay test is still a tough issue and still evolving. Rigorous delay testing also aims to detect "small defects" so as to reduce the test escape of latent defects that might hurt an IC's reliability in its field. ### Design-for-Testability - Design activities for generating a set of test patterns with a high fault coverage. - Methodology - Logic - Automatic Test Pattern Generation (ATPG) - Scan Insertion (to ease the ATPG process) - Built-In Self-Test - Memory (SRAM, DRAM, ...) - Built-In Self-Test ### Outline - Basics - Test Pattern Generation - Response Analyzers - BIST Examples - Memory BIST ch7-3 ### **Definition & Advantages of BIST** - Built-In Self-Test (BIST) is a design-fortestability (DFT) technique in which testing (test generation, test application) is accomplished through built-in hardware features. - [ V.D. Agrawal, C.R. Kime, and K.K. Saluja ] - Can lead to significant test time reduction Especially attractive for embedded cores ch7-4 ### Good Things About BIST - At-Speed Testing - catching timing defects - Fast - reduce the testing time and testing costs - a major advantage over scan - Board-level or system-level testing - can be conducted easily in field ch7-5 9 # Why Compression? - Motivation - Bit-to-bit comparison is infeasible for BIST - Signature analysis - Compress a very long output sequence into a single signature - Compare the compressed word with the pre-stored golden signature to determine the correctness of the circuit - Problems - Many output sequences may have the same signature after the compression leading to the aliasing problem - Poor diagnosis resolution after compression ch7-7 # Aliasing Effect in Response Compression Aliasing - the probability that a faulty response is mapped to the same signature as the fault-free circuit (魚目混珠) 錯變成對的機率 Response compression is a mapping from the output response space to the signature space In this example, aliasing prob. = 1/4 = 25% ch7-8 4 ### **BIST Issues** - Area Overhead - Performance Degradation - Fault Coverage - Most on-chip generated patterns may not achieve a very high fault coverage - Diagnosability - The chip is even harder to diagnose due to response compression ch7-9 # Pseudo-random pattern length An RPRF cannot be detected by random patterns is a major cause of low fault coverage in BIST Fault coverage inadequate coverage can be boosted by test points, ATPG patterns, ...? .5 # Example: Hard-To-Detect Fault - · Hard-to-detect faults - Faults that are not covered by random testing - E.g., an output signal of an 18-input AND gate # Reality of Logic BIST - BIST is NOT a replacement for scan - it is built on top of full-scan - BIST does NOT result in fewer patterns - it usually uses many more patterns than ATPG patterns - BIST does NOT remove the need for testers - tester still required to - · initiate test - read response - apply ATPG patterns to other part of IC # **BIST Techniques** - Stored-Vector Based - Micro-instruction support - Stored in ROM - Hardware-Based Pattern Generators - Counters - Linear Feedback Shift Registers - Cellular Automata ch7-13 # Linear Feedback Shift Register (LFSR) • Flip-Flop: one cycle delay - · XOR gate: modulo-2 addition • Connection: modulo-2 multiplication Type 1: Out-Tap Type 2: In-Tap $z = y4 + y1 = D^4(z) + D(z)$ $z = y4 = D(y3 + y4) = D(D^3(z) + z)$ $= D^4(z) + D(z)$ ch7-14 | Primitive Polynomials<br>(Up to Degree 100) | | | | | | | | |---------------------------------------------|------------|------|-----------|------|---------------|--------|---------------| | Not | e: "24 4 3 | 1 0″ | means | p(x) | $=x^{24}+x^4$ | $+x^3$ | $+ x^1 + x^0$ | | 72 | Exponents | n | Exponents | 72 | Exponents | 72 | Exponents | | 1 | 0 | 26 | 8 7 1 0 | 51 | 16 15 1 0 | 76 | 36 35 1 0 | | 2 | 1 0 | 27 | 8 7 1 0 | 52 | 3 0 | 77 | 31 30 1 0 | | 3 | 1 0 | 28 | 3 0 | 53 | 16 15 1 0 | 78 | 20 19 1 0 | | 4 | 1 0 | 29 | 2 0 | 54 | 37 36 1 0 | 79 | 9 0 | | 5 | 2. 0 | 30 | 16 15 1 0 | 55 | 24 0 | 80 | 38 37 1 0 | | 6 | 1 0 | 31 | 3 0 | 56 | 22 21 1 0 | 81 | 4 0 | | 7 | 1 0 | 32 | 28 27 1 0 | 57 | 7 0 | 82 | 38 35 3 0 | | 8 | 6 5 1 0 | 33 | 13 0 | 58 | 19 0 | 83 | 46 45 1 0 | | 9 | 4 0 | 34 | 15 14 1 0 | 59 | 22 21 1 0 | 84 | 13 0 | | 10 | 3 0 | 35 | 2 0 | 60 | 1 0 | 85 | 28 27 1 0 | | 11 | 2 0 | 36 | 11 0 | 61 | 16 15 1 0 | 86 | 13 12 1 0 | | 12 | 7 4 3 0 | 37 | 12 10 2 0 | 62 | 57 56 1 0 | 87 | 13 0 | | 13 | 4 3 1 0 | 38 | 6 5 1 0 | 63 | 1 0 | 88 | 72 71 1 0 | | 14 | 12 11 1 0 | 39 | 4 0 | 64 | 4 3 1 0 | 89 | 38 0 | | 15 | 1 0 | 40 | 21 19 2 0 | 65 | 18 0 | 90 | 19 18 1 0 | | 16 | 5 3 2 0 | 41 | 3 0 | 66 | 10 9 1 0 | 91 | 84 83 1 0 | | 17 | 3 0 | 42 | 23 22 1 0 | 67 | 10 9 1 0 | 92 | 13 12 1 0 | | 18 | 7 0 | 43 | 6 5 1 0 | 68 | 9 0 | 93 | 2 0 | | 19 | 6 5 1 0 | 44 | 27 26 1 0 | 69 | 29 27 2 0 | 94 | 21 0 | | 20 | 3 0 | 45 | 4 3 1 0 | 70 | 16 15 1 0 | 95 | 11 0 | | 21 | 2 0 | 46 | 21 20 1 0 | 71 | 6 0 | 96 | 49 47 2 0 | | 22 | 1 0 | 47 | 5 0 | 72 | 53 47 6 0 | 97 | 6 0 | | 23 | 5 0 | 48 | 28 27 1 0 | 73 | 25 0 | 98 | 11 0 | | 24 | 4 3 1 0 | 49 | 9 0 | 74 | 16 15 1 0 | 99 | 47 45 2 0 | | 25 | 3 0 | 50 | 27 26 1 0 | 75 | 11 10 1 0 | 100 | 37 0 | | | | | | | | | • | ( # Galois Field GF(2) - Operation - Modulo-2 addition, subtraction, multiplication, and division of binary data - Properties - Modulo-2 addition and subtraction are identical - 0+0=0, 0+1=1, 1+0=1, 1+1=0 - 0-0=0, 0-1=1, 1-0=1, 1-1=0 # Why LFSR? - · Simple and regular structure - D-flip-flops and XOR gates - Compatible with scan DFT design - Capable of exhaustive and/or pseudo exhaustive testing - If the LFSR is properly configured - Low aliasing probability - The fault coverage lost due to the response compression is less than other compression schemes ### LFSR - Definitions ### Maximum-length sequence - A sequence generated by an n-stage LFSR is called a maximum-length sequence if it has a period of 2<sup>n</sup>-1 - A maximum-length sequence is called m-sequence ### Primitive polynomial The characteristic polynomial associated with a maximum-length sequence is called a primitive polynomial ### Irreducible polynomial A polynomial is irreducible if it cannot be factorized into two (or more) parts, I.e., it is not divisible by any polynomial other than 1 and itself. ch7-21 ### LFSR - Properties ### No. of 1s and 0s - The number of 1s in an *m*-sequence differs from the number of 0s by only one ### Pseudo-random sequence The sequence generated by an LFSR is called a pseudorandom sequence ### The correlation - Between any two output bits is very close to zero ### Consecutive run of 1s and 0s - An *m*-sequence produces an equal number of runs of 1s and 0s. - In every *m*-sequence, one half the runs have length 1, one fourth have length 2, one eighth have length 3, and so forth # LFSR - Summary - LFSRs have two types - In-tap and Out-tap - LFSRs - Can be used to implement polynomial multiplication and division in GF(2) - As polynomial multiplier - LFSRs are capable of generating pseudo random vectors - As polynomial divisors - LFSRs are capable of compressing test response ch7-25 # Cellular Automaton (CA) - An one-dimensional array of cells - Each cell contains a storage device and next state logic - Next state is a function of current state of the cell and its neighboring cells Three-cell neighbor - Name of CA functions - Is determined by its truth table | State | Αo | <u>A1</u> | <b>A</b> 2 | <b>А</b> 3 | <u>A4</u> | <u>A5</u> | A <sub>6</sub> | <b>A</b> 7 | |--------------------|----|-----------|------------|------------|-----------|-----------|----------------|------------| | Ci+1<br>Ci<br>Ci-1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | | Ci | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | | Ci-1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | | Next State K-Map FcA | | | | | | |----------------------|----|------------|------------|--|--| | Ao | A2 | A4 | <b>A</b> 6 | | | | A1 | Аз | <b>A</b> 5 | <b>A</b> 7 | | | $Name = \sum_{i=0}^{7} A_i 2^i$ (defined by Wolfram) Example: $F_{CA} = C_{i-1} \oplus C_i$ Name = 64+32+4+2 = 102 # Outline - Basics - Test Pattern Generation - How to generate patterns on chip using minimum hardware, while achieving high fault coverage - Response Analyzers - BIST Examples - Memory BIST # PG Hardware Pattern Generated Stored Patterns Counter Based LFSR Based Cellular Automata Pseudo Random Patterns: Random patterns with a specific sequence defined by a seed Pattern Generated Deterministic Pseudo-Exhaustive Pseudo-Random Pseudo-Random Ch7-31 # On-Chip Exhaustive Testing - Exhaustive testing - Apply all possible input combinations to CUD - A complete functional testing - 100% coverage on all possible faults - Limitation - Only applicable for circuits with medium number of inputs ch7-33 # Pseudo Exhaustive Testing (PET) - Apply all possible input combinations to every partitioned sub-circuits - 100% fault coverage on single faults and multiple faults within the sub-circuits - Test time is determined by the number of sub-circuits and the number of inputs to the sub-circuit - Partitioning is a difficult task # Example for Pseudo-Exhaustive Testing 10 vectors are enough to pseudo-exhaustively test this circuit, Compared to 26=64 vectors for naive exhaustive testing ch7-35 ### LFSR-Based Pattern Generation - Apply random test sequence generated by LFSR/CA - Simplest to design and implement - Lowest in hardware overhead - Fault coverage - Is a function of the test length and the random testability of the circuits - Certain circuits are more resistant to random patterns than others ### Outline - Basics - Test Pattern Generation - Response Analyzers - How to compress the output response without losing too much accuracy - BIST Examples - Memory BIST ch7-41 # Types of Response Compression - Ones-counting compression - Transition-counting compression - Signature Analysis ### **Ones-Counting Signature** Procedure - Apply the predetermined patterns - Count the number of ones in the output sequence R0=00000000 R1=11000000 R2=10000000 **Test** CUT Pattern Counter Clock OC(R0) = 0signature OC(R1) = 2OC(R2) = 1ch7-43 # Zero-Aliasing Test Set for Ones-Counting ### Notations - T0: set of test vectors whose fault-free response is 0 - T1: set of test vectors whose fault-free response is 1 ### Theorem - The following new test set does NOT suffer from fault masking using ones count testing - T = {T0, (|T0|+1) copies of every pattern in T1} - Note that the fault masking only occurs when a fault is detected by the same number of patterns in TO and T1; the above new test set avoid this condition # Aliasing of Transition-Counting Consider a sub-sequence of bits $$(... r_{j-1} r_j r_{j+1} ...)$$ If $r_{j+1}$ is not equal to $r_{j+1}$ , then an error occurring at $r_{j}$ will not be detected by transition counting. Example 1. $(0, 1, 1) \rightarrow (0, 0, 1)$ 2. $(0, 0, 1) \rightarrow (0, 1, 1)$ 3. $(1, 1, 0) \rightarrow (1, 0, 0)$ 4. $(1, 0, 0) \rightarrow (1, 1, 0)$ # **Aliasing of Transition Counting** ### Aliasing Probability - Notations - m: the test length - r: the number of transitions - Highest when r=m/2 - No aliasing when r=0 or r=m - For combinational circuits, permutation of the input sequence results in a different signature - One can reorder the test sequence to minimize the aliasing probability # **Example: Aliasing Probability** - · Assume that - Output number to be compressed has m=4 bits - The compression is done by dividing output number by a divisor of 2<sup>n</sup>-1, (e.g., the divisor is 2<sup>2</sup>-1 = 3 when n=2) - The remainder is taken as the signature - Possible signatures ``` output = 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 remainder = 0 1 2 0 1 2 0 1 2 0 1 2 0 1 2 0 aliasing prob. when signature is 0 = (2^m/(2^n-1)) / 2^m = 1/(2^n-1) \sim 2^{-n} ``` ch7-49 # Multiple Input Shift Register (MISR) (Temporal Compression) A MISR compacts responses from multiple circuit outputs into a signature Aliasing probability of m stage = $2^{-m}$ ### Outline - Basics - Test Pattern Generation - Response Analyzers - BIST Examples - Memory BIST ch7-51 # Key Elements in a BIST Scheme - Test pattern generator (TPG) - Output response analyzer (ORA) - Also called Signature Analyzer (SA) - The circuit under test (CUT) - A distribution system (DIST) - which transmits data from TPG's to CUT's and from CUT's to ORA's - e.g., wires, buses, multiplexers, and scan paths - A BIST controller - for controlling the BIST circuitry during self-test - could be off-chip # HP Focus Chip (Stored Pattern) - Chip Summary - 450,000 NMOS devices, 300,000 Nodes - 24MHz clocks, 300K-bit on-chip ROM - Used in HP9000-500 Computer - BIST Micro-program - Use microinstructions dedicated for testing - 100K-bit BIST micro-program in CPU ROM - Executes 20 million clock cycles - Greater than 95% stuck-at coverage - A power-up test used in wafer test, system test, field test # Outline - Basics - Test Pattern Generation - Response Analyzers - BIST Examples - Memory BIST ch7-59 # The Density Issues - Historical π-Rule - The number of bits per chip has quadrupled roughly every 3.1 (or $\pi$ ) years - Density Induced Faults - The cells are closer together - More sensitive to influences of neighbors - More vulnerable to noise on the address and data lines # Test Time May Get Too Long! - For today's memory chips - Test time becomes a big issue! - We can afford nothing but linear test algorithm - Example - assume that the clock cycle time is 100 ns | Algorithm complexity | Testing time (in seconds) | | | | | | |----------------------|--------------------------------------------|-------|-----------------|---------------|--|--| | Capacity <b>n</b> | 64n n•log <sub>2</sub> n 3n <sup>3/2</sup> | | | 2n² | | | | 16k | 0.1 | 0.023 | 0.63 | 54 | | | | 64k | 0.4 | 0.1 | 5.03 | 14 Mins | | | | 256k | 1.7 | 0.47 | 40.3 | 3.8 Hrs | | | | 1M | 6.7 | 2.1 | <b>5.4</b> Mins | <b>61</b> Hrs | | | | 4M | 26.8 | 9.2 | 43 Mins | 41 Days | | | | 16M | 1.8 Mins | 40.3 | 5.7 Hrs | 2 Years | | | ### **Fault Models** - Stuck-At Faults (SAF) - cell, data line, address line, etc. - Open Faults (SAF) - open in data line or in address line - Transition Faults (TF) - Cell can be set to 0, but not to 1 - Address Faults (AF) - faults on decoders - Coupling Faults (CF) - short or cross-talk between data (or address) lines - A cell is affected by one of its neighboring cells - Neighborhood Pattern Sensitive Fault (NPSF) - A cell is affected by when its neighbors form a pattern ch7-65 cell is affected Fault Models ### **Example Faults** - SAF : Cell stuck - SAF : Driver stuck - SAF : Read/write line stuck - SAF: Chip-select line stuck - SAF : Data line stuck - · SAF: Open in data line - CF: Short between data lines - CF: Cross-talk between data lines - AF : Address line stuck - · AF: Open in address line - AF : Open decoder - · AF: Shorts between address lines - AF: Wrong access - AF : Multiple access - TF: Cell can be set to 0 but not to 1 (or vice-versa) - NPSF: Pattern sensitive interaction between cells 117 -66 # Simple Test Algorithms - Test Algorithm - is an abstract description of a sequence of test patterns. - Commonly Used Algorithms - Background patterns - Checkerboard patterns - March Patterns # **Quality Measures of BIST** | BIST-vsTester<br>Profile | | Tester | | | | | |--------------------------|------|------------|-------------|--|--|--| | | | pass | fail | | | | | B | pass | (I) | (III) •漏網之魚 | | | | | S<br>T | fail | (II) 。 誤殺者 | (IV) • • | | | | ### To minimize region (II) and (III): 1. False Negative Ratio: (II) / #chips e.g., (1/20) = 5% 2. False Positive Ratio: (III) / #chips e.g., (2/20) = 10% # Chapter 8 # **Test Compression** Acknowledgements: Mainly based on the lecture notes of Chapter 6, "VLSI Test Principles and Architectures" ch8-1 # What is this Chapter about? - □ Introduce the basic concepts of test data compression - □ Focus on stimulus compression and response compaction techniques - Present and discuss commercial tools on test compression ch8-2 # **Test Compression** - □ Introduction - □ Test Stimulus Compression - **☐** Test Response Compaction - □ Industry Practices - □ Concluding Remarks ch8-3 # Introduction - □ Why do we need test compression? - Test data volume - Test time - Test pins - □ Why can we compress test data? - Test vectors have a lot of "don't care" (X's) ch8-4 ## Test Compression Categories #### □ Test Stimulus Compression - (1) Code-based schemes - (2) Linear-decompression-based schemes - (3) Broadcast-scan-based schemes #### □ Test Response Compaction - Space compaction - Time compaction - Mixed time and space compaction ## Test Stimulus Compression - Code-based schemes - Dictionary code (fixed-to-fixed) - Huffman code (fixed-to-variable) - Run-length code (variable-to-fixed) - Golomb code (variable-to-variable) - □ Linear-decompression-based schemes - □ Broadcast-scan-based schemes #### Golomb Code #### □ Golomb code (variable-to-variable) | Group | Run-Length | Group Prefix | Tail | Codeword | |----------------|------------|--------------|------|----------| | A <sub>1</sub> | 0 | 0 | 0.0 | 000 | | | 1 | | 0.1 | 001 | | | 2 | | 10 | 010 | | | 3 | | 11 | 011 | | $A_2$ | 4 | 10 | 0.0 | 1000 | | - | 5 | | 0.1 | 1001 | | | 6 | | 10 | 1010 | | | 7 | | 11 | 1011 | | A <sub>3</sub> | 8 | 110 | 0.0 | 11000 | | | 9 | 1 1 | 0.1 | 11001 | | | 10 | 1 1 | 10 | 11010 | | | 11 | 1 | 11 | 11011 | | | | | | | ch8-13 # Example of Golomb Code #### □ Golomb code (variable-to-variable) $T_E$ = 010 1000 011 1000 1000 1001 010 1011 011 The length of $T_D$ is 43 bits The length of $T_E$ is 32 bits ### Test Stimulus Compression - □ Code-based schemes - **□** Linear-decompression-based schemes - □ Broadcast-scan-based schemes # Test Stimulus Compression - □ Code-based schemes - □ Linear-decompression-based schemes - □ Broadcast-scan-based schemes ch8-21 # Basic Concept: Broadcast-Scan $\{SC_1, SC_2, ..., SC_k\}$ shares the same test patterns applied by ATE ### ATPG Supporting Broadcast-Scan Force ATPG tool to generate patterns for broadcast scan (by binding certain PI's together) ch8-23 # Reconfigurable Broadcast Scan #### □ Reconfigurable broadcast scan - Static reconfiguration - The reconfiguration can only be done when a new pattern is to be applied - Dynamic reconfiguration - The configuration can be changed while scanning in a pattern #### Broadcast-Scan Based Scheme - First configuration is: 1->{2,3,6}, 2->{7}, 3->{5,8}, 4->{1,4} - Second configuration is: 1->{1,6}, 2->{2,4}, 3->{3,5,7,8} First Partition Second Partition # Test Response Compaction (or Called Output Compaction) - □ Space compaction - □ Time compaction - □ Mixed time and space compaction Unlike lossless input stimulus compression, Output compaction is often lossy, leading to aliasing... # Space (Output) Compaction - □ Space (output) compaction - Zero-aliasing output compaction - X-compactor - X-blocking & X-masking techniques - X-impact-aware ATPG ## **Zero-Aliasing Output Compaction** #### Theorem 6.1 For any test set T, for a circuit that implements function C, there exists a zero-aliasing output space compactor for C with q outputs where $q = \lceil \log_2(|T|+1) \rceil$ . #### Theorem 6.2 Let G be a response graph. If G is $2^q$ colorable, then there exists a q-output zero-aliasing space compactor for the circuit C. ## X-Blocking or Masking Techniques - □ X-blocking (or X-bounding, X-avoiding) - X's can be blocked before reaching the response compactor - To ensure that no X's will be observed - May still have fault coverage loss - Add area overhead and may impact delay # X-Impact-Aware ATPG #### **□** Concept - Simply use ATPG to algorithmically handle the impact of residual X's on the space compactor - Without adding any extra circuitry ch8-39 # Example: Handling X in ATPG Path (G5→G6→SC4→G8→q) might be contaminated by 'X' at f - (1) Propagate the fault effect through $(f1 \rightarrow G3 \rightarrow G2 \rightarrow SC2 \rightarrow G7 \rightarrow p) \Rightarrow b=0, c=1$ - (2) Kill the X by assigning g to '1' $\rightarrow$ SC4=0 $\rightarrow$ q is observable # **Output-Compactor-Aware ATPG** - □ f<sub>2</sub>/1 fault could be masked as propagated to p - □ Block aliasing by assigning a to '0' ch8-41 # Time Compaction - **□** Time compaction - A time compactor uses sequential logic to compact test responses - MISR is most widely adopted - n-stage MISR can be described by specifying a characteristic polynomial of degree n ## **Industry Practices** - □ OPMISR+ - □ Embedded Deterministic Test - □ Virtual Scan and UltraScan - Adaptive Scan - □ ETCompression ch8-45 ## Industry Solutions Categories - □ Linear-decompression-based schemes - Two steps - ETCompression, LogicVision - TestKompress, Mentor Graphics - SOCBIST, Synopsys - □ Broadcast-scan-based schemes - Single step - SPMISR+, Cadence - VirtualScan and UltraScan, SynTest - DFT MAX, Synopsys # Concluding Remarks #### □ Test compression is - An effective method for reducing test data volume and test application time with relatively small cost - An effective test structure for embedded hard cores - Easy to implement and capable of producing highquality tests - Successful as part of standard design flow #### History - 1985 - Joint European Test Action Group (JETAG, Philips) - 1986 - VHSIC Element-Test & Maintenance (ETM) bus standard (IBM et al.) - VHSIC Test & Maintenance TM Bus Structure (IBM et al.) - 1988 - Joint Test Action Group (JTAG) proposed Boundary Scan Standard - 1990 - Boundary Scan approved as IEEE Std. 1149.1-1990 - Boundary Scan Description Language (BSDL) proposed by HP - 1993 - 1149.1a -1993 approved to replace 1149.1-1990 - 1994 - 1149.1b BSDL approved - 1995 - 1149.5 approved ch9-5 #### Overview of P1149 Family | Number | Title | Status | | |--------|-------------------------------------------------------------------|-------------------------------------------------------------------|--| | 1149.1 | Testing of digital chips and<br>Interconnections between<br>Chips | Std. 1149.1-1990<br>Std. 1149.1a-1993<br>Std. 1149.1b-1994 (BSDL) | | | 1149.2 | Extended Digital Serial Interface | Near Completion | | | 1149.3 | Direct Access Testability Interface | Discontinue | | | 1149.4 | Mixed-Signal Test Bus | Started Nov. 1991 | | | 1149.5 | Standard Module Test and<br>Maintenance (MTM) Bus<br>Protocol | Std. 1149.5-1995 | | | 1149 | Unification | Not yet started | | | | | chs | | #### Hardware Components of 1149.1 - TAP (Test Access Port) - TMS, TCK, TDI, TDO, TRST\* (optional) - TAP Controller - A finite state machine with 16 states - Input: TCK, TMS - Output: 9 or 10 signals including ClockDR, UpdateDR, shiftDR, ClockIR, UpdateIR, ShiftIR, Select, Enable, TCK, and the optional TRST\* - IR (Instruction Register) - TDR (Test Data Register) - Mandatory: boundary scan register and bypass register - Optional: device-ID register, design-specific registers, etc. ch9-9 .5 #### States of TAP Controller - Test-Logic-Reset: normal mode - Run-Test/Idle: wait for internal test such as BIST - Select-DR-Scan: initiate a data-scan sequence - Capture-DR: load test data in parallel - Shift-DR: load test data in series - Exit1-DR: Finish phase-1 shifting of data - Pause-DR: Temporarily hold the scan operation (allow the bus master to reload data) - Exit2-DR: finish phase-2 shifting of data - Update-DR: parallel load from associated shift registers ch9-13 #### Instruction Set - EXTEST - Test Interconnection between chips and board - SAMPLE/PRELOAD - Sample and shift out data or shift data only - BYPASS - Bypass data through a chip - Optional - Intest, RunBist, CLAMP, Idcode, usercode, High-Z, etc. ch9-14 #### 國立清華大學電機系 EE-6250 超大型積體電路測試 VLSI Testing #### **Outline** - **♦** Introduction - ◆ Problem, Objective, Review, and Motivation - **♦** Pulse-Vanishing Test (PV-Test) - **♦ VOT-Based Oscillation Test** ### Objective and Challenge Objective: To detect parametric faults (e.g., <1ns delay fault) - → May need to maintain a <u>pitcher-catcher timing relationship</u> across dies (This type of cross-die clock synchronization may not be easy) - → There are so other choices... Note: test clocks TCK1 and TCK2 are low-speed test clocks (e.g., 10MHz) 7 #### **Outline** - **♦** Introduction - → Pulse-Vanishing Test (PV-Test) - At-speed testing for high-speed interconnects - **♦ VOT-Based Oscillation Test** An interposer wire is decomposed into multiple segments of r and c $r_{mb}$ is the resistance of the micro-bump 'IW-delay': interposer wire delay from A to WO 9 ### Resistive Open Fault Model (a) Fault-free model of an interconnect. (b) Faulty model of an interconnect with a resistive open fault. ### Test Time ◆ A PV-test session using 10MHz test clock is about 0.82 ms for 1024 interposer wires 26.21 ms for 32K (32,768) interposer wires #### Area Overhead #### Estimation is based on a 90nm CMOS process | Area overhead | | | | | |---------------------------|-----------------------------------|---------------------|--|--| | Type | Cell Name | Layout Area (μm*μm) | | | | Basic<br>Cells | INVERTER | 2.82 | | | | | 2-input NAND Cell | 2.94 | | | | | MUX Cell | 8.47 | | | | | FF Cell | 17.64 | | | | Basic<br>Macros | <b>Boundary Scan Cell</b> | 52.22 | | | | | Launch Cell | 92.56 | | | | | Capture Cell | 69.16 | | | | | PV-test controller | 670.3 | | | | Overhead | 55.55% for 1024 interposer wires | | | | | Percentage<br>Over 1149.1 | 54.9% for 32,768 interposer wires | | | | 21 ### Summary of PV-Test The interposer needs to be tested alone and thoroughly. And also, when a 2.5-D IC fails, We know if the interposer should be responsible. #### Advantages of Pulse-Vanishing Test - Simple fault detection scheme (No post-processing) - Delay Test without die-to-die high-speed clock synchronization - Boundary-Scan-Like Test Architecture (55.55% overhead) - On-the-spot Diagnosis (good for future self-repair) #### **Outline** - **♦** Introduction - **♦** Pulse-Vanishing Test (PV-Test) - **→ VOT-Based Oscillation Test** - Characterization-based parametric fault testing # Brief Summary of our Idea ### TSV Delay → Transition Time ### Transition Time → Oscillation Period Change (from normal to Schmitt-Trigger) (Easily Measurable) | Three Test Strategies | | | | | | |-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------|-------------------------------------------|-----------------------------------------------------------------------------------------|--|--| | Principles: (1) All ROs oscillate concurrently to detect "resistive open faults" (2) One RO oscillates at a time to detect "inter-RO resistive bridging faults" (3) No RO oscillates to detect "intra-RO resistive bridging faults" | | | | | | | | Test Strategy | RO Settings | Test Actions | | | | Test<br>OPEN | AO-strategy<br>(All Oscillation) | Every RO is Active | Measure {T <sub>REF</sub> ,T <sub>ST1</sub> ,T <sub>ST2</sub> } of every RO in sequence | | | | Test<br>Inter-RO<br>BRIDGING | OO-strategy<br>(One Oscillation) | Target RO is Active<br>(One RO at a time) | Measure<br>{T <sub>REF</sub> ,T <sub>ST1</sub> ,T <sub>ST2</sub> }<br>of the target RO | | | | | | The others are<br>Grounded | NA | | | | Test<br>Intro-RO<br>BRIDGING | NO-strategy<br>(No Oscillation) | Every RO is Half-<br>Floating | Measure {T <sub>REF</sub> } of every RO | | | | 34 | | | | | | # Normalized $\Delta T_{drift}$ for Outlier Analysis For each IW $w_i$ , we have two versions of $\Delta T$ : $$\Delta T_{sim}(w_i) = T_{ST\_sim}(w_i) - T_{REF\_sim}(w_i)$$ $$\Delta T_{measure}(w_i) = T_{ST\_measure}(w_i) - T_{REF\_measure}(w_i)$$ $\Delta T_{drift}(w_i)$ respresents the drifting amount of a measurement version of $\Delta T$ away from its simulation version: $$\Delta T_{drift}(w_i) = \Delta T_{measure}(w_i) - \Delta T_{sim}(w_i)$$ To take into account of the wire-length diversity, we further normalize it: $$(Normalized \Delta T_{drift}) = \left(\frac{\Delta T_{measure} - \Delta T_{sim}}{\Delta T_{sim}}\right) \cdot 100\%$$ # Fault Detection Capability (For Resistive Open Faults) MDR<sub>open</sub>: Minimum Detectable Open Fault Resistance This metric refers to the open fault resistance value beyond which the proposed test method can detect the fault successfully based on the outlier analysis using $3\sigma$ rule. # Detectable Extra-RC: (MDR<sub>open</sub>) \* (C<sub>downstream</sub>) 50.7 ps ### Resistive Open Fault Detection Capability A resistive open fault occurring at the micro-bump of the driver side of a 1000um long interposer wire. | Pseudo<br>Chip Conditions | MDR <sub>open</sub><br>(Min. Detectable Open<br>Fault Resistance) | Detectable<br>Extra-RC | |---------------------------|-------------------------------------------------------------------|------------------------| | #1 (FF & -10% RC) | 245 Ω | 50.7 ps | | #2 (FF) | 76 Ω | 17.5 ps | | #3 (SS) | 113 Ω | 26.0 ps | | #4 (SS & +10% RC) | 78 Ω | 19.7 ps | | Average | 145 Ω | 31.4 ps | ### Summary of VOT-Based Oscillation Test The interposer needs to be tested alone and thoroughly. And also, when a 2.5-D IC fails, We know if the interposer should be responsible. #### **Conclusion** | Criterion | PV-test | VOT-based oscillation test | |---------------------------|-------------------------------------------------------------------|-----------------------------------------------| | Basic Concept | Check if pulse will vanish | Measure ΔT | | Fault Detection<br>Scheme | Test threshold based | Outlier analysis | | Area overhead | 55.5% over IEEE-1149.1 | 55.7% over IEEE-1149.1 | | Test time | <u>0.82</u> ms for 1024 wires<br><u>26.21</u> ms for 32K wires | 4.7 ms for 1024 wires<br>177 ms for 32K wires | | Other benefits | No post-processing<br>On-the-spot diagnosis<br>Easier self-repair | Delay characterization<br>Process tracking | Outlier analysis: A measurement sample that significantly deviates away from the entire population indicates a fault ### What would you do when chips fail? - □ Is it due to design bugs? - If most chip fails with the same syndrome when running an application - □ Is it due to parametric yield loss? - Timing-related failure? - Insufficient silicon speed? - Noise-induced failure? - supply noise, cross-talk, leakage, etc.? - Lack of manufacturability? - inappropriate layout? - □ Is it due to random defects? - Via misalignment, Via/Contact void, Mouse bite, - Unintentional short/open wires, etc. Ch11-3 # Problem: Fault Diagnosis This chapter focuses more on diagnosis of defects or faults, not design bugs Question: Where are the fault locations? ## **Quality Metrics of Diagnosis** #### Success rate - The percentage of hitting at least one defect in the physical failure analysis - This is the ultimate goal of failure analysis #### Diagnostic resolution - Total <u>number of fault candidates reported</u> by a tool - The perfect diagnostic resolution is 1 - Though perfect resolution does not necessarily imply high hit rate #### First-hit index - Used for a tool that reports a ranked list of candidates - Refers to the index of the first candidate in the ranked list that turns out to be a true defect site - Smaller first-hit index indicates higher accuracy #### □ Top-10 hit - Used when there are multiple defects in the failing chip - The number of true defects in the top 10 candidates # Possible Assumptions Used in Diagnosis - □ Stuck-At Fault Model Assumption - The defect behaves like a stuck-at fault - □ Single Fault Assumption - Only one fault affecting any faulty output - □ Logical Fault Assumption - A fault manifests itself as a logical error - □ Full-Scan Assumption - The chip under diagnosis has to be full-scanned Note: A diagnosis approach less dependent on the fault assumptions is more capable of dealing with practical situations. #### **Outline** #### Introduction - Combinational Logic Diagnosis - Cause-Effect Analysis - Effect-Cause Analysis - Chip-Level Strategy - Diagnostic Test Pattern Generation - □ Scan Chain Diagnosis - □ Logic BIST Diagnosis - Conclusion ## Cause-Effect Analysis - □ Fault dictionary (pre-analysis of all causes) - Records test response of every fault under the applied test set - Built by intensive fault simulation process - □ A chip is diagnosed (effect matching) - By matching up the failing syndromes observed at the tester with the pre-stored fault dictionary ## Backtrace Algorithm #### □ Trace back from each mismatched PO To find out suspicious faulty locations #### □ Functional Pruning During the traceback, some signals can be disqualified from the fault candidate set based on their signal values. #### □ Rules - (1) At a controlling case (i.e., 0 for a NAND gate): Its fanin signals with non-controlling values (i.e., 1) are excluded from the candidate set. - (2) At a non-controlling case (i.e., 1 for a NAND gate): Every fanin signal remains in the candidate set. # Why Curable Vector? #### □ Information theory - A less probable event contains more information - Curable output is an easy-to-satisfy criterion, high aliasing - Curable vector is a hard-to-satisfy criterion, low aliasing ### □ Not all failing input vectors are equal! #### □ Niche input vector - Is an failing input vector that activates only one fault - Likely to be a curable vector of certain signals - Few, but tells more about the real fault locations | Failing<br>Input<br>Vectors | Signals in the CUD | | | | | | | | |-----------------------------|-----------------------|----------------|-----------------------|----------------|-----------------------|------------------------------------|-----------------------|--| | | <i>f</i> <sub>1</sub> | f <sub>2</sub> | <b>f</b> <sub>3</sub> | f <sub>4</sub> | <b>f</b> <sub>5</sub> | <b>f</b> <sub>6</sub> | <b>f</b> <sub>7</sub> | | | <b>V</b> <sub>1</sub> | * | | | | ( * | | | | | <b>V</b> <sub>2</sub> | * | * | * | | | | | | | <b>V</b> <sub>3</sub> | | | * | * | | | * | | | <b>V</b> <sub>4</sub> | | | | | * | * | | | | <b>V</b> <sub>5</sub> | | * | | | * | | | | | <b>v</b> <sub>6</sub> | | * | | | * | | | | | <b>V</b> <sub>7</sub> | * | | * | | | | | | | <b>v</b> <sub>8</sub> | | | * | | | | * | | | <b>V</b> <sub>9</sub> | | | * | | * | | | | | <b>V</b> <sub>10</sub> | | | | | * | | * | | | A mark *<br>a SLAT ve | | | | | is a va | (f <sub>3</sub> and f<br>lid fault | | | ### **Outline** - Introduction - □ Combinational Logic Diagnosis - Cause-Effect Analysis - Effect-Cause Analysis - Chip-Level Strategy - Diagnostic Test Pattern Generation - □ Scan Chain Diagnosis - □ Logic BIST Diagnosis - Conclusion # Main Strategy: Detach-Divide-and-then-Conquer - □ Phase 1: Isolate Independent Faults - Search for prime candidates - Use word-level information - □ Phase 2: Locate Dependent Faults As Well - Perform partitioning - Aim at finding one fault in each block # Word-Level Registers and Outputs Signals in a design are often defined in words. This property can be used to differentiate fake prime candidates from the real ones. **Word-Level Output: 01** Word-Level Registers: R1, R2, State ``` module design( O1, ...) output[31:0] O1; reg[31:0] R1, R2; reg[5:0] State ... endmodule ``` # Efficiency of Using Word-Level Info. - **□** Without word-level Information - 2.4 real faults out of 72.3 candidates - **□** With word-level Information - 1.23 real faults out of 3.65 candidates | # of candidates | Original | After<br>Filtering | Filtering<br>Ratio | |--------------------------|----------|--------------------|--------------------| | Prime<br>Candidates | 2.375 | 1.23 | 48.2 % | | Fake Prime<br>Candidates | 69.96 | 2.42 | 96.5 % | ## **Summary** - Strategy - (1) Search For Word-Level Prime Candidates - (2) Identify Independent Faults First - (3) Locate Dependent Faults As Well - Effectiveness - identify 2.98 faults in 5 signal inspections - find 3.8 faults in 10 signal inspections ## Computation of Average-Sum Filtering $\square$ (Average-sum filtering) Assume that the difference profile is given and denoted as D[i], where i is the index of a flip-flop. We use the following formula to compute a smoothed difference profile, SD[i]: $$SD[i] = 0.2*(D[i-2]+D[i-1]+D[i]+D[i+1]+D[i+2])$$ Ch11-69 ## Computation of Edge Detection - □ The true location of the faulty flip-flop is likely to be the *left-boundary of the transition region in the difference profile*. To detect this boundary, we can use a simply *edge detection formula* defined below. - $\Box$ (Edge detection) On the smoothed difference profile SD[i], the following formula can be used to compute the faulty frequency of each flip-flop as a suspicious profile. $$suspicion [i] = [-1,-1,-1,1,1] \cdot \begin{vmatrix} |SD[i] - SD[i-3]| \\ |SD[i] - SD[i-2]| \\ |SD[i] - SD[i-1]| \\ |SD[i] - SD[i+1]| \\ |SD[i] - SD[i+2]| \\ |SD[i] - SD[i+3] \end{vmatrix}$$ ### Summary of Scan Chain Diagnosis - Hardware Assisted - Extra logic on the scan chain - Good for stuck-at fault - Fault Simulation Based - To find a faulty circuit matching the syndromes [Kundu 1993] [Cheney 2000] [Stanley 2000] - Tightening heuristic → upper & lower bound [Guo 2001][Y. Huang 2005] - Use single-excitation pattern for better resolution [Li 2005] - Profiling-Based Method - Locate the fault directly from the difference profiles obtained by run-and-scan test - Applicable to bridging faults - Use signal processing techniques such as filtering and edge detection Ch11-71 ### **Outline** - Introduction - □ Combinational Logic Diagnosis - □ Scan Chain Diagnosis - Logic BIST Diagnosis - Overview - Interval-Based Method - Masking-Based Method - Conclusion # Diagnosis for BISTed Logic #### □ Diagnosis in a BIST environment requires - determining from compacted output responses which test vectors have produced a faulty response (time information) - determining from compacted output responses which scan cells have captured errors (space information) #### □ The true fault location inside the logic Can then be inferred from the above space and time information using previously discussed combinational logic diagnosis ### **Conclusions** - □ Logic diagnosis for combinational logic - Has been mature - Good for not just stuck-at faults, but also bridging faults - □ Scan chain diagnosis - Making good progress ... - Fault-simulation-based, or signal-profiling based - □ Diagnosis of scan-based logic BIST - Hardware support is often required - Interval-unloading, or masking-based - **□** Future challenges - Performance (speed) debug - Diagnosis for logic with on-chip test compression and decompression - Diagnosis for parametric yield loss due to nanometer effects