Optimization-based Test Scheduling for IEEE 1687 Multi-Power Domain Networks Using Boolean Satisfiability

Payam Habiby∗
Sebastian Huhn†
Rolf Drechsler†

∗University of Bremen, Germany
{habiby,huhn,drechsler}@uni-bremen.de
†Cyber-Physical Systems, DFKI GmbH
28359 Bremen, Germany

Abstract—The IEEE 1687 Std. provides an efficient access methodology for embedded instruments in complex system-on-a-chip designs by introducing reconfigurable scan networks. This flexibility enables the reduction of the overall test access time, which significantly decreases the test costs compared to the conventional daisy-chaining method. However, the new access methodology strictly requires effective test schedulers that consider multi-power domains with individual constraints. This work proposes a novel test scheduler that orchestrates the Boolean Satisfiability problem in conjunction with Pseudo-Boolean optimization techniques. The effectiveness of the proposed scheduler is proven by considering networks with over one thousand of so-called instruments forming industrial representative benchmark candidates.

Index Terms—Boolean Satisfiability, IEEE 1687, IJTAG, Pseudo-Boolean Optimization, Reconfigurable Scan Networks, Test Scheduling & Scheduling Algorithms

I. INTRODUCTION

The steadily increasing number of cores in System-on-a-Chip (SoC) designs has been boosting the demand for highly efficient instrument access methodologies since the last years. The IEEE 1687 Std. [1] – also known as IJTAG – was introduced to integrate reconfigurable scan networks into SoCs. The faster instruments’ access in IJTAG networks is realized by employing a new component called Segment Insertion Bit (SIB), which is programmed to include or exclude a section of the network. This principle aims at the scan chain’s shortening and, hence, the reduction of the access time overhead. Due to the power, temperature and further system limitations, not all of the SoC’s components can be simultaneously tested. A further power demand is introduced when the network components are considered in addition to the instruments’ scan and self-test components and, hence, the design is even more prone to issues like IR-drop and test failures [2], [3]. As the overall test time directly affects the test cost, determining an optimized access sequence to the cores massively improves the test procedure and reduces the resulting test costs. Several techniques [4]–[15] have been introduced to reduce the overall test time in IJTAG networks. For example, a formulation for calculating the overall access time is presented in [4]. In work [5], it is tried to minimize the SIB programming overhead by using a hybrid scheduling scheme for instruments with exclusive SIBs. The scan architecture of [7] uses a reconfigurable broadcast mechanism to reduce the overall test data transfer time. A modification scheme for the underlying SIB structures has been proposed in [6] aiming at a higher access bandwidth by introducing a parallel architecture. Boolean satisfiability is exploited in works like [8], [10] to model the reconfigurable scan networks for pattern retargeting and network verification tasks. The SAT-based algorithm of [8] has been further improved in [11] by proposing a new bound calculation method to reduce the number of SAT solver invocations. In particular, significant research has been carried out on the optimization of the test scheduling [12]–[14]. The authors in [13] propose a session-less test scheduling method by sorting the instruments according to their resource conflict and their number of test patterns. In [14], a pre-emptive scheduling method is introduced, which invokes an exhaustive search over all possible concurrent instruments.

However, all these works do neither consider any power constraints nor incorporate multi-power domains, which is strictly required for an effective test of state-of-the-art SoCs. In [15], a graph-based model of IJTAG networks has been proposed for the first time, which invokes a heuristic method to detect concurrently accessible instruments in multi-power domain IJTAG networks. Although the approach [15] delivers already quite promising results, it is computationally intensive when exploring a huge search space.

In contrast, the proposed scheduling method of this paper is applicable for multi-power domain IJTAG networks embracing ScanMixes, SCBs, and SIBs regardless of the structure of concurrency groups. This work proposes a novel optimization method using Boolean Satisfiability (SAT) in conjunction with Pseudo-Boolean Optimization (PBO) techniques. By this, it can be taken advantage of a powerful formal solving engine to, finally, calculate an optimal test schedule yielding highly effective but power-safe tests.

The remainder of this paper is structured as follows: Section II gives a brief introduction to IJTAG, SAT and PBO. Section III describes the proposed formal modeling scheme for IJTAG networks and the SAT-based optimization model. The experimental results are reported in Section IV and, finally, the paper is concluded in Section V.

II. BACKGROUND

Fig. 1 presents an exemplary IJTAG network named Mingle which is one of the instances of the ITC’16 benchmark set [16]. This network embraces ten SIBs labeled as sib1-sib10 and eight instruments indicated by w1–w8, which are connected to the network via parallel Test Data Registers (TDRs) performing the read, write and select operations. The elements of the network are divided into three power domains. During a capture operation the network is loaded with data from the instruments through the parallel interface. While this data is transferred to SO, the new scan data, including the network configuration, is shifted in through a shift operation. Finally, an update signal loads the instrument with the new data. This sequence of capture, shift, and update is called a CSU. Typically, the instruments in the network require different number of accesses. As soon as an instrument does not need further access, the corresponding SIB would be closed by resetting its programming bit to 0 during the last scan process. By this, IJTAG provides shorter paths for accessing the rest of instruments in the network.

The SAT problem investigates whether a given Boolean function can be satisfied by a set of assignments to its variables. If such an assignment to all variables exists, the function is classified as satisfiable (sat). Otherwise, it is classified as
FORMATIONS FORMING THE SAT INSTANCE, WHICH IS A CONJUNCTION \( \bigwedge \) OF CLAUSES \( \Phi = (\neg x_1 \lor \neg x_2 \lor \ldots \lor \neg x_n) \). A BOOLEAN FUNCTION \( F \) OF ITS VARIABLES.

THE PSEUDO-BOOLEAN (PB) SATISFIABILITY PROBLEM DETERMINES WHETHER A BOOLEAN FUNCTION \( \Psi : \{0, 1\}^n \rightarrow \{0, 1\} \) WITH WEIGHTED LITERALS CAN BE SATISFIED BY A SET OF ASSIGNMENTS TO ITS VARIABLES.

THE PBO PROBLEM EXTENDS THE PB-SAT PROBLEM SUCH THAT THE CONJUNCTION OF THE STATED PB CO-FACTORS AND, HENCE, CAN BE DEFINED AS FOLLOWS:

\[
F(x_1, \ldots, x_k) = \sum_{i=1}^{k} w_i \cdot x_i \quad \text{with} \quad w_1, \ldots, w_k \in \mathbb{Z}
\]

MODERN PBO SOLVING ENGINES SUPPORT MULTIPLE OBJECTIVE FUNCTION \( F_1, \ldots, F_n \), WHICH ARE IMPLEMENTED BY A PRIORITIZED EVALUATION ORDER. DIFFERENT PBO-BASED PROCEDURES HAVE ALREADY BEEN SUCCESSFULLY APPLIED IN THE TESTING DOMAIN LIKE [18]–[20].

III. PROPOSED SAT-BASED OPTIMIZATION MODEL

 THIS SECTION PRESENTS THE PROPOSED MODEL FOR DESCRIBING A GENERIC IJTAG NETWORK CONTAINING INSTRUMENTS, SIBs, SCBs, SCANMUXES AND BRANCHING NODES. THIS MODEL IS THEN USED TO OBTAIN AN OPTIMAL TEST SCHEDULE PER TEST SLICE, WHICH COVERS ALL INSTRUMENTS WITHIN A MINIMUM OVERALL ACCESS TIME WITHOUT EXCEEDING THE POWER DOMAINS’ LIMITS. THE PROPOSED SAT-BASED OPTIMIZATION SCHEME INVOLVES THE FOUR STEPS AS FOLLOWS:

1) THREE BUILDING BLOCKS ARE INTRODUCED ALLOWING FOR THE CREATION OF A HIGHLY REDUCED BASIC NETWORK STRUCTURE USING A (PURE) BOOLEAN REPRESENTATION.

2) THE INDIVIDUAL POWER DOMAIN LIMIT IS REFLECTED BY NEWLY INTRODUCED ENUMERATION CONSTRAINTS, WHICH INVALIDATE THE PART OF THE SEARCH SPACE THAT EXCEEDS THE CONSIDERED POWER LIMITS.

3) ONE OBJECTIVE FUNCTION IS DEFINED, WHICH GUIDES THE OPTIMIZATION PROCEDURE, I.E., THE MAXIMIZATION OF THE CONCURRENTLY ACTIVE INSTRUMENTS.

4) THE TEST SCHEDULING PROCESS IMPLEMENTS AN INCREMENTAL INVOCATION OF THE UNDERLYING SAT-SOLVING ENGINE TO ENSURE FULL COVERAGE OF ALL REACHABLE INSTRUMENTS. THE RECENTLY COVERED INSTRUMENTS OF TEST SLICE \( i-1 \) ARE THEN MASKED IN THE \( i \)-TH SLICE.

A. Building Blocks

A SERIAL PATH BETWEEN SI AND SO MAY INCLUDE THREE MAIN FEATURES, NAMELY BRANCHING POINTS, MERGING ELEMENTS AND SWITCHING NODES. THESE FEATURES CAN BE COMBINED TO CREATE COMPLEX HIERARCHICAL STRUCTURES DEFINED BY THE IEEE 1687 STANDARD. IN FIG. 2(a), \( S \) REPRESENTS A SIB AND \( x \) CAN BE ANOTHER SIB, AN INSTRUMENT OR A SUB-NETWORK. HERE, A LOGICAL VALUE ‘1’ MEANS THAT THE ITEM IS ON AN ACTIVE CHAIN AND IS ACCESSIBLE WHILE A VALUE ‘0’ STATES THAT THE ELEMENT CANNOT BE ACCESSED. SINCE SIBS OPERATE LIKE SWITCHES TO CONNECT/DISCONNECT THE ELEMENTS OF THE NEXT HIERARCHY, IN THE CASE THAT \( S \) IS NOT ACCESSIBLE, \( x \) IS NEITHER ACCESSIBLE. HOWEVER, IF \( S \) IS ACCESSIBLE – DEPENDING ON \( S \) BEING OPENED OR CLOSED – \( x \) WILL BE EITHER ACCESSIBLE OR INACCESSIBLE. THIS IS EXPRESSED BY THE FOLLOWING EQUATION:

\[
\Phi = s \lor \neg x
\]

THE SAME CNF IS USED TO MODEL THE RELATION BETWEEN THE WRAPPED INSTRUMENTS (USING NEGATION) AND THEIR RELATED TDRS. THE MODELS USED FOR BRANCHES AND MUXPLEXERS ARE DERIVED FROM THE FORMAL MODELS THAT ARE USED FOR RETARGETING AND ACCESS OPTIMIZATION IN [10]. HOWEVER, IN THIS PAPER THE PLAIN CHAINS ARE CONSIDERED AS A SPECIAL CASE OF BRANCHES IN WHICH THE ELEMENTS HAVE ONLY ONE SUCCESSOR. FIG. 2(b) PRESENTS A BRANCHING NODE WITH TWO ELEMENTS THAT CANNOT BE PLACED ON A COMMON CHAIN. FOR A BRANCHING NODE, THREE FEATURES ARE BEING CONSIDERED: AT FIRST, ONLY ONE ACTIVE CHAIN IN AN IJTAG NETWORK CAN EXIST AT A TIME. THEREFORE, ALL STATES THAT INCLUDE MORE THAN ONE CHAIN ARE INVALID. THIS IS EXPRESSED WITH THE CLAUSE \( (\neg x_2 \lor \neg x_3) \). SECONDLY, ONE OF THE SUCCESSOR NODES MUST NECESSARILY BE ON THE ACTIVE CHAIN IF A SOURCE NODE IS ACTIVATED. THIS CIRCUMSTANCE IS IMPLEMENTED BY \( (\neg x_1 \lor x_2 \lor x_3) \), AS STATED IN FIG. 2(b). FINALLY, THE SELECTION OF ONE NODE (ON A BRANCH) IMPLIES THAT THE PREDECESSOR IS ACTIVATED, WHICH IS REFLECTED BY \( (x_1 \lor \neg x_2) \land (x_2 \lor \neg x_3) \). THE CONJUNCTION OF THE STATED CLAUSES YIELDS THE CNF REPRESENTATION OF A BRANCHING NODE IN THE IJTAG NETWORK, WHICH CAN BE GENERALIZED AS FOLLOWS:

\[
\Phi = (\neg x_1 \lor x_2 \lor \ldots \lor x_n) \land \bigwedge_{i,j \in \{2, n\} \setminus \{i\}} (\neg x_i \lor \neg x_j) \land \bigwedge_{i=2}^{n} (x_1 \lor \neg x_i)
\]
merging node, if one of the components is accessible, the other components on this chain are reachable. Analogously, if one component is not reachable, the other components on the chain are neither reachable. In this case, the stated equation can be reduced to the bi-implication as follows:

\[ \Phi = x_1 \leftrightarrow x_2 = (x_1 \lor \neg x_2) \land (\neg x_1 \lor x_2) \]

Fig. 2(c) presents the state in which a ScanMux selects be- on the same chain. On the other hand, the active state of

For a ScanMux with

reduced to the bi-implication as follows:

\[ \Phi = (\neg x_1 \lor \neg x_2) \land (\neg x_2 \lor x_3) \land (\neg x_1 \lor x_3) \land (x_1 \lor x_2 \lor \neg x_3) \]

For a ScanMux with \( n = 1 \) inputs and \( x_n \) as output, this equation can be generalized as follows:

\[ \Phi = (x_1 \lor x_2 \cdots \lor \neg x_n) \land \bigwedge_{i=1}^{n} (\neg x_i \lor \neg x_j) \land \bigwedge_{i=1}^{n} (x_i \lor \neg x_n) \]

After these three building blocks are gradually applied to
generate the network model, the test scheduling problem is
modeled, as shown in Fig. 3. The horizontal axis is divided to test pattern units. Each strip is assigned to one test pattern, which is applied to the active chain over a CSU and every instrument occupies a number of strips based on its required number of accesses. As long as the access to none of the instruments on the current chain is completed, the path from SI to SO remains unchanged. \( w_4 \) and \( w_6 \) in Fig. 3 have this condition over the first twelve access. Once the access to these instruments is completed, the new configuration is applied to exclude them from the next scans. The SIB programming bits required for the configuration of the next chain are appended to the test data in each unit. The access to the chains can be reordered and no dependency between them is required.

B. Enumeration Constraints

The overall power limit \( P_{\text{max},j} \) of each power domain \( j \) is modeled as one enumeration constraint \( C_i \), which is added to the overall instance. This follows the basic idea that the accumulated power consumption of (active) instruments should not exceed the domain’s power limit during a CSU as follows:

\[ \sum_{p \in C\text{SU}_i} p_i \leq P_{\text{max}} \]

More precisely, a Pseudo-Boolean co-factor – the weight \( p \) – is introduced to the Boolean variable \( v \), which reflects the instrument’s activation status while \( p \) represents the weighted (classified) instrument’s power consumption. Consequently, a valid solution must fulfill all enumeration constraints:

\[ \bigwedge \left( \bigvee_{i \in \{1 \ldots n\}} (p_i \cdot v_i) \right) \leq P_{\text{max},j} \]

where \( I_{ij} \) is the set of reachable instruments of power domain \( j \). This concept is shown in Fig. 3, in which instruments are organized in two power domains \( j_2 \) and \( j_3 \). The sum of instruments’ power consumption over strips for domain \( j_2 \) and \( j_3 \) are shown with dashed and solid lines respectively and each line stays within the limit of its own domain constraint. As the power consumption of the scan chain elements is negligible compared with the one of instruments, it is not considered in Fig. 3.

C. Objective Functions

The optimization criterion is defined as follows: Let \( I_i(a_i, p_i) \) be the \( i \)-th instrument with \( a_i \) number of accesses and power consumption \( p_i \). The Pseudo-Boolean objective function \( F \) addresses the maximization of the allocated power consumption in every strip. To implement \( F \), a weight \( p_i \) is added to every of the \( N \) Boolean variable \( v_i \), representing their power consumption. The weight is accumulated if the instrument is active. \( F \) is defined as follows:

\[ F(v_1, \ldots, v_N) = \sum_{i=1}^{N} (p_i \cdot v_i) \]

D. Incremental Invocation

The blocks of Fig. 3 have to be distributed over the strips such that the minimum number of CSUs is required. This is obtained by allocating the maximum power consumption within each strip while observing the concurrent accessibility of the elements. Since no strip initially exists, 1 is assumed for the SI port’s corresponding variable. This yields a path to SO including the maximum sum of power consumption such that the active instrument do not violate the power limit, as determined by the SAT solver. The set of instruments that have been activated by the SAT solver are considered as the first strip. The next strip is created when the access to at least one of the instruments (of the current set) has been completed. Accordingly, the corresponding literals are deactivated to discard the already covered instruments in the next run of the solver.

IV. EXPERIMENTAL RESULTS

This section describes the experimental evaluation of the proposed test scheduler, which is solely written in C++ and utilizes clasp (v. 3.1.4) [21] as Pseudo-Boolean optimization solver. All experiments have been tried on different networks from the ITC’16 benchmark set [16] and conducted on a machine holding an Intel Xeon E3-1270 processor and 32GB of main memory.

Table I presents rightmost the number of variables and clauses, which have been introduced while processing the individual ICL netlist. The benchmark networks have been classified into three groups of complexity, namely basic,
classic and advanced. The values – being presented as nodes and edges – represent the total number of elements and their connections in the corresponding graphs. These metrics are meant to provide a general overview of the network scale.

Despite being small, the Mingle network [16] contains all structural features like branching and merging points, SIBs, SCBs and parallel branches, as earlier discussed. For the sake of a comprehensible presentation, the experimental evaluation of the proposed test scheduling is described for this network in detail. However, other networks are evaluated as well and their results are briefly summarized in Table II. The Mingle network has artificially been divided into three power domains, as is shown in Fig. 1. Every data path’s element is assigned to domain $j_1$ while the instruments are organized in domains $j_2$ and $j_3$. The power requirements and the number of required accesses are randomly generated and represented as tuples inside the first incidence of each instrument. These values are used as a side-input to the framework, albeit they can be replaced by the real numbers obtained from a power analyzer. As the power domains’ limits, 20 for the first domain and 140 for the other two domains have been assumed. The scheduling results are already stated in Fig. 3, in which a sum of 165 accesses is executed in 90 CSUs.

Table II gives the results of the experiments, for which all networks have been divided into three different power domains with individual power limits, analogously to the Mingle network as described above. The table presents the sum of the power consumption in each domain during all accesses. Note that there is not a single violation of the power domains’ limits. Additionally, the column #accesses reports the sum of all instruments’ accesses in each scenario and column #chains gives the number of required scan chains (test slices) to cover these accesses. The values in column #CSU shows the total number of strips that are required to complete the access to all instruments of the IJTAG network. Finally, the rightmost column presents the CPU time (in seconds) for conducting the complete test scheduling process including the model’s creation and the solving process.

### V. CONCLUSION

This paper proposed a novel test scheduler for large IJTAG test networks with multi-power domains by using SAT-based and PBO-based techniques. More precisely, a seamless SAT model for complex IJTAG network structures had been designed, which was then enhanced with Pseudo-Boolean aspects to model specific optimization criteria and enumeration constraints. The scalability of the proposed scheduler was proven by considering even large networks with over one thousand of instruments forming industrial representative benchmark candidates. In the end, the proposed test scheduler allows for determining highly effective but power-safe tests and, by this, contributes to reduced test costs when testing complex and power-critical SoC designs.

### VI. ACKNOWLEDGMENT

This work was supported by the AI initiative of the Free Hanseatic City of Bremen. The authors gratefully thank Stephan Enggersglls from Siemens Digital Industries Software for the inspiring research discussions and thoughtful feedback.

### REFERENCES


### TABLE II  BENCHMARK RESULTS

<table>
<thead>
<tr>
<th>network</th>
<th>power domain</th>
<th>#accesses</th>
<th>#CSU</th>
<th>#chains</th>
<th>run-time [s]</th>
</tr>
</thead>
<tbody>
<tr>
<td>Mingle</td>
<td>1  27  20</td>
<td>3  210  140</td>
<td>165  90  8</td>
<td>0.019</td>
<td></td>
</tr>
<tr>
<td>BasicSCB</td>
<td>3  34  30</td>
<td>3  328  180</td>
<td>88   52  5</td>
<td>0.012</td>
<td></td>
</tr>
<tr>
<td>TreeNat</td>
<td>1  13  30</td>
<td>3  473  180</td>
<td>158  32  7</td>
<td>0.020</td>
<td></td>
</tr>
<tr>
<td>t51250</td>
<td>1  39  30</td>
<td>3  5040 180</td>
<td>1684 486 105</td>
<td>3.215</td>
<td></td>
</tr>
<tr>
<td>p22810</td>
<td>3  3458 2300</td>
<td>3  9874 180</td>
<td>3090 831 200</td>
<td>12.654</td>
<td></td>
</tr>
<tr>
<td>N1TD3</td>
<td>1  53  55</td>
<td>3  1378 200</td>
<td>488 174 27</td>
<td>0.087</td>
<td></td>
</tr>
<tr>
<td>N32D6</td>
<td>1  79  100</td>
<td>3  1812 250</td>
<td>655 140 40</td>
<td>0.223</td>
<td></td>
</tr>
<tr>
<td>N7TD14</td>
<td>1  94  100</td>
<td>3  1478 200</td>
<td>900 120 12</td>
<td>0.843</td>
<td></td>
</tr>
<tr>
<td>N132D4</td>
<td>3  367 250</td>
<td>3  1572 200</td>
<td>2453 380 134</td>
<td>5.008</td>
<td></td>
</tr>
<tr>
<td>NE600P150</td>
<td>3  795 500</td>
<td>3  7201 2200</td>
<td>10524 191 176</td>
<td>159.622</td>
<td></td>
</tr>
<tr>
<td>NE1200P300</td>
<td>3  1561 700</td>
<td>3  63040 3100</td>
<td>21187 278 269</td>
<td>555.825</td>
<td></td>
</tr>
</tbody>
</table>


