Low Quantum Cost Construction for Adder and Symmetric Boolean Function

Kandakatla Sravanthi
M.Tech (VLSI),
Sidhartha Institute of Science and Technology.

Anam Srinivas Reddy, M.Tech
Assistant Professor,
Sidhartha Institute of Science and Technology.

Abstract:
Reversible logic design has been one of the promising technologies gaining greater interest due to less dissipation of heat and low power consumption. Quantum computing necessitates the design of circuits via reversible logic gates. Efficient reversible circuit can be constructed by achieving low ancilla count, reducing logical depth and lowering Quantum costs. Generalized Peres gates have recently been realized with very low Quantum Cost (QC) by utilizing Quantum rotation gates. This is utilized in recent literature for efficient reversible circuit constructions for symmetric Boolean functions. In this paper, we extend this line of construction further by demonstrating efficient realization of adder circuits. In particular, we revisit the adder construction of Vedral, Barenco and Eckert to show that improvement of gate count and QC is achievable by exploiting a construction based only on Peres gates. We also report improved constructions of symmetric Boolean functions by following an approach recently proposed in the context of Boolean function complexity analysis. All the synthesis and simulation results of the Proposed Reversible Adder based on Per2 and its 4Bit Adders are performed using Verilog HDL on Xilinx ISE 14.7.

Index Terms: Reversibility, reversible logic circuits, Boolean function, reversible gate.

1. INTRODUCTION:
The growing technologies have increased the demand of high performance computing. According to G. Moore’s low [1], number of transistor counts to be integrated per unit area in devices will almost double in one and half year.

To achieve high speed computation, high packaging density in the logic circuits is required which results in more heat dissipation. The conventional computing is found unable to deal with low power, high compaction and heat dissipation issues of the current computing environment. For example, Shor's factorization [1], has remained a key attraction for experimental Quantum computing groups for obvious reasons [2] and involves modular arithmetic operations [3]. Naturally, nvariable binary arithmetic operations have received significant research attention. Several adder realizations for Quantum circuits have been proposed in the literature [4], [5], [6], [7], [8], [9]. In this paper, we investigate potential improvements in the adder circuit realization. Considering a more general form, symmetric Boolean functions [10] represent an important subclass of Boolean functions, with specific applications in circuits dominant in arithmetic operations, and therefore, deserves closer attention as well. In the second part of this paper, we revisit a recently proposed construction of symmetric Boolean functions [11] and show that further improvements on this construction is possible.

A. Preliminaries
A Boolean function f is of the form \( f : \{0, 1\}^n \rightarrow \{0, 1\} \) (or equivalently \( f : \mathbb{V}^n \rightarrow \mathbb{V} \)). The output of the Boolean function \( f \) can be represented as a string \( s \) of ones and zeros. It can also be represented as a multivariate polynomial over \( \mathbb{GF}(2) \). This polynomial can be expressed as an exclusive disjunction (EXOR) of a constant \( a_0 \) and one or more conjunctions of the function argument. This is called the Exclusive Sum-Of-Product (ESOP) representation. An alternative representation of the ESOP form is known as the Algebraic Normal Form (ANF).
The general ANF for a function \( f(x_1, \ldots, x_n) \) over \( n \)-variables can be written as,

\[
f(x_1, \ldots, x_n) = a_0 \oplus a_1 x_1 \oplus \cdots \oplus a_n x_n
\]

The Hamming weight (or simply weight) of a binary string \( S \) is the number of 1’s present in it, which we denote as \( \text{wt}(S) \). Symmetric Boolean functions \[10\] form an important subclass of Boolean functions. A Boolean function \( f : \{0, 1\}^n \to \{0, 1\} \) is called symmetric if its output is invariant under any permutation of its input bits. Equivalently, we can say that the value of \( f(x) \) is constant for all \( x \)’s having the same weight. Reversible and Irreversible Boolean functions: An \( n \)-variable vectorial Boolean function is termed reversible if all its output patterns map uniquely to an input pattern and vice-versa. It can be expressed as an \( n \)-input, \( n \)-output bijection or alternatively, as a Boolean permutation function over the truth value set \( \{0, 1, \ldots, 2^n-1\} \). An irreversible Boolean function \( f_{irr} : \{0, 1\}^n \to \{0, 1\}^m \) with \( n = m \) can also be made reversible with the help of additional output lines, which adds distinguishing patterns to the irreversible output. Correspondingly, additional inputs are added. If an input line is constant-initialized and the constant is recovered after the circuit execution then, it is termed as ancilla. Otherwise, it is termed as garbage.

B. Reversible Logic Gates

Reversible logic circuit is implemented with the elementary reversible logic gates. The gates are characterized by their implementation cost in quantum technologies, which is denoted as the Quantum Cost (QC). A reversible gate library is a complete set of reversible gates which can be used to implement any reversible circuit. For example the set of NOT, CNOT, controlled-V and Controlled-V +, known as NCV, is a reversible gate library widely used in the literature. Recently, there has been a significant research activity towards the realization of quantum circuits using Clifford+T gates, considering the importance of fault tolerance in quantum computing.

Efficient synthesis for NCV circuits \[12\], \[13\], \[14\], Clifford+T circuits \[15\] and mapping of NCV circuits to Clifford+T gates \[16\] have been proposed in the literature. Few gates from these libraries are outlined below. For detailed discussion on primitive quantum gates and their universality, readers may refer to \[17\], \[18\].

- NOT gate: This is represented using the matrix \[
\begin{bmatrix}
0 & 1 \\
1 & 0
\end{bmatrix}
\]

- CNOT gate: \( CNOT(a, b) = (a, a \oplus b) \). This gate can be generalized with T off\( n \) gate, where first \( n - 1 \) variables are used as control lines. For 2 control lines and 1 target line, ed to as Multiple Control Toffoli (MCT) gate. When both positive control and negative control lines are permitted, the gate is denoted as Mixed-Polarity Multiple Control Toffoli (MPMCT) gate.

- Peres gate: \( P_{er}(a, b, c) = (a, a \oplus b, ab \oplus c) \). This gate can be generalized with \( P_{ern} \) gate (\( n > 2 \)) \[19\], where first \( n - 1 \) variables are used as control lines.

C. Cost Models

For evaluating the performance of the synthesis tools and benchmark circuit implementations, different cost models have been proposed in the literature. The most basic cost model is the number of reversible logic gates needed for the implementation. However, the actual implementation cost of these logic gates could be, with varying number of control lines, can be very different. The QC value is computed for each of these gates, which is nothing but the number of 2-qubit gates \[20\] needed to implement these circuits. In recent fault-tolerant Quantum circuit implementations, the cost is estimated in terms of T gates, corresponding to the realization of the circuits using Clifford+T library. Specifically, the logical depth in terms of the depth of T gates serve as a performance metric. Another important performance indicator is the total number of lines or ancilla/garbage count. In the following complexity analysis, we provide the complexity results in terms of the count of MPMCT gates and the total line count.
Further, we assume a conservative pergate estimate for T-depth, which is $3 \times |C|$, where $|C|$ is the number of control lines [16].

2. REVERSIBLE CIRCUIT FOR ADDER

For the first Quantum ripple-carry adder circuit proposed in [4], in fact, $(2n - 1)$ Peres gates are deployed in the carry computation blocks, though it uses $n$ ancilla lines. Among the ancilla-free $n$-bit binary adder circuits reported so far in the literature, [9] has the least gate cost with $(7n - 6)$ gates, out of which, there are $(2n - 1)$ T of 3 (CCNOT) gates. In the following, we propose a simple ripple-carry adder construction that is fully based on Peres gates. Our design is motivated by two recent results. QC of Per is $n^2$ and QC of T of $n+1$ is $2n^2 - 2n + 1$ [19]. An exemplary structure of cascaded Per gates is shown graphically in Fig. 1. We report a lemma from [11] and show how that can be used for designing adder circuits with low QC.

Lemma 2.1: 2 cascaded Per gates require $n^2 + n$ elementary 1-QC gates.

![Fig.1. Cascaded Peres Gates](image)

Lemma 2.2: An n-bit adder circuit with n ancilla lines can be realized with 2n Per gates, where all the Per gates are in pairwise cascade form.

Proof: For performing the addition, we repeatedly apply the following equations.

$$\text{sum}_i = x_i \oplus y_i \oplus \text{cin} \ (1)$$
$$\text{cout} = \text{Majority}(x_i, y_i, \text{cin}) = x_i \cdot \text{cin} \oplus (y_i \cdot (x_i \oplus \text{cin})) \ (2)$$

The cout is treated as cin for the next level in a simple ripple-carry adder formation. The realization of these Boolean functions can be arranged in form of pairwise cascaded Per gates as shown in the Figure 2. For an n-bit adder, clearly 2n Per gates are required.

![Fig.2. Reversible Adder based on Per2](image)

Considering the state-of-the-art adder constructions, the proposed circuit offers lower QC at the expense of higher ancilla lines (Table I). For the Toffoli (T of 3) and Peres (Per 2) gates, QC values of 5 and 4 are used respectively [19]. Note that, we accounted for the Per 2 gates in the adder structures of [9] and [4]. Furthermore, the input bits for the adder circuit remain unaltered at the output and therefore, can be used for subsequent circuit blocks. The total gate count is reported in MCT equivalence, where each Peres gate accounts for 2 MCT gates.

<table>
<thead>
<tr>
<th>TABLE I. PERFORMANCE OF ADDER CIRCUITS</th>
</tr>
</thead>
<tbody>
<tr>
<td>Circuit</td>
</tr>
<tr>
<td>---------</td>
</tr>
<tr>
<td>Vedral et al. [4]</td>
</tr>
<tr>
<td>Takahashi et al. [9]</td>
</tr>
<tr>
<td>Ciocarla et al [7]</td>
</tr>
<tr>
<td>This work</td>
</tr>
</tbody>
</table>

The proposed realization of adder circuit with only Per gates reduces QC compared to [4], [9], [7] when $n \geq 2$. A similar adder structure has been recently proposed in [21, Fig. 14a], where, however, the cascading of generalized Peres gates have not been explored. Thus, the QC is higher in that case.

3. REVERSIBLE CIRCUITS FOR SYMMETRIC BOOLEAN FUNCTIONS

Implementation of symmetric Boolean functions follows two steps. First, the computation of Hamming weight and second, the comparison of the Hamming weight with a specific outcome of 0 or 1.
It has been shown in [11] that, by using a ripple carry adder circuit, as described in the previous section 2, circuit with more efficient construction than state-of-the-art implementations can be obtained. We improve this result with another implementation of the Hamming weight computation circuit. First, we present a general result for an n-input 1-output symmetric Boolean function.

Lemma 3.1: The reversible circuit corresponding to an n-input 1-output symmetric Boolean function requires at most $n + \log_2 n$ garbage outputs.

Proof: Maximum Hamming weight of an n-variable Boolean function can be n, which can be stored at $\log_2 n$ 2 output lines. Considering these lines to be different from the n input lines, no more than $n + \log_2 n + 1$ output is produced, out of which 1 output contains the result.

In the following we study, two different circuit constructions for symmetric Boolean functions.

A. Construction I

The first construction, proposed in [11], computes the Hamming weights of the input variables in additional lines. The construction is presented in the Figure 3. The Hamming weight computation is followed by a series of Toffoli gates for deriving the output corresponding to each Hamming weight.

![Fig.3. Symmetric Boolean Function: Construction I](image)

A Quantum cost derivation for this circuit design approach is already presented in [11]. For the ease of future benchmarking, we also present the gate count, T-depth and line computations here.

Proposition 3.2: Reversible circuit realization of an n-variable 1-output symmetric Boolean function requires $n + \log_2 n - 1$ garbage lines if construction I is used. The proof follows directly from the Fig. 3, which shows that the least significant bit of the Hamming weight is stored on one of the input lines. Note that, this upper bound is 1 less than the one derived at Lemma 3.1 since, one input line is reused for the output.

Proposition 3.3: As per construction I, reversible circuit realization of an n-variable 1-output symmetric Boolean function needs at most $n + (n-1) \times (\log_2 n + 1)$ MCT gates and has a T-depth of at most $1.5(n-1)(\log_2 n + 1)(\log_2 n + 2) + 3(n)(\log_2 n + 1)$.

Proof: For an n-variable Boolean function, the Hamming weight computation requires $(n-1)$ cascaded Per gates, where the maximum value of k can be $\log_2 n + 1$. Considering the worst case scenario, total $(n-1)$ cascaded Per gates are needed. Since one Per gate is equivalent to k MCT gates, we obtain a total $(n-1) \times (\log_2 n + 1)$ MCT gates. The computation of Symmetric function is composed of Hamming weight calculation followed by a set of comparators. Each comparator is due to one Hamming weight value. There can be at most n different Hamming weights for a Symmetric function realization, as otherwise, it will become a constant function. Each comparator for a specific Hamming weight value require a mixed-polarity T ofk gate, where k is at most $(\log_2 n + 1)$. Hence, another $n \times \text{MPMCT}$ gates are required. In total $n + (n-1) \times (\log_2 n + 1)$ MPMCT gates are used, at most.

For one Per gate, $k \times \text{Toffoli}$ gates are required, with the number of control lines ranging from k to 1 in decreasing order. Hence, the T-depth of a Per gate is $3 \times k(k+1) = 1.5 \times k(k + 1)$. Considering the worst-case value of k to be $\log_2 n + 1$ and total $(n-1)$ Per gates, the T-depth for the Hamming weight computation is $1.5(n-1)(\log_2 n + 1)(\log_2 n + 2)$. 
For the comparator part, total T-depth is \(3(n)(\log_2 n + 1)\). Summing up, we get the result.

**B. Construction II**

Recently, Demenkov et al [22] presented a circuit of size \(4.5n\) using which, the binary representation of the sum of \(n\) input bits can be computed. Note that, there, the size refers to 2-input Boolean functions from the set \{\lor, \land, \oplus\}. The overall complexity for a symmetric Boolean function is presented as \(4.5n + O(n)\), where the \(O(n)\) is contributed by the comparator circuit, as discussed in the previous construction. Hence, we concentrate on the Hamming weight construction part and assume the comparator part to be implemented with exactly the same complexity as in construction I.

![Fig.4. Symmetric Boolean Function: Construction II](image)

The key circuit idea is presented in the Figure 4. An \(n\)-variable Hamming weight computing circuit uses \(n/2\) Modified Double Full Adders (MDFAs), which is shown in detail at the bottom of the figure. Each MDFA is a 5-input, 3-output circuit, which is presented in form of a 8-gate circuit in [22, Figure 3]. In the Figure 5, we present the corresponding reversible circuit realization using MPMCT gates. As it can be observed, for each MDFA, 5 garbage lines are generated. From this construction, we can state the following.

Proposition 3.4: Reversible circuit realization of an \(n\)-variable 1-output symmetric Boolean function requires \(2.5n + \log n - 1\) garbage lines if construction II is used.

Proof: Construction II uses total \(n/2\) MDFA blocks, resulting in \(2.5n\) garbage lines. From the \(\log n\) lines storing the Hamming weight, except 1 output line, the rest lines are not useful. Hence, the result.

![Fig.5. Reversible Circuit Implementation of MDFA](image)

Proposition 3.5: As per construction II, reversible circuit realization of an \(n\)-variable 1-output symmetric Boolean function needs at most \(6n\) MPMCT gates and has a T-depth of at most \(19.5n + 3n(\log_2 n + 1)\).

Proof: For an \(n\)-variable Boolean function, the Hamming weight computation requires \(n/2\) MDFA blocks, each of which has 9 Toffoli gates. Hence, \(4.5n\) Toffoli gates are needed. For the initial \(\oplus\) operations, \(n/2\) Toffoli gates are used. Altogether, for the Hamming weight circuit, \(5n\) MPMCT gates are needed, at most. Adding with the previous results of comparator circuit complexity, we obtain the result.

For each MDFA, including the initial \(\oplus\) operations, a T-depth of \(3 \times 13 = 39\) is obtained. Hence, the total T-depth for the Hamming weight computation circuit is \(39 \times n/2 = 19.5n\). Summing up with the previously established T-depth for the comparator circuit, we get the result. In short, construction I provides an implementation with less garbage lines and construction II, provides an implementation with less gate count and T-depth.

4. **SYNTHESIS AND SIMULATION RESULTS**

All the synthesis and simulation results of the Proposed Reversible Adder based on \(P_{er2}\) and its 4Bit Adders are performed using Verilog HDL.
The synthesis and simulation are performed on Xilinx ISE 14.7. The corresponding simulation results of the Proposed Reversible Adder based on Per2 and its 4Bit Adders are shown below.

5. CONCLUSION
In this paper, we propose a new reversible circuit construction for binary adder, which improves state-of-the-art designs in terms of gate count and QC,
while admitting ancilla lines. We also performed a detailed analysis of symmetric Boolean function implementations in reversible circuits, and proposed an improved circuit construction. In future, we will follow similar implementation techniques for realizing further complex designs of relevance in Quantum algorithms.

REFERENCES


