Achieving High Power Efficiency and Flexible Charge Recycling Using Dynamic Circuit Technique

Shaik Wasim Basha
M.Tech,
Santhiram Engineering College,
Nandyal, Kurnool (Dt), Andhra Pradesh.

O.Mohana Chandrika, M.Tech
Assistant Professor,
Santhiram Engineering College,
Nandyal, Kurnool (Dt), Andhra Pradesh.

Abstract:
In integrated circuit design, dynamic logic is a design methodology in combinatory logic circuits, particularly those implemented in MOS technology. Dynamic circuits are widely used in order to solve the problems occurred in the data path and the critical components of the microprocessor. The power consumption is significantly in dynamic circuits due to their switching activity. In order to obtain high performance dynamic circuits are used in microprocessors because of their special features such as speed and area. In this paper, we proposed flexible charge recycling design methodology and dynamic circuit selection algorithm in order to achieve high efficiency in the data path. According to the proposed methodology the simulation result of the design shows the consumption of the ALU (Arithmetic and Logic Unit) with proposed technique is reduced significantly compared to the conventional ALU.

Keywords:
Dynamic circuits, ALU, Switching, FCR, PNS, CMOS.

Introduction:
Integrated circuit design, or IC design, is a subset of electronics engineering, encompassing the particular logic and circuit design techniques required to design integrated circuits, or ICs. ICs consist of miniaturized electronic components built into an electrical network on a monolithic semiconductor substrate by photolithography. Integrated circuit design involves the creation of electronic components, such as transistors, resistors, capacitors and the metallic interconnect of these components onto a piece of semiconductor, typically silicon.

A method to isolate the individual components formed in the substrate is necessary since the substrate silicon is conductive and often forms an active region of the individual components. The two common methods are p-n junction isolation and dielectric isolation. Attention must be given to power dissipation of transistors and interconnect resistances and current density of the interconnect, contacts and vias since ICs contain very tiny devices compared to discrete components, where such concerns are less of an issue. Electromigration in metallic interconnect and ESD damage to the tiny components are also of concern.

Finally, the physical layout of certain circuit subblocks is typically critical, in order to achieve the desired speed of operation, to segregate noisy portions of an IC from quiet portions, to balance the effects of heat generation across the IC, or to facilitate the placement of connections to circuitry outside the IC. The number of transistors that can be placed inexpensively on an integrated circuit has doubled approximately every two years. The trend has continued for more than half a century and is not expected to stop any time soon.

The capabilities of many digital electronic devices are strongly linked to Moore's law: processing speed, memory capacity, sensors and even the number and size of pixels in digital cameras. All of these are improving at (roughly) exponential rates as well. The transistor count is the number of transistors on an integrated circuit chip. Transistor count is the most common measure of semiconductor integrated circuit complexity. As of 2016, the highest transistor count in a commercially available CPU (in one chip) is over 7.2 billion transistors, in Intel's 22-core Xeon Broadwell-EP.
The number of transistors in a chip has extremely grown continuously with advanced technology, over a past decade. As there is an increase in the transistor count it demands for less power consumption and less area, these are the main issues occur during the design of the circuit, with an increase in transistor count it leads to increase the transistor density, power consumption of the microprocessor, as a result these are major issues as to be carried out to ensure the high performance of the proposed system. In a modern microprocessor the data paths perform computing operations through a critical path. The operating frequency of the microprocessor can be determined by the operating speed of the data path. The data path consumes a significant amount of total power consumption and also it is a most active component. This results in a problem in those applications which as intensive computation on it, such as multimedia processor with multiple core and digital signal microprocessor. Hence, in a modern microprocessor, the low power data path can be achieved.

The dynamic circuit is widely used as a solution in critical data path and critical components in microprocessors due to their features such as speed and area [1], [2]. The n-type dynamic circuit is applied to on-chip memory and arithmetic and logic unit (ALU) of some microprocessors to minimize the latency [1], [3]. A high performance microprocessors use dynamic circuits, which can achieve a timing constraint than a static CMOS. The performance of dynamic circuits is increased due to low switching thresholds, reduced input capacitance and circuit implementation is achieved using high efficient and complex gates. The dynamic circuits are used in the implementation of a high frequency internal processor by using a logic blocks which have a timing requirement. In the proposed paper, a novel p-type/n-type dynamic circuit selection (PNS) algorithm and flexible charge recycling (FCR) methodology is designed in modern microprocessors in order to obtain a low power data path.

**RELATED WORK:**

In earlier research the techniques used were n-p dynamic circuits, n-p dynamic circuits are categorized as n-dynamic and p-dynamic. In the n-dynamic circuits it adopts high-speed NMOS transistor to achieve high performance and in pdynamic circuits it adopts slower PMOS transistor hence the speed is slower, but the power efficiency is enhanced due to suppressed gate and subthreshold leakage current of PMOS transistor. The n-p dynamic circuit has been proposed as a race-free dynamic CMOS technique for pipelined circuits [4]. The n-p dynamic circuit has lower intrinsic delay and requires less silicon area due to the more compact logic than with the static CMOS logic. Binding algorithm is mainly based on framework for low leakage data path. This technique is effective only for some applications and there is a considerable speed loss and not suitable for any high performance application [5].

A macrodriven data path design methodology has developed and it generates different topologies for different macros [6]. Three more methods have developed for synthesizing dynamic circuits, but these only contain n-type dynamic circuits and failed to include p-type dynamic circuits. Crosstalk-aware and speed-aware synthesis methodologies are presented, but neither of them considers power efficiency [7]-[8]. Finally, a dynamic data path is synthesized automatically, but requires a significant silicon area. The common feature of these techniques is that the potential for low power by combining different types of dynamic circuits is not effectively explored.

**PROPOSED METHODOLOGY**

The proposed PNS-FCR, exploring power saving opportunity for data path circuits, is presented in this section. The three-step PNS-FCR design methodology is depicted in fig1.
Design Flow for a Data Path Based on PNS-FCR:

As shown in Fig. 2, the design flow for a data path based on PNS-FCR is as follows:

1) First, the gate library based on a p-type/n-type dynamic circuit is built. Two types of each gate occupy a similar layout area to avoid area penalty.

2) Based on the gate library, the appropriate type of gates is selected using PNS to implement the data path or critical path, satisfying the performance requirements of different applications.

3) Next, the FCR is utilized to achieve high power efficiency in critical path by inserting the charge recycling paths between two independent gates or two neighboring gates. Note that the FCR is a tradeoff between power, performance, and silicon area.

4) Then, apply the proposed PNS-FCR to noncritical paths. The critical path is typically much longer than the uncritical path in the data path, and therefore, the gates in the uncritical path employ p-type for power efficiency. However, if an uncritical path formed by all p-type gates is even slower than the critical path, n-type gates would be inserted to meet the delay constraint based on PNS, and then the FCR is used to enhance the power efficiency.

5) Finally, the routing is completed manually or by CAD tools.

**PN Selection Algorithm**

Based on the gate library, the appropriate type of gate is selected to implement a data path, as shown in Fig. 1. To satisfy the performance requirements of different applications, a PNS algorithm is introduced based on the multidimensional multiple-choice 0-1 Knapsack problem (MMKP) [9]. The delay of the critical path determines the performance of the data path. The gates in the critical path are required to meet the performance constraint, while the gates in the noncritical paths, use a low power version to enhance power efficiency.

**Flexible Charge Recycling technique:**

A key design issue in low power data paths is exploring the choice of different power efficient n-type and p-type gates. Accordingly, the FCR is proposed to achieve high power efficiency, as shown in Fig. 2. Consider a critical data path exist between two cascaded gates. The initial gate is an n-type dynamic gate, while the latter gate is a p-type dynamic gate. During the precharge stage (CLK = 0), the dynamic node of the n-type gate Nn is precharged to Vdd through transistor Pcn, while the dynamic node of the p-type gate Np is discharged to ground through the transistor Ncp.
During the evaluation stage, provided that the necessary input combination is applied, \( N_n \) is discharged to the ground and \( N_p \) is charged to \( V_{dd} \). Otherwise, the high state of \( N_n \) and low state of \( N_p \) are maintained until the next precharge stage. As the evaluation process completes, \( N_n \) discharges from high to low and \( N_p \) charges from low to high. In the following precharge stage, \( N_n \) and \( N_p \) both consume dynamic power by charging \( N_n \) from \( V_{dd} \) and discharging \( N_p \) to ground. If a switch is inserted between the two dynamic gates, \( N_n \) is charged by \( N_p \) through a charge recycling path, thereby reducing the dynamic power.

A dynamic full adder is taken as an example, as shown in Fig.2. When the input vectors of full adder are respectively \((1, 1, 0), (1, 0, 1), \) and \((0, 1, 1)\), at the end of an evaluation stage, \( N_n \) has been discharged to \( G_{nd} \) while \( N_p \) has been charged to \( V_{dd} \), and the switch is turned on. And then, a desirable charge recycling path between \( N_n \) and \( N_p \) is built. With the FCR cell, in the precharge stage, the \( CLKB \) makes the recycle path available. Consequently, two supplies \( V_{dd} \) and \( N_p \) charge \( N_n \) simultaneously, which makes the precharge speed much higher as compared with the conventional circuit with only single supply \( V_{dd} \) charging \( N_n \).

The additional capacitance \( C_r \) between dynamic nodes \( N_n \) and \( N_p \) due to an adding charge recycling path has a negligible effect on evaluation speed. Accordingly, in the evaluation stage, the voltage waveforms of \( N_n \) and \( N_p \) without and with the FCR cell almost overlap. The charge recycling path exists between two independent gates as well as two neighboring gates. If \( r \) p-type gates and \( q \) n-type gates are selected for a critical path, \( \min(r, q) \) charge recycling paths can be inserted to reduce power.

**Verification of FCR:**

To verify the effectiveness of FCR, a full adder with clock-delay as designed, which is usually employed along the critical path in data path. A full adder with FCR includes one n-type Sum cell, one p-type Carry cell (dynamic full adder), and one FCR cell to enhance power efficiency (Fig. 2).

The fig.3 and fig.4 shows the schematic design of the adder circuit without and with FCR respectively. The output waveforms shows the voltage and dynamic node variation with respect to clk signal. In the above figure the adder circuit is designed and it is a dynamic full adder which consumes less power than the conventional adder. This adder circuit has 3 inputs and 2 outputs and the adder is called as ripple carry order where the carry out of one adder is given to the next adder and the carry out is obtained at the last adder circuit. This adder is implemented in ALU in order to achieve high efficiency.

![Fig.3. Schematic design of full adder circuit](image1)

![Fig.4. Schematic design of full adder with FCR circuit](image2)
CONCLUSION

A novel methodology is presented in this paper for designing dynamic circuits in the functional units of modern processors. The proposed PNS-FCR methodology achieves high power efficiency, while satisfying specific timing constraints. Simulation results show that the power consumption of adder in different submicron technology. The proposed adder utilizes low power than the conventional adder and the high performance is achieved. This proposed technique is implemented in ALU which consumes low power and achieves a higher efficiency than the conventional ALU. This methodology can be extended to static CMOS, pass gate, transmission gate, tristate gate, and other logic families.

References:

[1] Jinhui Wang, Member, IEEE, Na Gong, Member, IEEE, and Eby G. Friedman, Fellow, IEEE, PNS-FCR: Flexible Charge Recycling Dynamic Circuit Technique for Low-Power Microprocessors, IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS


