854 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 62, NO. 3, MARCH 2015

Size: px
Start display at page:

Download "854 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 62, NO. 3, MARCH 2015"

Transcription

1 854 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 62, NO. 3, MARCH 2015 Efficient Subquadratic Space Complexity Architectures Parallel MPB Single- and Double-Multiplications All Trinomials Using Toeplitz Matrix-Vector Product Decomposition Chiou-Yng Lee, Senior Member, IEEE, and Pramod Kumar Meher, Senior Member, IEEE Abstract Subquadratic multiplication algorithm has received significant attention of cryptographic hardware researchers efficient implementation public-key cryptosystems. In this paper, we derive a new shifted MPB (SMPB) representation based on modified polynomial basis (MPB). We have shown that by using MPB and SMPB, the proposed double basis multiplication can be transmed into Toeplitz matrix-vector product (TMVP) structure. Furthermore, by employing this mulation of double basis multiplication, we show that three-operand multiplication over all trinomials can be realized efficiently by the recursive TMVP (RTMVP) mulation. To perm the three-operand multiplication with the RTMVP mulation, we have derived a new RTMVP decomposition scheme. The proposed single- and double-multiplications can, respectively, use TMVP and RTMVP decompositions to achieve subquadratic space complexity architectures. By theoretical analysis, it is shown that the proposed subquadratic multipliers involve significantly less space complexity and less computation time compared to the existing subquadratic multipliers using TMVP and Karatsuba algorithms. Moreover, our proposed double-multiplication design can be used in several applications involving successive multiplications, such as exponentiation, inversion, and elliptic curve point multiplication. Index Terms Binary extension field, elliptic curve cryptography, finite field, Galois field, modified polynomial basis multiplication, Toeplitz matrix-vector product. I. INTRODUCTION F INITE FIELD multiplication over is a basic field operation, which is frequently used in elliptic curve cryptography (ECC) to perm point-additions and point-doubling operations on an elliptic curve. The multiplication over can be used further to perm division, exponentiation, and inversion operations. In finite field arithmetic, addition is the simplest operation because addition of any two bits can be permed by logical XOR operation; and there is no carry propagation. Division operations on the other hand can be implemented by a series of multiplications. The area and time complexity involved in perming the multiplications, consequently, contribute most of the area and the time required Manuscript received August 28, 2014; revised November 02, 2014; accepted November 20, Date of current version February 23, This paper was recommended by Associate Editor V. Chandra. P. K. Meher is with the School of Computer Engineering, Nanyang Technological University, Singapore ( aspkmeher@ntu.edu.sg). C.-Y. Lee is with the Department of Computer Inmation and Network Engineering, Lunghwa University of Science and Technology, Taoyuan 33306, Taiwan ( pp010@mail.lhu.edu.tw). Color versions of one or more of the figures in this paper are available online at Digital Object Identifier /TCSI the implementation of ECC. It is, theree, required to design finite field multipliers with greater efficiency in terms of area consumption and speed permance ECC. The Weil and Tate pairings [1] based on elliptic curve arithmetic involves extensive computations of multiplication involving operands in large finite fields. This generates further interests to explore hardware-efficient designs high-permance multiplication over large finite fields. A Toeplitz matrix is a matrix in which elements of each descending diagonal from left to right are identical. It is encountered in many signal processing and image processing applications. It is shown that Toeplitz matrix-vector product (TMVP) approach can lead to efficient hardware architecture multiplication in finite fields based on normal basis (NB) [2] [4], shifted polynomial basis (SPB) [5], [6], and dual basis (DB) (or modified polynomial basis, MPB) [7] [9]. In binary extension fields, multiplication is a twostep operation: naive polynomialmultiplicationfollowedbyreduction using the irreducible polynomial, which generates the field. The naive polynomial multiplication involves space complexity and delay. When the chosen is a low-weight irreducible polynomial, such trinomials and pentanomials, the reduction is a simple operation. Theree, naive polynomial multiplication is generally considered as the major contributor to the hardware implementation of multiplication. The complexity of naive polynomial multiplication is reduced by employing the divide-and-conquer techniques, such as Karatsuba-Ofman algorithm (KA) [10], [11], Toom-Cook algorithm [12], and TMVP decomposition [13], from to,. Recently, Lee et al. [14] have proposed a generalized -way KA decomposition with, which is suitable implementing subquatratic digit-serial multiplication. Based on KA decomposition, multi-partite digit-serial multiplier is introduced in [15]. Recently, it is shown that three-operand multiplication can provide area-delay efficient architectures applications involving successive multiplications, e.g., exponentiation, inversion, and elliptic curve point multiplication. Three-operand multiplication based on KA decomposition is suggested in [16], and is also shown that M-ary exponentiation using three-operand multiplication can be permed in nearly multiplication steps. Fast inversion based on Gaussian normal basis double-multiplication is suggested in [17]. Multi-operand multiplication has been found to be useful hardware implementation of high-permance applications IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See more inmation.

2 LEE AND MEHER: EFFICIENT SUBQUADRATIC SPACE COMPLEXITY ARCHITECTURES 855 Using divide-and-conquer method, (1) can be expressed as (2) Fig. 1. The subquadratic TMVP multiplier architecture [13]. In this paper, we extend the TMVP decomposition of [13] to derive 2-way and 3-way recursive TMVP (RTMVP) decomposition schemes the implementation of three-operand multiplication. Using the RTMVP decomposition approach, in the well-known MPB field representation, we explore a novel areatime efficient multiplication scheme in binary extension fields. We show here that three-operand MPB multiplication can be efficiently realized by the proposed RTMVP mulation. The proposed RTMVP decomposition is found to be a suitable match three-operand modified polynomial basis (MPB) multiplier with subquadratic space complexity. By theoretical analysis, we show that the proposed subquadratic single- and double-multiplications have less computation time and less space complexity compared to the existing subquadratic multipliers. The rest of this paper is organized as follows. Section II presents the preliminaries regarding 2-way and 3-way TMVP. In Section III, we present our proposed new RTMVP decompositions. In Section IV, we extend the well-known MPB representation to derive a new shifted MPB (SMPB) representation. Besides, in this section, we present the architecture of the proposed three-operand multiplication based on MPB and SMPB to be used in the RTMVP mulation. In Section V, time- and space-complexities are analyzed. Finally, we conclude the paper in Section VI. II. REVIEW OF TOEPLITZ MATRIX-VECTOR PRODUCT DECOMPOSITION In linear algebra, an Teoplitz matrix is a matrix with the property of,. The Toeplitz matrix-vector product is widely applied to compute multiplications in finite fields based on dual basis (DB), shifted polynomial basis, and normal basis. A Toeplitz matrix has the following property: Proposition 1: An Toeplitz matrix is determined by the entries in the first row and the first column. We can use the vector todefineatoeplitzmatrix. Using such vector representation, the addition requires XOR gates if and are two Toeplitz matrices. In the following, we briefly review 2-way and 3-way TMVP decompositions in [13]. A. 2-Way TMVP Decomposition Let be a column vector and the matrix vector be used to define a Toeplitx matrix, and are two column vectors, and,, and are three Toeplitz matrices. A Toeplitz matrix-vector product in this case is given by The original TMVP involves four sub-tmvps but the TMVP in (2) involves three sub-tmvps. The TMVP in (2), theree, provides better permance than the original TMVP in (2). Based on the decomposition scheme of (1), we can recursively generate four components [component matrix point (CMP), component vector point (CVP), point-wise multiply (PWM), and reconstruction (R)] of reduced size matrices as As mentioned above, Fig. 1 shows subquadratic TMVP architecture, which involves three stages: the evaluation point generation (EPG) stage, the point-wise multiplication (PWM) stage, and the reconstruction (R) stage. The EPG stage is to perm EMP(T) and EVP(V), the PWM stage computes,andther stage perms the operation. Let symbols and denote space and delay, respectively. Let and in the case of denote the number of bit-multiplications and the number of bit-additions required TMVP multiplication, and and denote the number of AND gate delay and the number of XOR gate delay required TMVP multiplication. In [13], Fan and Hasan have shown that, 2-way TMVP decomposition, CMP component involves XOR gates and delay, CVP component involves XOR gates and delay, PWM component involves space complexity and delay, and R component involves XOR gates and delay. Accordingly, we have obtained the following recurrences on complexities: Lemma 1: Assuming and to be two positive integers which satisfy, the solution of the recurrence relation and is, and are integer constants. Lemma 2: Let,,and be three integers with and, the solution of the recurrence relation and is. We utilize Lemmas 1 and 2 to solve the recurrence equations in (3). The time and space complexities of 2-way TMVP decomposition can be expressed as follows: (3) (1)

3 856 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 62, NO. 3, MARCH 2015 B. 3-Way TMVP Decomposition Let be a column vector and the matrix vector be a Toeplitz matrix, each of s(,1,2)isa column vectors, and each of s(,1,2,3,4)isa Toeplitz matrix. The product can be rewritten as in (4) denotes the sum of.from(4),we can find that the 3-way TMVP decomposition each recurrence involves 6 sub-tmvps. Accordingly, we have obtained the complexities of 3-way TMVP decomposition as follows: III. PROPOSED RECURSIVE TOEPLITZ MATRIX-VECTOR PRODUCT DECOMPOSITION In this section, we extend the TMVP decomposition to derive two new recursive TMVP (RTMVP) decompositions computing three-operand multiplication. For the convenience of presentation, we use the term 3-mult to refer the three-operand multiplication in the rest of the paper. We have derived the proposed 2-way and 3-way RTMVP decompositions as follows. A. 2-Way RTMVP Decomposition Let and be two Toeplitz matrices defined by and, sand s are Toeplitz matrices. Let be a column vector, and are column vectors. Here we consider the 3-mult of the m as a recursive TMVP multiplications. To realize 3-mult computation, let us define a two-step operation: and. Based on 2-way TMVP decomposition, the intermediate product is obtained in the first step as From TMVP computation in (5), we get the intermediate product as a vector. Next, we again use the 2-way TMVP decomposition to compute, then the product can be obtained as follows: The 3-mult in (6) involves 6 three-operand sub-rtmvps (,,,,,and ). We can use this mula to iteratively decompose the 3-mult. Based (4) (5) (6) on the proposed multiplication using three individual stages (the evaluation point generation (EPG) stage, the point-wise product (PWM) stage, and the reconstruction (R) stage), Fig. 2 shows the implementation of a 3-mult using 2-way RTMVP identity. Using a cascaded product approach, the six 3-mults in (6) can be rewritten as,,,,,and,, and.accordingly, the six 3-mults in the PWM stage in Fig. 2 involves three multiplier-1 and six multiplier-2. Multiplier-1 calculates and. Multiplier-2 calculates. Now, we define the complexity parameters in Table I to calculate the complexity of the proposed algorithm. In the following we estimate the complexities of three individual stages in Fig. 2: EPG Stage. Based on 6 three-operand sub-rtmvps in this stage, we compute two evaluation matrix points (EMP) and one evaluation vector point (EVP). EMP component: For any two matrices and,we can define and, respectively, each of and is a Toeplitz matrix. In [13], it is shown that XOR gates are required to generate, which involves two additions: and. Accordingly, the computation of involves the space complexity of XOR gates and the computation time of delay. And the complexity of is the same of.theree, and in total require XOR gates, and involve delay. EVP component: The column vector is split into two parts,, and are two column vectors. From (6), the EVP component is generated by. Theree, the complexity of involves XOR gates and delay. PWM Stage. The proposed splitting method (as given in (6)) involves three multiplier-1 and six multiplier-2 in each step. Note that we use cascaded structure to implement the 3-mult, and each multiplier-1 is associated with two multiplier-2. For example, multiplier-1 computes, and the result is used as input operand of two multiplier-2 computing and. Based on this, the complexity of computing a 3-mult can be expressed as XOR gates and AND gates. Theree, the complexity of PWM unit involves AND gates and XOR gates. It involves delays. R Stage. Each subproduct with is a RTMVP. The width of each of these subproducts is theree bits. The R stage needs XOR gates to evaluate,andrequires delay. Based on the above analysis, we have estimated the XOR and the AND complexities involved in different steps as follows: (7) (8)

4 LEE AND MEHER: EFFICIENT SUBQUADRATIC SPACE COMPLEXITY ARCHITECTURES 857 can be obtained ac- Substituting (13) into (14), the product cording to the following mulation. (15) Fig. 2. The proposed 2-way RTMVP decomposition. In the following, we analyze the complexity of three stages of 3-mult according to (15). In the EPG stage, we find three components as TABLE I LISTS THE SYMBOL PARAMETERS FOR ESTIMATING THE COMPLEXITY OF THE PROPOSED MULTIPLIER Note: denotes the number of bit-length polynomial. For estimating time complexity, the evaluation step requires delay, the PWM step requires delay, and the reconstructionsteprequires delays. Consequently, we can obtain the following recursive relation of the time complexity: Using Lemmas 1 and 2 to solve the recursive (7), (8), and (9), we can find the following complexities 3-mult using 2-way RTMVP decomposition. (9) (10) (11) (12) B. 3-Way RTMVP Decomposition Let and be two Toeplitz matrices defined by and, sand sare Toeplitz matrices. Let be a column vector,, and are column vectors. The 3-mult of the m uses a recursive TMVP decomposition, such as and. Based on 3-way TMVP decomposition, the intermediate product is obtained as (16) (17) (18) Using Lemma 3 in Appendix, the complexity of involves XOR gates. In [13], it is shown that the matrix additions (, and ) in (17) involves XOR gates and delays; and the vector additions in (18) involves XOR gates and delays. Thus, the EPG unit in total requires XOR gates and delay. The PWM stage needs 18 3-mults, and the R stage involves 15 vector additions. Out of 18 3-mults involved in (15), each multiplier-1 is associated with three multiplier-2. Theree, 18 3-mults can be clustered into 6 groups in PWM stage. For example,, and,, can be clustered to m a group of 3-mults. In this group, multiplier-1 is used to compute, three multiplier-2 are used to compute, and. In this case, the complexity of 3-mult can be defined as XOR gates and AND gates. Theree, the PWM unit in each step involves XOR gates and AND gates, and requires gate delays the computation. Since each 3-mult produces a -bit product word, we need XOR gates the R stage, which involves delay. Theree, we obtain the following expressions of the complexities. (19) Using Lemmas 1 and 2 to solve the recursive (19), subquadratic 3-mult based on 3-way RTMVP decomposition can be found to have the following complexities. (20) Again we use 3-way TMVP decomposition to compute as (13) (14) IV. NEW SUBQUADRATIC MPB SINGLE- AND DOUBLE-MULTIPLICATIONS BASED ON TMVP AND RTMVP DECOMPOSITIONS Toeplitz matrix vector product approach can be used efficient realization of multiplications in binary extension fields, some special classes of basis representation, such as shifted polynomial basis, modified polynomial basis, and dual basis.

5 858 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 62, NO. 3, MARCH 2015 In general, multiplication using double basis representation involves TMVP multiplier and basis conversion from/to the polynomials, and the cost of basis conversion, which depends on the chosen irreducible polynomial.amongthem,thempb representation involves significantly less space complexity if is a trinomial. For the sake of simplicity, we use trinomials to derive a new double basis representation and make use of efficient implementation of three-operand multiplication. We use the matrix to perm the basis conversion as is the invertible matrix defined as (23) (24) A. Formulation of New Shifted MPB Representation Let the field be constructed from an irreducible trinomial of the m, the MPB in [18] is defined as follows. Definition 1: Let be the polynomial basis (PB) of, the intermediate is the root of irreducible trinomial. We can define that the ordered set is called the modified polynomial basis with respect to the set. In the context of this basis representation, the MPB is equivalent to the revised order sequence of the triangular basis in [19]. From the mulation of the MPB, we can find that can be represented by mod,and mod.forthis reason, we can define a new shifted MPB as follows. Definition 2: If a given set is the MPB trinomial, then we can define the ordered set to be the shifted MPB (SMPB). Example 1: Let the field be constructed from the irreducible trinomial. We can find that the set is the MPB, and the set is the SMPB. Here, we discuss the basis conversion between MPB and SMPB. Assume that a field is constructed from.let and be two elements in represented by MPB and SMPB representations, respectively. From Definitions 1 and 2, we have obtained and. Moreover, from, we have obtained. Thus, by basis conversion from SMPB to MPB, we can obtain B. MPB Multiplication In this subsection, we use two bases MPB and SMPB to derive a new double basis multiplication. We have shown that the basis conversion in (21) does not involve any cost hardware implementation, and, theree, the double basis multiplication is equivalent to MPB multiplication. Let and be two polynomials in represented by MPB and SMPB, respectively. From Definition 2, we obtain, and the polynomial can be represented by (25) Assuming that the polynomial is represented by MPB, and is the product of and. It can be rewritten as (26) Based on the definition of MPB (Definition 1), we have obtained the following algebraic relations: (27) Let us denote that mod,. By using the algebra of (27), can be computed as (28) Thus, based on (26), the product using matrix-vector representation can be obtained according to the following mulation. (21) It can be noted that the basis conversion from SMPB to MPB is given by the permutation of the coordinate coefficients of the element Based on the basis conversion in (21), we can define the transmation matrix as (22) (29) Matrix in (29) can be transmed into a Toeplitz matrix. For clarity, we illustrate a double basis multiplication mod in Example 2. Example 2: Let a field be generated by. Assume that and are two elements in represented by MPB and SMPB, respectively. We can pre-compute

6 LEE AND MEHER: EFFICIENT SUBQUADRATIC SPACE COMPLEXITY ARCHITECTURES 859 Fig. 3. (a) Traditional MPB multiplier [7]; (b) The proposed MPB multiplier based on subquadratic TMVP multiplier architecture of Fig. 1., and.using(29), the product mod can be computed as follows. Referring to the above, to obtain the matrix,itisrequired to compute the four terms, which involves 4 XOR gates and delay. Fig. 3(b) shows the proposed MPB multiplier, which involves a TMVP multiplier, a matrix transm unit, and a pre-computation circuit. Note that, in our proposed architecture, the matrix transmation unit the implementation of the basis conversion from MPB to SMPB does not involve any hardware cost, but traditional MPB multiplier [7] requires XOR gates to perm the basis conversion from the MPB to the PB. C. Three-Operand MPB Multiplier Using RTMVP Scheme In Section III, we have derived the proposed three-operand multiplication algorithm using RTMVP decomposition. Based on MPB multiplication, we derive here a new three-operand multiplication architecture using the proposed RTMVP decomposition. Let,, and be four elements in,,, and are represented by MPB, is represented by SMPB, and mod. For three-operand multiplication, the product requires two multipliers in cascade, the first multiplier computes mod, and the second multiplier computes an MPB multiplication mod. Based on the structure of double basis multiplier, we can use Lemma 4 (in Appendix) to obtain the MPB multiplication with respect to Toeplitz matrix-vector product. Thus, the first multiplier directly uses the proposed double basis multiplication in (29) to produce the intermediate result, which can be used the computation of mod to perm the three-operand multiplication, provides the first multiplication result to convert the basis transmation from the MPB to the SMPB. Theree, based on Lemma 4, the three-operand multiplication can be expressed alternatively as (30) As mentioned above, it is shown that three-operand multiplication using the proposed double basis multiplier can be realized by an architecture with RTMVP approach. Theree, 3-mult Fig. 4. (a) The proposed three-operand MPB multiplier based on RTMVP decomposition of Fig. 2. (b) The three-operand multiplier using traditional MPB multiplier approach using Subquadratic TMVP multiplier of Fig. 1. TABLE II COMPLEXITIES OF BCC AND PCC UNITS FOR THE PROPOSED AND THE EXISTING MPB MULTIPLIERS using the proposed RTMVP decomposition can be used to derive the architecture of multiplier with subquadratic space complexity. Fig. 4(a) shows the proposed three-operand multiplier, which involves one RTMVP multiplier and two pre-computation units. Each pre-computation unit involve XOR gates. The proposed architecture multi-operand multiplication can be used hardware and time efficient realization of applications involving successive multiplications, such as inversion, exponentiation, and pairing computation. Typically, dual basis multiplication is realized efficiently by Toeplitz matrix-vector product structure, as shown in Fig. 3(a). The traditional MPB multiplier [7] involves a TMVP multiplier, a basis conversion circuit, and a pre-computation circuit. In [7], it is shown that the MPB is a dual basis mulation respect to PB, and the implementation of double basis multiplication requires basis conversion from MPB to PB if input operands are in MPB representation. The proposed dual basis multiplication has the following advantages compared to traditional dual basis multiplier [7]. 1) The proposed double basis multiplication is based on two bases MPB and SMPB, while traditional double basis multiplication is based on two bases MPB and PB. 2) The basis conversion circuit (BCC) our proposed method is permed from MPB to SMPB, which does not involve any cost hardware implementation. The BCC [7] is permed from MPB to PB. In Table II, we have listed the complexity of the BCC all trinomials.

7 860 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 62, NO. 3, MARCH 2015 TABLE III COMPARISON OF SELECTED SUBQUADRATIC PARALLEL MULTIPLIERS FOR TRINIMIALS WITH AND TABLE IV COMPARISON OF SUBQUADRATIC SELECTED PARALLEL MULTIPLIERS FOR COMPUTING THREE-OPERAND MULTIPLICATION FOR TRINIMIALS WITH AND 3) The pre-computation circuit (PCC) is used to compute all entries of Toeplitz matrix. As shown in Example 2, our proposed PCC involves XOR gates and delay, while the complexity of the PCC [7] depends on the selected trinomial, as shown in Table II. 4) Proposed RTMVP approach can be used multi-operand multiplication scheme, while traditional MPB multiplication scheme [as shown in Fig. 4(b)] cannot be used efficient multi-operand multiplication. V. COMPLEXITY ANALYSIS In this section, we analyze the complexity of MPB single- and double-multipliers based on TMVP and RTMVP decompositions, respectively, and compare the corresponding subquadratic multipliers. A. Comparison of Subquadratic Single-Multipliers We use two different bases, MPB and SMPB, to build an efficient single-multiplier [Fig. 3(b)] the field based on MPB, while traditional MPB multiplier [7] [Fig. 3(a)] is based on PB and MPB. Both multipliers have similar architectures, which involve three units such as TMVP multiplier, basis conversion circuit (BCC), and pre-computation circuit (PCC). Although traditional MPB multiplier is derived quadrinomial basis, its architecture is suitable trinomial basis. Here, we analyze the complexity of both multipliers trinomial basis. We assume that the TMVP multiplier is realized by non-recursive TMVP decomposition approach to develop a subquadratic multiplier. Theree, we compare BCC and PCC units only both multipliers, as shown in Table II. The BCC unit of our proposed MPB multiplier does not involve any cost hardware implementation, while traditional MPB multiplier involves additional hardware basis conversion. The delay of our architecture the case of with is less by than traditional MPB multiplier. Other subquadratic parallel multipliers are suggested in [20] Winograd algorithm, and in [21] Karatsuba algorithm. Those algorithms are based on the naive polynomial multiplication to explore an efficient subquadratic space complexity architecture. In [22], it is shown that the reduction polynomial stage trinomial with involves XOR gates and requires gate delays. We use the reduction polynomial stage in [22] to add Winograd and Karatsuba algorithms evaluating the complexity of multiplication. Table III compares the complexities of the proposed and the existing subquadratic parallel multipliers. As shown in this table, our proposed MPB multiplier has significantly less computation time and less space complexity compared to the existing subquadratic multipliers. B. Comparison of Subquadratic Double-Multipliers It is well-known that the subquadratic algorithms are derived from two-operand multiplication. For fast three-operand multiplication, the subquadratic algorithms require two separate multipliers in cascade. Multiplier of Lee et al. [16] is based on recursive Karatsuba algorithm to explore subquadratic three-operand multiplication. Our proposed double-multiplication scheme is based on RTMVP decomposition to derive subquadratic space complexity architecture. Table IV lists the complexity of the proposed and the exiting subquadratic multipliers [13], [16], [20], [21] three-operand multiplication. As shown in this table, TMVP-based multiplier [13] is better than other existing subquadratic architectures. The proposed 2-way RTMVP-based multiplier has nearly 25% less delays and about 9% less space complexity compared to those of the existing 2-way subquadratic multipliers. The proposed 3-way RTMVP-based multiplier has less delay and slightly more space

8 LEE AND MEHER: EFFICIENT SUBQUADRATIC SPACE COMPLEXITY ARCHITECTURES 861 TABLE V COMPARISON OF VARIOUS SUBQUADRATIC DOUBLE-MULTIPLIERS OVER IN THE TERMS OF DELAY,NUMBER OF GATES, AND TOTAL GE GE denotes gate equivalent in terms of number of 2-input NAND gates. complexity compared to those of existing 3-way TMVP-based multiplier [13]. We consider three fields based on the existing trinomials of degree,suchas, and, the comparison of our proposed and the existing subquadratic multipliers three-operand multiplication. In order to reduce the complexity, hybrid subquadratic multiplication approach is introduced in [10], which combines 2-way and 3-way decomposition schemes. By this approach, we use the hybrid of 2-way and 3-way decompositions to construct the proposed and the existing subquadratic multipliers based on the field order of the m,,,and, respectively, synthesis purpose. We have used the NanGate's Library Creator and the 45-nm FreePDK Base Kit from North Carolina State University (NCSU) [23] to synthesize the proposed double-multiplier and the corresponding existing multipliers. From the synthesis results, we obtain the delay, the number of gates, and total GE (gate equivalent), as shown in Table V. In this table, the total GE is estimated based on the used cell area, i.e., a NAND gate, a AND gate ( and ), and a XOR gate ( and ). We find that our proposed double-multiplier has less space complexity compared to the best of the existing subquadratic multipliers the selected field order of the m is with significantly less time complexity. VI. CONCLUSIONS In this paper, we have derived a new SMPB representation. The proposed MPB multiplication is constructed from MPB and SMPB, while traditional MPB multiplier is constructed from PB and MPB. We have shown that the basis conversion from MPB to SMPB does not involve any cost hardware implementation. We have proposed three-operand MPB multiplication using a new mulation of the RTMVP decomposition. We have used the traditional TMVP decomposition to propose two new 2-way and 3-way RTMVP decompositions. The proposed MPB single- and double-multiplications use TMVP and RTMVP decompositions, respectively, to achieve subquadratic space complexity architectures. Note that, based on the proposed MPB multiplication, we can derive a subquadratic multiplier using RTMVP decomposition, while traditional MPB multiplication cannot utilize RTMVP decomposition computing three-operand multiplication. From the theoretical analysis, it is shown that the proposed subquadratic multipliers have significantly less computation time and less space complexity compared to the existing subquadratic multipliers based on Karatsuba algorithm, Winograd algorithm, and splitting TMVP algorithm. Moreover, our proposed three-operand multiplier can be used several applications such as exponentiation, inversion, and elliptic curve point multiplication. APPENDIX Lemma 3: The matrix additions (,,,,,,and ) in (16) can be permed using XOR gates. Proof: Let. Based on Proposition 1, we can use polynomial to represent the corresponding matrix. The matrix is split into block Toeplitz matrix, such as. Using polynomial representation, the five block matrices (,1,2,3,4) can be represented by. We can theree have (31) Note that the matrix additions of this m are involved in computation of,,,,,,and. Based on (31), we can find the reused terms as follows: 1) The term in also appears 2) The term in also appears 3) The term in also appears 4) The term in also appears 5) The term in also appears The seven matrices,,,,,,and can be computed by XOR gates. Lemma 4: Assume that the MPB multiplication is based on matrix-vector product approach and expressed in the m, and we have,, is generated by,and is a matrix transmation from the SMPB to thempb. Proof: in are represented by the MPB polynomials,,andthematrix is generatedby. Using (23), we can have, perms the basis conversion from SMPB into MPB. Thus, the MPB multiplication can be written as. Since double basis multiplication is a Toeplitz matrix-vector product given by, we can find.

9 862 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 62, NO. 3, MARCH 2015 REFERENCES [1] U. Bose, A. K. Bhattacharya, and A. Das, GPU-based implementation of 128-Bit secure eta pairing over a binary field, in Proc. AFRICACRYPT, 2013, pp [2] C.-Y. Lee and C. W. Chiou, Scalable Gaussian normal basis multipliers over using Hankel matrix-vector representation, Signal Process. Syst., vol. 69, no. 2, pp , [3] C.-Y. Lee, Y.-H. Chen, C. W. Chiou, and J.-M. Lin, Unified parallel systolic multiplier over, J. Comput. Sci. Technol. vol. 22, no. 1, pp , 2007 [Online]. Available: s [4] C.-Y. Lee and C. W. Chiou, Efficient design of low-complexity bit-parallel systolic Hankel multipliers to implement multiplication in normal and dual bases of, IEICE Trans., vol. 88-A, no. 11, pp , [5] C.-Y. Lee, Low-complexity parallel systolic montgomery multipliers over using toeplitz matrix-vector representation, IEICE Trans., vol. 91-A, no. 6, pp , [6] J. Han and H. Fan, Toeplitz matrix-vector product based shifted polynomial basis multipliers all irreducible pentanomials, IACR Cryptol. eprint Archive, vol. 2013, p. 427, 2013 [Online]. Available: [7] M. A. Hasan, A. H. Namin, and C. Nègre, Toeplitz matrix approach binary field multiplication using quadrinomials, IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 20, no. 3, pp , [8] J.-S. Pan, R. Azarderakhsh, M. M. Kermani, C.-Y. Lee, W.-Y. Lee, C. W. Chiou, and J.-M. Lin, Low-latency digit-serial systolic double basis multiplier over using subquadratic toeplitz matrix-vector product approach, IEEE Trans. Comput., vol. 63, no. 5, pp , [9] S.-M. Park and K.-Y. Chang, Fast bit-parallel shifted polynomial basis multiplier using weakly dual basis over, IEEE Trans. VLSI Syst., vol. 19, no. 12, pp , [10] A. Weimerskirch and C. Paar, Generalizations of the Karatsuba Algorithm efficient implementations, Univ. Ruhr, Bochum, Germany, 2003, Tech. Rep.. [11] Y. Li, G. Chen, and J. Li, Speedup of bit-parallel Karatsuba multiplier in generated by trinomials, Inf. Process. Lett.,vol.111,no. 8, pp , [12] M. Bodrato, Towards optimal toom-cook multiplication univariate and multivariate polynomials in characteristic 2 and 0, in Proc. WAIFI, 2007, pp [13] H. Fan and M. Hasan, A new approach to subquadratic space complexity parallel multipliers extended binary fields, IEEE Trans. Comput., vol. 56, no. 2, pp , [14] C.-Y. Lee, C.-S. Yang, B. K. Meher, P. K. Meher, and J.-S. Pan, Low-complexity digit-serial and scalable SPB/GPB multipliers over large binary extension fields using (b, 2)-way Karatsuba decomposition, IEEE Trans. Circuits Syst. I, Reg. Papers vol. 61, no. 11, pp , 2014 [Online]. Available: [15] J.-S. Pan, C.-Y. Lee, and P. K. Meher, Low-latency digit-serial and digit-parallel systolic multipliers large binary extension fields, IEEETrans.CircuitsSyst.I,Reg.Papers, vol. 60, no. 12, pp , [16] C.-Y. Lee, P. K. Meher, and C.-P. Chang, Efficient M-ary exponentiation over using subquadratic KA-based three-operand montgomery multiplier, IEEETrans.CircuitsSyst.I,Reg.Papers, vol. 61, no. 11, pp , [17] R. Azarderakhsh, K. Järvinen, and V. S. Dimitrov, Fast inversion in with normal basis using hybrid-double multipliers, IEEE Trans. Comput., vol. 63, no. 4, pp , [18] C. Nègre, Quadrinomial modular arithmetic using modified polynomial basis, in Proc. ITCC, 2005, pp [19] R. Furness, S. Fenn, and M. Benaissa, Multiplication using the triangular basis representation over, in Proc. Global Telecommun. Conf. (GLOBECOM'96), vol. 2, pp [20] B. Sunar, A generalized method constructing subquadratic complexity multipliers, IEEE Trans. Comput., vol. 53, no. 9, pp , [21] C. Paar, A new architecture a parallel finite field multiplier with low complexity based on composite fields, IEEE Trans. Comput., vol. 45, no. 7, pp , [22] B. Sunar and C. K. Koç, Mastrovito multiplier all trinomials, IEEE Trans. Comput., vol. 48, no. 5, pp , [23] Nangate Standard Cell Library [Online]. Available: org/openeda.si2.org/projects /nangatelib/ Chiou-Yng Lee (SM 07) received the B.S. degree in medical engineering and the M.S. degree in electronic engineering, both from the Chung Yuan Christian University, Taiwan, in 1986 and 1992, respectively, and the Ph.D. degree in electrical engineering from Chang Gung University, Taiwan, in From 1988 to 2005, he was a Research Associate with Chunghwa Telecommunication Laboratory, Taiwan. From 2001 to 2005, he taught courses related to finite fields at Ching Yun University. Currently, he is a Professor in the Department of Computer Inmation and Network Engineering at Lunghwa University of Science and Technology, Taiwan. His research interests include computations in finite fields, error-control coding, signal processing, and digital transmission systems. He is a Senior Member of the IEEE and the IEEE Computer society. Pramod Kumar Meher (SM 03) received the B.Sc. (Honours) and M.Sc. degree in physics, and the Ph.D. degree in science from Sambalpur University, India, in 1976, 1978, and 1996, respectively. Currently, he is a Senior Research Scientist with Nanyang Technological University, Singapore. Previously, he was a Professor of Computer Applications with Utkal University, India, from 1997 to 2002, and a Reader in electronics with Berhampur University, India, from 1993 to His research interest includes design of dedicated and reconfigurable architectures computation-intensive algorithms pertaining to signal, image and video processing, communication, bio-inmatics and intelligent computing. He has contributed more than 200 technical papers to various reputed journals and conference proceedings. Dr. Meher has served as a speaker the Distinguished Lecturer Program (DLP) of IEEE Circuits and Systems Society during 2011 and 2012, Associate Editor the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS during 2008 to 2011, and Associate Editor the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS during Currently, he is serving as Associate Editor the IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, thejournal of Circuits, Systems, and Signal Processing,andIntegration, the VLSI Journal. Dr. Meher is a Fellow of the Institution of Electronics and Telecommunication Engineers, India. He was the recipient of the Samanta Chandrasekhar Award excellence in research in engineering and technology He has received the 2013 Sydney R. Parker Best Paper Award in the area of signal processing, and the 2013 M.N.S. Swamy Award being the best paper amongst all the papers published in the Journal of Circuits, Systems, and Signal Processing.

ALONG with the progressive device scaling, semiconductor

ALONG with the progressive device scaling, semiconductor IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 57, NO. 4, APRIL 2010 285 LUT Optimization for Memory-Based Computation Pramod Kumar Meher, Senior Member, IEEE Abstract Recently, we

More information

LUT Optimization for Memory Based Computation using Modified OMS Technique

LUT Optimization for Memory Based Computation using Modified OMS Technique LUT Optimization for Memory Based Computation using Modified OMS Technique Indrajit Shankar Acharya & Ruhan Bevi Dept. of ECE, SRM University, Chennai, India E-mail : indrajitac123@gmail.com, ruhanmady@yahoo.co.in

More information

OMS Based LUT Optimization

OMS Based LUT Optimization International Journal of Advanced Education and Research ISSN: 2455-5746, Impact Factor: RJIF 5.34 www.newresearchjournal.com/education Volume 1; Issue 5; May 2016; Page No. 11-15 OMS Based LUT Optimization

More information

Implementation of Memory Based Multiplication Using Micro wind Software

Implementation of Memory Based Multiplication Using Micro wind Software Implementation of Memory Based Multiplication Using Micro wind Software U.Palani 1, M.Sujith 2,P.Pugazhendiran 3 1 IFET College of Engineering, Department of Information Technology, Villupuram 2,3 IFET

More information

Design of Memory Based Implementation Using LUT Multiplier

Design of Memory Based Implementation Using LUT Multiplier Design of Memory Based Implementation Using LUT Multiplier Charan Kumar.k 1, S. Vikrama Narasimha Reddy 2, Neelima Koppala 3 1,2 M.Tech(VLSI) Student, 3 Assistant Professor, ECE Department, Sree Vidyanikethan

More information

A Reed Solomon Product-Code (RS-PC) Decoder Chip for DVD Applications

A Reed Solomon Product-Code (RS-PC) Decoder Chip for DVD Applications IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 36, NO. 2, FEBRUARY 2001 229 A Reed Solomon Product-Code (RS-PC) Decoder Chip DVD Applications Hsie-Chia Chang, C. Bernard Shung, Member, IEEE, and Chen-Yi Lee

More information

A Novel Architecture of LUT Design Optimization for DSP Applications

A Novel Architecture of LUT Design Optimization for DSP Applications A Novel Architecture of LUT Design Optimization for DSP Applications O. Anjaneyulu 1, Parsha Srikanth 2 & C. V. Krishna Reddy 3 1&2 KITS, Warangal, 3 NNRESGI, Hyderabad E-mail : anjaneyulu_o@yahoo.com

More information

Optimization of memory based multiplication for LUT

Optimization of memory based multiplication for LUT Optimization of memory based multiplication for LUT V. Hari Krishna *, N.C Pant ** * Guru Nanak Institute of Technology, E.C.E Dept., Hyderabad, India ** Guru Nanak Institute of Technology, Prof & Head,

More information

LUT Design Using OMS Technique for Memory Based Realization of FIR Filter

LUT Design Using OMS Technique for Memory Based Realization of FIR Filter International Journal of Emerging Engineering Research and Technology Volume. 2, Issue 6, September 2014, PP 72-80 ISSN 2349-4395 (Print) & ISSN 2349-4409 (Online) LUT Design Using OMS Technique for Memory

More information

Design and Implementation of Partial Reconfigurable Fir Filter Using Distributed Arithmetic Architecture

Design and Implementation of Partial Reconfigurable Fir Filter Using Distributed Arithmetic Architecture Design and Implementation of Partial Reconfigurable Fir Filter Using Distributed Arithmetic Architecture Vinaykumar Bagali 1, Deepika S Karishankari 2 1 Asst Prof, Electrical and Electronics Dept, BLDEA

More information

THE USE OF forward error correction (FEC) in optical networks

THE USE OF forward error correction (FEC) in optical networks IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 8, AUGUST 2005 461 A High-Speed Low-Complexity Reed Solomon Decoder for Optical Communications Hanho Lee, Member, IEEE Abstract

More information

International Journal of Engineering Trends and Technology (IJETT) - Volume4 Issue8- August 2013

International Journal of Engineering Trends and Technology (IJETT) - Volume4 Issue8- August 2013 International Journal of Engineering Trends and Technology (IJETT) - Volume4 Issue8- August 2013 Design and Implementation of an Enhanced LUT System in Security Based Computation dama.dhanalakshmi 1, K.Annapurna

More information

Memory efficient Distributed architecture LUT Design using Unified Architecture

Memory efficient Distributed architecture LUT Design using Unified Architecture Research Article Memory efficient Distributed architecture LUT Design using Unified Architecture Authors: 1 S.M.L.V.K. Durga, 2 N.S. Govind. Address for Correspondence: 1 M.Tech II Year, ECE Dept., ASR

More information

Implementation of Low Power and Area Efficient Carry Select Adder

Implementation of Low Power and Area Efficient Carry Select Adder International Journal of Engineering Science Invention ISSN (Online): 2319 6734, ISSN (Print): 2319 6726 Volume 3 Issue 8 ǁ August 2014 ǁ PP.36-48 Implementation of Low Power and Area Efficient Carry Select

More information

Reconfigurable FPGA Implementation of FIR Filter using Modified DA Method

Reconfigurable FPGA Implementation of FIR Filter using Modified DA Method Reconfigurable FPGA Implementation of FIR Filter using Modified DA Method M. Backia Lakshmi 1, D. Sellathambi 2 1 PG Student, Department of Electronics and Communication Engineering, Parisutham Institute

More information

An Efficient Reduction of Area in Multistandard Transform Core

An Efficient Reduction of Area in Multistandard Transform Core An Efficient Reduction of Area in Multistandard Transform Core A. Shanmuga Priya 1, Dr. T. K. Shanthi 2 1 PG scholar, Applied Electronics, Department of ECE, 2 Assosiate Professor, Department of ECE Thanthai

More information

Efficient Implementation of Multi Stage SQRT Carry Select Adder

Efficient Implementation of Multi Stage SQRT Carry Select Adder International Journal of Research Studies in Science, Engineering and Technology Volume 2, Issue 8, August 2015, PP 31-36 ISSN 2349-4751 (Print) & ISSN 2349-476X (Online) Efficient Implementation of Multi

More information

Implementation and Analysis of Area Efficient Architectures for CSLA by using CLA

Implementation and Analysis of Area Efficient Architectures for CSLA by using CLA Volume-6, Issue-3, May-June 2016 International Journal of Engineering and Management Research Page Number: 753-757 Implementation and Analysis of Area Efficient Architectures for CSLA by using CLA Anshu

More information

Keywords Xilinx ISE, LUT, FIR System, SDR, Spectrum- Sensing, FPGA, Memory- optimization, A-OMS LUT.

Keywords Xilinx ISE, LUT, FIR System, SDR, Spectrum- Sensing, FPGA, Memory- optimization, A-OMS LUT. An Advanced and Area Optimized L.U.T Design using A.P.C. and O.M.S K.Sreelakshmi, A.Srinivasa Rao Department of Electronics and Communication Engineering Nimra College of Engineering and Technology Krishna

More information

Area-efficient high-throughput parallel scramblers using generalized algorithms

Area-efficient high-throughput parallel scramblers using generalized algorithms LETTER IEICE Electronics Express, Vol.10, No.23, 1 9 Area-efficient high-throughput parallel scramblers using generalized algorithms Yun-Ching Tang 1, 2, JianWei Chen 1, and Hongchin Lin 1a) 1 Department

More information

ISSN (Print) Original Research Article. Coimbatore, Tamil Nadu, India

ISSN (Print) Original Research Article. Coimbatore, Tamil Nadu, India Scholars Journal of Engineering and Technology (SJET) Sch. J. Eng. Tech., 016; 4(1):1-5 Scholars Academic and Scientific Publisher (An International Publisher for Academic and Scientific Resources) www.saspublisher.com

More information

An MFA Binary Counter for Low Power Application

An MFA Binary Counter for Low Power Application Volume 118 No. 20 2018, 4947-4954 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu An MFA Binary Counter for Low Power Application Sneha P Department of ECE PSNA CET, Dindigul, India

More information

/$ IEEE

/$ IEEE 1960 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 56, NO. 9, SEPTEMBER 2009 A Universal VLSI Architecture for Reed Solomon Error-and-Erasure Decoders Hsie-Chia Chang, Member, IEEE,

More information

Design of Carry Select Adder using Binary to Excess-3 Converter in VHDL

Design of Carry Select Adder using Binary to Excess-3 Converter in VHDL Journal From the SelectedWorks of Kirat Pal Singh Summer May 18, 2016 Design of Carry Select Adder using Binary to Excess-3 Converter in VHDL Brijesh Kumar, Vaagdevi college of engg. Pune, Andra Pradesh,

More information

FPGA Hardware Resource Specific Optimal Design for FIR Filters

FPGA Hardware Resource Specific Optimal Design for FIR Filters International Journal of Computer Engineering and Information Technology VOL. 8, NO. 11, November 2016, 203 207 Available online at: www.ijceit.org E-ISSN 2412-8856 (Online) FPGA Hardware Resource Specific

More information

Optimization of Multi-Channel BCH Error Decoding for Common Cases. Russell Dill Master's Thesis Defense April 20, 2015

Optimization of Multi-Channel BCH Error Decoding for Common Cases. Russell Dill Master's Thesis Defense April 20, 2015 Optimization of Multi-Channel BCH Error Decoding for Common Cases Russell Dill Master's Thesis Defense April 20, 2015 Bose-Chaudhuri-Hocquenghem (BCH) BCH is an Error Correcting Code (ECC) and is used

More information

Keywords- Discrete Wavelet Transform, Lifting Scheme, 5/3 Filter

Keywords- Discrete Wavelet Transform, Lifting Scheme, 5/3 Filter An Efficient Architecture for Multi-Level Lifting 2-D DWT P.Rajesh S.Srikanth V.Muralidharan Assistant Professor Assistant Professor Assistant Professor SNS College of Technology SNS College of Technology

More information

An Lut Adaptive Filter Using DA

An Lut Adaptive Filter Using DA An Lut Adaptive Filter Using DA ISSN: 2321-9939 An Lut Adaptive Filter Using DA 1 k.krishna reddy, 2 ch k prathap kumar m 1 M.Tech Student, 2 Assistant Professor 1 CVSR College of Engineering, Department

More information

A Review on Hybrid Adders in VHDL Payal V. Mawale #1, Swapnil Jain *2, Pravin W. Jaronde #3

A Review on Hybrid Adders in VHDL Payal V. Mawale #1, Swapnil Jain *2, Pravin W. Jaronde #3 A Review on Hybrid Adders in VHDL Payal V. Mawale #1, Swapnil Jain *2, Pravin W. Jaronde #3 #1 Electronics & Communication, RTMNU. *2 Electronics & Telecommunication, RTMNU. #3 Electronics & Telecommunication,

More information

An optimized implementation of 128 bit carry select adder using binary to excess-one converter for delay reduction and area efficiency

An optimized implementation of 128 bit carry select adder using binary to excess-one converter for delay reduction and area efficiency Journal From the SelectedWorks of Journal December, 2014 An optimized implementation of 128 bit carry select adder using binary to excess-one converter for delay reduction and area efficiency P. Manga

More information

Design and Implementation of LUT Optimization DSP Techniques

Design and Implementation of LUT Optimization DSP Techniques Design and Implementation of LUT Optimization DSP Techniques 1 D. Srinivasa rao & 2 C. Amala 1 M.Tech Research Scholar, Priyadarshini Institute of Technology & Science, Chintalapudi 2 Associate Professor,

More information

An Efficient 64-Bit Carry Select Adder With Less Delay And Reduced Area Application

An Efficient 64-Bit Carry Select Adder With Less Delay And Reduced Area Application An Efficient 64-Bit Carry Select Adder With Less Delay And Reduced Area Application K Allipeera, M.Tech Student & S Ahmed Basha, Assitant Professor Department of Electronics & Communication Engineering

More information

A High- Speed LFSR Design by the Application of Sample Period Reduction Technique for BCH Encoder

A High- Speed LFSR Design by the Application of Sample Period Reduction Technique for BCH Encoder IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) ISSN: 239 42, ISBN No. : 239 497 Volume, Issue 5 (Jan. - Feb 23), PP 7-24 A High- Speed LFSR Design by the Application of Sample Period Reduction

More information

Modified Reconfigurable Fir Filter Design Using Look up Table

Modified Reconfigurable Fir Filter Design Using Look up Table Modified Reconfigurable Fir Filter Design Using Look up Table R. Dhayabarani, Assistant Professor. M. Poovitha, PG scholar, V.S.B Engineering College, Karur, Tamil Nadu. Abstract - Memory based structures

More information

Implementation of Area Efficient Memory-Based FIR Digital Filter Using LUT-Multiplier

Implementation of Area Efficient Memory-Based FIR Digital Filter Using LUT-Multiplier Implementation of Area Efficient Memory-Based FIR Digital Filter Using LUT-Multiplier K.Purnima, S.AdiLakshmi, M.Jyothi Department of ECE, K L University Vijayawada, INDIA Abstract Memory based structures

More information

Optimum Composite Field S-Boxes Aimed at AES

Optimum Composite Field S-Boxes Aimed at AES Optimum Composite Field S-Boxes Aimed at AES R.THILLAIKKARASI Assistant professor, Department Of ECE, Salem college of Engineering and technology. Salem, India. K.VAISHNAVI Post Graduate Student M.E Applied

More information

An Efficient High Speed Wallace Tree Multiplier

An Efficient High Speed Wallace Tree Multiplier Chepuri satish,panem charan Arur,G.Kishore Kumar and G.Mamatha 38 An Efficient High Speed Wallace Tree Multiplier Chepuri satish, Panem charan Arur, G.Kishore Kumar and G.Mamatha Abstract: The Wallace

More information

Adaptive decoding of convolutional codes

Adaptive decoding of convolutional codes Adv. Radio Sci., 5, 29 214, 27 www.adv-radio-sci.net/5/29/27/ Author(s) 27. This work is licensed under a Creative Commons License. Advances in Radio Science Adaptive decoding of convolutional codes K.

More information

128 BIT CARRY SELECT ADDER USING BINARY TO EXCESS-ONE CONVERTER FOR DELAY REDUCTION AND AREA EFFICIENCY

128 BIT CARRY SELECT ADDER USING BINARY TO EXCESS-ONE CONVERTER FOR DELAY REDUCTION AND AREA EFFICIENCY 128 BIT CARRY SELECT ADDER USING BINARY TO EXCESS-ONE CONVERTER FOR DELAY REDUCTION AND AREA EFFICIENCY 1 Mrs.K.K. Varalaxmi, M.Tech, Assoc. Professor, ECE Department, 1varuhello@Gmail.Com 2 Shaik Shamshad

More information

Implementation of High Speed Adder using DLATCH

Implementation of High Speed Adder using DLATCH International Journal of Emerging Engineering Research and Technology Volume 3, Issue 12, December 2015, PP 162-172 ISSN 2349-4395 (Print) & ISSN 2349-4409 (Online) Implementation of High Speed Adder using

More information

IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 61, NO. 4, FEBRUARY 15,

IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 61, NO. 4, FEBRUARY 15, IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 61, NO. 4, FEBRUARY 15, 2013 921 A High-Performance Energy-Efficient Architecture for FIR Adaptive Filter Based on New Distributed Arithmetic Formulation of

More information

Figure 1.LFSR Architecture ( ) Table 1. Shows the operation for x 3 +x+1 polynomial.

Figure 1.LFSR Architecture ( ) Table 1. Shows the operation for x 3 +x+1 polynomial. High-speed Parallel Architecture and Pipelining for LFSR Vinod Mukati PG (M.TECH. VLSI engineering) student, SGVU Jaipur (Rajasthan). Vinodmukati9@gmail.com Abstract Linear feedback shift register plays

More information

Research Article Design and Implementation of High Speed and Low Power Modified Square Root Carry Select Adder (MSQRTCSLA)

Research Article Design and Implementation of High Speed and Low Power Modified Square Root Carry Select Adder (MSQRTCSLA) Research Journal of Applied Sciences, Engineering and Technology 12(1): 43-51, 2016 DOI:10.19026/rjaset.12.2302 ISSN: 2040-7459; e-issn: 2040-7467 2016 Maxwell Scientific Publication Corp. Submitted: August

More information

N.S.N College of Engineering and Technology, Karur

N.S.N College of Engineering and Technology, Karur Modified Reconfigurable CSD Fir Filter Design Using Look up Table Sivakumar.M 1, Ranjitha.S 2, Vijayabharathi.P 3, Dhivya.G 4 1 Assistant professor, 2,3,4 UG student-final year, Department of Electronics

More information

Adaptive Fir Filter with Optimised Area and Power using Modified Inner-Product Block

Adaptive Fir Filter with Optimised Area and Power using Modified Inner-Product Block Adaptive Fir Filter with Optimised Area and Power using Modified Inner-Product Block Jesmin Joy M. Tech Scholar (VLSI & Embedded Systems), Dept. of ECE, IIET, M. G. University, Kottayam, Kerala, India

More information

Research Article Low Power 256-bit Modified Carry Select Adder

Research Article Low Power 256-bit Modified Carry Select Adder Research Journal of Applied Sciences, Engineering and Technology 8(10): 1212-1216, 2014 DOI:10.19026/rjaset.8.1086 ISSN: 2040-7459; e-issn: 2040-7467 2014 Maxwell Scientific Publication Corp. Submitted:

More information

LFSR Counter Implementation in CMOS VLSI

LFSR Counter Implementation in CMOS VLSI LFSR Counter Implementation in CMOS VLSI Doshi N. A., Dhobale S. B., and Kakade S. R. Abstract As chip manufacturing technology is suddenly on the threshold of major evaluation, which shrinks chip in size

More information

Design And Implimentation Of Modified Sqrt Carry Select Adder On FPGA

Design And Implimentation Of Modified Sqrt Carry Select Adder On FPGA Design And Implimentation Of Modified Sqrt Carry Select Adder On FPGA Ch. Pavan kumar #1, V.Narayana Reddy, *2, R.Sravanthi *3 #Dept. of ECE, PBR VIT, Kavali, A.P, India #2 Associate.Proffesor, Department

More information

IN DIGITAL transmission systems, there are always scramblers

IN DIGITAL transmission systems, there are always scramblers 558 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 53, NO. 7, JULY 2006 Parallel Scrambler for High-Speed Applications Chih-Hsien Lin, Chih-Ning Chen, You-Jiun Wang, Ju-Yuan Hsiao,

More information

THE CAPABILITY to display a large number of gray

THE CAPABILITY to display a large number of gray 292 JOURNAL OF DISPLAY TECHNOLOGY, VOL. 2, NO. 3, SEPTEMBER 2006 Integer Wavelets for Displaying Gray Shades in RMS Responding Displays T. N. Ruckmongathan, U. Manasa, R. Nethravathi, and A. R. Shashidhara

More information

Abstract 1. INTRODUCTION. Cheekati Sirisha, IJECS Volume 05 Issue 10 Oct., 2016 Page No Page 18532

Abstract 1. INTRODUCTION. Cheekati Sirisha, IJECS Volume 05 Issue 10 Oct., 2016 Page No Page 18532 www.ijecs.in International Journal Of Engineering And Computer Science ISSN: 2319-7242 Volume 5 Issue 10 Oct. 2016, Page No. 18532-18540 Pulsed Latches Methodology to Attain Reduced Power and Area Based

More information

Design of Polar List Decoder using 2-Bit SC Decoding Algorithm V Priya 1 M Parimaladevi 2

Design of Polar List Decoder using 2-Bit SC Decoding Algorithm V Priya 1 M Parimaladevi 2 IJSRD - International Journal for Scientific Research & Development Vol. 3, Issue 03, 2015 ISSN (online): 2321-0613 V Priya 1 M Parimaladevi 2 1 Master of Engineering 2 Assistant Professor 1,2 Department

More information

PAPER A High-Speed Low-Complexity Time-Multiplexing Reed-Solomon-Based FEC Architecture for Optical Communications

PAPER A High-Speed Low-Complexity Time-Multiplexing Reed-Solomon-Based FEC Architecture for Optical Communications 2424 IEICE TRANS. FUNDAMENTALS, VOL.E95 A, NO.12 DECEMBER 2012 PAPER A High-Speed Low-Complexity Time-Multiplexing Reed-Solomon-Based FEC Architecture for Optical Communications Jeong-In PARK, Nonmember

More information

ISSN:

ISSN: 427 AN EFFICIENT 64-BIT CARRY SELECT ADDER WITH REDUCED AREA APPLICATION CH PALLAVI 1, VSWATHI 2 1 II MTech, Chadalawada Ramanamma Engg College, Tirupati 2 Assistant Professor, DeptofECE, CREC, Tirupati

More information

Design and Analysis of Modified Fast Compressors for MAC Unit

Design and Analysis of Modified Fast Compressors for MAC Unit Design and Analysis of Modified Fast Compressors for MAC Unit Anusree T U 1, Bonifus P L 2 1 PG Student & Dept. of ECE & Rajagiri School of Engineering & Technology 2 Assistant Professor & Dept. of ECE

More information

High Performance Carry Chains for FPGAs

High Performance Carry Chains for FPGAs High Performance Carry Chains for FPGAs Matthew M. Hosler Department of Electrical and Computer Engineering Northwestern University Abstract Carry chains are an important consideration for most computations,

More information

VLSI System Testing. BIST Motivation

VLSI System Testing. BIST Motivation ECE 538 VLSI System Testing Krish Chakrabarty Built-In Self-Test (BIST): ECE 538 Krish Chakrabarty BIST Motivation Useful for field test and diagnosis (less expensive than a local automatic test equipment)

More information

Guidance For Scrambling Data Signals For EMC Compliance

Guidance For Scrambling Data Signals For EMC Compliance Guidance For Scrambling Data Signals For EMC Compliance David Norte, PhD. Abstract s can be used to help mitigate the radiated emissions from inherently periodic data signals. A previous paper [1] described

More information

Hardware Implementation of Viterbi Decoder for Wireless Applications

Hardware Implementation of Viterbi Decoder for Wireless Applications Hardware Implementation of Viterbi Decoder for Wireless Applications Bhupendra Singh 1, Sanjeev Agarwal 2 and Tarun Varma 3 Deptt. of Electronics and Communication Engineering, 1 Amity School of Engineering

More information

Novel Correction and Detection for Memory Applications 1 B.Pujita, 2 SK.Sahir

Novel Correction and Detection for Memory Applications 1 B.Pujita, 2 SK.Sahir Novel Correction and Detection for Memory Applications 1 B.Pujita, 2 SK.Sahir 1 M.Tech Research Scholar, Priyadarshini Institute of Technology & Science, Chintalapudi, India 2 HOD, Priyadarshini Institute

More information

Research Article VLSI Architecture Using a Modified SQRT Carry Select Adder in Image Compression

Research Article VLSI Architecture Using a Modified SQRT Carry Select Adder in Image Compression Research Journal of Applied Sciences, Engineering and Technology 11(1): 14-18, 2015 DOI: 10.19026/rjaset.11.1670 ISSN: 2040-7459; e-issn: 2040-7467 2015 Maxwell Scientific Publication Corp. Submitted:

More information

Reconfigurable Fir Digital Filter Realization on FPGA

Reconfigurable Fir Digital Filter Realization on FPGA Reconfigurable Fir Digital Filter Realization on FPGA Atmakuri Vasavi 1 Sita Madhuri Bondila 2 1 PG Student (M.Tech), Dept. of ECE, Gandhiji Institute of Science & Tech., Jaggaiahpeta, AP, India 2 Assistant

More information

Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling

Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling International Conference on Electronic Design and Signal Processing (ICEDSP) 0 Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling Aditya Acharya Dept. of

More information

Modified128 bit CSLA For Effective Area and Speed

Modified128 bit CSLA For Effective Area and Speed Modified128 bit CSLA For Effective Area and Speed Shaik Bademia Babu, Sada.Ravindar,M.Tech,VLSI, Assistant professor Nimra Inst Of Sci and tech college, jupudi, Ibrahimpatnam,Vijayawada,AP state,india

More information

Operating Bio-Implantable Devices in Ultra-Low Power Error Correction Circuits: using optimized ACS Viterbi decoder

Operating Bio-Implantable Devices in Ultra-Low Power Error Correction Circuits: using optimized ACS Viterbi decoder Operating Bio-Implantable Devices in Ultra-Low Power Error Correction Circuits: using optimized ACS Viterbi decoder Roshini R, Udhaya Kumar C, Muthumani D Abstract Although many different low-power Error

More information

Available online at ScienceDirect. Procedia Technology 24 (2016 )

Available online at   ScienceDirect. Procedia Technology 24 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Technology 24 (2016 ) 1155 1162 International Conference on Emerging Trends in Engineering, Science and Technology (ICETEST 2015) FPGA Implementation

More information

An FPGA Implementation of Shift Register Using Pulsed Latches

An FPGA Implementation of Shift Register Using Pulsed Latches An FPGA Implementation of Shift Register Using Pulsed Latches Shiny Panimalar.S, T.Nisha Priscilla, Associate Professor, Department of ECE, MAMCET, Tiruchirappalli, India PG Scholar, Department of ECE,

More information

Midterm Exam 15 points total. March 28, 2011

Midterm Exam 15 points total. March 28, 2011 Midterm Exam 15 points total March 28, 2011 Part I Analytical Problems 1. (1.5 points) A. Convert to decimal, compare, and arrange in ascending order the following numbers encoded using various binary

More information

Efficient Architecture for Flexible Prescaler Using Multimodulo Prescaler

Efficient Architecture for Flexible Prescaler Using Multimodulo Prescaler Efficient Architecture for Flexible Using Multimodulo G SWETHA, S YUVARAJ Abstract This paper, An Efficient Architecture for Flexible Using Multimodulo is an architecture which is designed from the proposed

More information

The main design objective in adder design are area, speed and power. Carry Select Adder (CSLA) is one of the fastest

The main design objective in adder design are area, speed and power. Carry Select Adder (CSLA) is one of the fastest ISSN: 0975-766X CODEN: IJPTFI Available Online through Research Article www.ijptonline.com IMPLEMENTATION OF FAST SQUARE ROOT SELECT WITH LOW POWER CONSUMPTION V.Elanangai*, Dr. K.Vasanth Department of

More information

MPEG has been established as an international standard

MPEG has been established as an international standard 1100 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 7, OCTOBER 1999 Fast Extraction of Spatially Reduced Image Sequences from MPEG-2 Compressed Video Junehwa Song, Member,

More information

Low Power Area Efficient Parallel Counter Architecture

Low Power Area Efficient Parallel Counter Architecture Low Power Area Efficient Parallel Counter Architecture Lekshmi Aravind M-Tech Student, Dept. of ECE, Mangalam College of Engineering, Kottayam, India Abstract: Counters are specialized registers and is

More information

A Parallel Area Delay Efficient Interpolation Filter Architecture

A Parallel Area Delay Efficient Interpolation Filter Architecture A Parallel Area Delay Efficient Interpolation Filter Architecture [1] Anusha Ajayan, [2] Rafeekha M J [1] PG Student [VLSI & ES] [2] Assistant professor, Department of ECE, TKM Institute of Technology,

More information

Efficient Realization for A Class of Clock-Controlled Sequence Generators

Efficient Realization for A Class of Clock-Controlled Sequence Generators Efficient Realization for A lass of lock-ontrolled Sequence Generators Huapeng Wu and M. A. Hasan epartment of Electrical and omputer Engineering, University of Waterloo Waterloo, Ontario, anada Abstract

More information

Efficient Method for Look-Up-Table Design in Memory Based Fir Filters

Efficient Method for Look-Up-Table Design in Memory Based Fir Filters International Journal of Computer Applications (975 8887) Volume 78 No.6, September Efficient Method for Look-Up-Table Design in Memory Based Fir Filters Md.Zameeruddin M.Tech, DECS, Dept. of ECE, Vardhaman

More information

Designing an Efficient and Secured LUT Approach for Area Based Occupations

Designing an Efficient and Secured LUT Approach for Area Based Occupations Designing an Efficient and Secured LUT Approach for Area Based Occupations 1 D. Jahnavi, 2 Y. Ravikiran varma 1 M.Tech scholar, E.C.E, Sreenivasa institute of technology and management studies, Chittoor

More information

VLSI Technology used in Auto-Scan Delay Testing Design For Bench Mark Circuits

VLSI Technology used in Auto-Scan Delay Testing Design For Bench Mark Circuits VLSI Technology used in Auto-Scan Delay Testing Design For Bench Mark Circuits N.Brindha, A.Kaleel Rahuman ABSTRACT: Auto scan, a design for testability (DFT) technique for synchronous sequential circuits.

More information

TERRESTRIAL broadcasting of digital television (DTV)

TERRESTRIAL broadcasting of digital television (DTV) IEEE TRANSACTIONS ON BROADCASTING, VOL 51, NO 1, MARCH 2005 133 Fast Initialization of Equalizers for VSB-Based DTV Transceivers in Multipath Channel Jong-Moon Kim and Yong-Hwan Lee Abstract This paper

More information

Comparative Analysis of Stein s. and Euclid s Algorithm with BIST for GCD Computations. 1. Introduction

Comparative Analysis of Stein s. and Euclid s Algorithm with BIST for GCD Computations. 1. Introduction IJCSN International Journal of Computer Science and Network, Vol 2, Issue 1, 2013 97 Comparative Analysis of Stein s and Euclid s Algorithm with BIST for GCD Computations 1 Sachin D.Kohale, 2 Ratnaprabha

More information

Design and Implementation of High Speed 256-Bit Modified Square Root Carry Select Adder

Design and Implementation of High Speed 256-Bit Modified Square Root Carry Select Adder Design and Implementation of High Speed 256-Bit Modified Square Root Carry Select Adder Muralidharan.R [1], Jodhi Mohana Monica [2], Meenakshi.R [3], Lokeshwaran.R [4] B.Tech Student, Department of Electronics

More information

Area and Speed Efficient Implementation of Symmetric FIR Digital Filter through Reduced Parallel LUT Decomposed DA Approach

Area and Speed Efficient Implementation of Symmetric FIR Digital Filter through Reduced Parallel LUT Decomposed DA Approach Circuits and Systems, 216, 7, 1379-1391 Pulished Online June 216 in SciRes. http://www.scirp.org/journal/cs http://dx.doi.org/1.4236/cs.216.78121 Area and Speed Efficient Implementation of Symmetric FIR

More information

Speech Enhancement Through an Optimized Subspace Division Technique

Speech Enhancement Through an Optimized Subspace Division Technique Journal of Computer Engineering 1 (2009) 3-11 Speech Enhancement Through an Optimized Subspace Division Technique Amin Zehtabian Noshirvani University of Technology, Babol, Iran amin_zehtabian@yahoo.com

More information

Pak. J. Biotechnol. Vol. 14 (Special Issue II) Pp (2017) Parjoona V. and P. Manimegalai

Pak. J. Biotechnol. Vol. 14 (Special Issue II) Pp (2017) Parjoona V. and P. Manimegalai ANALYSIS OF AREA DELAY OPTIMIZATION OF IMPROVED SPARSE CHANNEL ADDER Prajoona Valsalan,2 and P. Manimegalai 2 2 Karpagam University, Coimbatore, Tamil Nadu, India. Dhofar University, Salalah, Sultanate

More information

A Combined Compatible Block Coding and Run Length Coding Techniques for Test Data Compression

A Combined Compatible Block Coding and Run Length Coding Techniques for Test Data Compression World Applied Sciences Journal 32 (11): 2229-2233, 2014 ISSN 1818-4952 IDOSI Publications, 2014 DOI: 10.5829/idosi.wasj.2014.32.11.1325 A Combined Compatible Block Coding and Run Length Coding Techniques

More information

Clock Gating Aware Low Power ALU Design and Implementation on FPGA

Clock Gating Aware Low Power ALU Design and Implementation on FPGA Clock Gating Aware Low ALU Design and Implementation on FPGA Bishwajeet Pandey and Manisha Pattanaik Abstract This paper deals with the design and implementation of a Clock Gating Aware Low Arithmetic

More information

Fault Detection And Correction Using MLD For Memory Applications

Fault Detection And Correction Using MLD For Memory Applications Fault Detection And Correction Using MLD For Memory Applications Jayasanthi Sambbandam & G. Jose ECE Dept. Easwari Engineering College, Ramapuram E-mail : shanthisindia@yahoo.com & josejeyamani@gmail.com

More information

K. Phanindra M.Tech (ES) KITS, Khammam, India

K. Phanindra M.Tech (ES) KITS, Khammam, India Volume 7, Issue 5, May 2017 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com LUT Optimization

More information

EFFICIENT DESIGN OF SHIFT REGISTER FOR AREA AND POWER REDUCTION USING PULSED LATCH

EFFICIENT DESIGN OF SHIFT REGISTER FOR AREA AND POWER REDUCTION USING PULSED LATCH EFFICIENT DESIGN OF SHIFT REGISTER FOR AREA AND POWER REDUCTION USING PULSED LATCH 1 Kalaivani.S, 2 Sathyabama.R 1 PG Scholar, 2 Professor/HOD Department of ECE, Government College of Technology Coimbatore,

More information

PHASE-LOCKED loops (PLLs) are widely used in many

PHASE-LOCKED loops (PLLs) are widely used in many IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 5, MAY 2005 233 A Portable Digitally Controlled Oscillator Using Novel Varactors Pao-Lung Chen, Ching-Che Chung, and Chen-Yi Lee

More information

A Fast Constant Coefficient Multiplier for the XC6200

A Fast Constant Coefficient Multiplier for the XC6200 A Fast Constant Coefficient Multiplier for the XC6200 Tom Kean, Bernie New and Bob Slous Xilinx Inc. Abstract. We discuss the design of a high performance constant coefficient multiplier on the Xilinx

More information

Ultra-lightweight 8-bit Multiplicative Inverse Based S-box Using LFSR

Ultra-lightweight 8-bit Multiplicative Inverse Based S-box Using LFSR Ultra-lightweight -bit Multiplicative Inverse Based S-box Using LFSR Sourav Das Alcatel-Lucent India Ltd Email:sourav10101976@gmail.com Abstract. Most of the lightweight block ciphers are nibble-oriented

More information

LUT OPTIMIZATION USING COMBINED APC-OMS TECHNIQUE

LUT OPTIMIZATION USING COMBINED APC-OMS TECHNIQUE LUT OPTIMIZATION USING COMBINED APC-OMS TECHNIQUE S.Basi Reddy* 1, K.Sreenivasa Rao 2 1 M.Tech Student, VLSI System Design, Annamacharya Institute of Technology & Sciences (Autonomous), Rajampet (A.P),

More information

Optimizing area of local routing network by reconfiguring look up tables (LUTs)

Optimizing area of local routing network by reconfiguring look up tables (LUTs) Vol.2, Issue.3, May-June 2012 pp-816-823 ISSN: 2249-6645 Optimizing area of local routing network by reconfiguring look up tables (LUTs) Sathyabhama.B 1 and S.Sudha 2 1 M.E-VLSI Design 2 Dept of ECE Easwari

More information

Implementation of BIST Test Generation Scheme based on Single and Programmable Twisted Ring Counters

Implementation of BIST Test Generation Scheme based on Single and Programmable Twisted Ring Counters IOSR Journal of Mechanical and Civil Engineering (IOSR-JMCE) e-issn: 2278-1684, p-issn: 2320-334X Implementation of BIST Test Generation Scheme based on Single and Programmable Twisted Ring Counters N.Dilip

More information

IN A SERIAL-LINK data transmission system, a data clock

IN A SERIAL-LINK data transmission system, a data clock IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 53, NO. 9, SEPTEMBER 2006 827 DC-Balance Low-Jitter Transmission Code for 4-PAM Signaling Hsiao-Yun Chen, Chih-Hsien Lin, and Shyh-Jye

More information

A High-Speed Low-Power Modulo 2 n +1 Multiplier Design Using Carbon-Nanotube Technology

A High-Speed Low-Power Modulo 2 n +1 Multiplier Design Using Carbon-Nanotube Technology A High-Speed Low-Power Modulo 2 n +1 Multiplier Design Using Carbon-Nanotube Technology A Thesis Presented by He Qi to The Department of Electrical and Computer Engineering in partial fulfillment of the

More information

Filterbank Reconstruction of Bandlimited Signals from Nonuniform and Generalized Samples

Filterbank Reconstruction of Bandlimited Signals from Nonuniform and Generalized Samples 2864 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 48, NO. 10, OCTOBER 2000 Filterbank Reconstruction of Bandlimited Signals from Nonuniform and Generalized Samples Yonina C. Eldar, Student Member, IEEE,

More information

WITH the demand of higher video quality, lower bit

WITH the demand of higher video quality, lower bit IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 8, AUGUST 2006 917 A High-Definition H.264/AVC Intra-Frame Codec IP for Digital Video and Still Camera Applications Chun-Wei

More information

Segmented Leap-Ahead LFSR Architecture for Uniform Random Number Generator

Segmented Leap-Ahead LFSR Architecture for Uniform Random Number Generator , pp.233-242 http://dx.doi.org/10.14257/ijseia.2013.7.5.21 Segmented Leap-Ahead LFSR Architecture for Uniform Random Number Generator Je-Hoon Lee 1 and Seong Kun Kim 2 1 Div. of Electronics, Information

More information

VLSI Based Minimized Composite S-Box and Inverse Mix Column for AES Encryption and Decryption

VLSI Based Minimized Composite S-Box and Inverse Mix Column for AES Encryption and Decryption VLSI Based Minimized Composite S-Bo and Inverse Mi Column for AES Encryption and Decryption 1 J. Balamurugan, 2 Dr. E. Logashanmugam 1 Research scholar, 2 Professor and Head, 1 St. Peter s University,

More information