Parallel Evaluation of Arithmetic Circuits
نویسندگان
چکیده
In this paper, a generic algorithm designed for the parallel evaluation of arithmetic circuits is given. This algorithm can be used in the domain of VLSI design, in order to get tight upper bounds on the computing time of a circuit. It can also be used in automatic parallelization of numerical programs, as a guide for the detection of some predeenite schemes such as dot-products or reductions. More generally, the (theoretical) algorithm presented in section 2 evaluates very quickly arithmetic straight-line programs, and its evaluation time serves as a good upper bound. This algorithm generalizes Miller, Ramachandran and Kaltofen's algorithm 18] in the sense it deals with a great variety of algebraic structures: semi-rings, rings or lattices. Our contribution resides on the one hand in a new bound for the evaluation of circuits over lattices, which improves previous results 19], and on the other hand in the uni-ed formulation for the evaluation algorithm. This algorithm runs in O(min(logn + log d) log n; (h a + log n) log n)) parallel time, d being the \algebraic degree" (in an extended sense) of the circuit and h a the maximal number of alternances of and on a path of the circuit if the and operations deene a lattice, with M(n) processors, where M(n) is the number of processors necessary for the multiplication of two n n matrices in the structure in O(log n) parallel time. After presenting this algorithm, its eeciency is shown on particular cases: taking as input a simple and sequential algorithm, it can be used as a \compiler" to produce a sorting circuit as fast as Cole's circuit, with logarithmic depth, or an adder equivalent to Brent and Kung's adder in terms of size and depth. These academic examples connrm the relevance of the algorithm presented here in the area of conception of fast VLSI arithmetic operators.
منابع مشابه
A High-Speed Dual-Bit Parallel Adder based on Carbon Nanotube FET technology for use in arithmetic units
In this paper, a Dual-Bit Parallel Adder (DBPA) based on minority function using Carbon-Nanotube Field-Effect Transistor (CNFET) is proposed. The possibility of having several threshold voltage (Vt) levels by CNFETs leading to wide use of them in designing of digital circuits. The main goal of designing proposed DBPA is to reduce critical path delay in adder circuits. The proposed design positi...
متن کاملOptimization of Quantum Cellular Automata Circuits by Genetic Algorithm
Quantum cellular automata (QCA) enables performing arithmetic and logic operations at the molecular scale. This nanotechnology promises high device density, low power consumption and high computational power. Unlike the CMOS technology where the ON and OFF states of the transistors represent binary information, in QCA, data is represented by the charge configuration. The primary and basic devic...
متن کاملParallel Identity Testing for Skew Circuits with Big Powers and Applications
Powerful skew arithmetic circuits are introduced. These are skew arithmetic circuits with variables, where input gates can be labelled with powers xn for binary encoded numbers n. It is shown that polynomial identity testing for powerful skew arithmetic circuits belongs to coRNC, which generalizes a corresponding result for (standard) skew circuits. Two applications of this result are presented...
متن کاملAn Optimal Parallel Algorithm for Formula Evaluation
A new approach to Buss’s NC algorithm [Bus87] for evaluation of Boolean formulas is presented. This problem is shown to be complete for NC over AC reductions. This approach is then used to solve the more general problem of evaluating arithmetic formulas using arithmetic circuits.
متن کاملNon radial model of dynamic DEA with the parallel network structure
In this article, Non radial method of dynamic DEA with the parallel network structure is presented and is used for calculation of relative efficiency measures when inputs and outputs do not change equally. In this model, DMU divisions under evaluation have been put together in parallel. But its dynamic structure is assumed in series. Since in real applications there are undesirable inputs an...
متن کاملProbabilistic Parallel Prefix Computation
Given inputs ~1,. . . , z,,, which are independent identically distributed random variables over a domain D, and an associative operation o, the probabilistic prej?x computation problem is to compute the product ~1 o x2 o . . . o xn and its n 1 prefixes. Instances of this problem are finite state transductions on random inputs, the addition or subtraction of two random n-bit binary numbers, and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Theor. Comput. Sci.
دوره 162 شماره
صفحات -
تاریخ انتشار 1996