نتایج جستجو برای: spark assisted performance
تعداد نتایج: 1176099 فیلتر نتایج به سال:
In the contemporary world, network security has been of biggest importance and acute worry in both individual institutional wisdom, concurrent with newly emerging technologies. Firewalls, encryption techniques, intrusion detection systems, honeypots are just a few systems technologies that have developed to ensure information security. Systems for safeguarding an organizational environment thro...
The aim of this project is to introduce Spark Clouds, which integrate spark lines into a tag cloud to convey trends between multiple tag clouds. Spark Clouds ability is to show trends compares favorably to the alternative visualizations. In the Existing System, Tag clouds are used to display the relative tag frequency, popularity, or importance by font size. They serve as a visual summary of do...
Decision tree is one of the most widely used classification methods. For massive data processing, MapReduce is a good choice. Whereas, MapReduce is not suitable for iterative algorithms. The programming model of Spark is proposed as a memory-based framework that is fit for iterative algorithms and interactive data mining. In this paper, C4.5 is implemented on both MapReduce and Spark. The resul...
Apache Spark is a popular framework for writing large scale data processing applications. Our long term goal is to develop automatic tools for reasoning about Spark programs. This is challenging because Spark programs combine database-like relational algebraic operations and aggregate operations, corresponding to (nested) loops, with User Defined Functions (UDFs). In this paper, we present a no...
BACKGROUND Structure-based virtual screening is an in-silico method to screen a target receptor against a virtual molecular library. Applying docking-based screening to large molecular libraries can be computationally expensive, however it constitutes a trivially parallelizable task. Most of the available parallel implementations are based on message passing interface, relying on low failure ra...
Free-piston engines are under investigation by a number of research groups worldwide due to potential fuel efficiency and engine emissions advantages. The free-piston engine generator, in which a linear electric generator is fixed to the mover to produce electric power, has been proposed as an alternative prime mover for hybrid-electric vehicles. This paper investigates the performance of a spa...
Entity Resolution is among the hottest topics in the field of Big data. It finds duplicates in datasets, which actually belong to same entity in the real world. Algorithms that perform Entity Resolution are computation intensive and consume a lot of time especially for large datasets. A lot of research has been conducted for improving Entity Resolution solutions. A number of algorithms are deve...
In recent years, a new model of performing data-parallel computations on clusters of unreliable machines (e.g., MapReduce [1], Dryad [2]) has become widely popular. These systems achieve their scalability and fault tolerance by providing a programming model where users create acyclic data flow graphs to pass input data through a set of operators. This allows the underlying system to schedule jo...
Spark is a popular framework for writing large scale data processing applications. Our goal is to develop tools for reasoning about Spark programs. This is challenging because Spark programs combine database-like relational algebraic operations and aggregate operations with User Defined Functions (UDF s). We present the first technique for verifying the equivalence of Spark programs. We model S...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید