What is Apache PigApache Pig is a high-level data flow platform for executing MapReduce programs of Hadoop. The language used for Pig is Pig Latin.The Pig scripts get internally converted to Map Reduce jobs and get executed on data stored in HDFS. Apart from that, Pig can also execute its job in Apache Tez or Apache Spark. Pig can handle any type of data, i.e., structured, semi-structured or unstructured and stores the corresponding results into Hadoop Data File System. Every task which can be achieved using PIG can also be achieved using java used in MapReduce. Features of Apache PigLet's see the various uses of Pig technology.1) Ease of programmingWriting complex java programs for map reduce is quite tough for non-programmers. Pig makes this process easy. In the Pig, the queries are converted to MapReduce internally.2) Optimization opportunitiesIt is how tasks are encoded permits the system to optimize their execution automatically, allowing the user to focus on semantics rather than efficiency.3) ExtensibilityA user-defined function is written in which the user can write their logic to execute over the data set.4) FlexibleIt can easily handle structured as well as unstructured data.5) In-built operatorsIt contains various type of operators such as sort, filter and joins.Differences between Apache MapReduce and PIG
Advantages of Apache Pig
|
Friday, August 16, 2019
What is Apache Pig
Subscribe to:
Post Comments (Atom)
Lab 09: Publish and subscribe to Event Grid events
Microsoft Azure user interface Given the dynamic nature of Microsoft cloud tools, you might experience Azure UI changes that occur after t...
-
Import/export of any file type is the basic need of any development environment whether web development, desktop development or applicatio...
-
PowerShell का उपयोग करके एक Azure डेटा फ़ैक्टरी कैसे बनाएँ? This quickstart describes how to use PowerShell to create an Azure data fact...
-
Zone-redundant storage (ZRS) replicates your data synchronously across three storage clusters in a single region. Each storage cluster is ...
No comments:
Post a Comment