Big Data Analytics for Non-Programmers

Понравилась презентация – покажи это...

Слайд 0

Big Data Analytics for Non-Programmers

Слайд 1

Agenda for the day Can Hadoop be learnt without knowing Java? How Pig can be used in place of MapReduce ? Querying data with HiveQL

Слайд 2

Can Hadoop be learnt without knowing Java?

Слайд 3

YES !! Hadoop can be learnt without knowing Java

Слайд 4

Pig & Hive Tools like Pig and Hive that are built on top of Hadoop, offer high-level languages for working with data If you want to write MapReduce program, then you can use Pig and Pig Latin for which knowledge of Java is not required. If you want to view data in HDFS in a readable form you can use Hive which again does not require any knowledge of Java.

Слайд 5

Why Pig?

Слайд 6

But why Pig? Pig simplifies complex MapReduce programs by using Pig Latin Additionally, If you want to write your own MapReduce code, you can do so in any language (e.g. Perl, Python, Ruby, C, etc.) But the most attractive features of Pig are: 10 lines of PIG = 200 lines of Java Built in operations like: Join Group Filter Sort and more…

Слайд 7

Why Pig? Provides common data operations filters, joins, ordering, etc. and nested data types tuples, bags, and maps missing from MapReduce. It is Open source and is actively supported by a community of developers. An ad-hoc way of creating and executing map-reduce jobs on very large data sets Can take any data Easy to learn, Easy to read and write Extensible by UDF (User Defined Functions) Java not required

Слайд 8

Why Hive?

Слайд 9

Why Hive? Defines SQL-Like Query Language called HiveQL

Слайд 10

Features of Hive You can use HIVE to read and write files on Hadoop and run your reports from a BI tool Predictive Modeling & Hypothesis Testing Document Indexing Customer-facing Business Intelligence Log Processing Data Mining HIVE Applications

Слайд 11


Слайд 12

Thank You Questions/Queries/Feedback Recording and presentation will be made available to you within 24 hours