Big Data Analysis Technologies Traditional Vs Existing

Yojna Arora

Abstract


Analysis of data is important to find the meaningful information contained in it. There are many data storage and manipulation tools. Initially data was stored and analysed using files, tables, databases, data warehouse. However, in the current scenario of Big Data, these traditional methods are not efficient enough to do the analysis. Hadoop, open source software which provides support for distributed processing is implemented. In this paper, a detailed explanation about Hadoop and its components is given. Also, comparison of Hadoop components Pig, Hive and Map Reduce with traditional methods is explained.

Full Text:

PDF

References


Leons Petrazickis and Marius Butuc, “Crunching Big Data with Hadoop and BigInsights in the Cloud”, pg 241-242, Information Management Technologies

Ronald C Taylor, "An overview of the Hadoop / MapReduce /HBase framework and its current applications in bioinformatics", 11th Annual Bioinformatics Open SourceConference (BOSC) 2010, Boston, MA, USA. July 2010

Hadoop: Open-source implementation of MapReduce. http://hadoop.apache.org

Parth Chandarana and M Vijayalakshmi, “Big Data Analytics Framework”, in International Conference on Circuits, System, Communication and Information Technology Applications”,IEEE, 2014

Edmund Kohlwey, Abel Sussman, Jason Trost and Amber Maurer, “Leveraging the cloud for Big data Biometrics”, in World Congress in Services, IEEE,2011

Dean, S Ghemawat, “ Map reduce : A flexible data processing tool”,Communications of the ACM , Vol 53, Number 1, pp 72-77, January 2010

Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zheng Shao, Prasad Chakka, Ning Zhang, Suresh Antony, Hao Liu and Raghotham Murthy, “Hive – A Petabyte Scale Data warehouse using Hadoop”, in 26th International Conference on Data Engineering, IEEE, 2010

Julius Bladh and Andreas Palsson, “A performance comparison of Hive & Pig”, 2015.

Gates, A .F , Natkovich, O. Chopra, S Kamath, P Narayanmurty, & Srivastava, “Building a high level data flow system on top of Map Reduce : the Pig experience”, in Proceeding VLDB

E. Laxmi Lydia & Dr. M. Ben Swarup, “ Big data analysis using Hadoop components like Flume, Map Reduce,Pig and Hive”, IJCSET, Vol 5, Issue 11, Nov 2015

Mehul Nalin Vora, “Hadoop HBase fro large scale data”, International Conference on Computer Science & Network Technology, IEEE, 2011

Dorin Carstoiu, Elena Lepadatu, Mihai Gaspar, "Hbase – non SQL Database, Performances Evaluation", International Journalof Advancements in Computing Technology Volume 2, Number 5, December 2010

Mohammed Islam, Angelo K Huang, etal “Oozie : Toward a scalable workflow management system for Hadoop” ACM, 2012

Mr S. S Aravinth & Ms. A Hasenah Begam, “An efficient Hadoop FrameworkSqoop and Ambari for Big Data Prcessing”, International Journal of innovative research and in Sience and Technology, Vol 1, Isssue 10, March 2015




DOI: https://doi.org/10.23956/ijarcsse.v8i11.917

Refbacks

  • There are currently no refbacks.




© International Journals of Advanced Research in Computer Science and Software Engineering (IJARCSSE)| All Rights Reserved | Powered by Advance Academic Publisher.