123ArticleOnline Logo
Welcome to 123ArticleOnline.com!
ALL >> Education >> View Article

What Is Hadoop In Big Data?

Profile Picture
By Author: tibacademy
Total Articles: 55
Comment this article
Facebook ShareTwitter ShareGoogle+ ShareTwitter Share

What is Hadoop
Hadoop is an open supply framework from Apache and is used to store method and analyze information that is very vast in volume. Hadoop is written in Java and isn't OLAP (online analytical processing). It’s used for batch/offline process. It’s being used by Facebook, Yahoo, Google, Twitter, LinkedIn and lots of additional. Moreover it can be scaled up just by adding nodes in the cluster.
Modules of Hadoop
1. HDFS: Hadoop Distributed filing system. It states that the files are broken into blocks and keep in nodes over the distributed design.
2. Yarn: yet another Resource negotiator is used for job Hadoop training in Bangalore scheduling and manages the cluster.
3. Map Reduce: this is a framework which helps Java programs to do the parallel computation on data using key value pair. The Map task takes input file and converts it into information set which might be computed in Key value try. The output of Map task is consumed by reduce task then the out of reducer offers the specified result.
4. Hadoop Common: These Java libraries are used to begin Hadoop and are used by other Hadoop modules.
Advantages of Hadoop
• Fast: In HDFS the data distributed over the cluster and are mapped which helps in faster retrieval. Even the tools to process the information are usually on similar servers, so reducing the interval. It is able to process terabytes of data in minutes and Peta bytes in hours.
• Scalable: cluster can be extended by just adding nodes in the cluster.
• Cost Effective: Hadoop is open source and uses artifact hardware to store information thus it extremely cost effective as compared to ancient relational database management system.
• Resilient to failure: HDFS has the property with which it can replicate data over the network, thus if one node is down or another network failure happens, then Hadoop takes the opposite copy of data and use it. Normally, information is replicated thrice however the replication issue is configurable.
History of Hadoop
It was started by Doug Cutting and mike Cafarella. Its origin was the Google filing system paper, printed by Google.
Let's target the history of Hadoop within the following steps: -
• In 2002, Doug Cutting and mike Cafarella began to deal with a venture, Apache Nutch. It's an open source web crawler programming framework venture.
• While chipping away at Apache Nutch, they were managing huge information. To store that Big Data Hadoop Training in Bangalore data they need to spend a great deal of costs which turns into the outcome of that venture. This issue ends up one of the significant purposes behind the rise of Hadoop.
• In 2003, Google presented a record framework called GFS (Google document framework). It's a restrictive circulated record framework created to supply effective access to data.
• In 2004, Google discharged a white paper on Map lessen. This strategy improves the information handling on huge bunches.
• In 2005, Doug Cutting and mike Cafarella presented another document framework called NDFS (Nutch Distributed File System). This record framework additionally incorporates Map diminish.
• In 2006, Doug Cutting quit Google and joined Yahoo. Based on the Nutch venture, Dough Cutting presents another task Hadoop with a record framework known as HDFS (Hadoop Distributed File System). Hadoop first form 0.1.0 discharged in this year.
• Doug Cutting gave named his task Hadoop after his child's toy elephant.
• In 2007, Yahoo runs 2 groups of one thousand machines.
• In 2008, Hadoop turned into the speediest framework to sort one terabyte of information on a 900 hub bunch inside 209 seconds.
• In 2013, Hadoop 2.2.
• In 2017, Hadoop 3.0.

Total Views: 80Word Count: 580See All articles From Author

Add Comment

Education Articles

1. Top 50 Business Schools In India
Author: Kishore sharma

2. Are International Schools Really Worth The Fees?
Author: Priya Varma

3. The Most Affordable Medical Universities In The World
Author: admin

4. Best Graphics Designing Courses, Classes In Pune And Pimpri Chinchwad
Author: swara

5. Why Nursery School Studies Are Stress Free?
Author: zoomlanguage

6. The Child And Their Future On Schooling
Author: zoomlanguage

7. How To Enroll For Best Llm Coaching Classes In Delhi
Author: Vishal Sharma

8. Ssc Preparation: English Language & Comprehension
Author: punit shukla

9. 3 Reasons Why Holistic Education Is Essential For Your Toddler
Author: Little Wings Holistic Childhood Center

10. Tally Course | Tally Course In Delhi | Smartech Education
Author: Smartech education

11. Spoken English Course | Learn English Speaking | Smartech Education
Author: Smartech education

12. Web Designing Course In Delhi | Smartech Education
Author: Smartech education

13. Digital Marketing Course In Delhi | Smartech Education
Author: Smartech education

14. Why Should You Invest In Campus 365 Software?
Author: Patrick Jones

15. Study Abroad & Study Visa Consultants
Author: ShubhamThakur

Login To Account
Login Email:
Forgot Password?
New User?
Sign Up Newsletter
Email Address: