This repository includes two versions of hadoop management tools
-
Updated
Jul 6, 2023
This repository includes two versions of hadoop management tools
This is my final project for Data Engineer Expert course at Naya College.
The goal of this programming assignment is to compute the PageRanks of an input set of hyperlinked Wikipedia documents using Hadoop MapReduce. The PageRank score of a web page serves as an indicator of the importance of the page. Many web search engines (e.g., Google) use PageRank scores in some form to rank user-submitted queries. The goals of …
Cloudera commands used for Big Data Analytics
Assistant's Qualification to Teach Big Data Processing.
fundamental-hadoop is basically for introduction about Apache Hadoop and it's ecosystem.
Keywords network builder based on TF-IDF with the use of Hadoop platform
This repository contains the TF-IDF score calculation for the documents in the Canterbury dataset for a user given search query
This contains how to perform Sentiment Analysis on the tweets from Twitter using Hive.Collect the tweets from Twitter using Flume, As the tweets coming in from twitter are in Json format, we need to load the tweets into Hive using json input format. Use Cloudera Hive json serde for this purpose.
Spark Benchmark suite to evaluate cluster configuration and compare the performance with other big data frameworks.
Otto-von-Guericke Universität Magdeburg - Big Data SoSe 2017
a Simple SparkSQL Tutorial
a Simple Apache Spark Tutorial
Learn How Hive Work With HBase in Simple Example
This project utilizes the Cloudera platform and PIG queries to analyze and retrieve information on specific baseball performance and statistics problems. By employing big data methods, the analysis offers valuable insights into player performance, game trends, and strategic patterns.
Navigator is a data service that prepares the content for travel agencies, ready for exploration in EWNS (East-West-North-South) direction and hence allows them to render content to the end-user based on their desire to travel.
Add a description, image, and links to the cloudera-hadoop topic page so that developers can more easily learn about it.
To associate your repository with the cloudera-hadoop topic, visit your repo's landing page and select "manage topics."