This repository includes two versions of hadoop management tools
-
Updated
Jul 6, 2023
This repository includes two versions of hadoop management tools
Navigator is a data service that prepares the content for travel agencies, ready for exploration in EWNS (East-West-North-South) direction and hence allows them to render content to the end-user based on their desire to travel.
Cloudera commands used for Big Data Analytics
Assistant's Qualification to Teach Big Data Processing.
GCP hosted product for over 1 million movie investors on HSX.com, aiding online movie trading and box-office investments by leveraging Big Data technologies like Hive and Hadoop, and Tableau dashboards
fundamental-hadoop is basically for introduction about Apache Hadoop and it's ecosystem.
Keywords network builder based on TF-IDF with the use of Hadoop platform
Anticipatory customer order prediction after purchasal of item(s).
Running my first pyspark app in CDH5
This project involves analysing the airline datasets to solve the problem statements using HADOOP.
Data processing using docker containers, kafka, spark, and hadoop
a Simple HBase Tutorial
This contains how to perform Sentiment Analysis on the tweets from Twitter using Hive.Collect the tweets from Twitter using Flume, As the tweets coming in from twitter are in Json format, we need to load the tweets into Hive using json input format. Use Cloudera Hive json serde for this purpose.
a Simple SparkSQL Tutorial
a Simple Apache Spark Tutorial
Add a description, image, and links to the cloudera-hadoop topic page so that developers can more easily learn about it.
To associate your repository with the cloudera-hadoop topic, visit your repo's landing page and select "manage topics."