Mini spark project

Work on Hortonworks sandbox please

Implement a simple Spark application using Spark Core (not with Spark SQL) to get <state,total_sales> for (year, month, day and hour) granularities.

Two data sets in HDFS are given below.

Customer information <customer_id,name,street,city,state,zip>

Sales information <timestamp,customer_id,sales_price>

You can consider input/output data set in any one of the below format

Text(with any DELIMITER)



Consider timestamp in epoch (for example 1500674713)

Consider all possible cases of datasets like number of states are small(finitely known set) OR huge(unknown set) and come with appropriate solutions.

Use any of PYTHON/SCALA/JAVA API’s of your choice

Skills: Java, Python, Hadoop, Spark, Scala

See more: project work databases, mini website project, project work writers, apache spark projects github, apache spark real-time projects github, spark projects for practice, spark real-time projects github, real time spark sql projects, spark projects for beginners, apache spark sample project, spark projects for students, project work translate hindi, kannel project work contract, format text ebook, bangalore contract java project work home, software design project work orders, money saving project work, office project work chennai, project work english teachers, can marketing project work topic internet

About the Employer:
( 1 review ) Sunnyvale, United States

Project ID: #24637080

Awarded to:


hi I can do your project using spark both in SCALA/PYTHON. If you wish I can give code in both. Further I'm good at writing code in both RDD' s as well in dataFrames or SparkSql. As per your requirement I will give in More

$25 USD in 1 day
(4 Reviews)

7 freelancers are bidding on average $59 for this job


Hi there , I have about 15 years of development experience in java and about 4 years in big data using spark, scala, hadoop, hive . Hbase etc. I have reviewed the assignment and can deliver the solution in no time . P More

$35 USD in 1 day
(14 Reviews)

This is very simple and I can deliver very soon. Let’s please connect and discuss more on your requirements.

$35 USD in 2 days
(3 Reviews)

Hey I have been working on Spark and big data technology from past 3 years. I have worked as a software developer on multiple big data projects handling billions of records in companies like Amazon, Samasung (this migh More

$35 USD in 3 days
(0 Reviews)

I worked on spark since 3 years, my master thesis on spark applications , i would like to talk further information of the project.

$35 USD in 3 days
(0 Reviews)

Hi, I understand you need to implement an aggregation solution using Spark Core and I can help you with that. I am a senior BigData developer and worked constantly with Spark and also am a teacher of Data Engineer pro More

$150 USD in 2 days
(0 Reviews)

I am specialized in Bigdata (Apache Hadoop, IBM Biginsights, Apache Spark, Sqoop,Flume,Hive,Pig, Scala, Python,Apache Kudu,core Java, Spark Mlb). As a ML Engineer I work as part of a team responsible for building, desi More

$100 USD in 7 days
(0 Reviews)