Closed

Data analysis

This project received 15 bids from talented freelancers with an average bid price of $193 USD.

Get free quotes for a project like this
Employer working
Project Budget
$30 - $250 USD
Total Bids
15
Project Description

Deadline: Thursday 10/ Nov/2016

Using python, pandas, numpy and scikit learn.

For visualizations, you will not need anything more complex than scatter-plots, histograms or line plots. You will provide a single ipython notebook that contains the code for all the answers. Use a separate tab for each question. For each task, also write your appropriate answers in a .txt, .doc or .pdf and submit this along with your code.

1. I have provided you with a dataset called data1. It contains a train and test dataset. Use a suitable method to predict the “Value” given the features (there are 100 features) (there are a number of redundancies in the features). Evaluate and present your results using an appropriate error measure.

2. I have provided you with two datasets in data2.zip. For each dataset:

a. Analyze the data using an appropriate visualization

b. Use an appropriate method to cluster similar data-points together. Justify why you

picked the specific method for each dataset.

c. Output the clustered points using an appropriate visualization.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online