Bot for Crawling a Flash website and Scraping data to XML file - II
€30-120 EUR
Cancelled
Posted over 11 years ago
€30-120 EUR
Paid on delivery
Second attempt -- Please just bid if you read what I'm writing and you really think you can do it, don't waist your time and my time bidding without even reading. I'm ok to give a chance to someone that is starting. Please be as professional with me as you want that I will be with you, because I'm going to be professional with you. Please don't bid in the maximum value and just afgterwards read, because I will ward project also watching the final price. Ok, having that clear, let's start.
As my previous worker just took the milestone money and no work delivered and now I have to repost the job and maybe wait for the money to get back to me, I decide that I don't need any program, just the result for testing while you're working (the XML file). In the end you upload the program we test it and full payement is done, if you don't accept this, please don't bid. All communication is done is freelancer.com website, if you prefer other mean (like skype) the conversationwill be copied and pasted in one message.
If your bid is choosen, before starting the work for you to clearly know what is expected to do, we will have everything specified in the requirements documents that both of us have to accept. I will write a draft of the requirements and you can adjust, but both of us have to agree before the project is awarded.
The main objective is to create one independant windows application that can be compiled and them used in any windows computer witout the need of any interperter/compiler or other software.
The program is going to crawl part of a website mainly build in flash, scrap and collect specific data and store in a XML file.
I need that to be done recuservely, so you will need to create a bot. The objective is to read the website and constantly keep updating the XML. Entries that are older than 120 minutes should be deleted.
Just for having a general idea, another program (not from this project) will read the XML and use the data for some purpose.
I will create a XML and XSD for you to use, I'm open to listen your ideas and if you propose some other structure that can store all the data in a easy to index way and if I accept it, that can also be changed.
As data from the website is constanly changing is expected that the all process to take just a few seconds to crawl , scrape and store the data, otherwise data will not be up-to-date.
The time between each access of the bot to the website should be randomize between two values.
The start of the crawl is one fixed webpage mainly build in flash, each access is supossed to crawl just the dynamic links situated in a column of that flash webpage, all crawled webpages are also mainly done in flash and all pages have a very similar structure.
One access when you dont have so many pages should crawl 22 pages and scrape around 200 registers per page.
More specific information like the webpage, the data required will be given to the bidders.
Dear Sir,
I can deliver you the results as per your wish, and I just need the site that you are after. for more information please check your private message box.
Thanks and regards