two data scraper for italian websites
$30-250 USD
Paid on delivery
I need two data scraper for the following sites:
www dot aziende dot it
login dot cercaziende dot it
The scraper needs to collect the following information
- category (eg plumbers etc)
- Business Name
- description (id="textDescriptor")
- All phone & fax numbers
- Address
- website address
- email address
A business may have more than one phone number and should be broken into the following fields.
- Ph
- Other
- Fax
- Mobile
- AH Contact
I also need the address broken into separate fields
- Street number and name
- Suburb
- State
- postcode
- Country
The script must be able to:
use a proxy server lists in round robin way, rotating them every 20 or 50 requests
use as input a file with the urls list
export the data to a csv.
A simple interface will allow me to start/stop the script and provide basic progress feedback.
automatically extract the data from the continuing pages i.e. 2, 3, 4 onwards to get the full data
I should be able to specify the max number of records to retrieve and the speed (delay) of retrieving
For the first web site the url that contains the links to the information
are like:
http://www dot aziende dot it/abbigliamento/[login to view URL]
http://www dot aziende dot it/casa-e-giardino/[login to view URL]
and so on.
For the second website the urls are like:
http://login dot cercaziende dot it/category/abbigliamento
http://login dot cercaziende dot it/category/auto-e-moto
and so on
for this site the info is all on the page, you do not have to follow other links beside the paging.
Project ID: #263685
About the project
Awarded to:
13 freelancers are bidding on average $166 for this job
Dear sir, I am very interested in your project, Please see PMB for more details. Thanks. Best Regards.