Find Jobs
Hire Freelancers

Applescript BBEdit HTML to CSV for Froogle

$100-250 USD

Completed
Posted over 18 years ago

$100-250 USD

Paid on delivery
I need to convert 800+ pages of products currently displayed in HTML files into CSV files for Froogle. The site and pages are generated by Coldfusion, and I have no access to the raw data, just the HTML rendered pages. My suppliers (thousands of them) update their own products, and these products change all the time. Right now downloading the raw HTML files takes approx 2 min per file, some pages have 2 products, others have 2000 products. Items are displayed 3 across with images, titles, short description, price and min quantity. I need a script that I can run locally to strip out all the other stuff from the code and leave me with CSV file that I can then upload to Google. If we can take that CSV and remove duplicate items, that'd be a bonus. I have Flemaker if that helps for that. I need to be able to run this script locally because the data changes constantly. I'll integrate the script you provide me within the script I've already done to: 1) View the page of products in Safari, then allow it to set a veriable to the source code of that page. 2) Dump that source code into BBEdit 7 3) Use your script to process the file, stripping out all the crap 4) Leaving me with a nice clean CSV file for that page or products with image URL's and product-specific page URL's Then I'll be able to merge several of these together for multiple Froogle feeds. **Note - I've made several attempts to generate the complete page of products using CURL and URL Access Scripting, but there's a javascript cookie that needs to be set on the site prior to downloading, and these two alternatives don't seem to support that. Sample of HTML downloaded source file is attached. UPDATE: I've uploaded an excel spreadsheet of the required information that needs to be included. Fields should be mapped out as: product_url - is in the DETAILS link name - First line of desc description - 2nd line of desc image_url - url of image price offer_id - left blank ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Deliverables must be in ready-to-run condition, as follows? (depending on the nature? of the deliverables): a)? For web sites or? other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment. b) For all others including desktop software or software the buyer intends to distribute: A software? installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request. 3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement). ## Platform The final deliverable will need to run on Mac OSX 10.3.9 with BBedit 7.1. Ideally it will also use Safari to get the original code. However, if there's a better method using the Terminal and CURL, or another browser, I'm open to suggestions.
Project ID: 3118966

About the project

2 proposals
Remote project
Active 18 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
See private message.
$212.50 USD in 3 days
5.0 (3 reviews)
3.1
3.1
2 freelancers are bidding on average $208 USD for this job
User Avatar
See private message.
$204 USD in 3 days
4.9 (6 reviews)
2.7
2.7

About the client

Flag of UNITED STATES
Norwalk, United States
5.0
5
Member since Aug 13, 2005

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759) & Freelancer Online India Private Limited (CIN U93000HR2011FTC043854)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.