Hi, I'm looking for someone to build me a method of parsing cvs, preferably using Textract but other options considered. It has to be able to parse out specific information related to artists. The other option is to use Apache Tika. You need to be experienced with AWS. I'm open to page scraping technology as well. Either way, you will need to be AWS certified because this has to go on to my production server at some point in the future.
1. If you are interested let me know
2. I will send you the briefing document.
3. Send me your solution. It isn't really helpful to just say "I can do it". I want to know how you will complete this job and what technology you think will give the best results. How many hours you estimate it will take to work out the parsing. How will you use the AWS environment and what will you need to setup.
4. If you quote above the range then you probably won't get the gig unless you can show how your solution is neat and brilliant.
The required output at the end of this is to be able to identify the exhibition details for the given artist. You need to extract in the following format:
<YYYY> - <exhibition title>, <exhibition venue>, <exhibition city>
Start with 6 example pages that I supply to you. You use your infrastructure and just send me the results as a csv. If that looks good we can then go to the next stage.
Thanks for your interest.
25 freelancers are bidding on average $532 for this job
Unfortunately, I'd have proposed a well thought out proposal if you had a decent budget. FYI Here's a textract project I did a couple of months back. [login to view URL]
I am experienced with AWS and building services with Pytyon tools, judging by your description to be honest I am not sure about the final setup of the module you target, but this can be clarified
Hi, I am expert for python tika and textract. I read your specification details for now. I can give you good results using my python code. I am ready to start working for you. Thanks.
Hi I have 10 years of web scrapping experience. I am using guzzle module and proxy server for scrape. I want to discuss with you via chatting. Regards Rory....
This sounds interesting and I am curious in setting this up for you. Can you provide a sample data set that I can use on my end to generate the data format mentioned. Reach out to get started.
I can extract the data from either a PDF, Word document or a web page within the AWS environment Please have a look at my Github Profile for AWS work: [login to view URL]
Hello. I have rich experience in AWS project development for 5+ years. Especially, I am a web scraping expert having a lot of experiences. I can help you. Let's discuss in detail. Thanks.