Find Jobs
Hire Freelancers

Extract Data from PDF Files to XML

$8-15 USD / hour

In Progress
Posted over 7 years ago

$8-15 USD / hour

Dear Freelancers! Looking for massive help here ! Need help for building some kind of software or system which can capture certain information from PDF files or directly from a scanner. We receive information on hard copy paper and some of the information listed on the paper should be extracted and stored in a database as text. I have included to this project a couple of PDF examples of how the design of the PDF looks like. What I am interested in extracting from the PDFs is: - AWB (11 digits or 12 digits in format xxxxxxxxxxx or xxx-xxxxxxxx or xxx-xxxx xxxx) - MRN (18 mixed digits and letters alwaying beginning with year and DK00 fx. 16DK00xxxxxxxxxxxx) That is basically all data needed from the various PDF files. Should be stored into software or browser based system - and if possible like this: AWB MRN STATUS 23526491776 16DK0056002CEE7F10 OK 61522544077 16DK00560034234FD2 OK 11755688874 16DK005600JGFKFG7 OK 23565658794 16DK005600SJDGH45 OK 21746464646 16DK00560045345DSF ERROR 81045454570 16DK00560034254DFS OK 23554545788 16DK005600DSFLJHL3 OK 23526491776 16DK0056005354DSFD OK Please note each AWB can have more than one MRN in each PDF site! The goal is, when this project is finished, to be able to work further with the data from this designed software or browser based system. Plan is to be able to export the data again in a .xml file. I have no idea if anyone are able to assist with designing this piece of software and I know that we should design it "on the fly" and it could require a lot of communication both ways to achieve the final result. Please let me know if you are able to do this project for me and do not hesitate asking any questions you might have. Thank you ! Martin Brandt.
Project ID: 11416500

About the project

9 proposals
Remote project
Active 8 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Sir, I've worked specifically on batch optic character recognition, and can use very advanced libraries to recognize the text from the scanned pdfs. Many thanks. ITGold.
$13 USD in 10 days
4.6 (3 reviews)
1.1
1.1
9 freelancers are bidding on average $14 USD/hour for this job
User Avatar
Hey, This is pretty much the same project I did 2 projects ago for some guy on freelancer. He needed a system built where he could drop pdfs into a folder and then my script would kick start into action and OCR text from the scanned pdfs and store it in a text file. I could probably do this but I'd have to take a little bit of a closer look at the project. If your interested in working together just send me a message.
$22 USD in 10 days
5.0 (1 review)
3.8
3.8
User Avatar
i can do this if you provide more specific information,i worked in data scrapper software coding
$14 USD in 40 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hello sir, Thanks for the opportunity and for taking the time to review our bid. I have read your job offer and am very interested in doing the jobs for you. My name is Faiz from Malaysia. You can get this job will be done perfectly on the time we agree with. I will give you 100% commitment and will available whenever you need me. I am flexible and will stick to your budget. We can discuss further on it. Please consider me for this job. I am waiting for your response and eager to start your job immediately. Thank you, have the nice day and keep smile always.
$16 USD in 12 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I have great experience of extracting data from any source like pdf, txt, csv and transform to any other source like XML, csv , excel. You can call be data service engineer, and so what I am here in Nepal. Thanks Prashiddha
$12 USD in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
A proposal has not yet been provided
$10 USD in 14 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of DENMARK
Aabenraa, Denmark
5.0
13
Payment method verified
Member since Aug 14, 2014

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759) & Freelancer Online India Private Limited (CIN U93000HR2011FTC043854)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.