Find Jobs
Hire Freelancers

Develop resume parser for a specialized type of resumes

$30-250 USD

In Progress
Posted almost 9 years ago

$30-250 USD

Paid on delivery
I have thousands of resumes to scan that are in PDF format. I need to take those resumes and convert them to XML format. All resumes follow a similar format and are of the same type of candidate. All are in English. I have specific needs for the resume parsing. Usually, a resume parser focuses on work experience and focuses little on related areas such as academic awards and hobbies. The resume parser I need is one that focuses on things that a normal HR resume parser will not focus on - I need it to focus on the person's hobbies, academic qualifications, guess the person's age, guess the person's gender, etc. Work experience is still important but not as important as the other information. I have attached sample files from publicly available resumes that resemble the type of resumes that need to be parsed to give you a better idea of what we need to do. Further details will be provided upon request.
Project ID: 7916389

About the project

8 proposals
Remote project
Active 9 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
I've done a lot of work with Python and parsing data. I did some research and found the best/most reliable way to grab the text from the pdf is to use the xpdf package which includes a binary which does a pdf to txt conversion. Then all that remains is to parse the text into python and find a way to guess the information you want. For age, I think using the graduation years from school would be a good starting point, with tweaking based on other factors such as vocabulary used, etc. For gender, use the degree type/work experience and we can use probabilities to determine the likely gender. Any other classification can also be done once all the text is in the Python script.
$155 USD in 2 days
0.0 (0 reviews)
0.0
0.0
8 freelancers are bidding on average $191 USD for this job
User Avatar
Hi! I have good experience in python programming and data parsing. I think I can help you with this task.
$200 USD in 3 days
5.0 (173 reviews)
5.9
5.9
User Avatar
A proposal has not yet been provided
$200 USD in 3 days
4.9 (6 reviews)
3.8
3.8
User Avatar
Hello. How are u I saw your description and sample pdfs. I think that main point is to extract text from pdf . and I have convert to XML Format. I can complete well. I want to discuss with u, Please contact me. I'll wait your good reply. Bye Huang.
$189 USD in 3 days
5.0 (8 reviews)
3.5
3.5
User Avatar
I am student pursuing my degree and have more free time to work and also working on a project based on python. Familiar with regular expressions module in python.
$166 USD in 3 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I have experience in file type parsing. I have developed doc, xls, ppt, pdf and rtf file parsers.
$250 USD in 3 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
Cambridge, United States
4.9
6
Payment method verified
Member since Jun 22, 2015

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759) & Freelancer Online India Private Limited (CIN U93000HR2011FTC043854)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.