
Open
Posted
•
Ends in 1 day
My Excel workbook already contains a VBA macro that opens a PDF, extracts targetted numeric value from certain columns, aggregates them, and drops the results straight into specific cells. Functionally it works, yet it generates errors when it comes accross different number formats. I need a fast, tidy rewrite (or smart port) that does the same three core steps—read, parse & aggregate, write—within roughly three hours of coding time. You can choose the approach that lets you move fastest: streamline the existing VBA, replace it entirely with a Python routine built around pdfplumber, or create a hybrid where Python performs the heavy lifting and VBA simply updates the sheet. I’m comfortable with any of those paths as long as the final workbook remains a one-click solution. Key goals • Recognise both European “1.234,56” and North-American “1,234.56” formats without confusion • Add solid error handling so corrupted PDFs or missing fields never crash the workbook • Deliver noticeably better performance on a 30-page PDF than the current macro • Produce a concise execution log (text file or hidden sheet) capturing start/end time, pages processed, and anomalies Deliverables 1. Refactored, fully commented code dropped into the workbook (and, if Python is used, the .py file) 2. A brief README covering setup and use 3. Proof of success on my sample PDF (screenshot or short clip) I’ll supply the original workbook and a representative PDF as soon as we start. Please respond only if you can tackle and complete the work in a single session of less than three hours—speed and accuracy are both critical on this one.
Project ID: 40375618
16 proposals
Open for bidding
Remote project
Active 6 hours ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
16 freelancers are bidding on average ₹1,053 INR/hour for this job

With a multifaceted skill set that includes Automation, Data Analysis, Data Entry and Excel, I’m confident in offering you a straightforward, dependable solution for your PDF to Excel script revamp. At Zayer Tech, we value efficiency and believe in minimizing disruptions to your workflow which is why our approach to this project will rely on maximizing your existing macro while minimizing any complications. Having prior experience with both VBA and Python libraries like pdfplumber, we can amalgamate the speed and precision of Python with the convenience of VBA-based updates, if that’s what you prefer. In conclusion, by leveraging my extensive experience in Automation and Data Analysis as well as my proficiency with Excel and its VBA environment matched with the limitless capabilities of Python libraries such as pdfplumber, I can provide a smooth transition from your current solution to a revamped one that not only delivers better performance but also produces detailed logs for end-to-end traceability—all within your preferable timeframe of three hours. Remember, Zayer Tech has been instrumental in providing unique AI-driven solutions tailored to specific needs – let's make this project our latest success together!
₹1,000 INR in 40 days
6.8
6.8

Your current macro is failing because VBA's string-to-number conversion chokes when it encounters mixed locale formats—a 30-page PDF with inconsistent delimiters will throw Type Mismatch errors every time. This isn't a code quality issue; it's a fundamental limitation of VBA's parsing logic that no amount of error trapping will fix cleanly. Before I propose the architecture, I need clarity on two constraints: What's your Python environment? If I build a pdfplumber solution, do you already have Python installed, or do I need to bundle it as a standalone .exe that VBA shells out to? What triggers the macro today? Is it a button click, workbook open event, or manual run? I ask because if we go hybrid (Python extraction + VBA write), the handoff mechanism changes depending on how you currently invoke the process. Here's the execution path: - PDFPLUMBER + REGEX: Extract numeric strings with a pattern that captures both "1.234,56" and "1,234.56", then normalize to float before aggregation. This eliminates locale guesswork and handles edge cases like "1 234,56" (space-delimited thousands). - PYTHON LOGGING MODULE: Write a timestamped log to a .txt file in the workbook directory, capturing page count, parse failures, and runtime. VBA reads this log post-execution and optionally dumps it to a hidden sheet. - XLWINGS OR OPENPYXL: Write aggregated results directly to named ranges in your workbook without opening Excel's UI. This is 10x faster than VBA's cell-by-cell writes and prevents screen flicker. - VBA WRAPPER: A single subroutine that shells the Python script, waits for completion, then refreshes the target sheet. You keep your one-click button; the user never sees Python running. I've rebuilt similar PDF-to-Excel pipelines for three accounting firms where mixed European/US invoices caused daily failures. The Python approach cut processing time from 90 seconds to under 8 seconds on 40-page documents and eliminated all format-related crashes. I don't take three-hour projects where the existing codebase is undocumented. Send me the workbook and sample PDF first—I need to see the current VBA structure and confirm the PDF isn't image-based before committing to the timeline.
₹900 INR in 30 days
5.5
5.5

Hi, I can handle this PDF data extraction task using Python and pdfplumber to ensure speed and accuracy. I have significant experience with web scraping and automation projects involving complex document parsing. My plan is to develop a Python script that uses regex patterns to identify and normalize both European and North-American number formats before aggregation. I will wrap this in a simple executable or call it via a VBA shell command to maintain your one-click workflow. The script will include a robust try-except block to log anomalies to a hidden sheet without crashing. This approach will significantly outperform standard VBA for multi-page PDF processing. I am available to start immediately and can complete the rewrite within your three-hour window. Please reach out to discuss the specifics. cheers Nehal
₹1,400 INR in 2 days
3.5
3.5

As an experienced software developer, my proficiency lies precisely in turning ideas, like yours, into reliable and efficient code solutions. Your project aligns perfectly with my skills. Having extensive knowledge in Excel automation and scripting with VBA and Python, I'm confident I can deliver the revamped PDF to Excel script you require with speed and accuracy within your specified time frame. My approach to the task of recognising different number formats without confusion is to leverage Python's pdfplumber library for precision. I can also incorporate solid error handling protocols in the script to eliminate any chances of workbook crashes in events like corrupted PDFs or missing data fields. Additionally, a critical goal for me would be delivering noticeably better performance on a 30-page PDF than your current macro, enhancing the overall efficiency of your worksheet. In terms of deliverables, I commit to providing refactored code with comprehensive comments for easy understanding and maintenance. A README file will also be provided covering the setup and usage process. For validation purposes, I'll present proof of success on your sample PDF in form of a screenshot or short clip. Finally, I assure you of my commitment to communication, clear implementation of agreed-upon objectives, and long-term maintainability of the solution—qualities that have been crucial to my career as a sought-after developer.
₹1,000 INR in 40 days
2.6
2.6

**PROPOSAL: PDF-to-Excel VBA Revamp** Hi there, I specialize in automating data extraction workflows, and I understand the challenge—VBA macros for PDF processing are tricky, especially when dealing with finicky PDF structures and maintaining Excel integration. **What I can deliver:** I'll diagnose and revamp your existing macro to reliably extract numeric values from PDFs with improved error handling and performance. This includes: - Debugging current code and identifying failure points - Optimizing PDF parsing logic (likely leveraging VBA's native capabilities or lightweight Python integration if needed) - Testing against your actual PDFs to ensure robust extraction - Clear documentation for future maintenance **Why I'm a fit:** I've built dozens of automation pipelines—from data extraction and API integrations to multi-step workflows. I understand the balance between VBA's limitations and what's actually achievable within Excel, and I deliver clean, maintainable solutions. **Timeline & Scope:** Given the $750 budget and the VBA/PDF complexity, I'd estimate **5–7 business days** for a solid revamp. This covers diagnosis, optimization, and testing. However, I notice your description is truncated—could you clarify: - Current macro issues (errors, slowness, inconsistency)? - PDF complexity (layout, OCR needed, or simple text extraction)? This ensures I scope correctly and deliver exactly what you need—no surprises. **Let's talk specifics and get this working reliably.**
₹750 INR in 7 days
1.8
1.8

Hello, I can handle this in a single focused session and quickly refactor your PDF-to-Excel automation so it stays one-click while becoming more reliable and faster. I can either clean up the existing VBA, port the parsing to Python with pdfplumber, or build a hybrid solution depending on which path gives the best speed and stability within your three-hour window. A key part of my fix will be robust number parsing so both European and US formats are recognized correctly, along with solid error handling for damaged PDFs, missing fields, and unexpected layouts. I will also add a concise execution log showing run time, processed pages, and any anomalies, while keeping the code fully commented and easy to maintain. You will receive the updated workbook, any supporting Python file if used, and a short README explaining setup and usage. Once you share the workbook and sample PDF, I can move straight into the rewrite and deliver proof that it works on your sample file.
₹1,000 INR in 40 days
0.0
0.0

Hey! The core issue isn't the extraction logic, it's solid error handling when PDFs have inconsistent structures or missing columns. And I've debugged and refactored dozens of VBA macros for PDF parsing in Excel workbooks over the last 15 years. I'll add proper error trapping - validate column existence before extraction, and ensure the macro gracefully handles edge cases without breaking. This is a quick fix, likely 2-3 hours. I'm offering $30/hr (symbolic rate) because I need strong reviews to build momentum on Freelancer. You get fast, reliable work, I get a solid review. Check my profile for 135+ WordPress projects and automation tools, I work extremely fast with amplu scripting tasks. Ready to start immediately. Artur
₹2,801.38 INR in 40 days
0.0
0.0

Hi, I read your post—you already have a working VBA macro that extracts values from PDFs into Excel, but it breaks with mixed number formats and needs a fast, reliable rewrite that handles parsing, aggregation, and writing cleanly in one run. I’ve worked on similar Excel automation tasks where stability and data parsing were the main issues. In one project, I replaced a fragile VBA parser with a Python-based pipeline (pdfplumber + pandas) that handled inconsistent number formats, improved speed, and removed random runtime failures completely. We kept Excel as the final output layer so the user still had a one-click workflow. Here’s how I’d approach yours: – Parse PDF using Python (pdfplumber for structured + fallback text extraction) – Normalize number formats (handle both EU and US formats safely, no ambiguity) – Aggregate values in a clean pandas layer – Write results back to Excel (openpyxl or xlwings for direct cell mapping) – Add a simple execution log (pages processed, errors, timing, anomalies) – Wrap everything so Excel VBA triggers a single Python call (one-click flow preserved) This avoids patching fragile VBA logic and makes the system stable across different PDFs. Quick question—are your PDFs consistently structured (same layout every time), or do they vary between sources? I can complete this in a single focused session as requested. Best, Mihailo
₹1,000 INR in 40 days
0.0
0.0

Hello, I have read your project details and I get what you need. I am a skilled freelancer with 4 years of experience in Python, Automation. Check my profile for portfolio and reviews. Looking forward to your reply. Best regards, Syeda Tahreem
₹750 INR in 40 days
0.0
0.0

Hi, I can revamp your PDF→Excel script with robust handling for inconsistent table structures and numeric format edge cases. Milestone 1 (24–48h): 1) inspect current script + failure samples 2) harden parsing/normalization logic 3) preserve exact output mapping 4) provide validation run + error log format If you share 2–3 sample files (working + failing), I can start immediately. Best, Stoyan
₹1,100 INR in 2 days
0.0
0.0

Dear Sir/Madam, I am an experienced Python Developer with strong expertise in building scalable backend systems, APIs, automation tools, and full-stack applications. I specialize in delivering clean, efficient, and production-ready solutions. I have successfully developed and deployed multiple live applications including healthcare platforms, legal service apps, school management systems, fintech apps, and real-time communication systems. My Core Python Expertise ✔ Django & Django REST Framework ✔ FastAPI (High-performance APIs) ✔ Flask ✔ SQLModel / SQLAlchemy ✔ PostgreSQL / MySQL / MongoDB ✔ Supabase Integration ✔ Authentication (JWT, OAuth) ✔ Payment Gateway Integration (PhonePe, Razorpay, Stripe) ✔ Web Scraping (BeautifulSoup, Selenium) ✔ Automation Scripts ✔ WebSocket & Real-time Systems ✔ Docker Deployment ✔ AWS / VPS Deployment ✔ REST API Design & Optimization What I Can Build For You Secure REST APIs SaaS backend architecture Admin dashboards Real-time chat systems Payment systems Data processing systems Microservices architecture AI/ML API integration Custom business logic systems Recent Project Experience Healthcare booking & wallet system Legal consultation backend platform School ERP & management API Fintech wallet & transaction management Real-time chat application (WebSocket + MQTT) Location-based services & geo APIs
₹750 INR in 40 days
0.0
0.0

Automated PDF Data Pipeline: Format-Agnostic Extraction for Excel Hi, I will implement a high-performance automated data extraction system to replace your current VBA macro. As an Automation Architect, my focus is on building a robust "set-and-forget" solution that handles formatting chaos and eliminates manual intervention. How I will solve your problem: The Engine: I will use a Python-based extraction core (pdfplumber) integrated into your workbook. This tool is specifically designed for complex PDF structures and is 10-20x faster than native VBA. Smart Logic (The "Brain"): Instead of simple text replacement, I will implement a normalization layer that automatically detects decimal separators (EU vs. US) by analyzing the number structure. This prevents errors regardless of the locale. Execution Architecture: One-Click Experience: You keep your single-button Excel workflow. VBA will trigger the background process and update the cells. Resilience: I’ll add a dedicated Execution Log (hidden sheet or file) to capture start/end times and flag anomalies or missing data without crashing the system. Fast Delivery: I will complete the entire build, including the error-handling layer and README, in a single session of less than 3 hours. I focus on scalable, practical logic rather than just writing lines of code. Ready to start as soon as you provide the sample PDF. Best regards, Alexey
₹800 INR in 7 days
0.0
0.0

Hi, I am ready to start right now and can comfortably complete this entire rewrite in a single session under your 3-hour limit. I am a Senior Automation Engineer. I strongly recommend completely replacing the fragile VBA parsing with a pure Python routine using pdfplumber. Python is exceptionally fast at this and won't crash your Excel workbook. How I will hit all your key goals: Number Formats: I will use Python Regular Expressions (Regex) to flawlessly identify and standardize both EU (1.234,56) and NA (1,234.56) formats on the fly before aggregation. Stability: I will wrap the extraction logic in robust try-except blocks. Corrupted pages will be bypassed cleanly and logged, never crashing the script. Logging & One-Click: I will implement Python's logging module to generate the exact execution log you need. To keep it a "one-click" solution, I will provide a tiny VBA shell command that silently triggers the Python script from your workbook. As a new freelancer here, I am highly motivated to execute this perfectly to earn my first 5-star review. Please send over the representative PDF and the workbook, and I will jump right into the code.
₹800 INR in 3 days
0.0
0.0

Amravati, India
Member since Mar 29, 2023
$250-750 USD
$15-25 USD / hour
₹12500-37500 INR
$250-750 USD
$14-20 NZD
₹100-400 INR / hour
₹100-400 INR / hour
$10-30 USD
$750-1500 AUD
₹12500-37500 INR
$30-250 USD
₹400-750 INR / hour
₹100-400 INR / hour
$250-750 USD
$750-1500 AUD
₹750-1250 INR / hour
₹750-1250 INR / hour
$30-250 USD
$10-30 USD
$8-15 USD / hour