
Closed
Posted
German Long Document Sourcing - (AI Training Project) Summary We are seeking detail-oriented freelancers to support a large-scale data sourcing project focused on training advanced AI systems. This project involves sourcing high-quality long-form documents in German across multiple domains and categories. Project Scope Total Documents Required: 140 Coverage: 17 domains and 140 fine-grained categories Requirement: 1 document per category Document Length: Minimum 40 pages, Maximum 100 pages Key Responsibilities Ensure all documents are real-world data only (no synthetic or AI-generated content), created within the last 10 years, and relevant to the assigned domain and category. Maintain high-quality structure, layout, and formatting, and strictly follow all provided sourcing guidelines. Mandatory Requirements No duplicate templates — each of the 140 documents must follow a unique structure/template. Documents must not be sourced from public benchmark datasets. Only genuine, real-world documents will be accepted. Compensation & Candidate Profile Each approved submission will be paid at a fixed rate of $40 per document. Candidates with familiarity in German document formats and structures are preferred. Prior experience in data sourcing, data entry, document annotation, or AI training datasets is a plus but not mandatory. Additional Information This is a recurring opportunity, with ongoing batches available based on the quality and consistency of submissions. Only guideline-compliant submissions will be approved.
Project ID: 40417111
1 proposal
Remote project
Active 5 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
1 freelancer is bidding on average ₹750 INR/hour for this job

Hello, I am ready to start sourcing the 140 unique, long-form German documents for your AI project immediately. With B1 German proficiency and 6 years of professional data research experience, I can efficiently navigate the 17 specified domains to find authentic, high-quality documents that meet your strict 40-100 page requirement. What I will deliver: Real-World Authenticity: Sourcing strictly genuine, human-created documents published within the last 10 years (zero AI or benchmark datasets). 100% Template Diversity: Guaranteeing that every single document features a completely unique structure and layout, strictly avoiding duplicates. Language & Context Accuracy: Leveraging my German skills to verify the exact domain, fine-grained category matches, and overall document density. Flawless Organization: Delivering clean, guideline-compliant batches ready for your AI training systems. My Platform Goal: While I have extensive corporate experience in data management, I am currently building my profile on this platform. Earning a 5-star review is my top priority, which means I am highly motivated to over-deliver and ensure every document passes your quality checks without needing rework. I am available to start sourcing the first batch across different domains today. Best regards, Ahmed
₹750 INR in 30 days
0.0
0.0

Mathura, India
Member since Jul 18, 2013
₹12500-37500 INR
₹750-1250 INR / hour
₹750-1250 INR / hour
₹37500-75000 INR
₹750-1250 INR / hour
$250-750 USD
£2-5 GBP / hour
$10-30 USD
$15-25 USD / hour
$483.84 USD
$15-25 USD / hour
$45 USD
₹1500-12500 INR
₹37500-75000 INR
₹600-1500 INR
$15-25 USD / hour
₹750-1250 INR / hour
₹1500-12500 INR
£10-20 GBP
$10-30 USD
$10-30 AUD
$30-250 USD
₹750-1250 INR / hour
$10-30 USD