
Open
Posted
•
Ends in 5 hours
Paid on delivery
We are currently looking for pre-recorded call center conversation datasets with the following requirements: • English – 500 hours • Hindi – 500 hours • Spanish – 500 hours Requirements: * Agent-side audio only * No background noise * No long silence segments * Audio format: WAV * 16 kHz, 16-bit or higher * Unidirectional audio * Transcription text required Additionally, please confirm the following details: 1. Is transcription text available? * If yes, is it AI-generated or human-annotated? * If AI-generated, can it be manually refined? 2. If transcripts are available, please share details regarding: * Speaker tags * Timestamp information * Accuracy rates for: • Speaker diarization • WER (Word Error Rate) • Timestamp alignment 3. Please confirm the audio sampling rate specifications. 4. Has personal information within the recordings been de-identified? * If yes, please explain the de-identification process. 5. Scope of Data Use: * The dataset will only be used for internal model training purposes.
Project ID: 40474277
6 proposals
Open for bidding
Remote project
Active 6 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
6 freelancers are bidding on average $177,618 USD for this job

I have extensive experience working on large-scale speech and call-center data projects, including multilingual audio collection, transcription alignment, audio cleaning, segmentation, and dataset preparation for AI/ASR model training. I have previously handled similar datasets involving English, Hindi, and multilingual customer-service recordings for machine learning workflows. I can support your requirements for: • Agent-side audio only • WAV format (16kHz / 16-bit+) • Clean unidirectional recordings • Noise reduction and silence filtering • Transcript preparation and refinement • Metadata and timestamp alignment • QA validation for training readiness Regarding transcripts: • Transcriptions can be provided • AI-generated transcripts can be manually refined • Speaker tags and timestamps can be included • WER, diarization quality, and timestamp alignment reports can be shared where applicable I also understand the importance of privacy and compliance. Sensitive information within recordings can be de-identified using masking/redaction workflows before delivery. My experience includes: • ASR/NLP dataset preparation • Audio preprocessing and annotation • Hindi, English, and multilingual transcription projects • Large-volume dataset handling for internal AI training I can discuss available volume, language coverage, delivery structure, and timeline immediately. Ready to start as soon as requirements are finalized.
$2,000 USD in 7 days
4.3
4.3

Hello, I’m Purity K., a 5.0-rated freelancer specializing in high-quality multilingual call center datasets. I can deliver exactly what you need: • 500 hours each of clean agent-side only audio in English, Hindi & Spanish • WAV format (16kHz, 16-bit), no background noise, minimal silence • Accurate transcriptions with speaker tags & timestamps All data is properly de-identified and ready for AI training. I provide full quality reports (WER, diarization accuracy, etc.). I have successfully supplied similar datasets before and can deliver in clear milestones with fast turnaround. Let me know if you’d like to see samples and detailed quality specs. I’m ready to start immediately. Best regards, Purity K.
$2,505 USD in 7 days
3.0
3.0

I have access to curated call center audio archives across all three languages you've specified and can supply agent side, unidirectional WAV recordings at 16 kHz 16-bit or higher with no background noise and minimal silence segments. To address your checklist directly: transcription text is available in human annotated format with the option for AI assisted refinement on request. Files include speaker tags, timestamp alignment, and accuracy benchmarks covering WER, diarization, and timestamp precision. English and Spanish datasets carry WER rates consistently below 5% and Hindi is verified at under 8%. Timestamp alignment tolerance sits within 200ms across all three corpora. All recordings have been through a structured de-identification process. Personal details including names, account numbers, phone numbers, and addresses have been replaced with standardised placeholders at both the audio and transcript level using a combination of automated NER tagging and human review. The dataset is cleared for internal model training use and I can provide documentation confirming data provenance, licensing scope, and de-identification methodology on request. I can share sample files across all three languages for your technical evaluation before any commitment. Prices are negotiable.
$1,050,000 USD in 270 days
0.0
0.0

⭐ I handled a similar project ⭐, Happy to show you what works before you commit. I developed a comprehensive call center audio dataset matching your specifications. Aligned with your project needs, the dataset ensures smooth model training. Understanding the project's nuances, I emphasize performance, security, and user experience. Specializing in such projects, I guarantee top-notch quality and efficiency. Worst case, you walk away with a free consultation and a clearer understanding of your project. Kind regards, Curtley
$3,700 USD in 7 days
0.0
0.0

Zhob, Pakistan
Payment method verified
Member since Apr 4, 2022
$30-250 USD
$10-15000 USD
$30-250 USD
$10-30 USD
$750-1500 USD
$30-250 USD
$10-30 USD
$30-250 USD
$10-30 USD
$2-8 USD / hour
$30-250 USD
$30-250 USD
$10-30 USD
$30-250 USD
$10-30 USD
$42 USD
$30-250 USD
min ₹2500 INR / hour
$30-250 USD
$30-60 NZD / hour
$30-250 USD
₹600-1500 INR
₹600-1500 INR
$10-50 USD
₹1500-12500 INR