
In Progress
Posted
I am seeking skilled freelancers to support an AI training initiative aimed at improving Kazakh speech recognition models. In this role, you will produce precise, verbatim transcriptions and review audio recordings in Kazakh. Your contributions will directly help train and enhance next-generation AI speech technologies. Responsibilities ✅ Listen to audio recordings and produce accurate, verbatim transcriptions in Kazakh. ✅ Capture not only what is said, but how it is said (including filler words, repetitions, and false starts). ✅ Add speaker labels for multi-speaker recordings (2–8 participants). ✅ Insert precise timestamps for text segments (within 500ms accuracy). ✅ Annotate non-speech events such as laughter, background noise, or unclear audio. ✅ Follow detailed project instructions and maintain high quality standards. ✅ Maintain steady productivity and reliability throughout the project. Workload & Commitment ✅ Work volume is flexible, with approximately 4 hours of work available per day. ✅ Expected project duration: 1 to 2 months. ✅ We are flexible, but ideally contributors can dedicate at least 20 hours per week. ✅ Work can be completed at any time of day as long as weekly hours and quality levels are met. Requirements ✅ Native-level fluency in Kazakh, including familiarity with regional accents and informal speech. ✅ Strong listening comprehension and attention to detail. ✅ Ability to produce verbatim transcriptions, including stutters, repetitions, and filler words. ✅ Experience in transcription, annotation, or linguistics is an advantage. ✅ Comfortable using browser-based tools (training will be provided). ✅ Reliable internet connection. ✅ Access to a laptop or desktop computer (mobile devices are not supported). ✅ Reliable availability for 20+ hours per week. Work Assignment This project focuses on transcription tasks involving both single-speaker and multi-speaker audio recordings. Tasks may vary in complexity depending on audio quality and number of speakers. Clear instructions and guidelines will be provided, and ongoing feedback may be given to ensure quality standards are met. Task Summary (What to Expect) You will listen to real audio recordings and produce precise, verbatim transcripts — capturing not just what was said, but how it was said. That means every “uh,” every repeated word, and every trailing sentence matters. Your work will include: • Verbatim transcription with a target of 98%+ accuracy • Timestamping segments with less than 500ms deviation • Speaker labeling for conversations with 2 to 8 participants • Event annotation for non-speech elements (e.g., laughter, background noise, unintelligible audio) You will be asked to complete a calibration test which will determine whether you will proceed to production tasks. Transcription Payment ✅ Payment Note: The project focuses on transcription. ✅ The compensation rate is $30 per audio hour, based on expected productivity and quality standards. ✅ Payment is tied to meeting quality benchmarks and maintaining consistent output. Please note that if your work quality falls below the client's standards or if your productivity significantly exceeds or falls short of expected levels, you may be removed from the project. Payment may not be provided for submitted work that does not meet requirements.
Project ID: 40376874
1 proposal
Remote project
Active 5 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs

Bacolod City, Philippines
Payment method verified
Member since Apr 16, 2026
$161.28 USD
$15-25 USD / hour
$241.92 USD
$50-90 USD
$15-25 USD / hour
$250-750 USD
$13 USD
₹12500-37500 INR
$250-750 USD
$188.16 USD
$161.28 USD
$15-25 USD / hour
$174.72 USD
$10-30 USD
min $50 USD / hour
$15-25 USD / hour
₹400-750 INR / hour
$15-25 USD / hour
$215.04 USD
$2-30 USD / hour