
In Progress
Posted
Paid on delivery
I have a short “Live Photo”-style clip and I need clear, automated insight into the people captured in it. The goal is to identify every person in frame and extract their pose information, then summarise what is happening overall so I can understand “what’s in the video” at a glance. Please work with the original file I will supply (standard MP4 from an iPhone). I’m happy for you to use tools such as OpenCV, MediaPipe, or any preferred deep-learning framework, as long as the final results are accurate and reproducible on my end. Deliverables • JSON or CSV with per-frame pose key-points for each detected person • A short text summary describing the scene and actions • The processed video or image sequence with skeletal overlays for quick visual review Acceptance criteria • Every visible person is detected; missed detections flagged if confidence drops below your model’s threshold • Pose key-points align correctly with limbs in at least 95 % of frames • Summary clearly states how many people appear, what they are doing, and any notable interactions Once I confirm the results meet these criteria, we can wrap up—this should be a focused, quick-turnaround task.
Project ID: 40432112
10 proposals
Remote project
Active 8 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
10 freelancers are bidding on average $123 USD for this job

Hello, With over 7 years of experience in Python and OpenCV, I specialize in developing AI systems for image and video analysis. I understand your requirement for automated people pose analysis in a short video clip. I will utilize tools like OpenCV and deep-learning frameworks to accurately extract pose information for each person in the frame. My expertise in building scalable systems for real-time processing makes me well-suited for this project. I have a strong background in full-stack development (Node.js, TypeScript, Python, Go) and AI systems, ensuring efficient and accurate results. Let's discuss further details in the chat so we can align on the project specifics. Thanks.
$140 USD in 7 days
2.9
2.9

Hello, How will the pose analysis model handle scenarios where individuals are partially obscured or in complex poses? With over 7 years of experience in Computer Vision, Deep Learning, and Python, I specialize in developing accurate and efficient algorithms for analyzing visual data. For this project, I would approach the task by leveraging a combination of OpenCV and MediaPipe to detect and extract pose key-points from each person in the video. By ensuring robust handling of edge cases and optimizing for accuracy, I will deliver a JSON/CSV file with detailed pose information, a concise scene summary, and visual overlays for easy review. I prioritize clear communication and timely delivery. I am eager to discuss further details and move forward with this engaging project. Best regards, Borys
$140 USD in 7 days
1.7
1.7

Hi, This is Abhiram from UK. I understand the need to analyze poses in a short video clip accurately. Having experience in similar projects, I recognize the technical challenges involved in identifying individuals and extracting pose information effectively. To approach this, I would utilize tools like OpenCV or MediaPipe to ensure precise per-frame pose key-points extraction and scene summarization. The focus would be on delivering accurate results, aligning key-points correctly, and providing a clear summary of the actions in the video. Let me ask you a couple of things so I understand it better: Q1- Are there specific actions or interactions you are particularly interested in identifying? Q2- Do you have any preferences regarding the format of the final deliverables? Looking forward to discussing this further and ensuring a successful outcome for your project.
$120 USD in 3 days
1.1
1.1

Accurate pose estimation and scene summarization hinge on effectively utilizing frameworks like OpenCV and deep learning. To address your specific needs, I propose utilizing MediaPipe to extract reliable pose key-points for each individual detected. A JSON or CSV output will facilitate transparent data access, while a short text summary will encapsulate key scene dynamics and interactions amongst subjects. I will ensure that every visible person is flagged if pose confidence drops, adhering to a 95% accuracy threshold. Expect the initial deliverables within 7 days. Want to see a quick deliverable preview before committing?
$110 USD in 10 days
0.0
0.0

The "Live Photo" format gives us reliable frame data to work with, which makes this cleaner to process than a standard video clip. I would use OpenCV with MediaPipe to extract pose keypoints for each person and output a structured report with timestamps and movement labels. I can start today and have results ready within 48 hours. The bid reflects what is in the description. Final numbers come after I see the actual clip. Want to jump on a quick call?
$150 USD in 5 days
0.0
0.0

Kakanj, Bosnia and Herzegovina
Payment method verified
Member since Aug 3, 2025
$30-60 USD
$10-30 USD
$30-250 USD
₹750-1250 INR / hour
₹1500-12500 INR
$15-25 USD / hour
₹750-1250 INR / hour
$30-250 USD
₹600-1500 INR
$250-750 USD
₹600-1500 INR
₹1500-12500 INR
₹100-400 INR / hour
$5-30 USD / hour
€750-1500 EUR
₹37500-75000 INR
$30-250 USD
$10-30 USD
$15-25 USD / hour
₹1500-12500 INR
$250-750 USD
$30-250 USD
₹37500-75000 INR