Back to Daft

Video Object Detection Benchmark

benchmarking/ai/video_object_detection/README.md

0.7.10645 B
Original Source

Video Object Detection Benchmark

Detects objects in 1,000 videos using YOLO11n model. Extracts frames, runs object detection, and crops detected objects across distributed GPU nodes.

Input Dataset: Hollywood2 video dataset (S3 binary files) Output Format: Parquet with object detections, bounding boxes, and cropped images Cluster: 8 worker nodes using g6.xlarge instances Benchmark Date: September 22, 2024 Framework Versions: Daft 0.6.2, Ray Data 2.49.2, AWS EMR Spark 7.10.0

Performance Results

EngineRuntime
Daft11m 46s
Ray Data25m 54s
Spark3h 36m