Set-of-Mark detection pipeline for macOS — Apple Vision, YOLO11, and VLM on MLX. Transforms screenshots into numbered element maps and structured JSON manifests.
macos computer-vision yolo mlx screenshot-analysis apple-silicon ui-detection set-of-mark yolov11 vllm-mlx
-
Updated
Apr 9, 2026 - Python