Near-real time detection of derelict (ghost) crab pots with side-scan sonar.
GhostVision is an open-source Python interface for automatically detecting and mapping ghost (derelict) crab pots from side-scan sonar imagery. GhostVision currently supports multiple packaged object-detection models, including YOLO- and RF-DETR-based exports trained with Roboflow. Detections are then georeferenced with PINGMapper.
Bodine, C.S.; Baxevani, K.; Abbasi, N.;Wierzbicki, J.; Christoph, O.; Hughes, C.; Bagoren, O.; Hines, O.; Greco, J.; Trembanis, A. GhostVision: Democratizing Derelict Gear Detection using Low-Cost Sonar and Artificial Intelligence. (In Review). Submitted to Journal of Marine Science and Engineering.
GhostVision is optimized for running inference (predictions) on the GPU. The processing environment is installed with conda. Any flavor of conda will do, but we recommend Miniforge. Follow the instructions below based on your OS.
Windows does not natively support inference on the GPU. A utility called WSL (Windows Subsystem for Linux) needs to be installed in order to run inference on the GPU.
- Install the latest NVIDIA driver for your system.
- Add CUDA Support for WSL 2.
- Assumes your computer has an NVIDIA GPU.
- Install WSL (Windows Subsystem for Linux) &
- Open the command prompt by launching
Ubuntufrom the Windows Start menu. - You may need to install the NVIDIA Cuda Toolkit with
sudo apt install nvidia-cuda-toolkit.
- In a command prompt, download
Miniforgewith:wget "https://github.com/conda-forge/miniforge/releases/latest/download/Miniforge3-$(uname)-$(uname -m).sh" - Install
Miniforgewith:bash Miniforge3-$(uname)-$(uname -m).sh
- Install
PINGInstaller:pip install pinginstaller - Install
GhostVision:python -m pinginstaller ghostvision-gpu
An experimental version of GhostVision is available to test inference speeds on the CPU. This has been tested on Windows 11 only.
- Install
Miniforge. - Open the
Miniforgeprompt. - Install
PINGInstaller:pip install pinginstaller - Install
GhostVision.python -m pinginstaller ghostvision
- Open the appropriate command prompt based on your installation above.
- Launch
GhostVision:conda activate ghostvision python -m ghostvision - Select your desired model and processing parameters, then click
Submit.
Bundled release models are downloaded automatically into the local ~/.ghostvision/models cache the first time they are needed. The current packaged aliases exposed by the app include rf-detr_v1, yolo26_v1, and yolo12_v1.
The most useful model-specific operating points currently come from the full GhostVision accuracy assessment workflow used for the companion manuscript. In that analysis, detections were evaluated against manual annotations at a 3 m match radius. When object tracking is enabled, GhostVision combines confidence and temporal persistence using:
S = alpha * conf_avg + (1 - alpha) * pred_cnt / max(pred_cnt)
The combined-score analysis reports the following best-performing alpha values:
YOLOv12:alpha = 0.90YOLOv26:alpha = 0.95RF-DETR:alpha = 0.85
The same workflow reports these best thresholds:
YOLOv12:confidence = 0.148,pred_cnt = 17,combined score = 0.221YOLOv26:confidence = 0.101,pred_cnt = 15,combined score = 0.136RF-DETR:confidence = 0.386,pred_cnt = 18,combined score = 0.345
Peak F1 values from that workflow were:
YOLOv12:F1 = 0.739from confidence alone,0.574from persistence alone,0.716from the combined scoreYOLOv26:F1 = 0.737from confidence alone,0.596from persistence alone,0.707from the combined scoreRF-DETR:F1 = 0.721from confidence alone,0.667from persistence alone,0.727from the combined score
For practical use, this suggests:
- Prefer
YOLOv12as the default packaged model for the best overall operational balance. - Start
YOLOv12nearscore threshold = 0.15,pred_cnt = 17, andalpha = 0.90when you want settings that reproduce the evaluation workflow. - Use
RF-DETRonly when very high recall is more important than false-positive burden.
GhostVision now uses the same pred_cnt / max(pred_cnt) normalization as the evaluation workflow when computing the tracked combined score. These values should be treated as model-specific reference points, not universal defaults. GhostVision's packaged defaults remain general-purpose starting values for field use, while the values above are the best choices when you want to reproduce the evaluation workflow as closely as possible.
GhostVision includes packaged object detection models designed to detect crab pots from side-scan sonar imagery. If you want to use your own compatible Roboflow export instead, you can download a custom model with the included utility.
- Open the appropriate command prompt based on your installation above.
- Launch the Roboflow model download utility:
conda activate ghostvision python -m ghostvision rf-download - Supply your Roboflow API Key.
- Enter the project name (all lowercase).
- Enter the project version.
The model will be downloaded and available to use.
- Dataset: PINGEcosystem/sss-crab-pot-detection-ds
- RF-DETR model card: PINGEcosystem/gv-rf-detr
- YOLO12 model card: PINGEcosystem/gv-yolo12
- YOLO26 model card: PINGEcosystem/gv-yolo26
GhostVision has been made possible through mentorship, partnerships, financial support, open-source software, manuscripts, and documentation linked below.
NOTE: The contents of this repository are those of the author(s) and do not necessarily represent the views of the individuals and organizations specifically mentioned here.
Development Team: Cameron Bodine, Art Trembanis, Kleio Baxevani, Naveed Abbasi, Onur Bagoren, Olivia Hines, Jared Wierzbicki, Ophelia Christoph, Catherine Hughes, Julia Greco.
