Introduction

PROBLEM : Millions of abandoned oil and gas wells are scattered across the world, leaching methane into the atmosphere and toxic compounds into the groundwater. Many of these locations are unknown, preventing the wells from being plugged and their polluting effects averted.

SOLUTION : Leveraging remote sensing and machine learning to detect wells from medium-resolution satellite imagery

Alberta Wells Dataset

We introduce the first large-scale Benchmark dataset for this problem, leveraging high-resolution (3m/px) multi-spectral satellite imagery from Planet Labs with diverse landscape. Our curated Dataset comprises over 213,447 wells (abandoned, suspended, and active) from Alberta, a region with especially high well density, sourced from the Alberta Energy Regulator and verified by domain experts. We evaluate baseline algorithms for well detection and segmentation, showing the promise of computer vision approaches and room for improvement.

The dataset is drawn from the province of Alberta, Canada, a region with the third-largest oil reserves in the world and a substantial number of oil and gas wells, many of which have been present for over a century. The entire province of Alberta (an area larger than the UK and Germany combined) encompasses a diverse range of geographical zones and is highly diverse for a landlocked region, including prairies, lakes, forests, and mountains.

We source well data from the Alberta Energy Regulator (AER), which publishes monthly reports on well locations, operation modes, and product types. However, the data often contains duplicates and inconsistencies, so we work with domain experts to clean and categorize the wells as active, suspended, or abandoned.

We then divide Alberta into non-overlapping patches, each covering 1.1025 sq km, to acquire satellite imagery. We use high-resolution RGB and Near-Infrared (NIR) images from PlanetScope (Planet Labs), which provide 0.3m/px resolution across four bands (RGB + NIR). These images are processed to ensure quality and consistency before annotation. We process the images to ensure quality and consistency. Then, we annotate the image patches for both binary segmentation and object detection tasks, following the COCO format for object detection.

To create balanced training and test sets, we develop a dataset-splitting algorithm that groups wells by geographic proximity. This ensures a representative mix of patches with and without wells, allowing machine learning models to train effectively under real-world conditions for improved well detection and environmental monitoring.

Experiments & Analysis :

We Evaluated models for detection and segmentation task and we found that :


## Qualitative Samples :

We share a few Qualitative Samples of Dataset and Predictions from Model below :

NOTE : Please refer to the pre-print for more samples.

Citing

If you find this useful, please reference in your paper:

Seth, P., Lin, M., Yaw, B.D., Boutot, J., Kang, M., & Rolnick, D. (2024). Alberta Wells Dataset: Pinpointing Oil and Gas Wells from Satellite Imagery. ArXiv, abs/2410.09032.

@misc{seth2024albertawellsdatasetpinpointing,
  title={Alberta Wells Dataset: Pinpointing Oil and Gas Wells from Satellite Imagery}, 
  author={Pratinav Seth and Michelle Lin and Brefo Dwamena Yaw and Jade Boutot and Mary Kang and David Rolnick},
  year={2024},
  eprint={2410.09032},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2410.09032}, 
}

Authors

Pratinav Seth (*)
Mila, Manipal Institute of Technology
Michelle Lin (*)
Mila, Université de Montréal
Jade Boutot
McGill University
Mary Kang
McGill University
David Rolnick
Mila, McGill University

(* - Denotes co-first authorship) For questions, please contact us at: seth.pratinav@gmail.com, drolnick@cs.mcgill.ca.