My first topic of interest is to improve our understanding of deep learning models. In computer vision, current methods focus on assessing where model focus on input images. However, these approaches are uninformative when it comes to explain why classification models can fail to correctly predict some instances.
My research consists in using tools from classical signal processins theory, in particular the wavelet transform, to meaningfully decompose the input image and then to identify the important parts of this decomposition into the model's prediction. I prefer using the wavelet transform over the Fourier transform as the wavelet decomposition of an image is localized in space and in frequency, which on images enables to assess where and what (scales) models see.
During the PhD thesis, we introduced the wavelet scale attribution method (WCAM), an extension of Fel's et. al. attribution method to the wavelet domain in order to assess what models see on an image. Scales of the wavelet transform correspond to structural components of the image: therefore, we can assess whether the model relies on shapes, textures or other intermediary components to make its prediction.
I believe that the the WCAM can be expanded into several interesting directions:
The photovoltaic (PV) installed capacity grows very quickly, both worldwide and in France. A specificity of PV energy is that a sizezable share of the systems are located on rooftops (20% of the installed capacity in France). These rooftop systems are not well known by authorities or transmission system operators. The main goal of my thesis was to look for methods to improve the knowledge regarding the rooftop PV fleet and ultimately to introduce new methods for accurately estimating the PV power production.
In the thesis, we've introduced a method for constructing a large scale registry of rooftop PV systems and demonstrated that we could estimate the rooftop PV power production with relatively few information on these systems.
The last chapter of the thesis raises a lot of questions for improving current methods for PV power estimation, in particular since the task of fairly comparing the proposed method with the TSO current practices is very challenging. I also have in mind potential applications beyond France, for instance in coutries where the share of small scale PV is higher than in France.
Abstract: Photovoltaic (PV) energy generation plays a crucial role in the energy transition. Small-scale, rooftop PV installations are deployed at an unprecedented pace, and their safe integration into the grid requires up-to-date, high-quality information. Overhead imagery is increasingly being used to improve the knowledge of rooftop PV installations with machine learning models capable of automatically mapping these installations. However, these models cannot be reliably transferred from one region or imagery source to another without incurring a decrease in accuracy. To address this issue, known as distribution shift, and foster the development of PV array mapping pipelines, we propose a dataset containing aerial images, segmentation masks, and installation metadata (i.e., technical characteristics). We provide installation metadata for more than 28000 installations. We supply ground truth segmentation masks for 13000 installations, including 7000 with annotations for two different image providers. Finally, we provide installation metadata that matches the annotation for more than 8000 installations. Dataset applications include end-to-end PV registry construction, robust PV installations mapping, and analysis of crowdsourced datasets. The article can be accessed here and the Zenodo repository containing the dataset by clicking here: .
Abstract: The global photovoltaic (PV) installed capacity, vital for the electric sector’s decarbonation, reached 1552.3 GWp in 2023. In France, the capacity stood at 19.9 GWp in April 2024. The growth of the PV installed capacity over a year was nearly 32% worldwide and 15.7% in France. However, integrating PV electricity into grids is hindered by poor knowledge of rooftop PV systems, constituting 20% of France’s installed capacity, and the lack of measurements of the production stemming from these systems. This problem of lack of measurements of the rooftop PV power production is referred to as the lack of observability. Using ground-truth measurements of individual PV systems, available at an unprecedented temporal and spatial scale, we show that by estimating the PV power production of an individual rooftop system by combining solar irradiance and temperature data, the characteristics of the PV system inferred from remote sensing methods and an irradiation-to-electric power conversion model provides accurate estimations of the PV power production. We report an average estimation error (measured with the pRMSE) of 10% relative to the system size. Our study shows that we can improve rooftop PV observability, and thus its integration into the electric grid, using little information on these systems, a simple model of the PV system, and weather data. More broadly, this study shows that limited information is sufficient to derive a reasonably good estimation of the PV power production of small-scale systems.. The article can be accessed here.
This work in process was presented during the PhD Forum at ECML-PKDD 2022. It it a snapshot of our early attempts to use Fourier theory to explain the (lack of) robustness of a CNN classifier. This work later led to the WCAM. The manuscript is accessible here and the slides of the presentation here.