Semantic segmentation architecture based on Atrous Spatial Pyramid Pooling and convolutional based attention module for mapping of burned areas in satellite images

Ouadou, Anes

Semantic segmentation architecture based on Atrous Spatial Pyramid Pooling and convolutional based attention module for mapping of burned areas in satellite images

Files

OuadouAnesResearch.pdf (20.27 MB)

Authors

Ouadou, Anes

Date

2025

Format

Thesis

Abstract

Computer vision is a rapidly advancing field within artificial intelligence and deep learning. It has found applications in various areas, including climate. Remote sensing data is now widely accessible; medium-resolution satellite images can be obtained for free. A custom dataset was created to train deep learning models to map areas affected by wildfires. Developing such models can significantly aid in post-wildfire assessments and recovery efforts. Mapping burned areas can be framed as a semantic segmentation problem. Existing state-of-the-art convolutional neural network (CNN) architectures, such as U-Net and DeepLab have been trained to semantically segment burned areas. Initially successful in natural language processing tasks, transformer networks have recently been adapted for computer vision applications. These networks utilize self-attention layers, which are crucial to their design. SegFormer is a transformer-based model for semantic segmentation that has shown considerable promise. The attention mechanism is not exclusive to transformer networks; it can also be integrated with convolutional operations to enhance performance. For instance, SegNext is an architecture that effectively employs this combination. Seg- Former and SegNext models have also been trained using the same custom dataset and showed better performance than CNN architecture. However, these architectures have a large number of parameters and require a large amount of data to train. Seg- ConFormer, an architecture that combines the strengths of each of these architectures, has been trained from scratch using the custom dataset only, and it outperformed all other architectures using significantly less number of parameters.

URI

https://hdl.handle.net/10355/110275

DOI

https://doi.org/10.32469/10355/110275

Degree

Ph. D

Thesis Department

Computer Science

Collections

2025 MU Dissertations - Freely available online
Computer Science electronic theses and dissertations (MU)

Full item page

Semantic segmentation architecture based on Atrous Spatial Pyramid Pooling and convolutional based attention module for mapping of burned areas in satellite images

Files

Authors

Meeting name

Sponsors

Date

Journal Title

Format

Subject

Research Projects

Organizational Units

Journal Issue

Abstract

Table of Contents

URI

DOI

PubMed ID

Degree

Thesis Department

Rights

License

Collections