Design and evaluate a data-driven model for mapping subsurface lithology using remote sensing and geophysical data
Table Of Contents
Chapter ONE
INTRODUCTION
- 1.1Introduction to Remote Sensing and Geophysical Data Integration for Lithology Mapping
- 1.2Background of Subsurface Lithology Characterization and Its Importance in Resource Exploration
- 1.3Problem Statement on the Limitations of Traditional Lithology Mapping Methods
- 1.4Aim and Specific Objectives of Developing a Data-Driven Lithology Mapping Model
- 1.5Research Questions Focused on Model Performance and Data Integration Effectiveness
- 1.6Hypotheses Regarding the Accuracy and Reliability of the Proposed Model
- 1.7Significance of a Robust Lithology Mapping Model for Geological and Mining Applications
- 1.8Scope and Delimitations of the Study to Specific Geological and Geographic Contexts
- 1.9Limitations Related to Data Availability and Model Generalizability
- 1.10Organisation and Structure of the Thesis Chapters
- 1.11Definitions of Key Terms: Remote Sensing, Geophysical Data, Lithology, Data-Driven Model, etc.
Chapter TWO
LITERATURE REVIEW
- 2.1Conceptual Overview of Lithology Mapping Using Remote Sensing and Geophysics
- 2.2Theoretical Framework: Geophysical Inversion Theory and Machine Learning Paradigms
- 2.3Empirical Review of Previous Remote Sensing Applications in Lithology Discrimination
- 2.4Empirical Analysis of Geophysical Techniques in Subsurface Lithology Identification
- 2.5Prior Studies on Data-Driven Modeling for Geological Mapping
- 2.6Integration of Remote Sensing and Geophysical Data: Approaches and Challenges
- 2.7Identified Gaps in Existing Literature on Lithology Mapping Models
- 2.8Limitations of Past Methodologies and Opportunities for Improvement
- 2.9Summary of Conceptual Foundations and Empirical Evidence
- 2.10Proposed Conceptual Model for Combining Data Modalities in Lithology Mapping
- 2.11Synthesis of Review Findings and Research Gaps
- 2.12Visual Summary: Conceptual Diagram of the Integrated Modeling Approach
Chapter THREE
RESEARCH METHODOLOGY
- 3.1Research Design: Design Science and Model Development Framework
- 3.2Philosophical Paradigm Underpinning Data-Driven Modeling Approaches
- 3.3Population and Study Area Characteristics for Lithology Data
- 3.4Sample Size Determination and Sampling Technique for Data Collection
- 3.5Data Sources: Remote Sensing Imagery and Geophysical Surveys
- 3.6Instruments and Data Collection Procedures for Geophysical and Remote Sensing Data
- 3.7Ensuring Validity and Reliability of Data Acquisition Instruments
- 3.8Data Processing and Preparation: Preprocessing, Noise Reduction, and Feature Extraction
- 3.9Data Analysis Methods: Machine Learning Algorithms, Model Training, and Validation
- 3.10Model Specification: Architecture, Feature Selection, and Performance Metrics
- 3.11Ethical Considerations in Data Collection, Model Development, and Dissemination
Chapter FOUR
DATA PRESENTATION AND ANALYSIS
- ANALYSIS AND DISCUSSION
- 4.1Presentation of Remote Sensing and Geophysical Data Sets
- 4.2Descriptive Statistics and Data Distribution Analysis
- 4.3Results of Data Preprocessing and Feature Engineering
- 4.4Model Training Results: Accuracy, Precision, Recall, and F1-Score
- 4.5Hypotheses Testing Outcomes: Model Validity and Significance
- 4.6Spatial Accuracy and Confusion Matrices of Lithology Classifications
- 4.7Interpretation of Model Performance in Context of Existing Literature
- 4.8Discussion of Findings, Limitations, and Model Strengths and Weaknesses
Chapter FIVE
SUMMARY, CONCLUSION AND RECOMMENDATIONS
- CONCLUSION AND RECOMMENDATIONS
- 5.1Summary of Key Findings in Lithology Mapping Using Data-Driven Models
- 5.2Conclusions Regarding Model Effectiveness and Data Integration Approaches
- 5.3Contributions of this Study to Geological Mapping and Remote Sensing Literature
- 5.4Practical Recommendations for Implementing Data-Driven Lithology Models
- 5.5Recommendations for Future Research: Enhancing Model Accuracy and Extensibility
- 5.6Final Remarks and Implications for Geological and Resource Management Practices
Thesis Abstract
Advancements in subsurface geological mapping are critical for mineral exploration, groundwater management, and environmental assessment, yet traditional methods relying on point-based sampling and borehole data are often limited by high costs, spatial sparsity, and environmental constraints. This study addresses these issues by proposing a data-driven model that integrates remote sensing and geophysical data to enhance the accuracy and resolution of subsurface lithology mapping. The primary aim is to design, implement, and evaluate a predictive framework capable of classifying subsurface lithologies over extensive areas with minimal invasive procedures. Specific objectives include analyzing the correlation between multispectral satellite imagery, aeromagnetic, and gravity data with subsurface lithological units, developing a machine learning paradigm for lithological prediction, and assessing the model's performance relative to existing geological maps. The research adopts a quantitative, exploratory, and validation-oriented design, utilizing a combination of supervised machine learning algorithms—such as Random Forest and Support Vector Machines—for predictive modeling. The study population comprises lithological data pooled from a regional geological survey covering approximately 10,000 square kilometers characterized by diverse rock types, including igneous, sedimentary, and metamorphic formations. A stratified random sampling technique was employed to select 300 sample locations where existing borehole logs, core samples, and laboratory analyses provided ground-truth lithological labels. Remote sensing data, including multispectral imagery from Sentinel-2 and Landsat 8, were preprocessed for atmospheric correction and spectral enhancement. Geophysical data consisted of high-resolution aeromagnetic and gravity surveys acquired at spatial resolutions of 10 meters and 50 meters, respectively. Data collection involved GIS-based extraction of spectral indices such as NDVI, NDWI, and the Normalized Difference Built-up Index (NDBI), alongside spectral reflectance features. Geophysical measurements were processed through filtering and automated anomaly detection to delineate subsurface features. The primary analytical techniques employed encompass correlation analysis, principal component analysis (PCA) for feature reduction, and machine learning classification models, with model validation conducted via k-fold cross-validation (k=10) and assessment metrics including overall accuracy, precision, recall, and the F1 score. The study also employs ROC curve analysis to compare classifier performance, and feature importance measures to interpret model predictions. Expected findings suggest that the integrated model will significantly outperform traditional mapping approaches, with anticipated classification accuracies exceeding 85%, especially in complex geological terrains. The model is expected to reveal distinct spectral-geophysical signatures associated with specific lithological units, providing detailed lithological maps at a regional scale. Such results will demonstrate the potential of remote sensing combined with geophysical data to serve as a cost-effective and reliable tool for subsurface lithology prediction. This research contributes to the body of knowledge by developing a robust, reproducible framework that combines multiple data sources within a machine learning context for geological mapping. It advances the understanding of spectral-geophysical correlations and their applicability to lithological discrimination, filling existing gaps in remote sensing-based subsurface modeling. Additionally, the study proposes a scalable methodology adaptable to different geological settings, thus supporting resource management and environmental planning. The main conclusion underscores the efficacy of an integrated, data-driven approach in improving the spatial understanding of subsurface lithology. The study recommends further research into deep learning techniques, the integration of additional geophysical methods such as electrical resistivity tomography (ERT), and real-time data processing for dynamic geological monitoring. Overall, the findings advocate for a paradigm shift from traditional, labor-intensive geological surveys toward automated, remote sensing-based predictive modeling in subsurface geology.
Thesis Overview
This research project is focused on developing a new, data-driven method to create detailed maps of what lies beneath the Earth's surface, specifically the different types of rocks and sediments known as lithology. Understanding subsurface lithology is critical for many applications, including mineral exploration, groundwater management, construction planning, and environmental assessment. Traditionally, mapping these underground features requires expensive, time-consuming drilling and sampling, which operate at limited locations. This study aims to improve this process by using remote sensing data from satellites and geophysical measurements such as magnetic, gravity, and electrical resistivity data, which can cover large areas quickly.
The project seeks to fill a knowledge gap where existing models either rely heavily on physical sampling or are not sufficiently accurate across different geological conditions. The main goal is to design a model that can predict lithology types with high reliability using publicly available remote sensing and geophysical datasets combined with advanced data analysis techniques.
To achieve this, the researcher will first collect remote sensing images (such as multispectral or hyperspectral data) and geophysical survey data over a selected study area—likely a mineral-rich or tectonically active region—with a sample size of several hundred data points. These data will then undergo preprocessing steps like noise reduction and feature extraction. The core analysis will involve applying machine learning techniques such as Random Forests, Support Vector Machines, or Neural Networks to establish relationships between the surface data and known subsurface lithologies.
The study will evaluate how accurately the model predicts different lithology types by testing it against known drill results or borehole logs not used in training. The expected contribution is a validated, scalable model that can be used by geologists and resource managers to rapidly produce reliable subsurface maps, reducing the need for costly drilling.
The anticipated outcome is a robust, operational tool for subsurface lithology mapping that enhances current geological practices, with recommendations for its integration into routine geological surveys and future improvements for broader geographic areas.