Enhanced history matching process by incorporation of saturation logs as model selection criteria

APONTE Jesus Manuel; WEBBER Robert; CENTENO Maria Astrid; DHAKAL Hom Nath; SAYED Mohamed Hassan; MALAKOOTI Reza

doi:10.1016/S1876-3804(23)60400-8

Petroleum Exploration and Development >

2023 , Vol. 50 >Issue 2: 450 - 463

DOI: https://doi.org/10.1016/S1876-3804(23)60400-8

Enhanced history matching process by incorporation of saturation logs as model selection criteria

APONTE Jesus Manuel ^,¹^,²^,^* ,
WEBBER Robert ² ,
CENTENO Maria Astrid ³ ,
DHAKAL Hom Nath ¹ ,
SAYED Mohamed Hassan ⁴ ,
MALAKOOTI Reza ⁵

Expand

1. Faculty of Technology, University of Portsmouth, Portsmouth PO1 2UP, United Kingdom
2. CNOOC International Ltd, Uxbridge UB8 1LU, United Kingdom
3. London South Bank University, London SE1 0AA, United Kingdom
4. University of Southampton, Southampton SO17 1BJ, United Kingdom
5. Computer Modelling Group Ltd, London OX10 8BA, United Kingdom

*E-mail: Jesus.Aponte@myport.ac.uk

Received date: 2022-06-25

Revised date: 2023-02-14

Online published: 2023-04-25

Fold

Abstract

This paper proposes a methodology for an alternative history matching process enhanced by the incorporation of a simplified binary interpretation of reservoir saturation logs (RST) as objective function. Incorporating fluids saturation logs during the history matching phase unlocks the possibility to adjust or select models that better represent the near wellbore waterfront movement, which is particularly important for uncertainty mitigation during future well interference assessments in water driven reservoirs. For the purposes of this study, a semi-synthetic open-source reservoir model was used as base case to evaluate the proposed methodology. The reservoir model represents a water driven, highly heterogenous sandstone reservoir from Namorado field in Brazil. To effectively compare the proposed methodology against the conventional methods, a commercial reservoir simulator was used in combination with a state-of-the-art benchmarking workflow based on the Big Loop^TM approach. A well-known group of binary metrics were evaluated to be used as the objective function, and the Matthew correlation coefficient (MCC) has been proved to offer the best results when using binary data from water saturation logs. History matching results obtained with the proposed methodology allowed the selection of a more reliable group of reservoir models, especially for cases with high heterogeneity. The methodology also offers additional information and understanding of sweep behaviour behind the well casing at specific production zones, thus revealing full model potential to define new wells and reservoir development opportunities.

Key words： geological modeling; reservoir model; objective function; binary classification; history matching; saturation logs

Cite this article

APONTE Jesus Manuel , WEBBER Robert , CENTENO Maria Astrid , DHAKAL Hom Nath , SAYED Mohamed Hassan , MALAKOOTI Reza . Enhanced history matching process by incorporation of saturation logs as model selection criteria[J]. Petroleum Exploration and Development, 2023 , 50(2) : 450 -463 . DOI: 10.1016/S1876-3804(23)60400-8

Introduction

Reservoir simulation is traditionally required as a decision-making tool in reservoir management studies, as it provides the physical representation of static characteristics and dynamic behaviour of a hydrocarbon reservoir, while considering a large number of known uncertainties. The history matched model is essential for reservoir management to assess different strategies maximizing hydrocarbon recovery. History matching is the model calibration exercise, which relies on the fact that if a model can accurately reproduce the production history or observed data, it will be useful to predict the future performance of the reservoir. One of the main challenges for reservoir engineers during the history matching process is to reproduce detailed information about the fluid displacement in the porous media in order to identify the time of water breakthrough for each well and each pay zone in a production well. On that topic, Benlacheheb et al. ^[1] highlighted the advantages of incorporating additional monitoring data as part of the history matching process to allow more detailed outcome related to vertical heterogeneity of reservoir properties in the validated model. This research aims to develop a complementary methodology to improve the results obtained from the history matching process by incorporating binary saturations logs as a part of the evaluation parameters. The proposed methodology leads to a more robust and reliable reservoir model.

The novelty of the methodology proposed in this paper lies in its flexibility to include new evaluation parameters independent of the type of data. Besides, the methodology also provides a simplification of the matching responses by transforming the original format into binary output, which opens the possibility to use more complex frameworks to accelerate the matching process such as machine learning.

1. Theoretical basis

1.1 Objective functions

The history matching quality of a model is often expressed in terms of its global objective function. The objective function is a mathematical function that allows a measure of the misfit between simulated and observed data ^[2]. Most of the existing formulations can be written as a function of a point-by-point difference between simulated and historical data.

In a conventional history matching process, all reservoir data is gathered to create a 3D model through the reservoir characterisation process. Then, by using dynamic simulation and the observed data, the model is used to predict historical results. Finally, simulated results are compared to the observed data using the objective function to determine the history matching quality of the model. More holistic approaches have been introduced into geomodelling processes by unlocking the potential of assessing a significant number of possible combinations of inputs used to create the 3D model, and creating a wide range of alternative solutions ^[3-4]. These alternative solutions are commonly described as representative group of models. The final representative group of models are selected from a bigger equally probable ensemble which is derived from the combination of all related uncertainties in an uncertainty analysis. The selection of the representative group of models is based on their history matching quality. The rationale of these original approaches is based on the nature of the history matching as an inverse problem optimisation which means that there is no unique solution to the problem and hence different sets of inputs could lead to almost the same outcome.

The incorporation of the geologist’s interpretation data as part of the whole process provides the results with more information about the most representative reservoir models and uncertainties about static reservoir data^[5-6]. The selection of the different objective functions used for the evaluation of model’s performance and efficiency was reviewed by Mata-Lima ^[2]. Further findings from published data show that root mean squared error (RMSE) as well as mean of the deviations (AE) are the most widely used in history matching, considering the linear nature of the parameters commonly used in the process. However, many of these deviation-based statistics differ from each other in the way that differences between observed and simulated results are evaluated.

1.2. Limitation of conventional objective function calculations

In conventional history matching workflows, the objective function is commonly calculated using well level production data such as production rates (oil, gas, water, or total liquid rates), gauge pressures and production ratios such as well water cut. It is well known that for any model-built process, the more the number of key performance indicators (KPIs) the model managed to represent, the better the quality of the model. Hence, matching a reservoir model using only limited data may not be enough to define a satisfactory representation of the reservoir in order to predict its future performance ^[7]. One of the main challenges of using well production data to validate the models is to accurately capture the correct saturation changes in individual producing zones in commingled production wells. Frequent practice evokes two methods to measure near-wellbore water saturation changes, that is, saturation logs and 4D seismic. The use of 4D seismic technology has positively contributed to a better interpretation of fluid displacement ^[8-9]. However, this technology can lead to some difficulties which require additional algorithms and statistical analysis to predict fluid saturations. Besides, 4D seismic data is not always available and has additional economic implications on the budget. On the other hand, saturation logs are commonly obtained during regular surveillance interventions.

1.3. Classification metrics

Several classification techniques have been applied in different fields of sciences depending on the nature of the problem and the classification output (binary or multi-class) ^[10].

1.3.1 Confusion matrix

The confusion matrix (CM) is defined as a table that allows the user to analyze results and performance of a specific algorithm which classifies data. The confusion matrix is one of the most common tools used to assess binary classifiers. A CM contains information about actual and predicted classifications done by a classification system ^[11]. The performance of such systems is commonly evaluated using the data in the matrix. Fig. 1 shows an example of the confusion matrix for a two-class classifier.

Binary Metric	Key features	Application	Formulae
Confusion Matrix (CM)	CM measures the correlation between the observed and predicted data as quality of a binary response (true/false), (positive/negative).	The CM allows the application of the different metrics to correlate the data.	$CM = TP FN FP TN$
False Positive Rate (FPR)	FPR represents the proportion of positive cases that are incorrectly classified as positive from the total number of negative outcomes.	Also recognized as fallout and false alarm rate. This metric is not affected by imbalanced data.	$FPR = FP FP + TN$
True Negative Rate (TNR)	TNR represents the proportion of negative cases that are properly identified as negative from the total number of negative outcomes.	It is also called specificity or inverse recall. This metric is less affected by imbalanced data.	$TNR = TN FP + TN$
False Negative Rate (FNR)	FNR represents the proportion of negative cases that are incorrectly identified as negative from the total number of negative outcomes.	It is also called miss rate or inverse recall. This metric is less affected by imbalanced data.	$FNR = FN TP + FN$
Precision (P)	Represents the ratio of correct predictions that are relevant. When the prediction is yes, how often is it correct?	It is also called “confidence” metric. It does not consider the number of true negatives.	$P = TP TP + FP$
Recall (R)	Measure the accuracy on the positive class. Thus, when the correct prediction is yes, how often does it predict yes?	The metric is valuable to measure the real positive cases that are predicted. The metric is represented as a rate of discovery of positive classifiers.	$R = TP TP + FN$
F-Measure (FM)	It is the ratio of metrics Precision/Recall. It is the harmonic mean of precision and recall metrics.	It considers the ratio of True Positives to the arithmetic mean of predicted positives and real positives. This metric is sensitive to changes in the class distribution.	$FM = 2TP 2TP + F P + FN$
Accuracy (A)	Represents the ratio between correct predictions to all predictions. The best value is 1 and the worst value is 0.	The metric is not reliable for imbalanced data. It can provide an overoptimistic estimation of the classifier.	$A = TP + TN TP + T N + F P + FN$
Matthew’s correlation coefficient (MCC)	Represents the relation between the observed and predicted classes.	The outcome ranges from +1 to -1, +1 represents a perfect prediction and -1 total disagreement. The metric is sensitive to imbalance data.	$MCC = A − B CDEF$ Where: A=TPTN B=FPFN C=TP+FP D=TP+FN E=TN+FP F=TN+FN

模态框（Modal）标题

Abstract

Cite this article

Introduction

1. Theoretical basis

1.1 Objective functions

1.2. Limitation of conventional objective function calculations

1.3. Classification metrics

1.3.1 Confusion matrix

1.3.2. The Matthews correlation coefficient (MCC)

Table 1. Metrics used for binary classification, adapted from Tharwat [10]

2. Methodology

2.1. Benchmarking the proposed methodology

Fig. 2. Modified “Big Loop” workflow.

2.2. Proposed RST approach methodology

Fig. 3. Methodology proposed for enhanced history matching process using RST logs.

2.2.1. Estimation of observed binary interpretation of reservoir saturation logs

Fig. 4. Illustration of water saturation changes in the reservoir, as identified by cased hole saturation logs and interpreted in “sweep” binary log.

2.2.2. Generating synthetic saturation logs for each equiprobable model

Fig. 5. (a) 2D vertical section of water saturation profile along the well trajectory of a producer and an injector, and (b) the producer synthetic water saturation log.

2.2.3. Transforming saturation logs into binary logs using a threshold

Fig. 6. Swept and un-swept areas of a 2D reservoir model slice highlighting the producer RJS19 and the closest injector at a specific time step.

2.2.4. Comparing match quality between observed vs synthetic logs by generating a confusion matrix log

Fig. 7. Confusion matrix log of Model 1 and its corresponding confusion matrix table.

2.2.5. Assessing history matching quality of individual models using binary confusion matrix derived metrics

2.3. Geological model used

Fig. 8. Injectors and producers well trajectory modification.

Fig. 9. Porosity and permeability diversity of four randomly selected models from the modified case ensemble.

Fig. 10. Empirical threshold estimation using base case water saturation log at the time of water breakthrough.

Fig. 11. Water saturation threshold using Welge method.

3. Method application and discussion

3.1. Assessment and evaluation of classification metrics

3.1.1. Precision

Fig. 12. Precision scores of all 200 models in Group A.

Fig. 13. Precision scores of all 200 models in Group B.

3.1.2. Accuracy, F-Measure and Recall

3.1.3. Matthew correlation coefficient

Fig. 14. MCC scores of all 200 models in Group A.

Fig. 15. MCC scores of all 200 models in Group B.

3.1.4. Binary metrics assessment summary.

Fig. 16. Evaluation of selected model cases for each metric vs. observed binary RST logs.

3.2. Comparing proposed RST methodology versus conventional history matching approach

Fig. 17. Global misfits of 200 cases using proposed method.

Fig. 18. Global misfits of 200 cases using conventional method.

Fig. 19. Binary RST logs of top raked cases for the representative wells.

Fig. 20. Well NA2 water cut.

Fig. 21. Well PROD021V water cut.

Fig. 22. Well NA3D water cut.

Fig. 23. Water cut per zone for wells NA2, NA3D and PROD021V using conventional and proposed methodologies.

4. Conclusions

References

Table 1. Metrics used for binary classification, adapted from Tharwat ^[10]