Regular Article
Well log data super-resolution based on locally linear embedding
College of Physics and Electronic Engineering, Northeast Petroleum University, Daqing 163318, PR China
* Corresponding author: 2645073549@qq.com
Received:
7
February
2021
Accepted:
21
July
2021
Unconventional remaining oil and gas resources such as tight oil, shale oil, and coalbed gas are currently the focus of the exploration and development of major oil fields all over the world. Therefore, to make best understand of target reservoirs, enhancing the vertical resolution of well log data is crucial important. However, in the face of the continuous low-level fluctuations of international oil price, large scale use of expensive high resolution well logging hardware tools has always been unaffordable and unacceptable. In another aspect, traditional well log interpolation methods can always not realize high reliable information enhancement for crucial high frequency components. In this paper, in order to improve the well log data super-resolution performance, we propose for the first time to employ Locally Linear Embedding (LLE) technique to reveal the nonlinear mapping relationship between 2-times-scale-difference well log data. Several super resolution experiments with well log data from a given area of Daqing Oil field, China, were conducted. Experimental results illustrated that the proposed LLE-based method can efficiently achieve more reliable super-resolution results than other state-of-the-art methods.
© J. Han et al., published by IFP Energies nouvelles, 2021
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1 Introduction
In recent years, most old oilfields in the world have entered the middle and late stages of development. However, due to the long-term waterflooding development of oilfields, several unavoidable complex cases happened to the conventional resources, such as frequent river channel reconstruction, complex vertical and horizontal changes of sand body, strong heterogeneity of lithology and lithofacies. Consequently, oil and gas exploration and development are becoming more and more difficult, and unconventional oil and gas resources such as tight sandstone, shale oil/gas, coalbed gas, etc, have become the most important target reservoirs. Obviously, the efficient exploration and development of unconventional oil and gas resources has become the main way to achieve stable production to increase and extend the life of oilfield development [1]. Therefore, in order to enhance the production capacity of the unconventional remaining oil and gas resources, the reservoir modelling resolution should be as high as possible. Meanwhile, the high-resolution logging curves can better provide the possibility to predict the location of lost circulation [2], lithology identifies [3], and predict the productivity of heterogeneous reservoirs [4, 5]. In practice, because the vertical resolution of seismic data is too low and the cost of acquisition of core and other high resolution logging data is too high, it is an inevitable choice to employ conventional well log data to enhance the vertical resolution of the final reservoir models. However, in the conventional oil/gas era, interpolation algorithms based on simple models have been used for enhancing the vertical resolution of well log data for a very long time. Until the last decade or so, with the development of machine learning technology, a few data driven well log super-resolution methods have been proposed in the literature. Furthermore, whether it is a simple model-driven interpolation method or a data-driven super-resolution method, they all try to establish a non-linear mapping function between low resolution and high-resolution data directly in the original amplitude space. Due to some uncontrollable factors, there are unavoidable data deficiencies existing in the raw well data, therefore, the quality of training data severely restricts the effectiveness and robustness of these super-resolution methods. In order to deal with this problem to a certain extent, Locally Linear Embedding (LLE) technique which can explore intrinsic manifold of the raw data with sufficient outlier-resistance and robustness was employed in this paper to perform well super-resolution task. Therefore, in this paper, a novel LLE based well logs super resolution method was proposed in the literature for the first time.
2 Related works
In the actual survey work of the oil field, due to the influence of factors such as well diameter and measurement, the actual logging resolution that can be achieved at present is only 12.5 cm. Faced with complex geological conditions, the thickness of shale oil lamellae is less than 0.3 cm. Therefore, it is not realistic to accurately characterize the target reservoir using original logging data. Obviously, in order to enhance capacity of reservoir characterization accuracy, the resolution of well log data must be improved. However, due to the high cost of high-resolution well logging tools, a series of methods to improve the logging resolution was proposed in the literature. In 1989, Flaum et al. [6] proposed the neutron density α factor method. By assuming that the α factor changes slowly, the high-resolution neutron porosity curve can be calculated by using the counting rate of near detector. In 1991, Nelson, and Mitchell [7] deduced the expression of smoothing filter used for curve matching and proposed the resolution matching technology for high resolution processing of well log data. On this basis, [8] used genetic algorithm to enhance the resolution of well log data. Specifically, they first used genetic algorithm to determine the filter in the frequency domain and then used resolution matching technology to improve the resolution of the input well log data. Conaway [9] discussed the deconvolution technology of natural gamma ray data with point detector. Based on the three-point deconvolution formula, and proposed a method for determining the shape constant α, which appropriately considers the influence of formation dip. Freedman and Minerbo [10] employed the maximum entropy deconvolution method to improve the vertical resolution of well log data. By taking the layer interface information into consideration, they formulated a Lagrange optimal function with maximum entropy constraint for super resolution, and reasonable vertical resolution enhancement can be achieved. Besides to the method processed in spatial domain, transformed domain based well log super-resolution methods can achieve relative better performance. In 2005, Tai and Cao [11] used Walsh transform to improve the resolution of well log data. Furthermore, their method can also ensure that the error between the calculated value and the true value does not exceed the error of the raw well log data itself. However, this method is only suitable for linear response well log data and the response function H (τ) cannot be accurately determined. In 2015, Li et al. [12] proposed to employ window Fourier transform to transform the raw well log data from the spatial domain to the frequency domain. The relationship between high-resolution and low-resolution well log data in the frequency domain can therefore be constructed.
Although the above-mentioned model-driven methods can improve the well log resolution to a certain extent, the original outliers or errors in the raw well log data will be certainly propagated or even enlarged to the corresponding high-resolution data. To deal with similar problems, in the computer audio/vision fields, sparse representation and deep learning techniques have been successfully applied to the image super-resolution task [13–17]. In 2017, Ledig et al. [18] proposed an image super-resolution method based on generative adversarial network. Also in 2017, Volodymyr et al. [19] proposed a multi-layer convolution neural network for high-resolution audio signal processing. In 2018, Lim et al. [20] proposed a novel deep neural network structure to perform audio super-resolution in time-frequency domain. Generally, promising super-resolution results can be achieved in the computer audio/vision fields by employing data-driven super-resolution techniques. However, compared with audio or image/video data, there are much more uncertainties existing in the well log data. Consequently, directly employing data-driven methods in spatial or classical transform domains cannot prevent the propagation of uncertainty to the super-resolution version of data.
Based on the above review and analysis, we propose to employ manifold learning techniques, which can extract the intrinsic manifold of data with variable uncertainties, to enhance the super-resolution performance of well log data. Specifically, the LLE algorithm is used in this paper to exploit the intrinsic manifold information of well log data to conduct the corresponding super-resolution task.
3 Locally Linear Embedding (LLE) based super-resolution
3.1 Locally Linear Embedding
LLE algorithm is originally a nonlinear dimension reduction algorithm, which belongs to the category of manifold learning. Manifold learning is to recover intrinsic low dimensional structure from high-dimensional sampled data, that is, to find low dimensional manifold in high-dimensional space and find the corresponding nonlinear mapping to achieve dimension reduction. Chang et al. [21] first proposed the use of LLE to deal with image super-resolution and stable and promising results were illustrated. Specifically, for the LLE algorithm, suppose there are m n-dimensional samples {X1, X2, X3, ⋯ Xm}, the first step is to select the neighborhood size which is one of the hyperparameters used in the LLE algorithm. Without loss of generality, assuming that the neighborhood size is K. Therefore, for a given data point Xi, we assume that it can be represented by the weighted linear combination of its K nearest neighbors Ni = {Xj}, j = 1, 2, ⋯, K. And the corresponding mean square error is used as the loss function,
It should be noted that the weight coefficient Wij is local normalized (for any data Xi ∉ Ni, the corresponding weight coefficient is zero). It means that the sum of the weight coefficients corresponding to Xi’s neighborhood {Xj}, j = 1, 2, ⋯, K is 1. Therefore, the weight coefficient should meet the following requirements:
With equation (2) as constraint, the weight coefficients in equation (1) can be obtained by Lagrange multiplier method as follows:
Let , there will be:
Then, the optimization objective can be constructed as below:
Next, setting the derivative of L(W) to W is 0, the following results can be obtained:
Let be a constant, then the final weight coefficient vector Wi can be obtained as follows:
3.2 LLE based well log data super-resolution
From Section 3.1, we can see that LLE method can obtain the intrinsic relationship between a given 1 × n-dimensional local data patch and its K nearest neighbor patches. Inspired by the Sparse Representation-based image Super-Resolution (SRSR) method proposed in [12], in which low-resolution and high-resolution training data pairs were employed to share the same representation parameters. Therefore, combining the advantages of LLE and SRSR, two strategies were employed for well log super-resolution task. Firstly, let the low-resolution and high-resolution model data pairs share the same linear representation weights. Secondly, for the test data patches, let’s find their K nearest neighbors in the low-resolution model patches for LLE operation. With the guidance of these two strategies, the workflow diagram of the proposed well log super-resolution method was illustrated in Figure 1.
Fig. 1 Flow chart of locally linear embedding algorithm. |
Specifically, the detail of the proposed well log super-resolution method was given in Table 1.
Illustration of the detailed steps of the proposed method.
4 Experimental results
4.1 Geological background and data selection
In order to evaluate the performance of the proposed well log super resolution method, well log data from 10 wells in Qijia-Gulong depression, Songliao basin, China were selected. Qijia-Gulong depression illustrated in Figure 2 is one of the most important tight sandstone exploration area of Daqing oil field. The average permeability of this area is commonly less than 1 mD, and the average porosity is always less than 13%. Compared with these medium quality physical properties, the thickness of the target reservoir in this area is universally less than 3 m, and there are always too many complex thin interlayers existed [22]. Specifically, without loss of generality, wells, natural Gamma Ray (GR), and Deep Lateral Resistivity (LLD) were selected for conducting the following experiments for all selected wells.
Fig. 2 Illustration of the geological background of the selected well data: the basic phase map of the Qijia-Gulong depression (left), the histogram of the thickness of sand body (right-up), and the histogram of the porosity (right-bottom). |
4.2 Hyper-parameter settings
As mentioned in Section 3, the local patch size and the number of neighborhood patches are important hyper parameters of the method. Therefore, in this experiment, the influence of these two parameters on super-resolution results was discussed. Specifically, without loss of generality, we chose the GR curve as the test data. Firstly, using the same local patch size (r = 4), the performance of the proposed method was tested with neighborhood size varying from 2 to 16. As can be seen from Figure 3a, as the neighborhood size increases, the Peak Signal-to-Noise Ratio (PSNR) value improves rapidly, and when K is set to 10, the result tends to be stable. In addition, it is obvious that when K is 11, the best super-resolution result can be achieved. Meanwhile, by setting K to 11, as shown in Figure 3b, with the increase of the local patch size, the best super-resolution result is found when r is set to 4.
Fig. 3 Comparison of the super-resolution performance of the proposed method under a) different neighborhood sizes and b) different local patch sizes. |
4.3 Super resolution performance test
In this experiment, the standard 0.125 m GR and LLD logging data from 1550 m to 1600 m section of well_1 was selected as the test data, and the corresponding section of well_2 was selected as model data. The corresponding 0.25 m low resolution curve is obtained by down-sampling. In order to verify the superiority of this method, the Bicubic Spline Interpolation method (BSpInterp for short), one-dimensional Convolutional Neural Network (SRCNN for short) method, and the Sparse Representation method (SpR for short) were used for comparison.
Specifically, the comparison was conducted in three manners: (1) direct two times (2X) super-resolution; (2) double two times (indirect 4X) super-resolution; and (3) direct four times (4X) super-resolution. Then, as discussed in Section 4.2, local patch size and the neighborhood size used in the proposed method were setting to 4 and 11, respectively. Experimental results of different comparison manners were illustrated in Figure 4 (2X), Figure 5 (indirect 4X), and Figure 6 (4X), respectively.
Fig. 4 Illustration of 2X super-resolution results comparison of different methods for a) GR curve and b) LLD curve. |
Fig. 5 Illustration of indirect 4X super-resolution results comparison of different methods for a) GR curve and b) LLD curve. |
Fig. 6 Illustration of direct 4X super-resolution results comparison of different methods for a) GR curve and b) LLD curve. |
In order to quantitatively evaluate the objective performance of these comparison methods, the Mean Square Error (MSE), Peak Signal-to-Noise Ratio (PSNR), and Pearson correlation Coefficient (Coeff) of the results acquired by each method were calculated. At the same time, the execution time required for each method is also recorded for comparison. Specifically, the detailed quantitative evaluation results of GR and LLD logging data were given in Tables 2 and 3, respectively. Our proposed method (LLE) achieves state-of-the-art results compared to other methods. The corresponding time-consuming data was given in Table 4.
Comparison of quantitative evaluation results of different super-resolution methods for GR logging data.
Comparison of quantitative evaluation results of different super-resolution methods for LLD logging data.
Time-consuming comparison of different super-resolution methods.
4.4 Super resolution comparison of logging data in different areas
In order to verify the robustness of the proposed method, using well_2 as model well, the GR curves of the other eight wells were selected as test data using the 2X super-resolution manner. Specifically, for each well, 100 m log data was randomly extracted for processing. In this experiment, PSNR values of GR logs super-resolution results of different wells in different areas were compared. The detailed comparison data were given in Table 5. Our method has achieved the best results in 7 out of 8 oil wells. This result verifies the effectiveness of our method in different regions. Taking Well_3 as an example, the final super-resolution results was shown in Figure 7 with lithology information for reference. Among them, the black curve is the original high-resolution curve, and the red curve is the super-resolution result.
Fig. 7 Comprehensive interpretation diagram of Well_3 logging curve. |
Comparison of PSNR values of GR curves of different wells in different areas.
4.5 Discussion
In order to evaluate the performance of the proposed well log super-resolution method, several experiments were conducted. In Section 4.3, the direct visual effect and quantitative evaluation results of different methods were given. Specifically, from Figure 4, for 2X super-resolution, we can see that SpR and SRCNN methods cannot effectively reconstruct the detailed information of well log, and there are relatively large fluctuations. In addition, the result obtained by BSpInterp method looks pretty good, but there are two obvious shortcomings: (1) high frequency information gain is very small; and (2) the problem of peak shift. Compared with the above three methods, the proposed method is relatively stable and rich in preserving the overall contour and details of the curve. Obvious, the intrinsic local structure of the log data can be successfully reconstructed by using the LLE technique. In order to better reflect this result, taking GR log data as example, power spectrums of these methods were calculated and quantitative comparisons with the power spectrum of the original high-resolution GR log data were calculated. Specifically, the corresponding power spectrums were illustrated in Figure 8. Then, each power spectrum was divided into three subbands, namely the low frequency band (1 Hz–100 Hz), the medium frequency band (100 Hz–300 Hz), and the high frequency band (300 Hz–500 Hz). The Pearson coefficient values between super-resolution results of different methods and the original high-resolution GR log data were given in Table 6. From Table 6, we can clearly see that the proposed method can recover more high frequency components than other comparison methods, and the recovered low frequency and medium frequency bands are also excellent. Quantitative evaluation results of different test manners given in Tables 2 and 3 can also verify that the proposed method can achieve the best super-resolution results. In addition, we can see that unavoidable error propagation in the indirect 4X super-resolution manner makes it worse than the 2X and direct 4X manners. And 4X super-resolution results are worse than the results achieve in the 2X manner because of the original information loss existing in the 1/4 scale low-resolution data. Obviously, error propagation will lead to degradation of super-resolution performance. Furthermore, we can see from Table 4 that the proposed LLE-based well log super-resolution method is the fastest one among all the employed super-resolution methods. In experiment shown in Section 4.4, the proposed method is also the best one with stable and robust super-resolution performance. For the shortage of the proposed method, especially compared with BSpInterp method, the requirement of high-resolution model data limits its application value. Fortunately, for a given target exploration area, a variety of high-resolution well logs are commonly collected in some reference wells. Therefore, the shortage of the proposed method can be greatly alleviated.
Fig. 8 Illustration of the power spectrum comparison of different methods: a) Original GR; b) Bicubic Interpolation; c) Sparse representation; d) SRCNN; e) LLE. |
Comparison of the correlation coefficients of different sub-bands of the power spectrums between of original high-resolution data and the super-resolution ones of different methods.
5 Conclusion
In this paper, a novel well log super-resolution technique was proposed. In order to preserve or recovery as much the detailed structure information as possible during the well log super-resolution process, LLE technique was introduced for the super-resolution task. From the experimental results we can see that commonly three kinds of problems occurred for other comparison methods: (1) detailed structure information cannot be recovered satisfactorily; (2) obvious peak shift problem occurred; (3) the error fluctuation cannot be handled well. For the proposed LLE-based super-resolution method, its performance in experiments conducted with all the three test manners described in Section 4.3 is promising. Furthermore, the robustness of the proposed method is also verified in Section 4.4, which shows stable super-resolution results for well log data from different test wells. In general, the results given in this paper illustrate that LLE technique can achieve stable and reliable super-resolution of well log data.
References
- Zou C., Yang Z. (2019) Establishment and practice of unconventional oil and gas geology, Acta Geol. Sin. 93, 01, 12–23. [Google Scholar]
- Su J.L., Zhao Y., He T., Luo P.Y. (2021) Prediction of drilling leakage locations based on optimized neural networks and the standard random forest method, Oil Gas Sci. Technol.-Rev. IFP Energies nouvelles 76, 24. [Google Scholar]
- Deng Y., Guo R., Tian Z.Y., Zhao L.M., Hu D.D., Liu H.Y., Liu Y. (2020) Water saturation modeling using modified J-function constrained by rock typing method in bioclastic limestone, Oil Gas Sci. Technol.-Rev. IFP Energies nouvelles 75, 66. [Google Scholar]
- Shafiabadi M., Kamkar-Rouhani A., Sajadi S.M. (2021) Identification of the fractures of carbonate reservoirs and determination of their dips from FMI image logs using Hough transform algorithm, Oil Gas Sci. Technol.-Rev. IFP Energies nouvelles 76, 37. [Google Scholar]
- Bourbiaux B. (2010) Fractured reservoir simulation: a challenging and rewarding issue, Oil Gas Sci. Technol.-Rev. IFP Energies nouvelles 65, 2, 227–238. [Google Scholar]
- Flaum C., Galford J.E., Hasting A. (1989) Enhanced vertical resolution processing of dual detector gamma–gamma density logs, The Log Analyst 29, 5, 6, 150–157. [Google Scholar]
- Nelson R.J., Mitchell W.K. (1992) Improved vertical resolution of well logs by resolution matching, The Log Analyst 32, 4, 281–301. [Google Scholar]
- Liu Y.M., Zou C.C. (2006) A new method of high-resolution processing of well Logs based on genetic algorithm, Prog. Geophys. 21, 4, 1202–1207. [Google Scholar]
- Conaway J.G. (1980) Exact inverse filters for the deconvolution of gamma-ray logs, Geoexplorations 18, 1–14. [CrossRef] [Google Scholar]
- Freedman R., Minerbo G.N. (June 1991) Maximum entropy inversion of induction-log data, SPE Form. Eval. 6, 02183–200. [CrossRef] [Google Scholar]
- Tai Z.W., Cao S.M. (2005) Application of Walsh transform in improving logging curve resolution, J. Shengli Oilfield Staff Univ. 04, 47–48. [Google Scholar]
- Li X., Zhang Z.H., Su H.C. (2015) Improving the resolution of logging curve based on window Fourier transform, Mod. Chem. Indus. 44, 06, 1406–1407. [Google Scholar]
- Lu G.Y., Wong D.W. (2007) An adaptive inverse-distance weighting spatial interpolation technique, Comput. Geosci. 34, 9, 1044–1055. [Google Scholar]
- Dong C., Loy C.C., He K., Tang X. (2016) Image super-resolution using deep convolutional networks, IEEE Trans. Pattern Anal. Mach. Intell. 38, 2, 295–307. [PubMed] [Google Scholar]
- Yang J., Wright J., Huang T., Ma Y. (2010) Image super-resolution via sparse representation, Image Process. IEEE Trans. 19, 11, 2861–2873. [CrossRef] [Google Scholar]
- Aharon M., Elad M., Bruckstein A. (2006) K-SVD: an algorithm for designing over complete dictionaries for sparse representation, IEEE Trans. Sig. Proc. 54, 11, 4311–4322. [CrossRef] [Google Scholar]
- Jiang B. (2015) Method of improving acoustic well logging curve resolution based on compressed sensing, Petroleum Pipes Instrum. 1, 05, 34–36+41. [Google Scholar]
- Ledig C., Theis L., Huszar F., Caballero J., Cunningham A., Acosta A., Aitken A., Tejani A., Totz J., Wang Z. (2017) Photo-realistic single image super-resolution using a generative adversarial network, IEEE Computer Soc. 2017, 105–114. [Google Scholar]
- Volodymyr K., Zayd E.S., Stefano E. (2017) Audio super resolution using neural networks. [Google Scholar]
- Lim T.Y., Yeh R.A., Xu Y., Do M.N., Hasegawa-Johnson M. (2018) Time-frequency networks for audio super-resolution, in: ICASSP 2018 – 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE. [Google Scholar]
- Chang H., Yeung D.Y., Xiong Y. (2004) Super-resolution through neighbor embedding, in: IEEE Computer Society Conference on Computer Vision & Pattern Recognition, IEEE. [Google Scholar]
- Zhang D.P., Fan C.K., Kuang D.Q. (2019) Impact assessment of interlayers on geological storage of carbon dioxide in Songliao Basin, Oil Gas Sci. Technol.-Rev. IFP Energies nouvelles 74, 85. [CrossRef] [Google Scholar]
All Tables
Comparison of quantitative evaluation results of different super-resolution methods for GR logging data.
Comparison of quantitative evaluation results of different super-resolution methods for LLD logging data.
Comparison of the correlation coefficients of different sub-bands of the power spectrums between of original high-resolution data and the super-resolution ones of different methods.
All Figures
Fig. 1 Flow chart of locally linear embedding algorithm. |
|
In the text |
Fig. 2 Illustration of the geological background of the selected well data: the basic phase map of the Qijia-Gulong depression (left), the histogram of the thickness of sand body (right-up), and the histogram of the porosity (right-bottom). |
|
In the text |
Fig. 3 Comparison of the super-resolution performance of the proposed method under a) different neighborhood sizes and b) different local patch sizes. |
|
In the text |
Fig. 4 Illustration of 2X super-resolution results comparison of different methods for a) GR curve and b) LLD curve. |
|
In the text |
Fig. 5 Illustration of indirect 4X super-resolution results comparison of different methods for a) GR curve and b) LLD curve. |
|
In the text |
Fig. 6 Illustration of direct 4X super-resolution results comparison of different methods for a) GR curve and b) LLD curve. |
|
In the text |
Fig. 7 Comprehensive interpretation diagram of Well_3 logging curve. |
|
In the text |
Fig. 8 Illustration of the power spectrum comparison of different methods: a) Original GR; b) Bicubic Interpolation; c) Sparse representation; d) SRCNN; e) LLE. |
|
In the text |