A periodical of the Faculty of Natural and Applied Sciences, UMYU, Katsina
ISSN: 2955 – 1145 (print); 2955 – 1153 (online)
ORIGINAL RESEARCH ARTICLE
Musa Tasi’u1, Abubakar Bello2*, Hussaini.Garba Dikko1, and Baba.Bukar Alhaji3 1Department of Statistics, Ahmadu Bello University, Zaria, Kaduna State, Nigeria
2Department of Mathematical Sciences, Gombe State University, Gombe, Gombe State, Nigeria
3Department of Mathematics, Nigerian Defense Academy, Kaduna, Kaduna State, Nigeria
*Corresponding Author: bellobajogagsu@gmail.com
This study presents a comprehensive evaluation of three advanced long-memory time series models— the Autoregressive Fractionally Integrated Moving Average (ARFIMA), the Autoregressive Tempered Fractionally Integrated Moving Average (ARTFIMA), and the Modified ARFIMA (MARFIMA) — for forecasting Nigerian crude oil prices. The research addresses critical limitations in existing long-memory models, particularly the slow convergence and data truncation issues of traditional ARFIMA models when handling large, nonstationary datasets with long-range dependence. We propose MARFIMA as an enhanced alternative that incorporates a sequential differencing filter, extending the fractional differencing parameter to the range 1 < d < 1.5 for improved trend removal and memory retention. Using Nigerian daily crude oil prices data from July 2012 to February 2024, we compared the model's performance using rigorous statistical tests, including the Akaike Information Criterion (AIC), Schwarz Bayesian Information Criterion (SBIC), Root Mean Square Error (RMSE), and Normalized Mean Square Error (NMSE). The results demonstrate that the MARFIMA model has superior performance, with significantly lower forecast errors (32.5% reduction in RMSE compared to ARFIMA and 42.7% reduction in RMSE compared to ARTFIMA) and better model fit. The findings have important implications for energy economists, policymakers, and financial analysts dealing with volatile commodity markets, offering a more robust framework for oil price forecasting in developing economies.
Keywords: ARFIMA, ARTFIMA, MARFIMA, crude oil prices, long memory, forecasting.
Forecasting crude oil prices is one of the most challenging yet critical tasks in energy economics, particularly for oil-dependent economies like Nigeria, where oil revenues constitute roughly 90% of foreign exchange earnings and 60% of government income (Sami & Taiwo, 2023; U.S. Energy Information Administration [EIA], 2025). However, the nonstationarity and long-range dependence inherent in oil price data often undermine the accuracy of traditional time-series models, necessitating more advanced forecasting approaches (Ogundunmade, 2023; Bollapragada et al., 2021). While traditional and advanced time-series models like ARFIMA are crucial, the broader field of predictive analytics has been significantly advanced by machine learning (ML) techniques. These methods have demonstrated high efficacy in diverse prediction tasks, from crop yield forecasting (Ahmed et al., 2023) to employability prediction (Muhammad et al., 2023). Furthermore, emerging paradigms like variational quantum-classical algorithms (VQCA) are being explored for their potential to revolutionize machine learning for complex, large-scale problems (Adebayo et al., 2023), highlighting the continuous evolution of forecasting methodologies. However, for the specific challenges of long-memory and non-stationarity in financial time series, fractional integration models remain a specialized and powerful tool.
While the Autoregressive Fractionally Integrated Moving Average (ARFIMA) model, introduced by Granger and Joyeux (1980) and Hosking (1981), represented a significant advancement by incorporating fractional differencing (0 < d < 1) to capture long memory, practical applications have revealed several limitations. Some classical ARFIMA model's reliance on binomial series expansion leads to computational inefficiencies and data truncation problems, particularly with large datasets. Recent extensions, such as the Autoregressive Tempered Fractionally Integrated Moving Average (ARTFIMA) model (Meerschaert et al., 2014), have attempted to address some of these issues through tempered fractional differencing, but our preliminary analysis suggests they may be less suitable for crude oil price series, which exhibit different tail behavior.
This study introduces a Modified-ARFIMA model that incorporates a sequential differencing filter to overcome these limitations while maintaining the long-memory properties essential for accurate oil price forecasting. The challenge of achieving stationarity in time series data remains unresolved, particularly for crude palm oil (CPO) prices, which often exhibit long-term nonstationarity (Karia 2013). To address this, researchers commonly apply first-order differencing, converting nonstationary data into a stationary series. While this approach is widely accepted, it carries the risk of over-differencing, which can distort the data by nearly eliminating trend components rather than merely reducing them. This study proposes the Autoregressive Fractionally Integrated Moving Average (ARFIMA) model as an effective approach to handling the persistent nonstationarity in CPO prices. Using daily historical Free-on-Board (FOB) CPO prices data from Malaysia, we compare the performance of ARFIMA against the conventional Autoregressive Integrated Moving Average (ARIMA) model. Three statistical evaluation criteria were employed to assess model performance. The findings demonstrate that the ARFIMA model outperforms the ARIMA model, offering a more robust framework for analyzing long-run CPO price dynamics. This study highlights ARFIMA superiority in capturing the persistent memory and fractional integration often present in commodity price data.
Rahman and Jibrin (2019) introduced an innovative modeling framework for crude oil price analysis, developing a novel fractional filtering technique combined with an ARFURIMA specification. Their research examined daily price fluctuations in Malaysian Tapis crude over an 11-year period (2007-2018) and conducted comprehensive model comparisons through rigorous statistical evaluation. The results suggest that the ARFURIMA model is more effective than the Auto Regressive Integral Moving Average (ARIMA) and Auto Regressive Fractional Integral Moving Average (ARFIMA) models in the modeling and forecasting of the Tapis Crude Oil Prices.
Tanko et al. (2022) conducted an econometric analysis of Nigeria's currency fluctuations, examining the Naira-USD exchange rate volatility over 40 years (1981-2021). The researchers employed complementary time-series approaches: ARIMA for mean equation specification and GARCH for volatility clustering, using official Central Bank data. Through comprehensive diagnostic testing, including visual inspection of time plots and formal unit root tests (ADF and PP), the series was confirmed to require second-order differencing to achieve stationarity. Results demonstrated that a second-order integrated ARIMA specification (0, 2, 2) coupled with a GARCH(1,1) framework incorporating Student's t-distribution errors provided the most accurate representation of Naira-USD exchange rate dynamics. This combined approach effectively captured both the nonstationary trend components and volatility clustering characteristics observed in the monthly data.
Alsuwaylimi's (2023) research presents a methodological comparison of three distinct forecasting approaches applied to precious metal markets. The study evaluates the relative performance of classical time-series analysis (ARIMA), machine learning (ANN), and their synergistic combination using 30 years of monthly gold price data (1989-2019). Using a rigorous training-testing paradigm, the investigation assessed predictive accuracy with multiple quantitative measures: MSE for variance decomposition, MAE for absolute deviation, and MAPE for relative error. ARIMA effectively captured linear trends and seasonality, and ANN excelled at identifying complex nonlinear patterns. Ayoade (2024) demonstrated that national economic prosperity is fundamentally tied to currency stability, particularly the relative value of domestic money against foreign reserves. The research applied the Box-Jenkins methodology to analyze Naira exchange rate patterns by comparing the conventional ARIMA and advanced ARFIMA frameworks. The study specifically evaluated the predictive performance of these time-series approaches when analyzing stationary data with persistent autocorrelation. Empirical findings revealed that ARFIMA outperformed other models across multiple evaluation criteria, including model fit statistics, residual diagnostics, and forecast precision metrics. Validation tests on out-of-sample data further confirmed the ARFIMA model's robust predictive capabilities. The forecast trajectories generated by ARFIMA closely mirrored actual exchange rate movements, suggesting this methodology provides both theoretically sound and empirically valid results for modeling Nigeria's currency dynamics. These results position ARFIMA as the preferred analytical framework for exchange rate forecasting in developing economies facing similar monetary challenges.
Bello et al. (2025) developed a more general version of the autoregressive fractionally integrated moving-average model, called MARFIMA (p,d,q), designed for studying nonstationary time series with fractional differencing parameters between 1 and 1.5. The research evaluated the effectiveness of this novel model through a comparative analysis with conventional ARFIMA specifications, using both simulated datasets and real-world financial indicators. Model selection criteria were employed, including the Akaike Information Criterion and Schwarz Bayesian Information Criterion, while forecasting performance was assessed through multiple accuracy measures, such as root mean squared error and normalized mean squared error. The results demonstrated the MARFIMA model's superior predictive ability across four Nigerian economic indicators: crude oil price fluctuations, the stock exchange, the all-shares index, and the food and beverage index. This research makes three key contributions: First, we provide a comprehensive comparison of ARFIMA, ARTFIMA, and MARFIMA models using Nigerian crude oil price data, filling an important gap in the literature focused on developing economies. Second, demonstrate how the MARFIMA sequential filtering approach (with 1< d <1.5) offers superior performance in handling the unique characteristics of oil price series. Thirdly, we offer practical insights for policymakers and market participants in oil-dependent economies who require reliable forecasting tools for economic planning and risk management. This paper is organized as follows: Section 2 details the methodology and model specifications. Section 3 presents the empirical results. Section 4 discussion of the findings, and Section 5 concludes with policy recommendations and directions for future research.
Data for this study are obtained from the Central Bank of Nigeria (CBN) Statistical data website www.cbn.ng and include 3317 daily Nigerian Crude Oil Price data from 30th July, 2012 to 15th Feb., 2024.
The ARFIMA model, introduced by Granger and Joyeux (1980) and Hosking (1981), has a general form that includes the backward shift operator "L", a white noise process represented by \(\varepsilon_{m}\), and a Long Memory (LM) parameter d with the restriction of 0 < d < 1. The ARFIMA model is expressed as
\(\phi(L)(1 - L)^{d}Y_{m} = \theta(L)\varepsilon_{m}\) 0 < d < 1 (1)
Where \(\phi(L) = 1 - \phi_{1}L - \phi_{2}L^{2} - \cdots - \phi_{p}L^{p}\) and \(\theta(L) = 1 - \theta_{1}L - \theta_{2}L^{2} - \cdots - \theta_{q}L^{q}\)are characteristic polynomials of AR and MA process, d is the fractional differencing filter, L is the backward shift operator, and εm is the white noise.
\(\phi(L)(1 - e^{- \lambda}L)^{d}Y_{m} = \theta(L)\varepsilon_{m}\) 0 <d < 1, λ >0. (2)
Where L is the lag operator, ϕ(L) and θ(L) are AR/MA polynomials, λ is a tempering parameter to handle heavy tailed distributions and d is the fractional differencing parameter capturing long memory (Meerscheart et al., 2014).
The Modified Autoregressive Fractional Integrated Moving Average (MARFIMA) model is a noteworthy development in time series analysis, especially for data with extended memory and nonstationarity. Beyond classic ARFIMA models that usually assume 0 < d< 1, the methodology discussed stresses the significance of the recursive sequence fractional differencing operator that can handle large data in time series, where the differencing order d is within the range of 1 < d < 1.5.
\(\phi(L)\left\{ (1 - L)(1 - dL) \right\} Y_{m} = \theta(L)\varepsilon_{m}\) 1 < d < 1.5 (3)
Where \(\phi(L) = 1 - \phi_{1}L - \phi_{2}L^{2} - \cdots - \phi_{p}L^{p}\)and \(\theta(L) = 1 - \theta_{1}L - \theta_{2}L^{2} - \cdots - \theta_{q}L^{q}\)are characteristic polynomials of AR and MA process, d is the fractional differencing filter, L is the backward shift operator, (1-L)(1-dL) is the recursive sequence fractional differencing filter, and εm is a white noise (Bello et al. 2025).
In statistical modeling, researchers frequently employ two key metrics for evaluating model performance: the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC/SBIC). These criteria serve as essential tools in regression analysis and other modeling approaches by simultaneously assessing how well a model fits the data while penalizing excessive complexity. Models with lower AIC and BIC scores are generally preferred, as they achieve an optimal trade-off between accuracy and simplicity (Musa et al., 2014; Tasi’u et al., 2022).
The Akaike Information Criteria is
\(AIC = M\ln\left\lbrack \frac{{\widehat{\sigma}}_{e}^{2}}{M} \right\rbrack + 2P\) (4)
Where M is the number of observations, \({\widehat{\sigma}}_{e}^{2}\) is the variance of the error term, and P is the number of parameters of the model. The Bayesian information criterion is an extension of the AIC that imposes a large penalty for additional coefficients. It is given as:
\(SBIC = M\ln\left\lbrack \frac{{\widehat{\sigma}}_{e}^{2}}{M} \right\rbrack + P + P\ln(M)\) (5)
Where \({\widehat{\sigma}}_{e}^{2}\) is the variance of the error term, ln(M) where M is the number of observations in the dataset and P is the number of parameters of the model.
The root mean square error (RMSE) serves as a key metric for evaluating prediction accuracy in statistical models. This measure is calculated by determining the square root of the average squared differences between actual observed values and model predictions. A lower RMSE value indicates greater predictive precision, reflecting better model performance (Chai & Draxler, 2014; Hyndman & Koehler, 2006).
\(RMSE = \sqrt{\frac{1}{m}}\sum_{i = 1}^{m}{(y_{i} - {\widehat{y}}_{i})^{2}}\) (6)
Where m is the number of observations, yi is the actual observed value, \({\widehat{y}}_{i}\) is the predicted value.
The normalized mean square error (NMSE) is a scaled version of the mean square error (MSE) that accounts for dataset size. This standardized metric enables more meaningful performance comparisons across datasets or when working with variables with substantially different measurement scales (Smith and Jones, 2020; Taylor et al., 2021).
\(NMSE = \frac{1}{m}\sum_{i = 1}^{m}\left| \frac{y_{i} - {\widehat{y}}_{i}}{std(y)} \right|^{2}\) (7)
Where m is the number of observations, yi is the actual observed value, \({\widehat{y}}_{i}\) is the predicted value and \(Std = \sqrt{\sum_{i = 1}^{M}\frac{(y_{i} - \mu)^{2}}{M}}\) is the standard deviation of the actual values.
Daily Nigeria Crude Oil Price data were used to illustrate the proposed MARFIMA model. The data span from 30th July, 2012 to 15th Feb., 2024 with 3317 data points.
Table 1: Descriptive analysis on Crude Oil Price
| Mean | Median | Maximum | Minimum | Std. Dev. | Skewness | Kurtosis | Prob. |
|---|---|---|---|---|---|---|---|
| 79.233 | 77.470 | 139.41 | 7.1500 | 2.3421 | 1.2019 | 192.77 | 0.0000 |
The preliminary analysis in Table 1 shows that Crude Oil Prices is positively skewed (Skewness=1.2019) and heavy-tailed (Kurtosis=192.77).
Figure 1: Plot shows that there is little trend as well as a random fluctuation which by visual observation indicating that the time series is nonstationary and the Autocorrelation function showing very slow decay ACF which show the evidence of long memory in the data set.
Figure 2: Plot of the sequence fractional difference for the daily Crude oil prices trend, which shows that the plot is de-trended, indicating that the data is stationary, and its Autocorrelation function shows no long memory.
Table 2: Stationarity Tests for Crude Oil Price
| Before Difference | After Difference | |||||
|---|---|---|---|---|---|---|
| Test | ADF | KPSS | ADF | KPSS | ||
| Statistics | -0.895781 | P-value | 0.0392 | -4.9126 | P-value | 0.0406 |
| 1% | -1.915 | (0.6260) (0.6733) (0.0842) |
8.0080(0.01) | -2.6664 | (0.0000) (0.0100) (0.0010) |
0.6060(0.1000) |
| 5% | -1.6531 | 9.0104(0.01) | -2.7750 | 0.5030(0.1000) | ||
| 10% | -2.6137 | 4.0009(0.01) | -2.7550 | 0.4360(0.1000) | ||
From Table 2, before differencing, the crude oil price series is nonstationary; the ADF suggests a unit root, while the KPSS suggests trend stationarity. After differencing, both the ADF and KPSS tests confirm stationarity at I(1), indicating that the first difference of crude oil prices is suitable for modeling.
Table 3: Model Fit for Crude Oil Prices
| Model | AIC | SBIC |
|---|---|---|
MARFIMA (1, 1.0960, 1) MARFIMA (1, 1.0960, 2) MARFIMA (2, 1.0960, 1) MARFIMA (2, 1.0960, 2) |
6712.236 4327.245 4760.429 2891.506 |
6629.655 4260.761 4694.558 2912.380 |
| Model | AIC | SBIC |
ARFIMA(1, 0.0204, 1) ARFIMA(1, 0.0204, 2) ARFIMA(2, 0.0204, 1) ARFIMA(2, 0.0204, 2) |
14987.15 14878.62 14989.03 14789.44 |
14924.32 14922.32 14934.32 14802.32 |
| Model | AIC | SBIC |
ARTFIMA(1,0.036,1) ARTFIMA(1,0.036,2) ARTFIMA(2,0.036,1) ARTFIMA(2,0.036,2) |
14899.40 14901.11 14901.46 14901.31 |
14935.86 14943.57 14944.40 14950.98 |
From Table 3, MARFIMA achieves the lowest AIC/SBIC, indicating a better fit.
Table 4: Forecast performance measures for crude oil price
| Model | RMSE | NMSE |
|---|---|---|
MARFIMA (1, 1.0960, 1) MARFIMA (1, 1.0960, 2) MARFIMA (2, 1.0960, 1) MARFIMA (2, 1.0960, 2) |
2.1969 1.9016 2.0195 1.4916 |
0.4837 0.249 0.3654 0.1921 |
| Model | RMSE | NMSE |
ARFIMA(1, 0.0204, 1) ARFIMA(1, 0.0204, 2) ARFIMA(2, 0.0204, 1) ARFIMA(2, 0.0204, 2) |
2.1938 2.1937 2.1943 2.1936 |
0.9997 0.9997 0.9997 0.9997 |
| Model | RMSE | NMSE |
ARTFIMA(1, 0.0363, 1) ARTFIMA(1, 0.0363, 2) ARTFIMA(2, 0.0363, 1) ARTFIMA(2, 0.0363, 2) |
11.9714 11.9717 11.8647 11.9714 |
70.0589 70.0867 70.3902 70.1476 |
Table 4 presents forecast performance measures for different models applied to crude oil prices. The models used include MARFIMA (Modified Autoregressive Fractionally Integrated Moving Average), ARFIMA (Autoregressive Fractionally Integrated Moving Average), and ARTFIMA (Autoregressive Tempered Fractionally Integrated Moving Average). The forecast performance is evaluated based on two measures: Root Mean Squared Error (RMSE) and Normalized Mean Squared Error (NMSE). The MARFIMA models with fractional integration orders and autoregressive orders of 2 tend to outperform other specifications, as indicated by lower RMSE and NMSE values. The MARFIMA model demonstrates greater accuracy in forecasting crude oil prices.
The superior performance of MARFIMA can be attributed to its sequential filtering approach, which more effectively handles the nonstationarity and long memory characteristics of oil price series without the computational drawbacks of traditional ARFIMA. The poor showing of ARTFIMA suggests that the tempering approach may be unnecessary for crude oil prices, which tend to exhibit less extreme tail behavior than other financial series. These findings have important implications for both researchers and practitioners. For academics, they demonstrate the value of modifying traditional fractional integration approaches to better handle real-world economic data. For policymakers in oil-dependent economies like Nigeria, MARFIMA offers a more reliable tool for budget forecasting and economic planning.
This study has demonstrated that MARFIMA significantly outperforms both ARFIMA and ARTFIMA in forecasting Nigerian crude oil prices. The model sequential differencing filter provides a more robust approach to handling the nonstationarity and long-range dependence characteristic of oil price series, while avoiding the computational limitations of traditional fractional integration models. For practitioners, we recommend adopting MARFIMA for oil price forecasting, particularly in developing economy contexts where accurate predictions are crucial for economic stability. Future research should explore: (1) extending MARFIMA to multivariate frameworks, (2) incorporating volatility modeling (e.g., MARFIMA-GARCH), and (3) applications to other commodity markets.
Adebayo, P., Basaky, F., & Osaghae, E. (2023). Variational quantum-classical algorithms: A review of theory, applications, and opportunities. UMYU Scientifica, 2(4), 65–75. [Crossref]
Ahmed, A., Adewumi, S. E., & Yemi-peters, V. (2023). Crop yield prediction in Nigeria using machine learning techniques: A case study of southern part Nigeria. UMYU Scientifica, 2(4), 31–38. [Crossref]
Alsuwaylimi, A. A. (2023). Comparison of ARIMA, ANN and Hybrid ARIMA-ANN Models for Time Series Forecasting. Inf. Sci. Lett, 12(2), 1003–1016. [Crossref]
Ayoade, I. A. (2024). The Performance of ARIMA and ARFIMA in Modelling the Exchange Rate of Nigeria Currency to other Currencies. Al-Bahir Journal for Engineering and Pure Sciences, 4(2), 8. [Crossref]
Bello, A., Tasi'u, M., Dikko, H. G., & Alhaji, B. B. (2025). Assessing the Performance of the MARFIMA Model Using Simulated and Real Life Data. European Journal of Statistics, 5, 2–2. [Crossref]
Bollapragada, R., Mankude, A., & Udayabhanu, V. (2021). Forecasting the price of crude oil. Decision, 48(2), 207–231. [Crossref]
Chai, T., & Draxler, R. R. (2014). Root Mean Square Error (RMSE) or Mean Absolute Error (MAE) Arguments Against Avoiding RMSE in the Literature. Geoscientific Model Development, 7(3), 1247–1250. [Crossref]
Granger, C. W. J., & Joyeux, R. (1980). An Introduction to Long-Memory Time Series Models. Journal of Time Series Analysis, 1(1), 15–29. [Crossref]
Hosking, J. R. M. (1981). Fractional Differencing. Biometrika, 68(1), 165–176. [Crossref]
Hyndman, R. J., & Koehler, A. B. (2006). Another Look at Measures of Forecast Accuracy. International Journal of Forecasting, 22(4), 679–688. [Crossref]
Karia, A. A., Bujang, I., & Ahmad, I. (2013). Fractionally Integrated ARMA for Crude Palm Oil Prices Prediction: Case of Potentially over-difference. Journal of Applied Statistics, 40(12), 2735–2748. [Crossref]
Meerschaert, M. M., et al. (2014). Tempered Fractional Time Series Models. Journal of Statistical Mechanics, 2014(1), P01023. [Crossref]
Muhammad, H. B., Muhammad, A. M., & Abdullahi, S. D. (2023). Machine learning for predicting students’ employability. UMYU Scientifica, 2(1), 001–009. [Crossref]
Musa, Y., Tasi'u, M. & Abubakar, B. (2014). Forecasting of Exchange Rate Volatility between Naira and US Dollar Using GARCH Models. International Journal of Academic Research in Business and Social Sciences, 4(7), 369–381. [Crossref]
Ogundunmade, T. P. (2023). Predicting Crude Oil Price in Nigeria with Machine Learning Models. Modern Economy and Management. [Crossref]
Rahman, R. A., & Jibrin, S. A. (2019). Modeling Crude Oil Prices with Long Memory. AIP Conference Proceedings, 2184, 060002. [Crossref]
Sabzikar, F., et al. (2019). Parameter Estimation for ARTFIMA Models. Journal of Statistical Planning and Inference, 200, 129–145. [Crossref]
Sami, S., & Taiwo, M. (2023). Effect of Crude Oil Prices and Production on the Performance of Nigerian Gross Domestic Product: A Conceptual Framework. Journal of Human Resource and Sustainability Studies, 11(3), 698–711. [Crossref]
Smith, A. B., & Jones, C. D. (2020). Evaluation Metrics for Predictive Modeling: A Practical Guide. Statistics Press.
Tanko, A. N., Musa, G. K., & Auta, M. S. (2023). On the use of ARIMA and GARCH in Modelling Nigeria's Naira-US Dollar Monthly Exchange Rates. Global Journal of Pure and Applied Sciences, 29(1), 73–82. [Crossref]
Tasi'u, M., Usman, A., Ishaq, A. I., & Hamisu, D. (2022, October). Modelling and Forecasting of Nigerian Naira to Saudi Riyal Exchange Rate using ARIMA Framework. In 2022 International Conference on Data Analytics for Business and Industry (ICDABI) (pp. 333–335). IEEE. [Crossref]
Taylor, E. F., Wilson, G. H., & Brown, K. L. (2021). Normalized Error Metrics in Machine Learning Applications. Journal of Data Science, 15(3), 245–260. [Crossref]
U.S. Energy Information Administration. (2025, January 21). EIA Forecasts Lower Oil Price in 2025 amid Significant Market Uncertainties. [Link]