Forecasting Index of Industrial Production Sub-Series Using Statistical and Deep Learning Approach
Forecasting Index of Industrial Production Sub-Series Using Statistical and Deep Learning Approach
Abstract: The Index of Industrial Production (IIP) is a key economic indicator that tracks manufacturing activity across
various sectors. This paper aims to predict the IIP for three sub-series—Mining, Manufacturing, and Electricity—using
both conventional statistical methods and deep learning approaches, analyzing data from April 2012 to September 2022.
Model performance is evaluated by comparing Mean Square Error (MSE), Mean Absolute Percentage Error (MAPE), and
Root Mean Square Error (RMSE)1. The results show that the RNN model outperforms other models for all three sub-series,
and is used to forecast these sub-series from October 2022 to September 2023.
Keywords: IIP, Mining, Manufacturing, Electricity, SARIMA, FFNN, RNN, LSTM, Forecasting.
How to Cite: P. Preethi; S. A. Jyothi Rani; V. V. Haragopal (2025) Forecasting Index of Industrial Production Sub-Series Using
Statistical and Deep Learning Approach. International Journal of Innovative Science and Research Technology,
10(7), 3539-3547. https://doi.org/10.38124/ijisrt/25jul1699
Rani, SA Jyothi, and N. Chandan Babu (2020): “This III. OBJECTIVES OF THE STUDY
paper presents the forecasting of rice production (in million
tonnes) using Auto Regressive Integrated Moving Averages To fit the model using the ARIMA method
To fit the model using the Deep Learning Method-FFNN, “The results of forecasting are presented using
RNN, and LSTM, using python code. different methods. The methods are compared using Mean
Absolute Error (MAE), Root Mean Squared Error (RMSE),
To Identify the best model by comparing MSE, MAE, and and Mean Absolute Percentage Error (MAPE) which are
RMSE out of all fitted SARIMA and Deep learning given below” [1]:
models.
MSE=1/n ∑ (Yt-Ft)2
To forecast the IIP values for all three sub-series Mining,
Manufacturing, and Electricity for the next 12 months MAE=1/n ∑ |Yt-Ft|
using best-fit model.
Fig 1 Trends of IIP Data from April 2012 to Sept 2022 for all 3 Sectors
“First examined whether the data is stationary or not the results show that the data is not stationary for all the sub-
using the Dickey-fuller Test for IIP’s three data series- series. Therefore, the data is converted into stationary by
Mining, Manufacturing, Electricity as shown in Table 1 and using the first or second-order difference of the datasets.
To prevent overfitting in neural networks, we divided In addition, models are fitted using different
the original dataset into training and testing sets. We tested specifications as described below.
different combinations of these datasets across the following
neural network architectures: Activation function: Rectified Linear Unit;
Validation Generator: Timeseries Generator;
Feed Forward with splits of [70%, 30%], [75%, 25%], Batch and Sequence Size: 1 and 12.
and [80%, 20%]; RNN with splits of [70%, 30%], [75%,
25%], and [80%, 20%]; and LSTM with splits of [70%, 30%], The following three metrics— MSE, MAE, and
[75%, 25%], and [80%, 20%]. These analyses were RMSE, are used to compare the methods.
conducted across three different data series: Mining,
Manufacturing, and Electricity for the training datasets and
the results are presented in Table 2.
RNN Simple RNN 70% 7.95 2.30 2.82 7.58 2.25 2.75 13.90 3.07 3.72
(64)
DL (32)
DL (1)
RNN Simple RNN 75% 3.72 1.53 1.92 15.62 3.20 3.95 6.14 1.88 2.47
(64)
DL (32)
DL (1)
RNN Simple RNN 80% 3.61 1.50 1.90 58.13 4.99 7.62 7.83 2.12 2.79
(64)
DL (32)
DL (1)
LSTM LSTM (50) 70% 24.60 3.86 4.95 13.29 2.86 3.64 16.88 3.36 4.10
LSTM (50)
DL (1)
LSTM LSTM (50) 75% 15.54 3.18 3.94 34.12 4.91 5.84 23.14 4.01 4.81
LSTM (50)
DL (1)
LSTM LSTM (50) 80% 6.21 2.04 2.49 89.04 4.80 9.43 32.23 4.03 5.67
LSTM (50)
DL (1)
LSTM LSTM (50) 70% 16.05 3.13 4.00 14.37 3.15 3.79 33.80 4.77 5.81
LSTM (50)
DL (32)
DL (1)
LSTM LSTM (50) 75% 20.47 3.54 4.52 23.99 3.84 4.89 44.76 5.46 6.69
LSTM (50)
DL (32)
DL (1)
LSTM LSTM (50) 80% 17.22 3.06 4.15 81.02 4.87 9.00 32.04 4.27 5.66
LSTM (50)
DL (32)
DL (1)
Referring to Table 2, it becomes evident that the MSE, models across all series, namely Mining, Manufacturing, and
MAE, and RMSE values are consistently minimized when Electricity. With the adoption of the 75% training dataset,
75% of the training observations are used. Consequently, it SARIMA models were implemented for all three data series
was decided that 75% of the observations would serve as the -Mining, Manufacturing, and Electricity which are shown in
training dataset, while the remaining 25% would be Tables 3, 4, and 5.
designated as the testing dataset for the execution of different
The best models are Mining-SARIMA (1,1,1) (0,1,1) FFNN, RNN, and LSTM are executed using the commands
[12], Manufacturing- SARIMA (0,1,1) (1,1,1) [12], called Sequential, Simple RNN, and LSTM in Python. The
Electricity-SARIMA (1,1,0) (0,1,1) [12]. Actual Vs. Fitted values have been plotted for the Mining
series for all the models with [75%, 25%] split shown in Fig.
After fitting the SARIMA model, the other models 2. The blue trend indicates the mining IIP values, the orange
FFNN, RNN, and LSTM were fitted by considering training trend indicates the predicted training values and the green
and testing split of [75%,25%] using Python code. The trend indicates the predicted testing values.
The above Fig. 2, clearly shows that the SARIMA, the accuracy of projection is also quantified for the testing
FFNN, and RNN model fits well for training datasets whereas dataset by calculating the MSE, MAE, and RMSE of the
RNN and LSTM models fit well for testing data. Therefore, testing dataset of the mining series.
From above Table 6, we observe that MSE, MAE, and Similarly, The Actual Vs. Fitted values have been
RMSE are the least for the RNN model. Therefore, from plotted for the Manufacturing series for all the models with
Table 6 and Table 2, we conclude that RNN (SimpleRNN-64, [75%, 25%] split shown in Fig. 3. The blue trend indicates the
Dense Layer-1) is the best fit for testing as well as training manufacturing IIP values, the orange trend indicates the
datasets of mining data series. Hence, we conclude that RNN predicted training values and the green trend indicates the
(SimpleRNN-64, Dense Layer-1) model outperforms the predicted testing values.
other models and will be used for forecasting the mining data
series.
The above Fig 3 clearly shows that the SARIMA, RNN projection is also quantified for the testing dataset by
model fits well for training datasets whereas only the RNN calculating MSE, MAE, and RMSE to get the accuracy of the
model fits well for testing data. Therefore, the accuracy of testing dataset of the manufacturing series.
From Table 7 we observe that MSE, MAE, and RMSE the manufacturing data series. Similarly, The Actual Vs.
are the least for the RNN model. Therefore, from Table 7 and Fitted values have been plotted for the Electricity series for
Table 2, we conclude that RNN (SimpleRNN-64, Dense all the models with [75%, 25%] split shown in Fig 4. The blue
Layer-1) is the best fit for testing as well as the training trend indicates the electricity IIP values, the orange trend
dataset of the Manufacturing data series. Hence, we conclude indicates the predicted training values and the green trend
that the RNN (SimpleRNN-64, Dense Layer-1) model indicates the predicted testing values.
outperforms the other models and will be used for forecasting
The above Fig. 4 clearly shows that the SARIMA, accuracy of projection is also quantified for the testing dataset
FNN, and RNN model fits well for training datasets whereas by calculating MSE, MAE, and RMSE of the testing dataset
only the RNN model fits well for testing data. Therefore, the of the electricity series.
Table 9 Forecasted Values of IIP from Oct-2020 to Sept-2023 for all three Series
Months Mining Manufacturing Electricity
Oct-22 102.66 127.61 191.08
Nov-22 103.13 127.73 197.19
Dec-22 117.22 127.35 196.83
Jan-23 128.72 129.72 194.27
Feb-23 133.43 124.64 189.90
Mar-23 146.68 132.74 183.75
Apr-23 162.12 124.34 181.36
May-23 171.21 131.89 183.81
Jun-23 183.95 126.58 191.10
Jul-23 177.63 130.28 199.06
Aug-23 172.84 126.90 204.47
Sep-23 160.75 129.01 202.47
REFERENCES