Compressive Strength Prediction of High Strength Concrete using Regression and ANN Models

Compressive Strength Prediction of High Strength Concrete using Regression and ANN Models.

S Mandal1, Shilpa M2, R Rajeshwari3
Department of Civil Engineering
PES University, Bengaluru -560085
[email protected], [email protected], [email protected]: High strength concrete (HSC) is one of the most popular terminologies used in the concrete technology, which is known for benefits like high workable, durable and high ultimate strength. The estimation of the compressive strength (CS) using experimental method is too expensive and time consuming procedure and small error will lead to repetition of the work, to overcome this, alternative methods are used for prediction of the CS of HSC. In the present study, the experimental data on the strength of HSC of various mix designs are collected from authenticated journal papers, which are used to predict the CS using regression analysis (MLR- multi-linear regression) and artificial neural network (ANN) models. The collected data set are divided into two groups, one for training and other for testing. The input parameters used in regression and ANN models are cement content, superplasticizer, coarse aggregate, fly ash, fine aggregate, silica fume, blast furnace slag, water-cement ratio and the CS of HSC at 28 days is the output parameter. The models are developed using training dataset and the developed model is validated using testing dataset. The comparison is made between the CS obtained from the MLR and ANN models. The ANN model yields better correlation between predicted and actual values of the CS (test correlation for MLR-45.48% and ANN-95.03%) and the percentage of error also reduces as compared to that of MLR. From this investigation, it is observed that the ANN model can be used to predict the CS of HSC.Keywords: High strength concrete, artificial neural network, Multi-linear regression.

1. Introduction
Concrete is an excellent building material which carries the compressive stresses from large infrastructure because of this advantage concrete is used for the construction. Concrete is classified as different types based on their strength and performance. The concrete which has properties like high durable, workable and high ultimate-strength is called as HSC. The production of HSC depends upon the good selection and mix proportioning of the ingredient to achieve a high strength of concrete. The use of HSC in construction increases the service life of the structure and suffers less damage which would reduce overall cost.
In construction the estimation of the CS has received a large amount of interest because, the CS is one of the important mechanical properties of concrete. On specific topic several investigations are carried to estimate the CS of HSC with the addition of different types of admixtures. The comparison was made by considering the properties like CS, consumption of cement and economic validation. HSC saves the consumption of cement about 50% and which results in less usage of raw materials and reduction in overall costs (Alves et. al 2003). The performance of HSC is studied by variation of dosage of silicafume (SF) and water-cement ratio (w/c). SF effects negatively on the CS as compared to w/c ratio (Behnood and Ziari 2007). HSC gives the high workability and concrete strength without using the special ingredient except good quality of material and proportions of concrete (Rashid et. al 2008). The CS variation at different temperatures ranges are carried out. The study indicates that the CS decreases with increase in temperatures and structure may damage by fire (Husem 2005). Partial replacement of cement with other admixture increases the strength of concrete as compared to mixture made by 100% port land cement (Ibrahim et. al 2013).
The estimation of the CS by experimental method is too expensive and time consuming procedure and small error will lead to repetition of the work, to overcome this, an alternative method is used for prediction of the CS of HSC. Many attempts are made to develop the suitable mathematical models which have ability of find the strength of concrete at various ages and conditions. Sometimes mathematical model like regression analysis shows poor agreement with experimental data. To deal these drawbacks, a different method of modelling known as ANN is used. ANN is a soft computation method based on the structure and functionality of biological neuron in similar way how human brain works. The massages and information that flow through neural network, depends on the structure of ANN because ANN has the ability to learn on the basis of data provided for the input and output variables, and ANN method is trained and tested.
The present study involves the estimation of the strength of HSC. For this requirement, a total of 106 experimental data on HSC is extracted from authenticated journal papers; these data set are divided into two groups, 70% of data are taken for training and remaining 30% data are taken for testing the models. Finally the performances in estimating the CS of HSC using MLR and ANN are compared.
2. Modelling techniques
2.1 Multi-linear Regression Model
The linear regression model is a general form of regression models, the linear estimator functions are used to model the data, and output parameters are determined from the data. Some times for the regression analysis include more than one input variables which would lead to the formation of “multiple linear regression” function. MLR evaluates the correlation between two or more input variables by ?tting a linear equation between them and involves summarization of data as well as investigation of relationship between variables. Mathematically, a MLR model can be expressed as given below:
y = a? + a? x? +a? x?+…(1)
Where, y is the dependent variable, x?, x? … etc. are the independent input variables to the model, and a?, a? , a?,… are partial regression coefficients.
2.2 Artificial Neural Networks

The artificial neuron mimics the characteristics of the biological neuron that is human brain structure and it basically consists of inputs; each input represents the output of another neuron. The input used to solve the problem is multiplied with corresponding weights and these weighted inputs are summed with bias value. The summation of the inputs are processed in the hidden layer using transfer functions like linear, tan-sigmoid and log-sigmoid transfer functions etc. The processed information by transfer function sends along the output layer as required result. The arrangement of neurons into layers and the connection pattern between other layers is known as network architecture.

Figure 1: Structure of an Artificial Neuron Model.

Figure 1 shows the structure of an Artificial Neuron Model. Feed forward network (FFN) is generally used back propagation for training network where the error obtained at the output layer is moved back to the input and hidden layers for updating weights and decreases the errors to get the best output from ANN. The main aim of the FFN process is to reduce the overall error between the observed and estimated values by adjusting the weights, and these weights are combined and processed through an activation function and released to the output layer.

3. Data
For the present study, experimental data on HSC are obtained from various authenticated journals (Alves et. al 2003; Behnood and Ziari 2007; Husem 2005; Ibrahim et. al 2013; Elahia et. al 2010; Prasad et. al 2009; Biskri et. al 2017; Kumar et. al 2017; Hassan et. al 2000;Ouda 2015;Rashida et. al 2008). A total of 106 datasets collected and the data consists of input and output parameters. The input parameters include cement content, coarse aggregates, fine aggregates, blast furnace slag, silica fume, fly-ash content, super plasticizer and water-cement ratio. Output parameter is the 28 days CS of HSC. All collected data are converted into desirable units. Further, these data sets are normalized using Eq. 2.
Xn=Xi-XminXmax-Xmin (2)
Where, Xn is the normalized value, Xi is the actual value, is Xmax the maximum value, Xmin is the minimum value.

Table 1: Range of experimental variables
Variables Range
Cement(kg/m3) 135 – 912
Coarse aggregate(kg/m3) 814.54 – 1547.06
Fine aggregate(kg/m3) 241.93 – 1164
Fly ash(kg/m3) 0 – 270
Silica fume(kg/m3) 0 – 91
Blast furnace slag(kg/m3) 0 – 340
Super plasticizers(kg/m3) 0 – 32.2
W/C(kg/m3) 0.21 – 1.23
Compressive strength(MPa) 55 – 88.90
Total numbers of data sets collected are 106 and the data is divided into two sets: one is for testing and another for training purpose. Data considered for training consist of varying values covering maximum to minimum range. 70% of the data are taken for training purpose, therefore 75 data sets are used for training the network. 30% of the data are taken for testing purpose; hence 31 data sets are used for testing.

4. Methodology used for Model Development
4.1 MLR model
The MLR model constructed in the current study has eight input parameters namely cement content, superplasticizer (SP), coarse aggregate (CA), fly ash(F),fine aggregate(FA), , silica fume(SF), blast furnace slag(BFS) and w/c to get the 28 days CS as output parameter. Initially, the 75 train data set containing both the input and output parameters are used to get the regression coefficients for each variable and obtained CCtrain, thereafter remaining data are utilized to estimate the CS and obtained CCtest. The estimated and observed CS of testing data sets are compared based on the correlation
between them and performance of MLR in predicting the CS of HSC is assessed based the CC value of testing data set.

4.2 ANN model
Here, the ANN model with the 8- input nodes, 6-hidden nodes and one output node is developed to estimate the CS of HSC. The input information is passed along the input layer no computation are happens in this layer but it pass the information to the next layer called hidden layer in hidden layer inputs are multiplied by weights and the bias value is added to the each input nodes. These weighted inputs are processed by the transfer function of the ANN model, finally the processed information are received from output layer. The ANN model works on the basis of the Levenberg-Marquardt (LM) algorithm.

5. Results and Discussion
Regression (MLR) and ANN approaches have been used to predict the CS of HSC. The capability of these models was assessed using statistical measures like Correlation Coefficient (CC), Root Mean Square Error (RMSE) and Scatter Index (SI), which are defined as,
CC=i=1n(Oi-Oi)(Pi-Pi)/i=1nOi-Oi2Pi-Pi2(3)
RMSE=i=1n(Oi-Pi)2 n X100(4)
SI=RMSEOi(5)
Where Oi and Pi is the observed and predicted CS of HSC respectively. n is the number of data set used. Oi and Pi is the average observed and predicted CS of HSC respectively.

5.1 Regression Analysis:
The MLR model is trained using normalized training data set. After training the model, the MLR coefficients are used to predict the CS of test data set. Both train and test data sets are shown in Figures 2 and 3.
Eq. 6 is developed using training data:
Y = – 0.5009 + (0.7885*x1) + (0.7421*x2) + (0.5846*x3) + (0.1693*x4) + (0.0115*x5) +
(0.6027*x6) – (0.1595*x7) – (0.1018*x8)(6)

Figure 2: Comparison between predicted and observed CS for training and testing by MLR
Figure 2 shows variation of the predicted and observed CS of trained and tested data set with the CC of 0.6757 and 0.4548. This shows that prediction of CS by MLR is poor, as could be observed, when data are scattered and not along or near by the (y=mx) line. If the predicted data are mostly lying along a nearby (y=mx) line, the prediction is fairly, accurate.
5.2 Artificial Neural Network:
The ANN model is trained and tested using LM algorithm for a given input and output parameter. The network is trained for different number of hidden layer nodes; initially 3 hidden nodes are used to model. Further numbers of hidden layer nodes are increased to arrive at better results. The results obtained during training and testing processes showing the CC, RMSE and SI values are shown in Table 2.

Table 2: Statistical results obtained for ANN model (8-X-1).

NETWORK CC RMSE SI
8-3-1 Training
Testing 0.8868
0.8843 11.1774
10.9854 0.2800
0.2759
8-4-1 Training
Testing 0.9220
0.9002 9.3633
10.1574 0.2311
0.2589
8-5-1 Training
Testing 0.9475
0.9293 7.7485
8.8217 0.1913
0.2248
8-6-1 Training
Testing 0.9589
0.9503 6.8668
7.1643 0.1695
0.1826
8-7-1 Training
Testing 0.9505
0.9413 7.5208
7.9991 0.1856
0.2039
8-8-1 Training
Testing 0.9524
0.9393 7.3698
8.2017 0.1819
0.2090
CCs between the observed output and predicted output are calculated using Eq. 3. RMSE and SI between the observed output and predicted output are calculated using Eq. 4 and Eq. 5 respectively.

From Table 2, it is observed that CC obtained for the network 8-6-1 with 15 epochs has achieved the best value. The CC value obtained for trained and tested data is 0.9589 and 0.9503 respectively. The RMSE value is found to be 6.8668 and 7.1643 for training and testing respectively. The SI value is found to be 0.1695 and 0.1826 for training and testing respectively. Hence the ANN model with network 8-6-1(Figure 4) yields the best performance to predict the CS of HSC.
Figure 3: Correlation between predicted and observed CSs for training and testing by ANN (8-6-1).
Figure 3 shows the comparison between predicted and observed values of CS by ANN (8-6-1) with CC of 0.9589 and 0.9503 for trained and tested data respectively.

Figure 4: Structures of ANN (8-6-1).

Once the network is trained, the weight and bias values are fixed for that model. The ANN structure constructed for predicting the CS of HSC is shown in Figure.4. The structure consists of 8- input nodes, 6-hidden layer nodes, and one output node.

5.3 Comparison between MLR and ANN models
Table 3 shows the comparison between CC values for regression (MLR) and ANN models. The test data for CC of 45.48% by MLR shows that it gives poor correlation; whereas CC of 95.03% by ANN shows that the ANN model predicts the CS of HSC with very good correlation.

Table 3: CCs for MLR and ANN Models
Model CC
MLR Training
Testing 0.6757
0.4548
ANN Training
Testing 0.9589
0.9503
6. Conclusion

Regression and ANN models have been trained and tested with about 70:30 of the total data sets. Based on the present study, the following conclusions are drawn:
The regression model yields CC of 0.6757and 0.4548 for training and testing data respectively; this shows that estimation of the CS by MLR is poor.

For better prediction of the CS of HSC, a soft computing model such as ANN is used and the results are compared in terms of statistical measures such as CC, RMSE and SI.

ANN model yields a good correlation between the input parameters and compressive strength of HSC with 15 epochs. The statistical parameters obtained, CC- 0.9589 and 0.9503, RMSE – 6.86 and 7.18and SI- 0.16 and 0.18 for training and testing respectively, demonstrate that the predicted output values are very close to the actual output values.

With comparison to regression model, the performance of ANN models show good results in terms of statistical measures like CC, RMSE and SI for the observed and predicted CS of HSC. Therefore, the ANN model can be used to predict the CS of HSC.

References
Ali Behnood andHasanZiari, E?ects of silica fume addition and water to cement ratio on the properties of high-strength concrete after exposure to high temperatures, Cement & Concrete Composites – Elsevier 30 (2008) 106–112.

Ahmed S. Ouda, Development of high-performance heavy density concrete using different aggregates for gamma-rays shielding, Housing and Building National Research Center- Elsevier, HBRC Journal (2015)11, 328–338.

Ahmed Ibrahim, Hassan El-Chabib, Ahmed Eisa, Ultra strength Foldable Concrete Made with High Volumes of Supplementary Cementitious Materials, Journal of material in civil engineering-ASCE 25(12), (2013) 1830-1839.

A. Camões, B. Aguiar, S. Jalali, Durability of Low Cost High Performance Fly Ash Concrete, International Ash Utilization Symposium, Center for Applied Energy Research, University of Kentucky, Paper #43, 2003.

A. Elahia, P.A.M. Basheerb, S.V. Nanukuttanb, Q.U.Z. Khana, Mechanical and durability properties of high performance concretes containing supplementary cementitious materials, Construction and Building Materials-Elsevier 24 (2010) 292–299.

B.K Raghu Prasad , Hamid Eskandari, V. K VenkataramaRaddy , Prediction of compressive strength of SCC and HPC with high volume fly ash using ANN , Construction and Building Materials- Elsevier, 23 (2009), 117–128.

B.M. Vinay Kumar, H. Ananthan , K.V.A. Balaji, Experimental studies on utilization of recycled coarse and ?ne aggregates in high performance concrete mixes, Alexandria Engineering Journal(2017).

K.E. Hassan, J.G. Cabrera, R.S. Maliehe, The effect of mineral admixtures on the properties of high-performance concrete, Cement & Concrete Composites- Elsevier, 22 (2000), 267-271.

M.F. Alves, R.A. Cremonini, D.C.C. Dal Molin, A comparison of mix proportioning methods forhigh-strength concrete, Cement & Concrete Composites- Elsevier 26 (2004) 613–621.

Mohammad AbdurRashida and Mohammad AbulMansurb, Considerations in producing high strength concrete, Journal of Civil Engineering (IEB), 37(1) (2009) 53-63.

M. Husem, The effect of high temperature on compressive and flexural strengths of ordinary and high performance concrete, Fire Safety Journal – Elsevier, 41 (2006), 155–163.

YasminaBiskri, DjamelAchoura, NourredineChelghoum, Michel Mouret, Mechanical and durability characteristics of High Performance Concrete containing steel slag and crystalized slag as aggregates, Construction and Building Materials- Elsevier, 150 (2017), 167–178.

FaezehossadatKhademi, Mahmoud akbar, SayedMohammadmehdi, Mehdi nikoo, Multiple linear regression, arti?cial neural network, and fuzzy logic prediction of 28 days compressive strength of concrete, Front. Struct. Civ. Eng. 2017, 11(1): 90–99.

FatihAltun ,Ozgur Kisi, KamilAydin, Predicting the compressive strength of steel ?ber added lightweight concrete using neural network, Computational Materials Science- Elsevier , 42 (2008) 259–265.

FatihÖzcana, Cengiz D. Atis, OkanKarahanb, ErdalUncuog ?luc, HarunTanyildizid, Comparison of arti?cial neural network and fuzzy logic models for prediction of long-term compressive strength of silica fume concrete, Advances in Engineering Software- Elsevier, 40 (2009) 856–863.

Jui-Sheng Chou, Chih-Fong Tsai, Concrete compressive strength analysis using a combined classi?cation and regression technique, Automation in Construction Elsevier 24 (2012) 52 –60.

MarekSlonski, A comparison of model selection methods for compressive strength prediction of high-performance concrete using neural networks, Computers and Structures 88 (2010) 12-48.

Mohammad Iqbal Khan, Predicting properties of High Performance Concrete containing composite cementitious materials using Arti?cial Neural Networks, Automation in Construction-Elsevier, 22 (2012) 516–524.

Mustafa Sar?demir, Predicting the compressive strength of mortars containing metkaolin by arti?cial neural networks and fuzzy logic, Advances in Engineering Software-Elsevier, 40 (2009) 920–927.

NeelaDeshpande, ShreenivasLondhe, SushmaKulkarni, Modeling compressive strength of recycled aggregate concrete by Arti?cial Neural Network, Model Tree and Non-linear Regression, International Journal of Sustainable Built Environment (2014) 3, 187–198.

Pinar Akpinara, Adnan Khashmanb, Intelligent classification system for concrete compressive strength, Procedia Computer Science- Elsevier, 120 (2017) 712–718.

R. Parichatprecha, P. Nimityongskul, Analysis of durability of high performance concrete using arti?cial neural networks, Construction and Building Materials- Elsevier, 23 (2009) 910–917.

Seung-Chang Lee, Prediction of concrete strength using arti?cial neural networks, Engineering Structures 25 (2003) 849–857.

Z.H.Duan, S.C.Kou, C.S.Poon , Prediction of compressive strength of recycled aggregate concrete using arti?cial neural networks, Construction and Building Materials- Elsevier, 40 (2013) 1200–1206.