Considering that there exist many factors having highly nonlinear effects on the concentration of the 4-carboxybenzaldehyde (4-CBA), which is the most important intermediate product of the oxidation of the p-xylene (PX) to terephthalic acid (TA), a modified back propagation algorithm embedded with ridge regression (BP-RR) was proposed to develop a soft sensor of the 4-CBA concentration. To overcome the two main flaws of regular multi-layer neural networks, i.e. the tendency of overfitting and the difficulty to determine the optimal number of neurons for the hidden layer, firstly, a three-layer network is selected and the number of the hidden-layer neurons is determined according to the number of the training samples and the number of the neural network parameters. Then, BP is applied to learn from the training samples. In sequel, the ridge regression is employed to remove the multicollincarity among the hidden-layer-node outputs and obtain the optimal weights (and thresholds) between the hidden layer and the output layer to replace the original values obtained by BP. Thus the neural network model with good prediction ability is developed. In addition, the ridge regression uses heuristic differential evolution algorithm to optimize ridge parameter according to the prediction accuracy of the model. The results show that the optimal value of ridge parameter is adaptively determined according to the degree of multicollinearity among the hidden-layer-node output, and then the good prediction ability model with the robust character is obtained by BP-RR. The best and the mean prediction accuracies of the neural network models developed by BP-RR are higher than those of the neural network models trained by BP alone and obtained by pruning algorithms based on principle component analysis.