-
In MMM, the binding energy of the nucleus is expressed in two parts: the macroscopic liquid-drop part and the microscopic shell correction part. The simplest macroscopic expression is derived from the LDM, which can well explain the variation of nuclear binding energy except for a small deviation. According to the Strutinsky energy theory, the fluctuation of nuclear binding energy is provided by shell correction energy. Based on this idea, we also express the binding energy of the nucleus as the sum of the two parts of the energy
$ B(Z, A)_{\rm LDM}^{\rm NN} = E_{\rm LDM}(Z, A)+\delta B_{\rm LDM}^{\rm NN}. $
(1) Unlike normal MMM, the macroscopic part of the energy does not take into account the deformation of the nucleus. Instead, the deformation energy, the shell correction energy and other energies are obtained using the NN approach. The energy of the liquid-drop can be expressed as
$ \begin{aligned}[b] E_{\rm LDM} =& a_{\rm V}(1+k_{\rm V}I^2)A+a_{\rm S}(1+k_{\rm S}I^2)A^{{2}/{3}}\\&+a_{\rm C}\frac{Z^2}{A^{{1}/{3}}}(1-Z^{-{{2}/{3}}}) +c_1\frac{2-|I|}{2+|I|A}I^2A+a_{\rm pair}A^{-{{1}/{3}}}\delta_{np}, \end{aligned} $
(2) where Z and A are the proton and mass numbers, respectively.
$I = (A-2Z)/A$ is the charge-asymmetry parameter of the nucleus. The meanings of the terms in the formula are volume energy, surface energy, Coulomb energy, symmetric energy and average pair energy.$\delta_{np}$ is expressed as [11]$ {\delta _{{np}}} = \left\{ {\begin{array}{l} {2 - |I|,\;{\rm{ for\;even - }}Z,\;{\rm{ even - }}N,\;}\\ \quad\;\; {|I|,\;{\rm{ for\;odd - }}Z,\;{\rm{ odd - }}N,}\\ {1 - |I|,\;{\rm{ for\;odd - }}Z,\;{\rm{ even - }}\;N\;{\rm{ and }}\;N > Z,}\\ {1 - |I|,\;{\rm{ for\;even - }}Z,\;{\rm{ odd - }}N\;{\rm{ and }}\;N < Z,}\\ \quad\;\;\; {1,\;{\rm{ for\;odd - }}Z,\;{\rm{ even - }}N\;{\rm{ and }}\;N < Z,}\\ \quad\;\;\; {1,\;{\rm{ for\;even - }}Z,\;{\rm{ odd - }}N\;{\rm{ and }}\;N > Z.} \end{array}} \right. $
The parameters in Eq. (2) are obtained by fitting the binding energy in AME2016 [31]. We select 2314 nuclei with the number of protons and neutrons greater than or equal to 8 and the error of mass measurement less than or equal to 150 keV [32]. The fitting results are listed in Table 1. It can be seen that the root-mean-square deviation (RMSD) after fitting is 2.385 MeV. In Fig. 1, we use black solid circles to represent the deviation between the binding energy calculated by the LDM and the experimental values. This deviation is small relative to the total binding energy. But for the study of nuclear physics, such as nuclear astrophysics, nuclear reactions, decay energy of the nucleus and so on, such precision is far from enough. Next, we will introduce a NN approach to improve the precision of the LDM.
Parameter Value $ a_{\rm V} $ −15.6670 $k_{\rm V}$ 2.0219 $a_{\rm S}$ 18.4334 $k_{\rm S}$ −3.1785 $a_{\rm C}$ 0.7153 $c_1$ 44.5756 $a_{\rm pair}$ −6.8522 σ/MeV 2.385 Table 1. Model parameters of the LDM.
Figure 1. (color online) Deviation of theoretical calculation of binding energy from experimental values. Black solid circles are the deviation between the LDM result and the experimental value, and red solid squares are the modified results using the NN approach.
The ANN approach is a mathematical model that simulates brain function. A neuron takes one input, performs some mathematical operation, and then produces an output. In our calculation, we use a feedforward neural network. Feedforward neural networks adopt a unidirectional multi-layer structure. Each layer contains several neurons. In this NN model, each neuron can receive the signal from the previous layer and produce output to the next layer.
In an NN approach for nuclear binding energy, there are usually two input neurons, the number of protons Z and the number of nucleons A [22]. As for the LDM, the results can be approximated by the MMM after training. To get more accurate results, more physical information needs to be included in the input terms of the NN approach. In Ref. [23], the distance between the proton and neutron to the nearest magic number is taken into account in the input neurons. In Ref. [24], the pairing and shell effects are added to the input neurons, producing better results. It can be seen from literature that better results can be obtained by adding effective information as input neurons.
In Refs. [23, 24], only the shell contributions closest to the proton and neutron magic numbers are considered in the input neurons. But we think that between the two shells, the previous magic number and the next magic number both contribute to the energy of the nucleus. So in this contribution, the NN model is constructed with an input layer including six input neurons (proton number Z, mass number A, valence nucleus and hole number of neutrons (
$ |N-N_0| $ and$ |N_1-N| $ , and valence nucleus and hole number of protons$ |Z-Z_0| $ and$ |Z_1-Z| $ ), a hidden layer, and an output layer ($ \delta B_{\rm LDM}^{\rm NN} $ ).$ Z_0 $ (8, 20, 50 and 82) and$ Z_1 $ (20, 50, 82 and 126) here represent the number of protons in a given nucleus at the radius of the last magic number and the next magic number.$ N_0 $ (8, 20, 50, 82 and 126) and$ N_1 $ (20, 50, 82, 126 and 184) are the same numbers for neutrons. For example, for$ ^{56}_{26}{\rm Fe}^{}_{30} $ , the inputs are, in turn 26, 56, 10 (30-20), 20 (50-30), 6 (26-20) and 24 (50-26). Since there is no prior indication to determine the number of neurons H in the optimal hidden layer, some considerations are used to determine the optimal choice. After many attempts, the number of neurons in the hidden layer was set at 40. The output layer of the NN model contains a neuron that represents the expected deviation of the nuclear binding energy. The structure of the NN model in this study is shown in Fig. 2.Figure 2. (color online) A feedforward neural network consisting of six input neurons, a hidden layer and an output neuron.
We can use a function that expresses the relationship between the input and output for an ANN with a vector input x (
$ x_1, x_2,..., x_6 $ ) and an output scalar y. It can be written as [22]:$ y = a + \sum\limits_{j = 1}^{H} b_j{\rm tanh}\left(c_j+\sum\limits_{i = 1}^{n} d_{ji}x_i\right). $
(3) where the free parameters are
$ \omega = (a, b_j, c_j, d_{ji}) $ ; H is the number of neurons in the hidden layer; n is the input unit number;$ x_i $ is the ith input unit;$ d_{ji} $ is the weight parameter between the ith input unit and the jth hidden neuron:$ c_j $ is the bias of the jth hidden neuron;$ b_j $ is the weight parameter between the jth hidden neuron and the output neuron; and a is the bias of the output neuron. For the six input variables, the function in Eq. (3) has 1 + (2+n)H parameters.Our goal is to obtain the minimum RMSD, so weights and biases need to be adjusted during training. In the learning stage of this work, we used a back propagation algorithm with Levenberg–Marquardt (LM) to change the connections between neurons so as to obtain the consistency between the neural network output and the expected output [33-35]. The LM algorithm is an iterative algorithm that can be used to solve the least square problem. The weight calculation formula of LM is as follows:
$ \omega_{i+1} = \omega_{i}-(J^T_iJ_i+\mu_iI)^{-1}J^T_ie_i,$
(4) where
$ w_{i+1} $ is a trial solution estimated by the$ i $ th iteration solution. J is the Jacobian matrix of output error. I is the identity matrix. μ is a learning parameter (so called LM parameter). LM parameters have good operability and controllability, making the algorithm more stable and reliable. Now we can train the difference between the binding energy of the LDM and the experimental values. To illustrate the advantages of the six inputs (marked as$ \delta B_{\rm LDM}^{\rm NN-6} $ ), we also compare the results using four input neurons (there are three cases with four inputs: ①$ Z,\; A,\; Z-Z_0, N-N_0 $ , marked as$ \delta B_{\rm LDM}^{\rm NN-4_{\rm LS}} $ ; ②$ Z,\; A,\; Z_1-Z, $ $ N_1-N, $ marked as$ \delta B_{\rm LDM}^{\rm NN-4_{\rm NS}} $ ; ③$Z, \;A, \; {\rm min}(Z-Z_0, Z_1-Z), $ $ \min(N-N_0, N_1-N)$ , marked as$ \delta B_{\rm LDM}^{\rm NN-4} $ ) and using two inputs (Z, A, marked as$ \delta B_{\rm LDM}^{\rm NN-2} $ ), which will be discussed in detail in the next section. -
Based on the previous instructions, we can now calculate the binding energy of the nuclei using the NN approach. Previously, we used 2314 nuclei experimental data from AME2016 to fit the parameters of Eq. (2). Using the NN method to train, the data should be divided into two parts, a learning set and a verification set. We randomly selected 414 experimental data as the verification set, and the remaining 1900 data as the learning set. The effect of hyperparameters can be removed by human adjustment. When the hidden layer neurons are fixed, we can tune the neural network model by adjusting the learning rate and the number of training sessions. For different input neurons, we can always find a suitable set of hyperparameters, but these hyperparameter sets are different. When the minimum root mean square deviation of the validation group occurs during the training process, the result at this point is considered to be the best result, rather than the minimum overall RMSD. This prevents over-fitting results. It should be noted that we have tried random selection several times and found little change. Here's the best one.
After using the NN approach, the RMSD of the LDM decreased significantly, especially when six input neurons were considered. The RMSD decreased from 2.385 to 0.203 MeV. The difference between the experimental binding energy and the corresponding calculated binding energy is shown in Fig. 1. In Table 2, we list the results of the NN method learning for the five variable input sets. From Table 2, we can clearly see that it is not sufficient to consider the effect of only one magic number on the shell correction. In all cases, the results with four inputs are not as good as with six inputs. We also show in Figs. 3–5 for each case how much the NN method results deviated from the experimental values. From Fig. 3 we can see, for nuclei near the magic number and nuclei in the light nucleus area, when only using the proton number and the mass number as input neurons in the NN method, the effect of the correction is poor. The reason is that shell effects are not taken into account. This situation is improved after considering the shell effect. From Fig. 4, we can see that the results have significantly improved. But there are still a lot of highly deviated nuclei in the light nucleus area. This is also a problem that MMM needs to solve. The present results can be significantly improved by using the NN method. Table 3 gives the accuracy of the description of the experimental masses by each of the models for five regions of nuclei: global (
$ Z,\;N\geqslant 8 $ ), light ($ 8\leqslant Z<28,\;N\geqslant 8 $ ), medium-I ($ 28\leqslant Z<50 $ ), medium-II ($ 50\leqslant Z<82 $ ), and heavy ($ Z\geqslant 82 $ ) nuclei.Model $ \sigma_{\rm pre} $ /MeV$ \sigma_{\rm NN} $ /MeVΔσ (%) Learning set $ B_{\rm LDM}^{\rm NN-2} $ 2.388 0.383 83.96 $ B_{\rm LDM}^{\rm NN-4_{\rm LS}} $ 2.388 0.235 90.16 $ B_{\rm LDM}^{\rm NN-4_{\rm NS}} $ 2.388 0.231 90.33 $ B_{\rm LDM}^{\rm NN-4} $ 2.388 0.216 90.95 $ B_{\rm LDM}^{\rm NN-6} $ 2.388 0.196 91.79 Validation set $ B_{\rm LDM}^{\rm NN-2} $ 2.375 0.386 83.75 $ B_{\rm LDM}^{\rm NN-4_{\rm LS}} $ 2.375 0.275 88.42 $ B_{\rm LDM}^{\rm NN-4_{\rm NS}} $ 2.375 0.255 89.26 $ B_{\rm LDM}^{\rm NN-4} $ 2.375 0.246 89.64 $ B_{\rm LDM}^{\rm NN-6} $ 2.375 0.231 90.27 Table 2. RMSD (MeV) of binding energy in light nuclei region calculated by different models.
Model LDM $ B_{\rm LDM}^{\rm NN-2} $ $ B_{\rm LDM}^{\rm NN-4_{\rm LS}} $ $ B_{\rm LDM}^{\rm NN-4_{\rm NS}} $ $ B_{\rm LDM}^{\rm NN-4} $ $ B_{\rm LDM}^{\rm NN-6} $ WS4 [14] FRDM [8] KTUY [9] Global RMSD 2.385 0.384 0.242 0.236 0.221 0.203 0.288 0.584 0.704 Light RMSD 1.58 0.86 0.44 0.42 0.41 0.35 0.42 1.11 0.69 Medium-I RMSD 2.47 0.35 0.25 0.24 0.23 0.22 0.30 0.62 0.78 Medium-II RMSD 2.32 0.21 0.18 0.17 0.15 0.15 0.25 0.37 0.54 Heavy RMSD 2.79 0.25 0.15 0.16 0.15 0.13 0.24 0.37 0.88 Table 3. RMSD (MeV) calculated for the global (
$ Z,N\geqslant 8 $ ), light ($ 8\leqslant Z<28,N\geqslant 8 $ ), medium-I ($ 28\leqslant Z<50 $ ), medium-II ($ 50\leqslant Z<82 $ ), and heavy ($ Z\geqslant 82 $ ) nuclei, with the use of the specified models.One can see in Table 3 that for each region of the nuclear chart, the RMSD changes quite strongly from one model to another. A strong dependence of the accuracy of the description of the mass on the region of nuclei considered is obtained for each model. In all regions considered, the best accuracy is obtained by the
$ B_{\rm LDM}^{NN-6} $ . One also can see similar patterns to the MMM of the shell correction calculated by the Strutinsky method. The improved result of the NN method for heavy nuclei is more obvious. If the shell effect is not considered in the input neurons, the correction results in the light nucleus region are poor. Different considerations in input would bring different results. From Table 3, we can clearly find that the shell closer to the nucleus has a greater impact on the nucleus, which is in line with the physical situation. However, if the input neurons are also considered to be a more distant shell, the results can be improved.Since the correction results of the NN method have such high accuracy for nuclei with known experimental binding energy, we should also verify the reliability of the NN method's extrapolation. Recently, the RIKEN RI Beam Factory used the time-of-flight magnetic rigidity technique to direct mass measurements of neutron-rich scandium, titanium and vanadium isotopes around the neutron number 40. The experimental results show that the two-neutron separation energy near 62Ti is increased compared with adjacent nuclei. In Ref. [30], we can find that the LZU model [36] has a slightly lower description of the two-neutron separation energy near 62Ti.
$ \rm{WS4^{RBF}} $ [14] also gives a similar result to LZU. In WS-type model [11–14, 36], the macroscopic part is expressed as$ E_{LD}\prod(1+b_{k}\beta_{k}^2) $ and the microscopic part is the shell correction calculated by the Strutinsky method. The above parabolic approximation to the change of macroscopic energies with$ \beta_{k} (k = 2,3....) $ is acceptable near the ground state. In Ref. [21], R. Utama used a Bayesian NN approach to refine the Duflo–Zuker (DZ) model and the Hartree–Fock–Bogoliubov (HFB) model. The two-neutron separation energies in 61Ti and 62Ti are also smaller than the experimental values in DZ-BNN and HFB-BNN [21]. To verify the extrapolation ability of the NN method proposed in this paper, in addition to the RMSD, we also calculated the double neutron separation energy measured in Ref. [30]. The result of the calculation is shown in Fig. 6. As we can see in Fig. 6, compared with several other methods, NN-6 has a clear advantage, both in terms of trend and value. We have therefore chosen to compare the results of the two-neutron separation energy calculated by NN-6 with those of the LZU model. The results are plotted in Fig. 7. The experimental results and error bars are given in solid circles. The settlement result of the NN method is a solid square, and the dotted line represents the calculation result of the LZU model. From Fig. 7, we can clearly see that the computational results of the NN method reproduce the experimental values very well, especially in 61Ti and 62Ti. It should be noted that there is a large error in the experimental two neutron separation energy of 60Sc, and our result also has a large deviation from the experimental result. We believe that the shell effect like 62Ti should occur when the neutron number approaches N = 40 and scandium is no exception. Therefore, we believe that more accurate experimental measurement is needed for 60Sc. Other than that, almost all the other data falls within the error bar. We also listed the comparison between the binding energy of Ti in Ref. [30] and the calculation results of different models in Table 4. It can be seen that the trained NN model gives the best results.Figure 6. (color online) Two-neutron separation energy calculated by using the different NN approaches compared with the results in Ref. [30].
Figure 7. (color online) Two-neutron separation energy calculated by using the NN approach compared with the results in Ref. [30]. The solid circle is the experimental value in Ref. [30], the solid square is the calculation result of the work in this paper, and the dashed line is the calculation result of the LZU model. Different colors are used to represent different isotopic chains.
Model Ref. [30] AME2016 $B_{\rm LDM}^{\rm NN-6}$ WS4 LZU $^{62}{\rm Ti}$ 496.51 495.69 496.12 495.03 495.09 $ ^{61}{\rm Ti} $ 491.32 491.48 491.32 491.10 491.27 $ ^{60}{\rm Ti} $ 489.17 489.42 489.07 489.37 489.33 $ ^{59}{\rm Ti} $ 484.20 484.51 484.30 484.99 484.84 $ ^{58}{\rm Ti} $ 481.87 482.04 482.08 482.75 482.44 Table 4. Comparison of Ti isotopic chain binding energies (MeV) calculated by different models.
In addition, the latest quality assessment data, AME2020 [37], has been published. To further verify whether the work in this paper has extrapolation ability, we selected 78 updated nuclear binding energies from AME2020. The selected nuclei are either marked with
$ \# $ (value and uncertainty derived not from purely experimental data, but at least partly from the trends from the mass surface) or have a deviation greater than 150 keV in AME2016. Similar to the previous discussion, we classified these nuclei into global ($ Z,\;N\geqslant 8 $ ), light ($ 8\leqslant Z<28, $ $ \;N\geqslant 8 $ ), medium-I ($ 28\leqslant Z<50 $ ), medium-II ($ 50\leqslant Z<82 $ ), and heavy ($ Z\geqslant 82 $ ) nuclei. For$ B_{\rm LDM}^{\rm NN-6} $ , their RMSD were global nuclei (78) 0.455 MeV, light nuclei (23) 0.607 MeV, medium-I nuclei (11) 0.323 MeV, medium-II nuclei (28) 0.424 MeV and heavy nuclei (16) 0.303 MeV. For$ B_{\rm LDM}^{\rm NN-2} $ ,$ B_{\rm LDM}^{\rm NN-4_{LS}} $ ,$ B_{\rm LDM}^{\rm NN-4_{NS}} $ and$ B_{\rm LDM}^{\rm NN-4} $ , their RMSD of global nuclei (78) were 0.677, 0.513, 0.426 and 0.605 MeV, respectively. Although the accuracy of the prediction is not as high as previous predictions, the results are acceptable. These results prove that the prediction of the modified model by the NN method is relatively reliable in the region near the known nucleus.NN-6 has more parameters. In order to eliminate the influence of parameter number, we increase the number of hidden neurons in NN-4 (H = 54). In this way, NN-6 and NN-4 (H = 54) are close in number of parameters, and the results are calculated and compared (the data in parentheses are the results for NN-6). For NN-4 (H = 54), the total RMSD is 0.209 (0.203) MeV, the RMSD of the training set is 0.201 (0.196) MeV and the RMSD of the validation set is 0.241 (0.231) MeV, 0.39 (0.35) MeV for the light nucleus region, 0.21 (0.22) MeV for Medium-I, 0.15 (0.15) MeV for Medium-II, and 0.14 (0.13) MeV for the heavy nucleus region. For the 78 nuclei added in AME2020, the RMSD is 0.564 (0.455) MeV. The computational results show that increasing the number of hidden layer neurons does reduce the total RMSD. Although the bias between the validation and training sets becomes larger, it is within an acceptable range. The extrapolation results also improved, but the results for the dual neutron separation energy were still poor. Even if the number of parameters is close, NN-6 has better extrapolation results with lower RMSD, which may be the advantage of using more input neurons.
However, if the inferred distance is too far from the known region, new physical effects may be generated, which are hidden in the unknown region and therefore cannot be detected by training the NN model with the known data. The calculation results of α decay energy from superheavy nuclei illustrate this point. In Ref. [38], the experimental values of alpha decay energy for 44 superheavy nuclei are listed. Neither these superheavy parent nuclei nor their decayed daughter nuclei are among the 2314 nuclei we used before. We present the comparison of the calculated results with the experimental values in Table 5. As can be seen from Table 5, the calculated results are relatively accurate when the proton number is less than 112, but when the proton number is greater than or equal to 112, the results show a large deviation. The reason may be that not enought data of superheavy nuclei is included in the training, and the maximum number of protons among the 2314 nuclei involved in the training is 110. Thus, it can be seen that the correction results of the NN method depend on the available experimental data. More data facilitates the training results and allows more accurate prediction of the binding energy of nuclei near known regions.
Z A $ Q_{\alpha} $ (Our work)$ Q_{\alpha} $ (Ref. [38])δ 118 294 12.47 11.82 0.65 117 294 12.06 11.18 0.88 117 293 12.44 11.32 1.12 116 293 11.95 10.71 1.24 116 292 12.36 10.78 1.58 116 291 12.64 10.89 1.75 116 290 12.80 11.00 1.80 115 290 12.41 10.41 2.00 115 289 12.49 10.49 2.00 115 288 12.47 10.65 1.82 115 287 12.38 10.76 1.62 114 289 12.06 9.98 2.08 114 288 12.07 10.07 2.00 114 287 11.97 10.17 1.80 114 286 11.83 10.35 1.48 114 285 11.72 10.56 1.16 113 286 11.37 9.79 1.58 113 285 11.21 10.01 1.20 113 284 11.12 10.12 1.00 113 283 11.10 10.38 0.72 113 282 11.16 10.78 0.38 112 285 10.70 9.32 1.38 112 283 10.49 9.66 0.83 112 281 10.63 10.45 0.18 111 282 9.85 9.16 0.69 111 281 9.92 9.41 0.51 111 280 10.08 10.15 −0.07 111 279 10.28 10.53 −0.25 111 278 10.49 10.85 −0.36 110 281 9.20 8.85 0.35 110 279 9.50 9.85 −0.35 110 277 10.00 10.71 −0.71 109 278 8.91 9.58 −0.67 109 276 9.47 10.10 −0.63 109 275 9.74 10.48 −0.74 109 274 9.99 10.20 −0.21 108 275 8.92 9.45 −0.53 108 273 9.50 9.67 −0.17 107 274 8.37 8.94 −0.57 107 272 8.97 9.21 −0.24 107 271 9.23 9.42 −0.19 107 270 9.46 9.06 0.40 106 271 8.44 8.67 −0.23 106 269 8.95 8.63 0.32 Table 5. Comparison of α decay energy calculated using the NN method with experimental values.
A neural network approach based on more input neurons to predict nuclear mass
- Received Date: 2021-08-23
- Available Online: 2022-04-15
Abstract: The study of nuclear mass is very important, and the neural network (NN) approach can be used to improve the prediction of nuclear mass for various models. Considering the number of valence nucleons of protons and neutrons separately in the input quantity of the NN model, the root-mean-square deviation of binding energy between data from AME2016 and liquid drop model calculations for 2314 nuclei was reduced from 2.385 MeV to 0.203 MeV. In addition, some defects in the Weizsäcker–Skyrme (WS)-type model were repaired, which well reproduced the two-neutron separation energy of the nucleus synthesized recently by RIKEN RI Beam Factory [Phys. Rev. Lett. 125, (2020) 122501]. The masses of some of the new nuclei appearing in the latest atomic mass evaluation (AME2020) are also well reproduced. However, the results of neural network methods for predicting the description of regions far from known atomic nuclei need to be further improved. This study shows that such a statistical model can be a tool for systematic searching of nuclei beyond existing experimental data.