Classifying cosmic-ray proton and light groups in LHAASO-KM2A experiment with graph neural network

Chao Jin; Song-zhan Chen; Hui-hai He; (for the LHAASO Collaboration)

doi:10.1088/1674-1137/44/6/065002

Chinese Physics C> 2020, Vol. 44> Issue(6) : 065002 DOI: 10.1088/1674-1137/44/6/065002

Classifying cosmic-ray proton and light groups in LHAASO-KM2A experiment with graph neural network

1.
Key Laboratory of Particle Astrophysics, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
2.
University of Chinese Academy of Sciences, 19 A Yuquan Rd, Shijingshan District, Beijing 100049, China

Abstract
HTML
Reference
Related

PDF

Abstract：
The precise measurement of cosmic-ray (CR) knees of different primaries is essential to reveal CR acceleration and propagation mechanisms, as well as to explore new physics. However, the classification of CR components is a difficult task, especially for groups with similar atomic numbers. Given that deep learning achieved remarkable breakthroughs in numerous fields, we seek to leverage this technology to improve the classification performance of the CR Proton and Light groups in the LHAASO-KM2A experiment. In this study, we propose a fused graph neural network model for KM2A arrays, where the activated detectors are structured into graphs. We find that the signal and background are effectively discriminated in this model, and its performance outperforms both the traditional physics-based method and the convolutional neural network (CNN)-based model across the entire energy range.
- cosmic ray knee ,
- graph neural network

References

[1]	A. Krizhevsky, I. Sutskever, and G. E. Hinton, Advances in neural information processing systems,1097-1105 (2016)
[2]	J. Redmon, S. Divvala, R. Girshick et al., IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (Las Vegas, NV, 2016), 779-788
[3]	S. Ren, K. He, R. Girshick et al., IEEE Transactions on Pattern Analysis and Machine Intelligence, 39: 1137-1149 (2017)
[4]	M. T. Luong, H. Pham, and C. D. Manning, the Conference on Emprirical Methods in Natural Language Processing, 1412-1421 (2015)
[5]	Y. Wu, M. Schuster, Z. Chem et al., arXiv: 1609.08144
[6]	G. Hinton, L. Deng, D. Yu et al., IEEE Signal processing magazine, 29: 82-97 (2012)
[7]	A. Hannun, C. Case, J. Casper et al., arXiv: 1412.5567
[8]	D. Amodei, S. Ananthanarayanan, R. Anubhai et al., International conference on machine learning,173-182 (2016)
[9]	Y. LeCun and Y. Bengio, Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, 1995, 3361(10): 1995
[10]	K. He, X. Zhang, S. Ren et al., Proceedings of the IEEE conference on computer vision and pattern recognition, 770-778 (2016)
[11]	D. E. Rumerlhar, Nature, 323: 533-536 (1986) doi: 10.1038/323533a0
[12]	S. Hochreiter and J. Schmidhuber, Neural computation, 9(8): 1735-1780 (1997) doi: 10.1162/neco.1997.9.8.1735
[13]	K. Cho, B. Van Merrinboer, C. Gulcehre et al., arXiv: 1406.1078
[14]	R. Berg, T. N. Kipf, and M. Welling, arXiv: 1706.02263
[15]	F. Monti, M. Bronstein, X. Bresson, Advances in Neural Information Processing Systems,3697-3707 (2017)
[16]	J. Gilmer, S. S. Schoenholz, P. F. Riley et al., Proceedings of the 34th International Conference on Machine Learning, 70: 1263-1272 (2017)
[17]	N. Choma, F. Monti, L. Gerhardt, et al., 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), 386-391 (2018)
[18]	M. Abdughani, J. Ren, L. Wu et al., arXiv: 1807.09088
[19]	J. Arjona Martinez, J. R. Vlimant, M. Spiropulu, et al., arXiv: 1810.07988
[20]	G. V. Kulikov and G. B. Khristiansen, Sov. Phys. JETP, 35(8): 441-444 (1959)
[21]	C. Jin, L. Q. Yin, S. Chen et al., Radiation Detection Technology and Methods, 3(3): 19 (2019) doi: 10.1007/s41605-019-0097-z
[22]	B. Bartoli, P. Bernardini, X. J. Bi et al., Physical Review D, 92(9): 092005 (2015) doi: 10.1103/PhysRevD.92.092005
[23]	M. Amenomori, S. Ayabe, D. Chen et al., Physics Letters B, 632(1): 58-64 (2006) doi: 10.1016/j.physletb.2005.10.048
[24]	M. Bertaina, W. D. Apel, J. C. Arteaga-Velazquez et al., Nuclear Physics B-Proceedings Supplements, 256: 149-160 (2014)
[25]	W. D. Apel, J. C. Arteaga-Velazquez, K. Bekk et al., Physical Review Letters, 107(17): 171104 (2011) doi: 10.1103/PhysRevLett.107.171104
[26]	H. H. He, LHAASO collaboration, Radiation Detection Technology and Methods, 2(1): 7 (2018) doi: 10.1007/s41605-018-0037-3
[27]	L. Yin, Z. Cao, S. S. Zhang et al., Accurate Measurement of the Cosmic Ray Proton Spectrum from 100 TeV to 10 PeV with LHAASO, PoS,508 (2017)
[28]	L. Q. Yin, S. S. Zhang, Z. Cao et al., arXiv: 1904.09130
[29]	P. K. Grieder, Extensive air showers, Berlin: Springer, 2010
[30]	A. Haungs, Journal of Physics G: Nuclear and Particle Physics, 29(5): 809 (2003) doi: 10.1088/0954-3899/29/5/303
[31]	Z. Wu, S. Pan, F. Chen et al., arXiv: 1901.00596
[32]	D. I. Shuman, S. K. Narang, P. Frossard et al., arXiv: 1211.0053
[33]	F. R. K. Chung and F. C. Graham, Spectral graph theory, American Mathematical Soc., (1997)
[34]	J. Bruna, W. Zaremba, A. Szlam et al., arXiv: 1312.6203
[35]	M. Defferrard, X. Bresson, and P. Vandergheynst, dvances in neural information processing systems,3844-3852 (2016)
[36]	T. N. Kipf and M. Welling, arXiv: 1609.02907
[37]	J. Masci, D. Boscaini, M. Bronstein et al., Proceedings of the IEEE international conference on computer vision workshops. 37-45 (2015)
[38]	D. Boscaini, J. Masci, E. Rodola et al., Advances in Neural Information Processing Systems,3189-3197 (2016)
[39]	F. Monti, D. Boscaini, J. Masci et al., Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5115-5124 (2017)
[40]	K. Greisen, Annual Review of Nuclear Science, 10(1): 63-108 (1960) doi: 10.1146/annurev.ns.10.120160.000431
[41]	K. Kamata and J. Nishimura, Progress of Theoretical Physics Supplement, 6: 93-155 (1958) doi: 10.1143/PTPS.6.93
[42]	D. Heck, G. Schatz, J. Knapp et al., CORSIKA: a Monte Carlo code to simulate extensive air showers, (1998)
[43]	S. Z. Chen, J. Zhao, Y. Liu et al., Nuclear Electronics & Detection Technology, 37(11): 1101 (2017)
[44]	S. Agostinelli, J. Allison, K. Amako et al., GEANT4: a simulation toolkit, Nuclear instruments and methods in physics research section A: Accelerators, Spectrometers, Detectors and Associated Equipment, 506(3): 250-303 (2003)
[45]	D. P. Kingma and J. Ba, arXiv: 1412.6980
[46]	J. R. Hoerandel, Astroparticle Physics, 19(2): 193-220 (2003) doi: 10.1016/S0927-6505(02)00198-6
[47]	S. Ter-Antonyan, Physical Review D, 89(12): 123003 (2014) doi: 10.1103/PhysRevD.89.123003
[48]	C. Jin, W. Liu, H. B. Hu and Y. Q. Guo, Physical Review D, 97: 123005 (2018) doi: 10.1103/PhysRevD.97.123005
[49]	C. Li, H. H. He, G. Xiao et al., Physical Review D, 98(4): 042001 (2018) doi: 10.1103/PhysRevD.98.042001

[1]	Tian-Lu Chen , Wei Liu , Qi Gao , Mao-Yuan Liu , Hai-Jin Li , Danzengluobu , Ying Shi . Cosmic ray electron spectrum due to the dispersion ofinjection spectrum. Chinese Physics C, 2018, 42(7): 075001. doi: 10.1088/1674-1137/42/7/075001
[2]	Yi-Qing Guo , Qiang Yuan . On the knee of Galactic cosmic rays in light of sub-TeV spectral hardenings. Chinese Physics C, 2018, 42(7): 075103. doi: 10.1088/1674-1137/42/7/075103
[3]	Masoumeh Mohamadian , Hossein Afarideh , Mitra Ghergherehchi . Optimized feed-forward neural-network algorithm trained for cyclotron-cavity modeling. Chinese Physics C, 2017, 41(1): 017003. doi: 10.1088/1674-1137/41/1/017003
[4]	Wen-Hui Lin , Bi-Wen Bao , Ze-Jun Jiang , Li Zhang . A possible explanation of the knee of cosmic light component spectrum from 100 TeV to 3 PeV. Chinese Physics C, 2017, 41(10): 105101. doi: 10.1088/1674-1137/41/10/105101
[5]	Yong-ji Xie , Zhong-hua Qin , Xiao-yan Ma , Jian Zhang , Ling-hui Wu , Wan Xie , Ming-yi Dong , Jing Dong , Xiao-lu Ji , Xiao-shan Jiang , Qun Ou-yang , Ke-jun Zhu , Yuan-bo Chen . Construction and cosmic-ray test of the new inner drift chamber for BESIII. Chinese Physics C, 2016, 40(9): 096003. doi: 10.1088/1674-1137/40/9/096003
[6]	Cai-Xun Zhang , Shin-Ted Lin , Jian-Ling Zhao , Xun-Zhen Yu , Li Wang , Jing-Jun Zhu , Hao-Yang Xing . Discrimination of neutrons and γ-rays in liquid scintillator based on Elman neural network. Chinese Physics C, 2016, 40(8): 086204. doi: 10.1088/1674-1137/40/8/086204
[7]	CHEN Tian-Xiang , LI Cheng , SUN Yong-Jie , CHEN Hong-Fang , SHAO Ming , TANG Ze-Bo , YANG Rong-Xing , ZHOU Yi , ZHANG Yi-Fei . A cosmic ray test platform based on high time resolution MRPC technology. Chinese Physics C, 2015, 39(5): 056003. doi: 10.1088/1674-1137/39/5/056003
[8]	ZHANG Fei , FAN Rui-Rui , PENG Wen-Xi , DONG Yi-Fan , GONG Ke , LIANG Xiao-Hua , LIU Ya-Qing , WANG Huan-Yu . A prototype silicon detector system for space cosmic-ray charge measurement. Chinese Physics C, 2014, 38(6): 066101. doi: 10.1088/1674-1137/38/6/066101
[9]	FU Zai-Wei , QIAN Sen , NING Zhe , LIU Shu-Dong , CHEN Xiao-Hui , HENG Yue-Kun , WANG Yi-Fang , QI Ming , YANG Shuai , SUN Yong-Jie , SHAO Ming , LI Cheng , ZHENG Yang-Heng . Cosmic ray test for a T0 detector. Chinese Physics C, 2011, 35(10): 946-951. doi: 10.1088/1674-1137/35/10/011
[10]	YUE Ke , YU Yu-Hong , XU Hu-Shan , SUN Zhi-Yu , WANG Jian-Song , XIAO Zhi-Gang , HU Zheng-Guo , CHEN Ruo-Fu , ZHANG Xue-Ying , TU Xiao-Lin , CHEN Jun-Ling , ZHAN Wen-Long . Characteristics of the sampling paddle modules of neutron wall from the cosmic ray test. Chinese Physics C, 2010, 34(8): 1111-1115. doi: 10.1088/1674-1137/34/8/014
[11]	WU Zhi , LIU Jian-Bei , QIN Zhong-Hua , WU Ling-Hui , CHEN Chang , CHEN Yuan-Bo , CHEN Ma-Li , CHEN Xi-Hui , DONG Ming-Yi , GUAN Bei-Ju , HUANG Jie , JIANG Xiao-Shan , JIN Yan , LI Fei , LI Ren-Ying , LI Xiao-Nan , LEI Guang-Kun , LIU Rong-Guang , LUO Xiao-Lan , MA Xiao-Yan , SHENG Hua-Yi , SUN Han-Sheng , TANG Xiao , WANG Lan , WANG Liang , XU Mei-Hang , ZHANG Jian , ZHANG Hong-Yu , ZHANG Yin-Hong , ZHAO Yu-Bin , ZHAO Ping-Ping , ZHU Ke-Jun , ZHU Qi-Ming , ZHUANG Bao-An . Tuning of the cosmic-ray test system of the BESⅢ drift chamber. Chinese Physics C, 2010, 34(7): 983-987. doi: 10.1088/1674-1137/34/7/010
[12]	YAN Jie , SUN Sheng-Sen , LI Cheng , HE Kang-Lin . BESⅢ barrel time-of-flight (TOF) calibration using cosmic ray data. Chinese Physics C, 2010, 34(3): 368-373. doi: 10.1088/1674-1137/34/3/012
[13]	LIANG Yu-Tie , MAO Ya-Jun , YOU Zheng-Yun . Study of BESⅢ MUC offline software with cosmic-ray data. Chinese Physics C, 2009, 33(7): 562-566. doi: 10.1088/1674-1137/33/7/011
[14]	WANG Si-Guang , MAO Ya-Jun , YE Hong-Xue . An artificial neural network for proton identification in HERMES data. Chinese Physics C, 2009, 33(3): 217-223. doi: 10.1088/1674-1137/33/3/011
[15]	Liu Shaomin , Ding Linkai , Shi Ce , Zhaxiciren , Zhaxisanzhu , Mu Jun , Wnag Hui , Lu Hong , Feng Zhenyong , Ren Jingru , Yu Guangce , Zhou Wende , Labaciren , Meng Xianru , Meng Lie , Zhang Jilong , Zhang Chunsheng , Zhang Huimin , Shi Zhizheng , Jia Huanyiu , Mei Dongming , Huang Qing , Tan Youheng , Huo Anxiang , Dai Benzhong . Correlation Between Sun Shadows of 10 TeV Cosmic Ray and the Solar Activity. Chinese Physics C, 1997, 21(S4): 11-18.
[16]	Jiang Yinlin . Atmospheric Cherenkov Images of Cosmic Ray Muons. Chinese Physics C, 1995, 19(S1): 5-11.
[17]	Luo Guangxuan , Tan Youheng , Zhang Chunsheng , Dong Yuju , Yuan Peng , Zhang Huimin , Wang Hui , Yuang Yukui , Li Jing . Energy Spectrum of Primary Cosmic Rays at the “Knee” Region Observed with the Huairou Air Shower Array. Chinese Physics C, 1995, 19(S4): 325-331.
[18]	Cao Zhen , Ding Linkai . Generator of QCD Parton Model for the Cosmic Ray Ultrahigh Energy Interaction. Chinese Physics C, 1994, 18(S4): 321-332.
[19]	Lu Suiling , Ren Jingru , Su Shi , Wang Chengrui , He Mao . Variation of the Intensity of High Energy Cosmic Ray Hadrons with Altitude. Chinese Physics C, 1992, 16(S2): 123-125.
[20]	Li Zhibing . (φφ) in the Tree Graph Approximation. Chinese Physics C, 1990, 14(S4): 349-351.

Access

Figures(9) / Tables(3)

Get Citation

Chao Jin, Song-zhan Chen and Hui-hai He. Classifying the Cosmic-Ray Proton and Light Groups on the LHAASO-KM2A Experiment with the Graph Neural Network[J]. Chinese Physics C. doi: 10.1088/1674-1137/44/6/065002

RIS(for EndNote,Reference Manager,ProCite)

BibTex

Txt

Milestone

Received: 2019-11-05
Revised: 2020-02-16

Article Metric

Article Views(3863)
PDF Downloads(57)
Cited by(0)

Policy on re-use

To reuse of subscription content published by CPC, the users need to request permission from CPC, unless the content was published under an Open Access license which automatically permits that type of reuse.

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

HTML

4. Experiment

We employ the Monte Carlo simulation to generate event data for training and evaluating KM2A GNN performance. The primary EAS events are generated by the CORSIKA package with the hadronic model QGSJETII [42]. The KM2A detector simulation is performed based on the Geant4 framework [43, 44]. We generate major CR groups including the Proton (P), Helium (He), medium group (CNO), heavy group (MgAlSi), and Iron (Fe). Total events are generated into four energy fragments, including 10 ~ 100 TeV, 100 ~ 1 PeV, 1 ~ 10 PeV, 10 ~ 100 PeV, with the spectral index of –2.7. Reconstructed energies from 100 TeV to 10 PeV are considered, which cover most of the CR knee region. For each task, these groups are divided into independent signal and background groups, where only P belongs to the signal for the P task, and P&He forms the signal for the L task.

After reconstruction of the simulated events [26], we further select events according to their reconstructed locations and directions. The reconstructed shower core spread inside the KM2A array within the distance 200 ~ 500 m from the array center is selected. We ignore the inner circular area (within 200 m) to suppress the disturbance from the WCDA for the KM2A reconstruction. Further, the reconstructed zenith angle below $ 35^{\circ} $ is also required. Consequently, 105732 events remain for the following analysis. We split the selected events into train, test, and evaluation data sets. In consideration for the data balance, the group ratios for each data set are readjusted to maintain roughly $ 1:1 $ signal-to-noise ratio (SNR). The readjusted data sets for each task are listed in Table 1. The dataset ratio between the two major energy fragments, with 100 TeV ~ 1 PeV and 1 ~ 10 PeV, is around $ 2:1 $.

data set	P		L
data set	signal	background	signal	background
train	14635	14595	24358	23733
test	2875	2831	4754	4713
evaluation	24921	22994	24921	22994

Table 1. Number of signal and background events for each dataset.

To train the GNN models, we employ supervised learning techniques with the mean square error (MSE) as the loss function. For each training epoch, the loss is calculated on the test dataset to avoid overfitting. The Adam [45] optimizer is used to optimize the model parameters based on adaptive estimation of low-order moments. The training procedure includes two steps, (i) two independent trainings for the GNN ED and MD models with the learning rate 0.001, and (ii) a subsequent fine-tuning procedure fuses the ED and MD model together with the learning rate 0.0001. It runs over a total of 80 epochs with the model already converged. All code is written in Python using the open-source deep learning framework PyTorch with GPU acceleration. For each model training, four identical candidates with different randomized weights are trained, and the one with the best performance is selected for further processing, which helps suppress the local optimization.

6. Conclusion

Deep learning has contributed extensively to significant progress in numerous fields. Therefore, we leverage this technology to improve classification performance in the LHAASO -KM2A experiment. We propose a fused GNN model, which constructs independent networks for the KM2A ED and MD arrays, and fuse their outputs for classification. This model is demonstrated to be effective, and its performance outperforms the traditional physics-based method as well as the CNN-based method over the entire energy range. Furthermore, we compare the performance of the GNN framework for independent ED and MD arrays. The ED array is found to behave better than the MD array. We attribute this to the higher density configuration of the ED array. Moreover, in comparison with the LHAASO hybrid detection method, our KM2A GNN model exhibits competitive classification performance. Owing to the large area and full duty cycle of the KM2A array, it can acquire statistics on the order of ~ 870× higher than the hybrid detection.

We thank the LHAASO Collaboration for their support on this project.

Reference (49)

	P	L
baseline	0.836	0.904
GNN MD	0.847	0.93
GNN ED	0.861	0.936
GNN ED+MD	0.878	0.959

	Purity (%) (+stat.+sys.)		Aperture (${\rm m^2 \cdot sr}$) (+stat.+sys.)
	P	L	P	L
handcraft (hybrid) [27]	~90	~95	~1.5e3	~4e3
GBDT (hybrid) [28]	~90	~97	~3.6e3	~7.2e3
baseline (KM2A)	73.4±2.5±2.4	93.20.9±1.1	3.2e5±1.3e3±1.0e4	6.3e5±2.7e3±7.6e3
CNN (KM2A)	75.4±2.5±2.4	93.3±0.9±1.1	3.2e5±1.3e3±1.0e4	6.3e5±2.7e3±7.6e3
GNN MD (KM2A)	77.1±2.3±2.5	95.9±0.6±1.2	3.2e5±1.3e3±1.0e4	6.3e5±2.7e3±7.6e3
GNN ED (KM2A)	82.8±1.9±2.6	96.6±0.6±1.2	3.2e5±1.3e3±1.0e4	6.3e5±2.7e3±7.6e3
GNN ED+MD (KM2A)	84 ±1.9±2.7	98.2±0.4±1.2	3.2e5±1.3e3±1.0e4	6.3e5 ±2.7e3±7.6e3

Classifying cosmic-ray proton and light groups in LHAASO-KM2A experiment with graph neural network

Abstract：

References

Access

Article Metrics

Metrics

通讯作者: 陈斌, bchen63@163.com

Email This Article

Classifying cosmic-ray proton and light groups in LHAASO-KM2A experiment with graph neural network

Corresponding author: Chao Jin, jinchao@mail.ihep.ac.cn

HTML

3.1. Graph neural network overview

3.2. Graph neural network on LHAASO-KM2A

目录