-
The CEPC detector was designed to accomplish the physics goal that all final states can be identified and reconstructed with high resolution. The baseline detector concept is based on the particle flow approach (PFA) idea [15]. It comprises a precise vertex detector, a Time Projection Chamber (TPC), a silicon tracker, a high granularity Silicon-Tungsten sampling ECAL, and a GRPC-based high granularity hadronic calorimeter (HCAL). The whole system is embedded in a 3 Tesla magnetic field. The outermost part of the detector is a muon chamber. Further details can be found in Ref. [4].
The Higgs production mechanisms at the CEPC are Higgs-strahlung
$ e^{+} e^{-} \to ZH $ ,$W / Z$ fusion$ e^{+} e^{-} \to \nu \bar \nu H $ , and$ e^{+} e^{-} \to e^{+} e^{-} H $ , as illustrated in Fig. 1. In this analysis, Higgs production via ZH process decaying to diphoton final state$ e^{+}e^{-} \to ZH \to f\bar{f}\gamma\gamma $ at$ \sqrt{s}=240 $ GeV is considered the dominant signal. It is further divided into three sub-channels, depending on Z decaying to$ q\bar{q} $ ,$ \mu^{+}\mu^{-} $ , and$ \nu\bar{\nu} $ . The$ Z\to e^{+}e^{-} $ channel is dismissed owing to the well-known extremely large Bhabha background. Likewise, the$ Z\to \tau^{+}\tau^{-} $ channel is dismissed because of the complexity of τ identification. The$W / Z$ fusion process is considered in the ZH,$ Z\to\nu \bar \nu $ sub-channel. The only considered background process is the 2-fermion background$ e^{+} e^{-} \to f\bar {f} $ in CEPC with at least two photons from the initial and final state radiations. The Higgs resonant background, 4-fermion processes, and possible reducible background in the experiments are expected to be negligible. These SM physical processes are generated with Whizard [16] at leading order (LO) interfaced with Pythia 6 [17] for parton showering and hadronization, and parameters based on Large Electron Positron Collider (LEP) [18] data. Initial state radiation (ISR) and final state radiation (FSR) effects are taken into account. The total energy spread caused by beamstrahlung and synchrotron radiation was studied through Monte-Carlo simulation and determined to be 0.1629% at CEPC [19]. Table 1 lists the cross sections of physical processes and MC sample statistics used in the analysis. Event yields were normalized to 5.6 ab–1. Details on the configurations can be found in Ref. [20].Figure 1. Feynman diagrams of the Higgs boson production processes at the CEPC: (a)
$e^{+} e^{-} \to ZH$ , (b)$e^{+} e^{-} \to \nu \bar \nu H $ , and (c)$e^{+} e^{-} \to e^{+} e^{-} H $ .Process σ statistics $q\bar q\gamma \gamma$ sub-channel$ e^{+} e^{-} \to ZH \to q\bar q\gamma \gamma $ 0.31 fb 100 k $ e^{+} e^{-} \to q \bar{q} $ 54.1 pb 20 M ${\mu ^ + }{\mu ^ - }\gamma \gamma$ sub-channel$ e^{+} e^{-} \to ZH \to {\mu ^ + }{\mu ^ - }\gamma \gamma $ 0.15 fb 100 k $ e^{+} e^{-} \to {\mu ^ + }{\mu ^ - } $ 5.3 pb 20 M $\nu \bar \nu \gamma \gamma $ sub-channel$\begin{aligned}e^{+} e^{-} \to ZH \to \nu \bar \nu \gamma \gamma \\ e^{+} e^{-} \to \nu \bar \nu H \to \nu \bar \nu \gamma \gamma\end{aligned} $ 0.11 fb 100 k $ e^{+} e^{-} \to \nu \bar \nu $ 54.1 pb 20 M Table 1. Cross sections and simulated MC sample statistics. In the
$ q\bar q\gamma \gamma$ and$ {\mu ^ + }{\mu ^ - }\gamma \gamma $ channels,$ ZH$ is the only process considered, and in the$ \nu \bar \nu \gamma \gamma $ channel, both ZH$ Z\to inv. $ and$W / Z$ fusion processes are considered.The simulations of the detector configuration and response were conducted with MokkaPlus [21], a GEANT4 [22] based framework. The full detector simulation was performed for signal processing only. The background processes were simulated by smearing the truth particles with the parameterized detector resolution and efficiency to save computing resources.
-
The CEPC follows the PFA scheme for event reconstruction, with a dedicated tookit ARBOR [23, 24]. The tracks are first reconstructed with the hits in the tracking detector by the Clupatra module [25]. Then, ARBOR collects the tracks from Clupatra and hits in the calorimeter, and composes the Particle Flow Objects (PFOs) using clustering and matching modules. These PFOs are identified as charged particles, photons, neutral hadrons, and unassociated fragments. With this approach, a photon is identified in ARBOR with the shower shape variables obtained from the high granularity calorimeter, without any matched tracks. Converted photons are not considered yet; they amount to 5%–10% in the central region and 25% in the forward region [4]. The lepton (
$e^{\pm}, \mu^{\pm}$ ) is defined by a track-matched particle. A likelihood-based algorithm, namely LICH [26], is implemented in ARBOR to separate electrons, muons, and hadrons. Jets are formed from the particles reconstructed by ARBOR with the Durham clustering algorithm [27] after excluding the particles of interest. The jet energy is currently calibrated using MC simulation, but it is foreseen to be re-calibrated with physical events such as$ W\to q \bar{q} $ and/or$ Z\to q \bar{q} $ in CEPC. No flavor tagging approach was used in this analysis for simplicity.The event selections are applied to improve the signal significance and background modeling. Individual strategies are considered in the three sub-channels depending on the topology of the physical process. In the
$ ZH\to \nu \bar \nu \gamma \gamma $ channel, two photons are required inclusively in the final state. In the$ ZH\to {\mu ^ + }{\mu ^ - }\gamma \gamma $ channel, the two leading photons and two muons are exclusively selected, requiring a veto of other particles, with the missing energy$E_{\rm missing}$ and missing mass$M_{\rm missing}$ less than 10 GeV and the invariant mass of the muon pair close to the Z boson mass.In the
$ ZH\to q\bar{q}\gamma\gamma $ channel, two leading photons are first selected, and other particles are reconstructed into two jets using the Durham algorithm. Some dedicated cuts are applied on the kinematic variables of these final state objects as listed in Tables 2, 3, 4, along with the final efficiency and expected event yields.Selections Higgs signal $q\bar q\gamma \gamma$ backgroundExclusive 2 jets and 2 photons 85.56% 69.57% $ E_{\gamma 1}> $ 25 GeV100.00% 2.35 % $ E_{\gamma 2} \in [35,95] $ GeV98.37% 35.33% $ \cos\theta_{\gamma \gamma }> $ –0.9595.20% 68.01% $ \cos\theta_{jj}> $ –0.9590.86% 85.54% $ pT_{\gamma 1}>$ 20 GeV93.42% 56.94% $ pT_{\gamma 2}>$ 30 GeV93.25% 54.54% $ m_{\gamma \gamma } \in [110,140]$ GeV97.50% 21.14% $ E_{\gamma \gamma }> $ 120 GeV99.47% 98.41% $\min|\cos\theta_{\gamma j}| < 0.9$ 71.67% 48.05% Total eff 44.08% 0.01% Yields in 5.6 ab−1 766.64 26849.38 Table 2. Selection criteria and corresponding efficiencies in the
$q\bar q\gamma \gamma$ channel.$ \gamma 1 (\gamma 2) $ is defined as the photon with lower (higher) energy,$ \cos\theta_{\gamma \gamma } (\cos\theta_{jj}) $ is the polar angle of the di-photon (di-jet) system, and$\min|\cos\theta_{\gamma j}|$ is the minimum$ \cos\theta $ of the photon-jet pairs.Selections Higgs signal ${\mu ^ + }{\mu ^ - }\gamma \gamma$ backgroundExclusive 2 muons and 2 photons 70.18% 5.18% $ E_{\gamma}> 35$ GeV99.21% 8.39% $ |\cos\theta_{\gamma}|< $ 0.983.79% 38.14% $ pT_{\gamma 1} \in [10,70]$ GeV99.84% 86.30% $ pT_{\gamma 2} \in [30, 100] $ GeV99.96% 95.59% $ m_{\gamma \gamma } \in [110,140] $ GeV98.08% 37.62% $M_{\gamma \gamma}^{\text {recoil }} \in[85, 105]$ GeV80.12% 21.29% $E_{\gamma \gamma} \in[125, 145]$ GeV99.88% 95.86% Total eff 45.69% 0.01% Yields in 5.6 ab−1 39.32 2662.77 Table 3. Selection criteria and corresponding efficiencies in the
${\mu ^ + }{\mu ^ - }\gamma \gamma$ channel.$ \gamma 1 (\gamma 2) $ is defined as the photon with lower (higher) energy;$M_{\gamma \gamma }^{\rm recoil}$ is the recoil mass of the di-photon system in CEPC$\sqrt{s}=240 \; \mathrm{GeV}:\left(M_{\gamma \gamma}^{\text {recoil }}\right)^2=\left(\sqrt{s}-E_{\gamma \gamma}\right)^2-p_{\gamma \gamma}^2= $ $ s-2 E_{\gamma \gamma} \sqrt{s}+m_{\gamma \gamma}^2$ .Selections Higgs signal $\nu \bar \nu \gamma \gamma $ backgroundInclusive 2 photons 85.51% 0.34% $ E_{\gamma \gamma } > $ 30 GeV99.81% 20.13% $ |\cos\theta_{\gamma}|< $ 0.870.48% 11.56% $ pT_{\gamma}> $ GeV99.97% 99.26% $M_{\rm missing} >$ 60 GeV98.17% 99.71% $ m_{\gamma \gamma} \in[110,140] $ GeV97.51% 22.86% $ E_{\gamma \gamma} \in[120, 150]$ GeV99.16% 99.58% Total eff 57.08% 0.002% Yields in 5.6 ab−1 335.89 3640.20 Table 4. Selection criteria and corresponding efficiencies in the
$\nu \bar \nu \gamma \gamma $ channel.$ M_{\text {missing }}$ is the missing mass calculated from the total visible objects. -
The Multi-Variate Analysis (MVA) method is employed to further suppress the background. It exploits machine learning (ML) techniques to combine the separation power from several variables into a unique variable. In this study, we chose the Gradient Boosted Decision Tree (BDTG) method and TMVA toolkit [28]. For each sub-channel, the ZH and two fermion processes were considered as the signal and background for the BDTG. All events from MC were separated into two sets for 2-fold validation [29] to avoid the risk of overtraining. The following principles were considered while constructing the input variables for BDTG:
● The basic information is the Lorentz vector of the final state particles. This includes the momentum (P), transverse momentum (pT), energy (E), polar angle (
$\cos \theta$ ), and recoil mass for photons, fermions, and systems;$\Delta P,\, \Delta E,\, \Delta \Phi,\, \Delta \cos \theta,\, \Delta R$ for two objects or systems; and the missing mass$M_{\text {missing }}$ .● The separation
$\left\langle S^2\right\rangle$ defined in Eq. (1) is used to quantify the discrimination power between signal and background of a given variable, where y represents the discriminating variable, and$\hat{y}_s(y)$ and$\hat{y}_b(y)$ are the corresponding probability distribution function of the variable for signal and background samples, respectively.$ \left\langle S^2\right\rangle=\frac{1}{2} \int \frac{\left(\hat{y}_s(y)-\hat{y}_b(y)\right)^2}{\hat{y}_s(y)+\hat{y}_b(y)} {\rm d} y .$
(1) ● To ensure the application of the 2D model described in Sec. V, which requires an assumption of independence between the BDTG response and
${m_{\gamma \gamma }}$ , the constructed variable should have a low linear correlation with${m_{\gamma \gamma }}$ :$ |\text{Corr}_{v-{m_{\gamma \gamma }}}|<30\% $ .● To reduce the training redundance, the linear correlation between any two variables should be small:
$ |\text{Corr}_{v1-v2}|<40\% $ . The one with lower separation power is removed.Tables 5–7 lists the selected variables along with their definition and
$ \langle S^{2} \rangle $ for BDTG. Their distributions can be found in Appendix A (Figs. A1, A3, A5). The ROC curves and distributions of the trained BDTG are also shown in Appendix A (Figs. A2, A4, A6).Variable Definition Separation $ pT_{\gamma 1} $ Transverse momentum of the sub-leading photon 0.209 $ \cos\theta _{\gamma 2} $ Polar angle of the leading photon 0.197 $ \Delta\Phi_{\gamma \gamma } $ Azimuthal angle between two photons 0.147 $ \min\Delta R_{\gamma, j} $ Minimum $\Delta R$ between one of the two photons and one of the jets0.054 $ E_{j1} $ Energy of the sub-leading jet 0.041 $ \Delta\Phi_{\gamma \gamma, jj} $ Azimuthal angle between the diphoton and dijet system 0.033 $ pT_{j2} $ Transverse momentum of the leading jet 0.032 $ \cos\theta_{j1} $ Polar angle of the sub-leading jet 0.032 $ \cos\theta_{\gamma \gamma, jj} $ Polar angle difference between diphoton and dijet system, $\cos(\theta_{\gamma \gamma }-\theta_{jj})$ 0.024 $ \cos\theta_{\gamma 1, j1} $ Polar angle difference between sub-leading photon and sub-leading jet, $\cos \left(\theta_{\gamma 1}-\theta_{j 1}\right)$ 0.023 Table 5. Input variables for BDTG in the
$q\bar q\gamma \gamma$ channel.Variable Definition Separation $ \min\Delta R_{\gamma, \mu} $ Minimum $\Delta R$ between one of the two photons and one of the muons0.335 $ E_{\mu\mu} $ Energy of the di-muon system 0.259 $ \cos\theta_{\gamma 1, \mu1} $ Polar angle difference between the sub-leading photon and sub-leading muon 0.189 $ E_{\gamma 2} $ Leading photon energy 0.160 $ \Delta\Phi_{\gamma \gamma } $ Azimuthal angle between two photons 0.090 $ \cos\theta_{\gamma 2} $ Polar angle of the leading photon 0.072 $ \Delta\Phi_{\gamma \gamma, \mu\mu} $ Azimuthal angle between the diphoton and dimuon system 0.034 $ \cos\theta_{\mu 1} $ Polar angle of the sub-leading muon 0.014 Table 6. Input variables for BDTG in the
${\mu ^ + }{\mu ^ - }\gamma \gamma$ channel.Variable Definition Separation $ pT_{\gamma 1} $ Transverse momentum of the sub-leading photon 0.089 $ \cos\theta _{\gamma 2} $ Polar angle of the leading photon 0.079 $ \Delta\Phi_{\gamma \gamma } $ Azimuthal angle between two photons 0.054 $ pTt_{\gamma \gamma } $ Diphoton pT projected perpendicular to the diphoton thrust axis 0.042 $ pT_{\gamma 2} $ Transverse momentum of the leading photon 0.037 Table 7. Input variables for BDTG in the
$ \nu \bar \nu \gamma \gamma $ channel. -
The Higgs signal is extracted by fitting
${m_{\gamma \gamma }}$ and the shape of the BDTG responses. The resonant peak above a smooth${m_{\gamma \gamma }}$ distribution for the background at around the Higgs mass (125 GeV) can be reconstructed through the excellent calorimeter energy resolution in CEPC. The signal${m_{\gamma \gamma }}$ distribution is fitted with a Double Side Crystal Ball (DSCB) function:$ \begin{align} f(t) = N \times \begin{cases} {\rm e}^{-t^{2}/2}, & \text{if }\, -\alpha_{\rm low} \leq t \leq \alpha_{\rm high} \\ \dfrac{ {\rm e}^{-{}^{1}_{2} \alpha_{\rm low}^{2}} } { \left[ \dfrac{1}{R_{\rm low}} \left(R_{\rm low} - \alpha_{\rm low} - t \right) \right]^{n_{\rm low}} }, & \text{if }\, t < -\alpha_{\rm low} \\ \dfrac{ {\rm e}^{-{}^{1}_{2} \alpha_{\rm high}^{2}} } { \left[ \dfrac{1}{R_{\rm high}} \left(R_{\rm high} - \alpha_{\rm high} + t \right) \right]^{n_{\rm high}} }, & \text{if }\, t > \alpha_{\rm high} \\ \end{cases} \end{align} $
(2) where N is a normalization factor and
$ t=({m_{\gamma \gamma }} - \mu_\text{CB}) / \sigma_\text{CB} $ . Figure 2 shows the fitted${m_{\gamma \gamma }}$ signal shape in three channels. They are well described by the DSCB function. The resolution is estimated to be 2.81 / 2.68 / 2.74 GeV in the$q\bar q\gamma \gamma / {\mu ^ + }{\mu ^ - }\gamma \gamma / \nu \bar \nu \gamma \gamma$ channels, respectively.Several smooth functions (Cheybyshev polynomials, and exponential and polynomial families) were tested for background modeling, and the one with the smallest
$ \chi^{2} $ /Ndof value was finally selected. The results are listed in Table 8 and shown in Fig. 3. Details on the fitting conditions for all functions are provided in Appendix A (Table A1 and Fig. A7).Channel Selected function $ \chi_{2} $ /Ndof$q\bar q\gamma \gamma$ 2nd order Chebyshev 0.60 ${\mu ^ + }{\mu ^ - }\gamma \gamma$ 2nd order Chebyshev 1.79 $ \nu \bar \nu \gamma \gamma $ 1st order Chebyshev 3.32 Table 8. Decided background model in the three channels. Tested functions include the exponential, 2nd order exponential polynomial, 1st and 2nd order polynomials, and 1st and 2nd order Chebyshev polynomials.
Figure 3. (color online) Background MC and fitted
${m_{\gamma \gamma }}$ models in the three channels.The histograms from the MC of signal and background were used to build the binned Probability Density Function (PDF), which was in turn used as the model of BDTG distributions.
The strategies employed for constructing BDTG ensured the reasonable independence between the BDTG response and
${m_{\gamma \gamma }}$ . Therefore, a 2-dimensional model resulting from the multiplication of${m_{\gamma \gamma }}$ and BDT models was applied to describe the signal and background. A high correlation can introduce improper modeling of the signal and/or background process. The linear correlation coefficients between${m_{\gamma \gamma }}$ and BDT are −3.45%, −11.6%, 8.33% for the signals in the$q\bar q\gamma \gamma $ ,${\mu ^ + }{\mu ^ - }\gamma \gamma$ , and$ \nu \bar \nu \gamma \gamma $ channels, respectively. The corresponding correlation coefficients for the background are 11.6%, 28.2%, and 28.4%, respectively. -
The number of expected signal events was extracted by combining the fitting in the three channels with the unbinned maximum likelihood fitting method. The likelihood function was built using the models presented in Sec. V and the constraints derived from the systematic uncertainties presented in Sec. VI:
$\begin{aligned}[b] \mathcal{L}(\mu,{\boldsymbol{\theta}};({m_{\gamma \gamma }}, \text{BDT})) & = \prod_{c}\text{Pois}(n_c|N_c(\mu, {\boldsymbol{\theta}}))\cdot \\ & \prod_{i}^{n}f_{c}(({m_{\gamma \gamma }}, \text{BDT})^{i};{\boldsymbol{\theta}}) \cdot \prod_{j} G(\theta_j), \end{aligned} $
(3) where
● μ is the signal strength expressed as
$\mu = $ $ \dfrac{N\ (e^{+} e^{-} \to ZH \to f\bar {f}\gamma \gamma)} {N_{\rm SM}\ (e^{+} e^{-} \to ZH \to f\bar {f}\gamma \gamma)}$ , which is the parameter of interest (POI) in the fitting;●
$ {\boldsymbol{\theta}} $ denotes nuisance parameters defined for systematic terms;●
$ n_c $ is the observed event number in the channel c from the data;●
$N_c(\mu, {\boldsymbol{\theta}})=\mu S_{{\rm SM}, c}({\boldsymbol{\theta_{\rm yield}}}) + B_c$ .$S_{{\rm SM}, c}({\boldsymbol{\theta_{\rm yield}}})$ is the expected signal yield in the channel, including the relevant nuisance parameters.$ B_c $ is the background yield;●
$ f_{c}(({m_{\gamma \gamma }}, \text{BDT})^{i};{\boldsymbol{\theta}}) $ is the probability density function built with the signal and background models presented in Sec. V:$\begin{aligned}[b] f_{c}(({m_{\gamma \gamma }}, \text{BDT})^{i};{\boldsymbol{\theta}}) =& \frac{1}{N_c}\times \Big[ \mu S_{{\rm SM}, c}({\boldsymbol{\theta_{\rm yield}}})f_{c,\rm sig}(({m_{\gamma \gamma }},\text{BDT})^i;{\boldsymbol{\theta}}) \\&+ B_{c} f_{c,\rm bkg}(({m_{\gamma \gamma }},\text{BDT})^i;{\boldsymbol{\theta}}) \Big]. \end{aligned} $
(4) ● The signal yield
$S_{{\rm SM},c}$ , shape peak$\mu_{\rm CB}$ , and width$\sigma_{\rm CB}$ are affected by systematic uncertainties with a response function:$ \begin{aligned}[b] S_{{\rm SM},c}({\boldsymbol{\theta_{\rm yield}}})=S_{{\rm SM},c}\prod\limits_{j}{\rm e}^{\theta_j \sqrt{\ln(1+\delta_j^2)}}, \end{aligned} $
$ \begin{aligned}[b] & \mu_{\rm CB}({\boldsymbol{\theta_{\rm peak}}}) = \mu_{\rm CB}^{\rm nom}\prod\limits_{j}(1+\delta_j \theta_j), \\ & \sigma_{\rm CB}({\boldsymbol{\theta_{\rm width}}}) = \sigma_{\rm CB}^{\rm nom}\prod\limits_{j}{\rm e}^{\theta_j \sqrt{\ln(1+\delta_j^2)}}. \end{aligned} $
(5) ●
$ G(\theta_{j}) $ is the unitary Gaussian constraint PDF for nuisance parameter j with mean 0 and width 1.For the fitting, the signal model parameters were fixed to the values resulting from fitting the signal MC. The background yields, model parameters, and all nuisance parameters were floated, as mentioned in Sec. VI.
In order to mimic real data and avoid statistical fluctuations of the MC samples, a set of Asimov data [30] were generated from the signal + background models and simultaneously fitted to obtain the expected precision and significance. Figure 4 shows the
${m_{\gamma \gamma }}$ and BDTG distributions of the Asimov data and the models in the three channels. A final precision of 7.7% (stat.)$ \pm $ 2.1% (syst.) for the$\sigma\times {\rm Br}$ measurement can be reached in the$ H \to\gamma \gamma $ channel of the CEPC with 5.6 ab−1 data. With the 20 ab−1 data of the updated CEPC operation period, the expected precision is 4.0% (stat.)$ \pm $ 2.1% (syst.). Table 9 lists the contributions from each systematic term. The contribution from background modeling was decoupled from fixing and floating the background parameters in the fitting, and it was included into the statistical precision. Combined results are summarized in Table 10. According to our preliminary assumption, this measurement is still statistically dominant in the CEPC.$q\bar q\gamma \gamma$ ${\mu ^ + }{\mu ^ - }\gamma \gamma$ $\nu \bar \nu \gamma \gamma $ Theo 0.5% 0.005 - - Lumi 0.1% 0.001 0.001 0.001 photon eff 1% 0.019 0.020 0.020 PES 0.05% 0.001 <0.001 0.001 PER 0.05% <0.001 <0.001 <0.001 mH 5.9 MeV <0.001 <0.001 <0.001 BDT 0.006 0.006 0.007 Bkg. modeling 0.029 0.062 0.006 Table 9. Decoupled contributions from considered systematic uncertainties of the
$(\sigma\times {\rm Br}) / (\sigma\times {\rm Br})_{\rm SM}$ measurement in the three channels. The 0.5% theoretical uncertainty was only considered in the$q\bar q\gamma \gamma$ channel.5.6 ab−1 20 ab−1 $\dfrac{\Delta_{\rm tot} }{(\sigma\times \rm Br)_{\rm SM} }$ $\dfrac{\Delta_{\rm stat} }{(\sigma\times \rm Br)_{\rm SM} }$ $\dfrac{\Delta_{\rm tot} }{(\sigma\times\rm Br)_{\rm SM} }$ $\dfrac{\Delta_{\rm stat} }{(\sigma\times\rm Br)_{\rm SM} }$ $q\bar q\gamma \gamma$ 0.101 0.098 0.056 0.052 ${\mu ^ + }{\mu ^ - }\gamma \gamma$ 0.373 0.371 0.202 0.200 $ \nu \bar \nu \gamma \gamma $ 0.130 0.127 0.071 0.067 Combined 0.079 0.077 0.046 0.040 Table 10. Expected precisions on
$\sigma(ZH)\times {\rm Br}(H \to\gamma \gamma)$ from Asimov data fitting in the three channels and their combination. Results in 20 ab−1 were obtained by re-fitting the workspace with the scaled signal and background yields. The statistical precision includes the contribution from background modeling. -
Concerning the fitting of the
${m_{\gamma \gamma }}$ shape, the width of the signal peak is a direct connection between the measurement precision in the$ H\to \gamma\gamma $ channel and the ECAL resolution. Currently, a new detector design for CEPC is under development [12–14] in which the present Si-W sampling ECAL will be replaced by a homogeneous crystal ECAL. This new ECAL is expected to have an energy resolution of$ \sigma_{E}/E = 3\%/\sqrt{E} $ , which is almost five times higher than the sampling Si-W ECAL$ \sigma_{E}/E = 16\%/ \sqrt{E} \oplus 1\% $ [4]. This can facilitate photon detection and neutral meson ($ \pi^{0} $ ) reconstruction, and further contribute to the Higgs study in the$ H\to \gamma\gamma $ channel and flavor physics in the$ \pi^{0}\to \gamma\gamma $ final state, e.g.,$ B^0_{(s)} \to \pi^0 \pi^0 $ [31]. The jet energy resolution may not be significantly improved from this ECAL, given that the detector granularity is the dominant factor in PFA-based jet reconstruction.We performed a rough estimation in the
$q\bar q\gamma \gamma$ channel according to the strategy followed in this work: to study the ECAL resolution impact on the$ H\to \gamma\gamma $ measurement. In the estimation, the selected photon was replaced by the truth photon with a smearing in its energy. Normally, the ECAL energy is approximated as:$ \begin{equation} \frac{\sigma_{E}}{E} = A \oplus \frac{B}{\sqrt{E}} \oplus \frac{C}{E}, \end{equation} $
(6) where A is the constant term, e.g., the energy leakage and readout threshold; B represents the stochastic term from photoelectron statistics and depends on the sensitive material; and C comes from the electronic noise. Presently, the noise term C is expected to be 0, and the constant term A is expected to be at the level of 1%. The photon energy is smeared with the stochastic term B varying from 1% to 35%. Figure 5 shows a comparison between the
${m_{\gamma \gamma }}$ shape from the full simulation and two smearing points, i.e., 3% and 16%. The jet performance is maintained consistent with the baseline Si-W sampling ECAL, assuming there is no impact from the new detector. The same selection criteria as in Sec. III were applied; the BDT was not employed in this simplified study to focus on the photon detection only, which is expected to present a 30% decrease, approximately, compared with the results in Sec. VII. A Gaussian function was used to describe the signal model from energy smearing. The 2-dimensional model was replaced with a 1-dimension${m_{\gamma \gamma }}$ model, and a similar unbinned maximum likelihood fitting was performed to extract the signal strength precision$ \delta\mu/\mu $ without systematic uncertainties. Considering that${m_{\gamma \gamma }}$ and BDT are independent, this simplification was expected to have little impact on the relative improvement. Figure 6 shows the relationship between energy resolution B and fitted precision$ \delta\mu/\mu $ . These points can be fitted with the following function:Figure 5. (color online) Signal shape for the full simulated
$ H \to \gamma\gamma $ sample (blue) and for two samples with smeared photon energy (3% in red and 16% in green). The fitted signal widths were 2.81 GeV, 0.94 GeV, and 1.96 GeV respectively.Figure 6. (color online) Signal strength measurement precision in the
$ ZH \to q\bar q\gamma \gamma $ channel as a function of the stochastic term in ECAL resolution from a fast analysis. The points were fitted using Eq. (7).$ \begin{equation} \frac{\delta \mu}{\mu} = p_{0} \oplus (p_{1}\times B), \end{equation} $
(7) where
$ p_{0} $ and$ p_{1}\times B $ represent the contributions from the constant and stochastic terms, respectively. According to this relation, the homogeneous ECAL achieves a 28% improvement in the statistical precision of signal strength measurement. Moreover, a "critical point" can be defined: the two components in resolution equally contribute to$ \delta\mu/\mu $ , i.e.,$ p_{0}=p_{1}\times B $ . When the constant term A was fixed to 1%, the critical point for B, within this definition, was 14%. This indicates that the constant term in resolution would become the dominant contribution at the new ECAL design point with B = 3%. The scanning of a series of constant terms and the corresponding balanced stochastic terms are shown in Fig. 7. -
Figure A1-1. (color online) Training variables in
$q\bar q\gamma \gamma$ channel. The signal and background yields are normalized.Figure A2. (color online) The ROC curve (left) and output BDTG distribution (right) in
$q\bar q\gamma \gamma$ channel.Figure A3. (color online) Training variables in
${\mu ^ + }{\mu ^ - }\gamma \gamma $ channel. The signal and background yields are normalized.Figure A4. (color online) The ROC curve (left) and output BDTG distribution (right) in
${\mu ^ + }{\mu ^ - }\gamma \gamma $ channel.Figure A5. (color online) Training variables in
$\nu \bar \nu \gamma \gamma$ channel. The signal and background yields are normalized.Figure A6. (color online) The ROC curve (left) and output BDTG distribution (right) in
$\nu \bar \nu \gamma \gamma $ channel.$q\bar q\gamma \gamma$ ${\mu ^ + }{\mu ^ - }\gamma \gamma$ $ \nu \bar \nu \gamma \gamma $ 1st order Exp. 0.941 5.423 3.786 2nd order Exp. 0.610 2.035 3.435 1st order Poly. 0.644 4.321 7.399 2nd order Poly. 0.600 3.758 3.439 1st order Chebyshev 0.644 4.321 3.320 2nd order Chebyshev 0.596 1.789 3.411 Table A1. The
$\chi^{2} $ /Ndof values for 6 considered models in the background modeling in 3 channels, including the first and second order exponential, polynomial and Chebyshev functions.Figure A7. (color online) Tested functions for the background modeling. In All 3 channels the second order Chebyshev function gives the smallest
$\chi^{2}/Ndof$ value. Detailed numbers are listed in Table A1.
Expected measurement precision of the branching ratio of the Higgs boson decaying to the di-photon at the CEPC
- Received Date: 2022-09-13
- Available Online: 2023-04-15
Abstract: This paper presents the prospects of measuring