-
PDF
- Split View
-
Views
-
Cite
Cite
Hyun-Gi Lee, Jungsic Park, Restoring the saturation response of a PMT using pulse shape and artificial neural networks, Progress of Theoretical and Experimental Physics, Volume 2023, Issue 5, May 2023, 053C01, https://doi.org/10.1093/ptep/ptad047
- Share Icon Share
Abstract
The linear response of a photomultiplier tube (PMT) is a required property for photon counting and reconstruction of the neutrino energy. The linearity valid region and the saturation response of a PMT were investigated using a linear-alkylbenzene (LAB)-based liquid scintillator. A correlation was observed between the two different saturation responses, with pulse-shape distortion and pulse-area decrease. The observed pulse shape provides useful information for the estimation of the linearity region relative to the pulse area. This correlation-based diagnosis allows an in situ estimation of the linearity range, which was previously challenging. The measured correlation between the two saturation responses was employed to train an artificial neural network (ANN) to predict the decrease in pulse area from the observed pulse shape. The ANN-predicted pulse-area decrease enables a prediction of the ideal number of photoelectrons regardless of the saturation behavior. This pulse-shape-based machine-learning technique offers a novel method for restoring the saturation response of PMTs.
1. Introduction
A photomultiplier tube (PMT) amplifies the number of photoelectrons (NPE) released from a photocathode at each dynode and discharges them as a pulse of current. The linearity of the gain between the photocathode and the amount of output signal is a required property for counting NPE from the observed currents. It is known that saturation behavior appears when detecting an event of high light intensities [1,2] and the validity of the linearity depends on the environment [3,4]. Hamamatsu’s 10-inch photocathode PMT R7081 is a widely used photon sensor specialized for neutrino experiments [5–9]. Its linearity has been tested up to |$\rm \sim 300\, (\rm 600)\, PE$| at a gain of |$1\times 10^7\, (5\times 10^6)$| [7,10] and the characteristics of the saturation behavior have been investigated [11].
Reconstruction of neutrino events is typically performed from the NPE observed by each PMT installed in the detector [12–14]. When it comes to reconstructing relatively high-energy particles, there are concerns regarding saturation behavior as the number of photoelectrons observed by a PMT (observed NPE) increases. The saturation behavior is a major limitation in the reconstruction of neutrinos produced by kaon decay-at-rest |$\rm (236\, MeV)$| in the JSNS2 experiment [7,15] or passing through muons |$\rm (\sim GeV)$| in the Double Chooz experiment [16]. For each case, a diagnosis of the linearity range or an understanding of the saturation response is required for the reconstruction of the events occurring in these energy regions.
The observed NPE associated with the reconstructed particle energy also depends on the detector size and light-collection method. In very short baseline reactor |$\bar{\nu }_e\, \rm (\sim few\, MeV)$| experiments, several PMTs were installed in each segmented cell with different baselines, and the observed NPE reached |$\rm \sim 500\, PE/MeV$| [17]. These types of experiments prove the existence of sterile neutrinos based on the comparison of the observed |$\bar{\nu }_e$| energy spectra for the segmented region with different baselines [18–20]. Global analyses of available |$\bar{\nu }_e$| disappearance data with the existence of sterile neutrinos have indicated that the best-fitting value of |$(\sin ^{2}{\theta }_{14},\, \Delta m^{2}_{41})$| is (0.009, 1.3) [21]. An identical response for each detector is required to prove the energy-dependent disappearance of |$\bar{\nu }_e$| with such a small mixing angle [12,22]. The saturation behavior of PMTs may spoil the identical energy scale response for each cell, which gives ambiguity in the precision comparison of the prompt energy spectra or reduces the sensitivity in the sterile neutrino search [18,19].
In this study, we investigate a possible diagnosis and restore the saturation response of R7081 up to |$\rm \sim 4000\, PE$| using artificial neural networks (ANNs) and the observed pulse shape. A correlation between two different types of saturation responses, pulse-shape distortion and pulse-area decrease, was observed. The obtained correlation was employed to train the ANN to recover the reduced pulse area from the observed pulse shape, which contains in situ information on the pulse-area decrease. Section 2 describes the experimental setup for the saturation-response measurements. Section 3 elucidates the measurement of gain and saturation response. Section 4 provides details on the structure and training of the ANN. Section 5 presents the training results and restoration of the saturation response from the saturated pulse shape. Finally, Sect. 6 summarizes the study and highlights the major conclusions drawn from the obtained results.
2. Experimental setup
The saturation response of R7081 was investigated using the setups shown in Fig. 1. Two different types of PMTs, 10 inch (Hamamatsu R7081) and 2 inch (Hamamatsu H7195), were used to detect the scintillation events. The 2-inch PMT was designed to detect the scintillation events without the saturation, whereas the 10-inch PMT coincidently detected the same scintillation events, including the saturation response.

Experimental setup for the PMT response measurement. Left: Test setup used for the linearity test of the 2-inch PMT-A. The 2-inch PMT-B receives scintillation light attenuated by a |$\rm 1/4$| neutral-density filter and monitors the linearity of the 2-inch PMT-A. Right: Experimental setup for the saturation-response measurement of the 10-inch PMT. The 2-inch PMT-A (10-inch PMT) observed the scintillation events, without (including) the saturation.
The linearity (saturation response) of the 2-inch PMT (10-inch PMT) was measured using the configuration shown in Fig. 1 (left) (right). A mu-metal was used to shield the 10-inch PMT from the geomagnetic field to improve its collection efficiency [23]. A glass vial contained the liquid scintillator (LS), which yielded the scintillation light from ionization. The γ-rays emitted from the radioactive sources of |$^{137}{\rm Cs}\, (0.66 \rm \, MeV)$| and |$^{35}{\rm Cl}\, (\rm \sim 8\, MeV)$| transferred their energy to the electrons of the LS via Compton scattering and further ionized the scintillator. The emitted scintillation photons were reflected in the cylindrical Teflon tube, and the 10-inch and 2-inch PMTs coincidently detected these photons with different quantum and collection efficiencies. The cylindrical Teflon tube was made of polytetrafluoroethylene (PTFE) and had a length of 7.5 cm with an inner diameter of 2 inches. The gain of each PMT was adjusted to the typical gain provided by the manufacturer |$\, (\sim 3\times 10^6\, \, \rm \, for\, \, H7195,\, \sim 1\times \, 10^7\, \, for\, \, R7081)$| [5], and the observed pulse area was divided into gain to determine the observed NPE.
An LS is a mixture of an organic solvent, a fluor, and a wavelength shifter. Linear alkylbenzene |$(\mathrm{LAB}, \mathrm{C}_{n}\mathrm{H}_{2n+1}$|–|$\mathrm{C}_{6}\mathrm{H}_{5}, n = 10$|–13) is widely used as a solvent for LS owing to its relatively high light yield and environmentally friendly characteristics [24–29]. In this study, 2,5-diphenyloxazole |$(\rm PPO, C_{15}H_{11}NO)$| and 1,4-bis(2-methylstyryl)benzene |$(\rm bis$|-|$\rm MSB, C_{24}H_{22})$| were adopted as the fluor and the wavelength shifter, respectively. The LS was synthesized by dissolving |$3\, \rm g/L$| of PPO and |$30\, \rm mg/L$| of bis-MSB into LAB, which is an aromatic solvent. The primary decay time of the LS is about ∼4 |$\rm ns$| [30].
As described in Sect. 1, the NPE released from the photocathode is multiplied at each dynode and discharged as current pulses. Figure 2 presents a relevant circuit diagram for the 10-inch PMT, which contained an attenuator (Kaizuworks KN320), a flash analog-to-digital converter (FADC), and a high-voltage power supply (ORTEC 556). The discharged pulses were attenuated to adjust the dynamic range of the electronics and digitized by Notice FADC400, which is a 10-bit, 400 MHz/s, |$\rm \pm 1\,V_{pp}$| FADC [31].
![Circuit diagram of R7081. The total resistance between the dynodes is 12.7 $\rm \, M\Omega$, and the distribution ratio of the dynode resistance is (16.8–0.6–3.4–5–3.33–1.67–1–1.2–1.5–2.2–3–2.4) [32].](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/ptep/2023/5/10.1093_ptep_ptad047/1/m_ptad047fig2.jpeg?Expires=1750191746&Signature=Py7D3Kbi02rTb9SL4DvRWIySQRzi0K~mKwx22xa3lCbVdo5HjUiZcv9u2c5~mETZ~qsFFNIxo0xTOiB2cYObgZBE-~udp-gmVebaYG7V5gaWwLcHOw4e2-YNTlp4a7WmcUGm7RSSHl4EJ8-QWMaKrXi-7Nrmtd~0e0DKHOiEW85ODTcgiZUpmHgh~CUWHS2FaSWu5IEQB9-XCggJhETD7kU5lA7l3Ej955It-NjyFievbg7kNVc6M1z-pU5y5OE4nZss4togLgdnsHEfD~XNR7BPzXKiPtTbfrH6W5zZ~hdHCv53Y~6UlSqO~bAuEaHphjZNTLz3FCZZx9UpsSsnQA__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
Circuit diagram of R7081. The total resistance between the dynodes is 12.7 |$\rm \, M\Omega$|, and the distribution ratio of the dynode resistance is (16.8–0.6–3.4–5–3.33–1.67–1–1.2–1.5–2.2–3–2.4) [32].
3. Measurements
The gain of each PMT was measured using an attenuated laser light source with an external trigger. For obtaining the single photoelectron (SPE) charge distribution, 440 nm laser light (OPG-NIM-440) was attenuated by a neutral-density filter |$(0.01\%)$|. Figure 3 shows the observed charge distribution from the PMT with the light source. The inset indicates the exponential increase in the SPE as a function of the applied voltage. The adjusted gains are 2.99 × 106 (2-inch-A), 2.78 × 106 (2-inch-B), and 1.02 × 107 (10-inch) with applied voltages of 1600, 1600, and 1365 V, respectively.

Measurement of the gain of each PMT. The charge distribution of the 2-inch-A, 2-inch-B, and 10-inch PMTs are presented on the left, middle, and right, respectively. The insets display the exponential increase in the observed gain with increasing applied voltage.
As described in the previous section, the scintillation events are detected by the coincidence of two PMTs with different efficiencies. The relative difference between the photon detection efficiencies of the two PMTs shown in Fig. 1 was obtained by comparing the observed Compton edges [33]. In Fig. 4, the red data points illustrate the NPE observed by the two PMTs obtained using a |$^{137}{\rm Cs}\, (0.66 \rm \, MeV)$| γ-ray source. A 1D projection of the scatter plot is presented in the inset, and their distribution is fitted with an empirical formula of error + exponential function |$(y= \mathrm {P}_0 \cdot \left[ 1 - \mathrm{erf}\, \left( (x - {\rm P_1})/{\rm P_2} \right) \right] + {\rm P_3} \cdot \exp \left( -{\rm P_4} \cdot x \right))$| to find the edge of P1 [34,35]. The blue dashed line in Fig. 4 indicates the linear model |$(Y\propto X)$|, which has a slope of the ratio of the observed edges between the two PMTs. From the consistency between the data and model in the upper figure, the ratio of the observed edges between the two PMTs is employed as an efficiency correction coefficient. By multiplying the ratio of the obtained edges with the observed NPE of the 2-inch PMT-A, the ideal NPE of the 10-inch PMT regardless of the saturation behavior is estimated.
![Obtained PMT response at the typical gain using the setup shown in Fig. 1. The responses of the 2-inch PMT-A and 2-inch PMT-B (10-inch PMT) are shown in Fig. 4(a) (b). The blue dashed lines indicate the linear model and their slopes are determined from the ratio of the observed Compton edges $\rm (79.5/284.5\, (222.5/286.8)\, [NPE/NPE])$.](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/ptep/2023/5/10.1093_ptep_ptad047/1/m_ptad047fig4.jpeg?Expires=1750191746&Signature=UbyHKuyVsDcB1gvVrqLk6Q5UxedSvpOyT7W~QXQAYv5zdCDTTg65NGwtUDoUkd4zEFKBHHGFCwIjOeVHU87~Jl-8j0bMdsUAMlw~n20MopsbykyDcokT8yrgKHBIz3Kpl7eqlJsaHRn0WtEG6sXKRthHb50jg33n9TOC3l8H5~XKil6cdbiJPHiopd0W643vskxon2siqYEUxvlC5JM5892y6cdWaceDpevSpqA9UQPvF-OyXMzWFboPODt8Hmdz4eY6EqzObtsPenpJMwyLfThIQQDDFVZS2uft5mJenziN3WvOPcTQoLxvTzo19LBht0d9kbTDergMOPT6ur3InQ__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
Obtained PMT response at the typical gain using the setup shown in Fig. 1. The responses of the 2-inch PMT-A and 2-inch PMT-B (10-inch PMT) are shown in Fig. 4(a) (b). The blue dashed lines indicate the linear model and their slopes are determined from the ratio of the observed Compton edges |$\rm (79.5/284.5\, (222.5/286.8)\, [NPE/NPE])$|.
The responses of the PMTs at a higher scintillation light intensity were obtained using a |$^{35}{\rm Cl}\, (\rm \sim 8\, MeV)$| γ-ray source [36]. In Fig. 4, the black data points show a comparison between the observed NPE of the two PMTs at a larger NPE. Figure 4(a) compares the NPE observed by the 2-inch PMT-A and 2-inch PMT-B from the same scintillation events. To test the linearity of the 2-inch PMT-A, a |$\rm 1/4$| neutral-density filter was installed on the 2-inch PMT-B. No saturation behavior is evident for |$\rm \sim 4000\, PE$| observed by the 2-inch PMT-A. Figure 4(b) compares the response of the 2-inch and 10-inch PMTs. As the NPE increases, the saturation behavior of the 10-inch PMT becomes distinct.
In the operation of a PMT with a high-intensity light pulse, the electron trajectory between the dynodes is distorted by the space charge effect caused by the repulsive force of the electron cloud [1,2], and the saturation behavior of the PMT appears as a distortion of the pulse shape and simultaneous decreases in the pulse area [3,11]. Both the saturation responses were observed in the 10-inch PMT and are presented in Fig. 5. The left side of Fig. 5 shows the accumulated pulse shape of the scintillation events with respect to the observed NPE of each PMT. The measured pulse shape contains a rise, peak, and tail [37]. Figure 5(a) displays the distortion of the pulse shape of the 10-inch PMT, which appears as a decrease in the peak. The distortion starts around |$\rm \sim 300\, PE$|. On the other hand, in Fig. 5(b), a similar pulse shape is observed for the 2-inch PMT irrespective of the observed NPE. Figure 5(c) presents the pulse-area decrease responses for the 10-inch PMT. A deviation from the ideal response of more than 1% appears at |$\rm \sim 300\, PE$|, and this deviation increases as a function of the NPE of the 10-inch PMT. The response is obtained by averaging the ratio between the ideal and observed NPE according to the NPE of the 10-inch PMT.

Observed pulse-shape and pulse-area saturation response. Left: Observed pulse-shape distortion (linearity) of the 10-inch PMT (2-inch PMT-A). Right: Observed decrease in pulse area shown by the 10-inch PMT. The ideal NPE of the 10-inch PMT that is disregards the saturation behavior is obtained based on the NPE of the 2-inch PMT-A. The responses were measured with respect to the observed NPE for each PMT.
An increase in the pulse-shape distortion and the pulse-area decrease was observed as a function of the observed NPE of the 10-inch PMT. Note that the pulse-shape distortion of the 10-inch PMT is observed using only the 10-inch-PMT information, without comparing its response to that of the 2-inch PMT. The correlation observed between the pulse-shape distortion and pulse-area decrease suggests the possibility of diagnosing the linearity range or predicting a decrease in the pulse area from the observed pulse shape.
4. Training of artificial neural networks (ANNs)
ANNs are widely used machine-learning algorithms that can extract correlated features from an input of a higher dimension. A typical ANN consists of repeated layers of perceptron and a non-linear activation function. An output of the ith perceptron in the Nth layer is fed forward to the jth perceptron in the (N + 1)th layer with a connection strength of wij. Higher-dimensional features are extracted as the layer is repeated, and the extracted features are fed forward to the next layer. The ANN has numerous free parameters for the unfixed wij, and multiple sets of data-label pairs are required to train these parameters. Further details on ANNs are provided in Ref. [38].
The ANN was trained using the observed correlation between the pulse-shape distortion and pulse-area decrease. Figure 6(a) shows the structure of the ANN used for predicting the pulse-area decrease from the observed pulse shape. The deep-learning framework PyTorch (A. Paszke, data not published) was adopted to construct the ANN model. The input is provided to 60 nodes, and each node represents a single 2.5 |$\rm ns$| bin of the area-normalized pulse shape. The area-normalized pulse shape is then fed forward to the next layer by the exponential linear unit (ELU), which is a non-linear activation function (D.A. Clevert, T. Unterthiner and S. Hochreiter, data not published), and repeated several times to extract the non-linear features corresponding to the pulse-area decrease. The extracted features are transferred to the linear layers to predict the decreased ratio of the observed pulse area for restoration.

Structure and training of the ANN. Left: Schematic diagram of the ANN used to predict the pulse-area saturation from the observed pulse shape. Right: Decrease in MSE loss of the training and validation sets during the epochs. The decrease of the loss saturates at an epoch of 130.
As demonstrated in Fig. 5, the saturation response appears as a distortion of the pulse shape and a simultaneous decrease in the pulse area. The free parameters in the ANN are trained by the number of input-label pairs obtained from the scintillation events. The input is the pulse-shape distortion response, which is given by the area-normalized pulse shape with 60 bins, while the label is the inverse of the pulse-area decrease response defined by the ideal NPE/observed NPE or Q/(Q − ∆Q). Here, the ideal NPE is denoted by Q, which is the ideal charge from the pulse area regardless of the saturation response. On the other hand, the observed NPE is denoted as Q − ∆Q, where ∆Q is the reduced charge from the pulse area decreased by the saturation response.
To train, validate, and test the ANN model, 2.0 × 105 input-label pairs were selected at NPE larger than 300 for the restoration. For each event, the normalized pulse shape of the 10-inch PMT was used as the input. The corresponding label was determined from the observed pulse-area decrease response in Fig. 5(c) and the observed NPE of the 10-inch PMT. The inverse of the pulse-area decreased ratio was adopted as a label (label = ideal NPE/observed NPE or or Q/(Q − ∆Q)) and used as the pulse-area restoration coefficient for the decreased pulse area. The splitting ratio of the training, validation, and test datasets was 2:1:2. The correction should not be performed in regions where the saturation response is absent, and NPE smaller than 300 have been excluded from the restoration.
Figure 6(b) depicts the training of the ANN over each epoch. The Adam optimizer [39] and the mean squared error (MSE) loss [40] were utilized during the training process. The initial learning rate (5 × 10−4) was gradually decreased at a rate of |$\rm 0.975^{epoch}$| by using the learning rate scheduler. During each epoch, the connection strengths represented as wij between the perceptrons were optimized, leading to a decrease in the loss. The ANN was trained for a total of 130 epochs.
5. Results
The trained ANN was tested using the test dataset. The restoration coefficient corresponding to the decreased pulse area was predicted by the trained ANN from the observed pulse shape. A clear correlation between the ANN prediction and the label is shown in Fig. 7(a). The inset depicts the ratio between the prediction and the label. From the trained ANN, the pulse shapes are classified according to the restoration coefficient as presented in Fig. 7(b). The peak of the area-normalized pulse height decreases with increasing restoration coefficient.

Validation of the trained ANN using the test dataset. The trained ANN predicts the restoration coefficient for the decreased pulse area from each observed pulse shape. Left: Comparison of the labels and ANN predictions. Right: Classified pulse shape by the trained ANN for each prediction.
The ANN-predicted restoration coefficients obtained from the observed pulse shapes were applied to the test dataset. Figure 8 compares the responses of the 10-inch PMT with and without restoration. A distinct saturation behavior appears as the NPE increases without restoration. The ANN-predicted restoration coefficient from the observed pulse shape is multiplied with the NPE observed by the 10-inch PMT to restore the decreased pulse area. The restored response is compared with a linear response model (Y = AX) with the ideal slope (A = 1). The insets in Fig. 8 provide a comparison of the 10-inch PMT response with (black points) and without (red points) restoration. Without restoration, a deviation from the ideal response appears in NPE larger than 300. The biased response was restored by the ANN from the prediction obtained by the pulse-shape classification. Due to the finite classification power, the root-mean-square (RMS) of the ideal/observed NPE ratio increased after the restoration. The RMS differences between the red and black decrease as the NPE increases.

Restoration of the saturation response of the PMT using the ANN model. The observed NPE of the 10-inch PMT without (with) restoration is shown in red (black). The blue dashed line indicates the linear model. The inset compares the bias and resolution of the PMT response with restoration and without restoration. The restoration was performed for NPE larger than 300.
6. Summary
Neutrino events are typically reconstructed from the NPE observed by the PMTs mounted in the detector. Understanding the saturation behavior or a solid diagnosis of the linearity range of the PMT provides useful information for correct reconstruction of the events. Previous studies were primarily focused on the PMT saturation behavior based on the absolute pulse height or absolute pulse area.
In this study, the saturation behavior of a PMT was investigated by focusing on the correlation between two saturation responses, distortion of the pulse shape, and decrease in the pulse area. Comparing the observed pulse shapes with the observed NPE provided useful information for an in situ diagnosis of the pulse-area decrease. The observed correlation between the two saturation responses was employed to train the ANN. The trained ANN predicted the decreased ratio of the pulse area from the observed pulse shape. The predicted restoration coefficient was applied to the observed NPE to restore the ideal case regardless of the pulse-area decrease. The restored linearity facilitates the correct reconstruction of the event in the saturation region for PMT-based detectors.
Acknowledgement
This work was supported by grants from the National Research Foundation (NRF) of the Korean Government (2022R1A2C1006069, 2022R1A5A1030700, 2022R1I1A1A01064311, 2018R1D1A1B07045812, RS-2022-00166269, RS-2023-00212787). We are very grateful for the support provided by the Center for Precision Neutrino Research at Chonnam National University.
Appendix A. Enlarged insets

Enlarged insets from Fig. A1. The exponential increase in the observed gain with increasing applied voltage for each PMT is presented. The gain curves for the 2-inch-A, 2-inch-B, and 10-inch PMTs are displayed on the left, middle, and right, respectively.

Enlarged insets from Fig. A2. 1D projections of the red data points from Figs. A2(a) (upper) and A2(b) (lower), obtained using a |$^{137}{\rm Cs}\, (0.66 \rm \, MeV)$| γ-ray source. The NPE distributions observed by each PMT are fitted with an empirical error + exponential function to determine the Compton edges.

Enlarged inset from Fig. A3. 1D histogram of the ratio between the ANN prediction and the label of the restoration coefficients for the decreased pulse area at the test dataset.

Enlarged inset from Fig. A4. Comparison of the resolution and bias response of the 10-inch PMT with restoration (black points) and without restoration (red points). The resolution and bias response are represented by the observed/ideal ratio and the RMS of the observed/ideal ratio, respectively.