-
PDF
- Split View
-
Views
-
Cite
Cite
Michael J Wilensky, Nichole Barry, Miguel F Morales, Bryna J Hazelton, Ruby Byrne, Quantifying excess power from radio frequency interference in Epoch of Reionization measurements, Monthly Notices of the Royal Astronomical Society, Volume 498, Issue 1, October 2020, Pages 265–275, https://doi.org/10.1093/mnras/staa2442
- Share Icon Share
ABSTRACT
We quantify the effect of radio frequency interference (RFI) on measurements of the 21-cm power spectrum during the Epoch of Reionization (EoR). Specifically, we investigate how the frequency structure of RFI source emission generates contamination in higher order wave modes, which is much more problematic than smooth-spectrum foreground sources. Using a relatively optimistic EoR model, we find that even a single relatively dim RFI source can overwhelm the EoR power spectrum signal of |$\sim 10\, {\rm mK}^2$| for modes |$0.1 \ \lt k \lt 2 \, h\, {\rm Mpc}^{-1}$|. If the total apparent RFI flux density in the final power spectrum integration is kept below 1 mJy, an EoR signal resembling this optimistic model should be detectable for modes |$k \lt 0.9\, h\, {\rm Mpc}^{-1}$|, given no other systematic contaminants and an error tolerance as high as 10 per cent. More pessimistic models will be more restrictive. These results emphasize the need for highly effective RFI mitigation strategies for telescopes used to search for the EoR.
1 INTRODUCTION
The Epoch of Reionization (EoR) is a cosmological period in which the content of the universe transitioned from being mostly neutral to being mostly ionized, as we see it today. Understanding the EoR will inform other areas of cosmology such as structure formation and the expansion history of the universe. For reviews on the study of the EoR, see Furlanetto, Oh & Briggs (2006), Morales & Wyithe (2010), and Liu & Shaw (2019). One way to probe the EoR is to measure the power spectrum of fluctuations in the brightness temperature of 21-cm radiation emitted from the neutral hydrogen hyperfine transition. These types of measurements are performed using radio interferometers such as the Murchison Widefield Array (MWA; Tingay et al. 2013; Wayth et al. 2018), the Hydrogen Epoch of Reionization Array (HERA; DeBoer et al. 2017), the LOw Frequency ARray (LOFAR; van Haarlem et al. 2013), the Precision Array for Probing the Epoch of Reionization (PAPER; Parsons et al. 2010), and the Giant Metrewave Radio Telescope (GMRT; Paciga et al. 2011), as well as the developing Square Kilometer Array (Mellema et al. 2013). Constructed for a given redshift, the 21-cm power spectrum tells us about the scales of neutral hydrogen at that moment in cosmological history.
A common theme that emerges from attempted measurements of the EoR power spectrum is that systematics, particularly those that yield frequency-dependent effects, must be understood with a high dynamic range in order for an EoR detection to be feasible. These systematics come in many forms. For instance, bright radio emission from astrophysical foregrounds is typically four to five orders of magnitude brighter than the expected reionization signal. These foregrounds dominate the EoR signal in a region of power spectrum space known as the foreground wedge (Datta, Bowman & Carilli 2010; Morales et al. 2012; Parsons et al. 2012; Trott, Wayth & Tingay 2012; Thyagarajan et al. 2013). Moreover, errors in calibration that arise from incomplete knowledge of these sources can themselves provide contamination of power spectra (Datta et al. 2010; Barry et al. 2016; Patil et al. 2016, 2017; Trott & Wayth 2016; Ewall-Wice et al. 2017; Li et al. 2018, 2019; Byrne et al. 2019; Joseph et al. 2020; Kern et al. 2020b). Other effects involve subtle instrumental behaviour that would be negligible in other contexts, including, but not limited to, effects arising from baseline layout (Hazelton, Morales & Sullivan 2013; Murray & Trott 2018), internal signal chain reflections (Beardsley et al. 2016; Barry et al. 2019b; Kern et al. 2019, 2020a,b; Li et al. 2019), and complexities of the primary beam (Beardsley et al. 2016; Ewall-Wice et al. 2016; Thyagarajan et al. 2016; Fagnoni et al. 2019; Barry et al. 2019b; Joseph et al. 2020). Another important class of contaminants, which is the focus of this work, are radio signals such as FM radio broadcasts and other anthropogenic transmissions that are referred to in radio astronomy as radio frequency interference (RFI).
Reception of RFI poses problems for many radio astronomy applications. RFI can often be exceptionally bright compared to distant astrophysical radio sources, and there are many different mechanisms by which RFI can enter an array. Such mechanisms include self-generated RFI within a telescope, direct reception of RFI from a nearby transmitter, long-range reception of RFI due to tropospheric ducting (Sokolowski, Wayth & Lewis 2016), broad-band emission from lightning (Sokolowski et al. 2016), direct emission from satellites such as ORBCOMM (Sokolowski et al. 2016), as well as reflections off of aircraft (Wilensky et al. 2019), the moon (McKinley et al. 2013, 2018), meteors (Zhang et al. 2018), and satellites (Zhang et al. 2018). The prolific means by which RFI can contaminate radio data make it a ubiquitous problem, and so there has been much study devoted to mitigation methods. While an exhaustive review of general RFI mitigation techniques employed in different radio contexts would be outside the scope of this work, we note a few post-correlation flagging and filtering techniques that have been employed on EoR data so far, namely aoflagger1 (Offringa, van de Gronde & Roerdink 2012; Offringa et al. 2015), deep learning techniques with convolutional neural networks (Kerrigan et al. 2019), the Watershed RFI algorithm (Roerdink & Meijster 2000; Kerrigan et al. 2019), as well as ssins2 (Wilensky et al. 2019).
While there is a strong consensus that RFI flagging is a highly important step in pre-processing EoR data (Barry et al. 2019b; Li et al. 2019; Mertens et al. 2020), and much effort has been devoted to removing RFI contaminated data from EoR studies, there has been relatively little study to specifically characterize the manner in which undetected RFI systematically affects EoR detection efforts. Interestingly, RFI excision itself can induce chromatic structure in radio data that can systematically bias power spectrum estimates, which is specifically studied in Offringa, Mertens & Koopmans (2019). In general, any given mitigation method is bound to leave undetected RFI at some level. For instance, Offringa et al. (2013) present a statistical estimate for the apparent brightness of undetected RFI in LOFAR data after aoflagger, along with an empirical limit on the contamination levels judged from imaging RFI-flagged data. Another example is Barry et al. (2019b), in which two rounds of RFI flagging were performed on MWA EoR data: first with aoflagger, and then with ssins. In this study, ssins found a substantial number of faint digital television (DTV) interference events remaining after the first round of flagging. Furthermore, removal of any 2-min snapshots containing any DTV remnants helped improve EoR limits, where the improvement was quite substantial in a certain subset of the data. A natural line of inquiry that follows from this is to investigate the level to which undetected RFI occludes EoR detection. The purpose of this work is to provide a quantitative, theoretical expectation for the systematic bias in the power spectrum that results from undetected RFI, and to thereby quantify a tolerance for RFI contamination in EoR data based on the expected brightness of the EoR signal.
In Section 2, we lay the foundation for the theoretical formalism used in this paper and state assumptions that are used. In Section 3, we first calculate the expected power spectrum of several common types of RFI source using this formalism. Using a fully functional imaging and power spectrum pipeline, we then simulate RFI power spectra and compare them to simulated power spectra of sources from the GLEAM sky catalogue (Hurley-Walker et al. 2017) as well as theoretical expectations for the EoR signal. In Section 4, we draw conclusions from this work.
2 THEORETICAL FORMALISM
In this section, we establish the theoretical formalism that is used for the power spectrum calculations in this work. This includes choices of notation, definitions of useful quantities, and assumptions about quantities that affect the calculation.
2.1 The power spectrum estimator
Since the goal of this work is to extract the effect that an RFI source has on the power spectrum, we will make several simplifying assumptions in order to ease theoretical calculations. We will later compare these theoretical calculations to realistic in situ simulations wherein qualities of the instrumental measurement process are included, so that we can be sure that the gross effects predicted from the theoretical calculations are still present along with the nuances that arise from measurement. For analytic descriptions of EoR power spectra that include effects of the instrument and analysis choices, see Liu, Parsons & Trott (2014a,b).
Astrophysical foreground contamination is largely contained within a wedge-shaped region of the cylindrically averaged Fourier domain known as the foreground wedge, e.g. Datta et al. (2010), Morales et al. (2012), Trott et al. (2012), Parsons et al. (2012), and Thyagarajan et al. (2013). Without extremely accurate foreground removal, this contamination excludes lower order line-of-sight modes in power spectrum measurements. The depth of exclusion depends linearly on the length of the perpendicular mode in question, hence the wedge shape. The higher order line-of-sight modes with strongly diminished foreground contamination form the EoR window (e.g. Liu et al. 2014a,b). As is shown in Section 3, the most problematic feature of RFI seen in this analysis is that it provides very strong excess power in the EoR window. This contribution arises strictly from the intrinsic frequency structure of RFI. With this in mind, we will ignore the chromatic point spread function that leads to the foreground wedge in order to simplify the theoretical calculations. Moreover, concentrating on the spectral characteristics of the RFI makes the theoretical derivation equally applicable for all EoR power spectrum analyses (Morales et al. 2019).
We also choose to ignore the fine details of the bandpass response of the instrument, chromaticity of the primary beam, as well as chromatic errors in calibration. These effects are all deeply important in power spectrum estimation and, in general, not negligible (Barry et al. 2016; Beardsley et al. 2016; Patil et al. 2016; Ewall-Wice et al. 2017; Byrne et al. 2019). However, they are beyond the scope of the theoretical calculation. We will include the fact that an instrument possesses a limited field of view and is only sensitive to some range of radiation frequencies. Since it is highly relevant to this analysis and convenient to implement theoretically, we will also include a frequency tapering function for this range of frequencies. A frequency tapering function is commonly used in power spectrum estimation that gracefully limit foreground spillover into the EoR window (Thyagarajan et al. 2013) and ensure that a sufficiently narrow range of frequencies is used to prevent the influence of cosmological evolution (Morales & Hewitt 2004).
2.2 Calculating a power spectrum for point sources
Placing the source at phase tracking centre eliminates the k⊥ dependence in the power spectrum; thus, the cylindrically averaged power spectrum of a source at phase tracking centre will be constant along lines of constant k∥. Simulation results do not show a strong difference between power spectra of centred and off-centre RFI sources, particularly within the EoR window. For this reason, we will only show theoretical power spectra for sources at phase centre.
2.3 Imposing a frequency dependence
In general, RFI sources may take on a great variety of frequency profiles. Many transmissions only occupy a single channel in an EoR analysis, although some can be relatively wide compared to the observing band.
Two types of common and relatively broad RFI signals observed in EoR analyses include DTV and digital audio broadcasting (DAB). These types of signals are well approximated by top-hats in frequency over their allocation.4 The widths and locations of these profiles are usually subject to matters of protocol in their country of origin. For this work, we use the Western Australian 7-MHz-wide allocations, as is frequently observed by the MWA. For example, see Offringa et al. (2015), Sokolowski et al. (2016), and Wilensky et al. (2019).
A sufficiently narrow signal (the resolution of the analysis or finer) can be formally handled using a Dirac delta function frequency dependence. This renders the Fourier transform over frequency analytically straightforward. The result is a constant power on all k-modes.
Due to the relative ease of evaluating power spectra for narrow-band point sources, one may glean information about ensembles of narrow-band point sources using this analytic machinery. We relegate the precise mathematical details to Appendix A and summarize the results here. In principle, any two sources could destructively interfere for some mode in the power spectrum; however, any two sources will interfere constructively for modes perpendicular to the separation vector of the sources. The power of any ensemble for some given mode is bounded from above by the case of total constructive interference; the power is proportional to the square of the sum of the fluxes. The modes for which this can be achieved are intimately related to the angular distribution of the emitters. For a random ensemble drawn from a uniform distribution over the sky and whose true flux distribution is independent of its uniform angular distribution, the expected power (average over realizations) is bounded from below by an incoherent sum of powers; it is proportional to the quadrature sum of the fluxes, i.e. a linear sum of individual powers. For this particular distribution, baselines longer than the inverse width of the primary beam should observe relatively little coherence. Since this is the minimum length of a physical baseline, an angularly uniform distribution of RFI emitters are expected to add power incoherently for any mode in an estimated power spectrum, although specific realizations can defy this expectation. In conclusion, it is expected that adding many uniformly distributed RFI sources should tend to increase contamination relative to a single source in a predictable way. Therefore, if we can understand the strength of a single RFI source on a power spectrum measurement relative to the EoR, we can quantify the degree to which RFI needs to be mitigated to make an EoR detection feasible.
3 RESULTS
In this section, we apply the theoretical formalism in Section 2 to make rough predictions about the general behaviour and contamination levels of RFI in the power spectrum. In order to compare these predictions to realistic power spectra, we also simulate power spectra of RFI sources and sources from the GLEAM sky catalogue using the fhd6/εppsilon7 power spectrum pipeline (Sullivan et al. 2012; Barry et al. 2019a). Ultimately, we find that, in both simulation and theory, RFI provides orders-of-magnitude more contamination than typical point-like foreground sources within the EoR window. We do not estimate power spectra for diffuse foregrounds, which are substantial. Including these may make foregrounds more competitive with RFI on some modes.
3.1 Theoretical results for RFI power spectra
We specifically consider power spectra from sources with the following frequency dependences: flat broad-band, a single DTV channel, two simultaneous frequency-adjacent DTV channels (both received from the same position, e.g. from an aircraft reflection), and a narrow-band source. A flat, broad-band source emulates an astrophysical foreground source so that there is a baseline contamination level to which we can compare that afforded by band-limited RFI. For all sources, we numerically evaluate the integral in equation (4), with and without a Blackman–Harris tapering function. The results for a collection of 1-Jy sources at phase tracking centre are shown in Fig. 1. We used the same observing band as was used in the final limit in Barry et al. (2019b), centred on redshift 7 for the 21-cm line. Of striking significance is that the RFI sources universally provide dramatic contamination in the higher order line-of-sight modes in the power spectrum compared to the flat-spectrum (foreground) source due to sharp spectral cutoffs within the observing band.

Theoretical frequency structures and power spectra along the line of sight for various 1-Jy point sources. The top panels show the windowed (solid, narowband is dashed) and unwindowed (dotted, narrow-band is dash–dotted) frequency structure of each source in Jy, while the bottom panels show the corresponding power spectra of the windowed (bottom left-hand panel) and unwindowed (bottom right-hand panel) sources. The sources are placed at the phase centre of a highly idealized instrument detailed in Section 2, so there is no k⊥ dependence. For the broader sources, the unwindowed spectra are essentially identical to one another except in the lowest ordered k-modes. In the windowed case, we see that the flat-spectrum (foreground) source contamination has a sharp falloff starting at modes greater than ∼0.05 h Mpc−1. This is exactly the desired effect of the Blackman–Harris window function. However, the window function does not have nearly the same effect on the RFI sources, whose contamination remains many orders of magnitude greater than that of the foreground source. The narrow-band source is in the middle of the observing band, so the window function has no effect on the frequency structure. However, the power spectra differ because the effective cosmological volume probed in the frequency-tapered case is smaller than in the non-tapered case.
As mentioned in Section 2, a frequency tapering function is typically employed to reduce power leakage of foregrounds into the EoR window, thus boosting the dynamic range of the measurement. The desired effect is shown in Fig. 1, where the flat-spectrum source, which resembles an astrophysical foreground, displays a sharp falloff at higher k∥ relative to its unwindowed power. This effect occurs because the tapering function smoothly approaches zero at the boundaries of the observing band, eliminating sharp spectral cutoffs for this source (Harris 1978).
On the other hand, the RFI sources are band-limited, and so the tapering function fails to eliminate the sharp spectral cutoffs of the RFI sources. Comparing the power spectra of windowed and unwindowed RFI sources, we observe that the tapering function has little effect on overall contamination levels, in great contrast to the flat-spectrum source. Generically, as the bandwidth of a source approaches the bandwidth of the observing window, and thus more closely resembles a foreground source, the EoR window contamination is more effectively reduced by the frequency tapering function. However, we see that even in the case of the two-channel DTV event, which occupies nearly three-quarters of the band, the window function makes hardly any difference relative to the single-channel DTV event. Additional calculations not shown indicate that an emitter must occupy greater than 95 per cent of the observing band to appreciably close the several order-of-magnitude gap between the RFI and foreground contamination seen in Fig. 1. Some experimentation shows that different tapering functions may close this gap more effectively; however, this typically comes at the expense of the effective and crucial dynamic range boost offered by the Blackman–Harris window.
This suggests an interesting and very simple mitigation strategy for appropriately sized sources such as DTV RFI. If the observing band is set equal to the band allocated for the RFI source, then the tapering function will affect the RFI power spectrum in the same way the flat-spectrum source was affected in Fig. 1. Of course, any narrower RFI source within the same allocation will be relatively unaffected by this choice. While regional radio allocations are set so that particular types of signals are broadcast over certain frequencies, RFI other than the regionally allocated signals can be observed through several mechanisms, e.g. intermodulation products arising from non-linearities in the instrument signal path. Furthermore, the size of the observing band is subject to important constraints, including sensitivity requirements for EoR detection and the need to preserve the assumption of isotropy over the probed cosmological volume. Therefore, this particular strategy is extremely limited in scope, and should not be the primary mitigation strategy. The enormous contamination of RFI in the EoR window suggests that specialized post-excision mitigation strategies are needed if significant RFI sources are missed by excision algorithms. We specifically discuss the degree to which RFI sources need be mitigated in terms of allowable apparent flux density in Section 3.3.
The effectiveness of the tapering function on a narrow-band source is strictly a function of the location of that source within the observing band. The window function multiplicatively adjusts the constant power level provided by the source by the square of the value of the window function at the frequency of the source. Here, we have chosen to keep the narrow-band source at the centre of the window, so that we could see the full effect of an unmitigated narrow-band source.
There is also an interesting difference between the two-channel DTV event and the single-channel DTV event, which is that the single-channel event exhibits a lobed structure as a function of k∥, while the two-channel event does not. This is an interaction between the lobes of the Fourier transforms of the DTV signal and tapering function that depends on the location of the event within the observing band (Harris 1978). This effect is of little practical consequence, since the overall contamination levels are roughly identical regardless of the presence of lobes.
As mentioned in Section 2, these theoretical calculations ignore many important aspects of measuring EoR power spectra with radio interferometers. In order to check these predictions against realistic power spectra made from instrumental visibilities and a fully functioning analysis pipeline, we employed the simulation capabilities of the fhd/εppsilon pipeline. We also simulate the power spectrum of sources from the GLEAM catalogue, so that we can compare simulated RFI power spectra to a power spectrum of a sky full of foregrounds.
3.2 Simulated RFI power spectra
We show cylindrical power spectra for sources from the GLEAM catalogue and two different RFI sources in Fig. 2. In these simulations, we used the MWA Phase I (Tingay et al. 2013) as our example instrument, thus capturing a realistic baseline distribution and beam pattern for an interformeter. The beam is simulated using an average embedded element and mutual coupling model (Sutinjo et al. 2015), chosen at the central frequency of observation (about 178 MHz). This allows us to accurately depict some typical chromatic effects of an instrument and analysis pipeline. To simulate each RFI source, we modeled a single fictitious, phase-centred, flat-spectrum, 1-Jy source, and simulated visibilities using fhd. Using pyuvdata8 (Hazelton et al. 2017), we then modified the frequency structure of the fictitious source to match our desired RFI sources. We then passed these visibilities back through fhd to create the input healpix cubes (Górski et al. 2005) for εppsilon. The simulated GLEAM sources lie in the same observing field as was used in Barry et al. (2019b), which is a region of the sky with minimal bright sources and low sky temperature. As expected from our theoretical work, the narrow-band source has a nearly constant power spectrum, while the DTV source has a lobed structure along k∥ and relatively little structure along k⊥. The power spectrum of the simulated GLEAM catalogue demonstrates how foreground power is largely confined to the foreground wedge, with very little power escaping into the EoR window. Because the RFI sources are at phase centre, they do not exhibit any sort of foreground wedge. While an off-centre RFI source would not be immune to the effects that cause the foreground wedge, the amount of power leakage into higher line-of-sight modes from the wedge (an |$\sim 0.01\!-\!0.1{{\ \rm per\ cent}}$| effect in the foregrounds of Fig. 2a) is subdominant to the spillover that occurs from the band-limited nature of the RFI (an |$\sim 10{{\ \rm per\ cent}}$| effect for the DTV power spectrum in Fig. 2(c)). We can see that this spillover drastically contaminates the EoR window.

Cylindrical power spectra for simulations with GLEAM (left-hand panel), narrow-band (middle panel), and DTV (right) sources. The solid line marks the scales corresponding to the horizon, while the dashed line demarcates the scale of the primary field of view. Smooth-spectrum source contamination is predominantly contained within the foreground wedge, which extends everywhere below the solid line. Above the solid line is the EoR window. The structure seen in these plots generally match our theoretical expectations. The EoR window contamination of a single 1-Jy RFI source is substantially higher than what is included from GLEAM.
The cylindrical power spectra from Fig. 2, averaged over the k⊥-axis, are shown in Fig. 3, along with some additional simulated power spectra for comparison to Fig. 1. We can see that in a realistically simulated power spectrum, there is still a several order-of-magnitude discrepancy between the contamination of astrophysical foregrounds and that of RFI within the EoR window, thus confirming the earlier theoretical predictions. A natural line of investigation is to see what level of RFI is sustainable, given the high dynamic range required to measure the EoR signal in the power spectrum.

Simulated power spectra (averaged over k⊥) analogous to the theoretical spectra in Fig. 1, with an additional line for a simulated power spectrum of sources from the GLEAM catalogue. The RFI spectra show a strong correspondence with the theoretical prediction. The two-channel DTV power spectrum is seemingly smooth compared to the lobed single-channel power spectrum, although the difference in power is more pronounced at higher modes than theoretically predicted. The falloff for the 1-Jy foreground source is similar, although the theoretical lobed structure is not visible. There is still a large difference in contamination between the foreground source and the RFI sources. Moreover, even the contamination from the simulated GLEAM catalogue has several orders of magnitude less contamination than a single 1-Jy DTV or narrow-band source at high k∥.
3.3 How to develop an RFI budget for EoR detection
The final RFI budget for a given analysis depends on several qualities of the experiment. The type of RFI signal observed by the instrument as well as details about the RFI environment of the telescope, such as the number of emitters and their angular distribution, all affect the amount of RFI contamination. Furthermore, the cosmological modes of interest and actual strength of the EoR signal in those modes determines the amount of allowable contamination. To design a budget, the analyst must also decide on a model for the EoR. The final RFI budget designed by the analyst will of course be affected by the choice of model. An RFI power spectrum that is beneath the expected EoR signal by a significant amount for the modes of interest gives a budget for those modes.
In this work, we design some example budgets for different RFI realizations, adopting the fiducial model used in Barry et al. (2019b). This is a relatively optimistic model made using 21cm FAST (Mesinger, Furlanetto & Cen 2011) and an astrophysical parameterization from Park et al. (2019). We focus on modes between 0.1 and 2 |$h\, {\rm Mpc}^{-1}$|, which represents the foreground avoidance strategy employed in MWA EoR analyses. For these modes, the model predicts a signal strength of order |$\Delta ^2\sim 10\, {\rm mK}^2$|, where Δ2 = (k3/2π2)P(k). A foreground removal strategy might endeavour to measure the EoR on lower modes where some models expect a stronger EoR signal, which would affect the overall RFI budget compared to these examples.
3.3.1 Designing a budget for a single source
While these theoretical scaling relations are conceptually helpful, power spectrum measurements involve complicated pipelines that can alter the RFI power spectrum compared to these theoretical predictions. As shown in Section 3.2, there is not a strong discrepancy between the theoretical RFI power spectra and their simulation counterparts. However, for the purposes of drawing an accurate budget, it is better to compare the expected EoR signal to injected RFI signals processed by a 21-cm power spectrum pipeline. To this end, Fig. 4(a) shows simulated power spectra for two different 1-mJy RFI sources alongside the fiducial EoR model in this work. The simulation pipeline emulates the analysis used in Barry et al. (2019b), so we show similar modes as were used in that work. Fig. 4(b) shows the SIR for these two different cases as a function of k, along with a flat line at a hypothetical SIR threshold of 10, exemplifying a hypothetical maximum error tolerance of 10 per cent. Due to the different scaling of the RFI power spectra, one will arrive at different budgets depending on which types of RFI sources contaminate the measurement. For example, assuming errors greater than 10 per cent of the fiducial EoR model are unacceptable, a 1-mJy narrow-band source would be acceptable for modes |$k \lesssim 0.9 \, h\, {\rm Mpc}^{-1}$|, and a two-channel DTV source would pose no problems for the modes shown. On the other hand, if only 1 per cent errors or less are considered acceptable, a narrow-band source would exclude |$k \lesssim 0.5 \, h\, {\rm Mpc}^{-1}$|, and a 1-mJy two-channel DTV source would nearly saturate this error tolerance on all modes shown.

Panel (a): 1-mJy narrow-band (blue) and two-channel DTV (green) RFI power spectra, along with the fiducial EoR model from Barry et al. (2019b, black). We can see that even at an integrated flux density of 1 mJy, a narrow-band RFI source can compete with the EoR signal at higher k-modes. The DTV source has flatter scaling as a function of k, and so is substantially lower then the EoR model on all modes shown. Panel (b): the EoR signal to RFI power ratios (dashed) for the spectra shown in panel (a), along with a level at 10, indicating the minimum allowable signal-to-interference ratio (dotted) for a hypothetical budget. The SIR for the narrow-band source is below the threshold for modes |$k \gtrsim 0.9 \, h\, {\rm Mpc}^{-1}$|. This means that modes beyond this value are excluded by a 1-mJy narrow-band source in a budget drawn from this EoR model. In other words, a budget of a single narrow-band source of integration flux density equal to 1 mJy is sufficient if this EoR model is accurate, 10 per cent errors are considered acceptable, and desired measurements are on modes |$k \lesssim 0.9 \, h\, {\rm Mpc}^{-1}$|. On the other hand, the SIR for the DTV source is greater than 100 for most modes shown, indicating that a budget of a single 1-mJy two-channel DTV source in the final power spectrum integration is sufficient for the modes shown if only 1 per cent errors are acceptable and the EoR model is accurate. Alternatively, a 3-mJy DTV source will be acceptable if the error budget is 10 per cent, but this error budget will be closer to saturation on more modes than in the narrow-band case. We relate the quantities shown in this figure to what can be expected from ensembles of RFI sources and sources that appear intermittently in Section 3.3.2, highlighting the connection between snapshot and integration flux density of sources.
3.3.2 Including integration and ensemble effects
Another important consideration when developing an RFI budget is the nature of ensembles. Given the occupancy study in Wilensky et al. (2019), it is relatively likely that any given snapshot contains some extremely faint RFI of some variety. Therefore, a power spectrum integration consisting of many averaged snapshots probably contains an ensemble of RFI emitters. The results in Appendix A allow us to quantitatively relate the single-source and ensemble power spectra. We repeat some important results here.
With these effects in mind, we develop RFI budgets for some hypothetical ensembles, given the fiducial model shown in Fig. 4. We design the budgets so that they resemble what is shown in Fig. 4. Since RFI flagging is commonly performed on a per-snapshot basis, a per-snapshot budget may be more readily applicable than a full integration budget. To this end, we describe RFI budgets for a given ensemble in terms of an allowed snapshot flux density per source and an allowed integration flux density per source. The example RFI budgets and parameters that determine them are summarized in Table 1.
Number of sources . | Number of snapshots . | Coherence . | Occupancy (per cent) . | Snapshot flux density . | Integration flux density . |
---|---|---|---|---|---|
311 | 1029 | Coherent | 0.1 | 3.3 mJy | 3.2 μJy |
311 | 1029 | Incoherent | 0.1 | 58 mJy | 57 μJy |
1 | 10 000 | Coherent | 0.1 | 1 Jy | 1 mJy |
1 | 10 000 | Incoherent | 1 | 1 Jy | 100 μJy |
Number of sources . | Number of snapshots . | Coherence . | Occupancy (per cent) . | Snapshot flux density . | Integration flux density . |
---|---|---|---|---|---|
311 | 1029 | Coherent | 0.1 | 3.3 mJy | 3.2 μJy |
311 | 1029 | Incoherent | 0.1 | 58 mJy | 57 μJy |
1 | 10 000 | Coherent | 0.1 | 1 Jy | 1 mJy |
1 | 10 000 | Incoherent | 1 | 1 Jy | 100 μJy |
Notes. For narrow-band sources, these give 10 per cent or less fractional excess power relative to the fiducial EoR model for |$k \lesssim 0.9\, h$| Mpc−1. For two-channel DTV sources, these give 1 per cent or less fractional excess power on modes considered in this work. All occupancies and flux densities are expressed per individual RFI source. The integration flux density refers to the flux density of a source in the final integrated spectrum, while the snapshot flux density reflects how bright the source was in its original snapshot. Note that for the final row, 100 incoherent copies of the source appear in the final integration.
Number of sources . | Number of snapshots . | Coherence . | Occupancy (per cent) . | Snapshot flux density . | Integration flux density . |
---|---|---|---|---|---|
311 | 1029 | Coherent | 0.1 | 3.3 mJy | 3.2 μJy |
311 | 1029 | Incoherent | 0.1 | 58 mJy | 57 μJy |
1 | 10 000 | Coherent | 0.1 | 1 Jy | 1 mJy |
1 | 10 000 | Incoherent | 1 | 1 Jy | 100 μJy |
Number of sources . | Number of snapshots . | Coherence . | Occupancy (per cent) . | Snapshot flux density . | Integration flux density . |
---|---|---|---|---|---|
311 | 1029 | Coherent | 0.1 | 3.3 mJy | 3.2 μJy |
311 | 1029 | Incoherent | 0.1 | 58 mJy | 57 μJy |
1 | 10 000 | Coherent | 0.1 | 1 Jy | 1 mJy |
1 | 10 000 | Incoherent | 1 | 1 Jy | 100 μJy |
Notes. For narrow-band sources, these give 10 per cent or less fractional excess power relative to the fiducial EoR model for |$k \lesssim 0.9\, h$| Mpc−1. For two-channel DTV sources, these give 1 per cent or less fractional excess power on modes considered in this work. All occupancies and flux densities are expressed per individual RFI source. The integration flux density refers to the flux density of a source in the final integrated spectrum, while the snapshot flux density reflects how bright the source was in its original snapshot. Note that for the final row, 100 incoherent copies of the source appear in the final integration.
Now consider the data set in the Barry et al. (2019b) limit. Out of the original 1029 snapshots proposed for the integration, 311 snapshots were found to contain residual DTV interference. Assuming one source per contaminated snapshot, we analyse two example realizations of this ensemble below.
For the first example, let us assume this ensemble sums power incoherently. Following similar logic as in the preceding paragraphs (applying equation (15) and demanding an identical SIR), we find that if each emitter in this ensemble has a snapshot flux density as low as 58 mJy, corresponding to an integration flux density of 57 μJy, the ensemble SIR will resemble Fig. 4(b). We note that despite the ensemble power spectrum resembling that of a single 1-mJy source, the total RFI integration flux density in this incoherent ensemble is substantially higher at 17.6 mJy. This demonstrates how incoherent ensembles can allow more total integrated RFI flux density for the same SIR.
Alternatively, consider a situation where sources appear in consistent locations on the sky, such as reflections from regularly scheduled aircraft flying due north–south as in Wilensky et al. (2019). In this particular example, the sources will exhibit coherence for east–west modes. If east–west baselines are highly favored in the analysis, then there will be strong constructive interference in the power spectrum. In the worst-case scenario of perfect constructive interference, the total integration flux density budget of 1 mJy can be split over the 311 coherent sources, as in equation (14).10 This ensemble of sources will resemble Fig. 4(b) if the snapshot flux densities of each source are as low as 3.3 mJy, corresponding to an integration flux density of 3.2 μJy per source.11 In these examples, the total number of contaminated snapshots is quite high, and so the per-source allowable RFI brightness is quite low. Given the relative remoteness of the MWA, we expect comparable overall occupancy or worse in most radio telescopes. This low tolerance emphasizes the need to effectively filter RFI and mitigate its effects within EoR data sets.
The exact dilution factor is strongly dependent on the RFI environment, length of integration, and telescope operations (pointing schedule, etc.). Without knowing the exact source of RFI and its mechanism of arrival, there is a great deal of uncertainty in this dilution factor. Setting an accurate per-snapshot budget requires a detailed study of the RFI environment of an EoR telescope. Imaging of contaminated snapshots to understand RFI propagation mechanisms can help determine the consistency of different RFI emitters. This can help inform modifications to telescope operations and data cuts in order to prevent observation of consistent RFI in long integrations.
4 CONCLUSIONS
We used theoretical calculations and end-to-end simulations to investigate the level to which RFI occludes EoR detection. For simulation, we used the fhd/εppsilon pipeline, with the MWA as our simulated instrument. There was a strong correspondence between the theoretical contamination estimate and the simulated results, verifying the conceptual framework used in this analysis. We conclude that relatively low levels of RFI contamination are sufficient to overwhelm the EoR signal in the 21-cm power spectrum.
Specifically, simulations show that a single narrow-band source of 1-mJy apparent flux density in the final power spectrum integration offers excess Δ2 that is cubic in k, with greater than 10 per cent fractional errors relative to an optimistic EoR model for |$k \gtrsim 0.9 \, h\, {\rm Mpc}^{-1}$|. The contamination of this single-source scales quadratically as its flux density, and so a single narrow-band source of flux density 10 mJy can begin to overwhelm the EoR signal on modes |$k \gtrsim 0.3 \, h\, {\rm Mpc}^{-1}$|. This flux density reflects the brightness of the source after forming a power spectrum integration, rather than its brightness in a single snapshot. Depending on the observing strategy and consistency of the source, this may be a strong dilution relative to its snapshot flux density. However, for long power spectrum integrations, it is unlikely that only a single source will be present in the final integration, and so the average allowed apparent flux density of any one RFI source in the final integration may be substantially lower than this number, depending on how many sources are actually present in the measurement set. Averaged over realizations, a random angularly uniform ensemble of narrow-band sources is expected to add power incoherently. Furthermore, certain ensembles may add power coherently for certain modes. The power spectrum of a single source of a given flux density, e.g. 1 mJy, represents the maximum power spectrum of a coherent ensemble whose total flux density is the given flux density of that single source. Ultimately, this means that if the assumed optimistic EoR model used here is accurate, a total RFI apparent flux density budget of ≲1 mJy in the final power spectrum integration will be sufficient for attempting to measure modes |$k \lesssim 0.9 \, h\, {\rm Mpc}^{-1}$|, assuming a 1–10 per cent error budget depending on the type of RFI that dominates the ensemble.
This work assumes a foreground avoidance strategy, and so does not focus on lower order modes where the EoR signal may be stronger. If foregrounds can be successfully mitigated on these modes, it is possible that a less stringent RFI budget can be deduced. On the other hand, the EoR model used is relatively optimistic, and a more pessimistic EoR model will produce a stricter budget.
The size of contamination relative to the faintness of the offending RFI source helps explain the noticeable general improvement of the 21-cm power spectra made by removing RFI-contaminated observations in Barry et al. (2019b). Furthermore, we expect the results of this analysis to be fairly generic between experiments, particularly predictions for narrow-band RFI contamination and any contamination levels within the EoR window. This strongly motivates quantifying the effectiveness of RFI excision implementations, increasing their sensitivity if necessary, and even possibly implementing post-excision mitigation strategies.
ACKNOWLEDGEMENTS
We thank the anonymous referee for providing very helpful comments during review. We also thank the development teams of pyuvdata, numpy, scipy, matplotlib, fhd, and εppsilon, which enabled this work. This work was directly supported by NSF grants AST-1643011, AST-1613855, OAC-1835421, and AST-1506024. NB is supported by the Australian Research Council Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), through project number CE170100013. This scientific work makes use of the Murchison Radio-astronomy Observatory, operated by CSIRO. We acknowledge the Wajarri Yamatji people as the traditional owners of the Observatory site. Support for the operation of the MWA is provided by the Australian Government (NCRIS), under a contract to Curtin University administered by Astronomy Australia Limited.
DATA AVAILABILITY
No new data were generated or analysed in support of this research.
Footnotes
This lets one equate the transverse and line-of-sight comoving distances. See Hogg (1999).
Examples of technical standards for such signals can be searched for at http://www.etsi.org/standards-search. DTV broadcasts may be referred to as digital video broadcasting (DVB) in such manuals.
This is nearly exact in the theoretical unwindowed case, where a suitable approximation using the sinc function and its derivative describes the Fourier transform of the DTV brightness, implying the leading order term in the power spectrum goes as k−2.
This is mathematically equivalent to considering it as a single source with dilution equal to 311/1029, similar to the very first example ensemble in the preceding paragraphs.
Given that the effect of RFI removal was noticeable at the 104 mK2 level, the brightest sources found by ssins in this data set must have been substantially brighter than these hypothetical figures.
REFERENCES
APPENDIX A:
NARROW-BAND POINT SOURCE ENSEMBLES
Note that in equation (A7), other than the assumption of independence of the angular position, we have made no assumptions about the nature of the flux distribution of the ensemble. In principle, one could inform this distribution using studies such as Offringa et al. (2013) and Sokolowski et al. (2016). This would allow one to specifically relate the average flux squared (incoherent term) to the average flux of the ensemble, and, in turn, set an average allowed apparent flux of individual RFI sources within an integration.