-
PDF
- Split View
-
Views
-
Cite
Cite
N. Bourne, S. J. Maddox, L. Dunne, R. Auld, M. Baes, I. K. Baldry, D. G. Bonfield, A. Cooray, S. M. Croom, A. Dariush, G. de Zotti, S. P. Driver, S. Dye, S. Eales, H. L. Gomez, J. González‐Nuevo, A. M. Hopkins, E. Ibar, M. J. Jarvis, A. Lapi, B. Madore, M. J. Michałowski, M. Pohlen, C. C. Popescu, E. E. Rigby, M. Seibert, D. J. B. Smith, R. J. Tuffs, P. van der Werf, S. Brough, S. Buttiglione, A. Cava, D. L. Clements, C. J. Conselice, J. Fritz, R. Hopwood, R. J. Ivison, D. H. Jones, L. S. Kelvin, J. Liske, J. Loveday, P. Norberg, A. S. G. Robotham, G. Rodighiero, P. Temi, Herschel‐ATLAS/GAMA: a census of dust in optically selected galaxies from stacking at submillimetre wavelengths, Monthly Notices of the Royal Astronomical Society, Volume 421, Issue 4, April 2012, Pages 3027–3059, https://doi.org/10.1111/j.1365-2966.2012.20528.x
- Share Icon Share
Abstract
We use the Herschel‐ATLAS survey to conduct the first large‐scale statistical study of the submillimetre properties of optically selected galaxies. Using ∼80 000 r‐band selected galaxies from 126 deg2 of the GAMA survey, we stack into submillimetre imaging at 250, 350 and 500 μ m to gain unprecedented statistics on the dust emission from galaxies at z < 0.35. We find that low‐redshift galaxies account for 5 per cent of the cosmic 250‐μm background (4 per cent at 350 μ m; 3 per cent at 500 μ m), of which approximately 60 per cent comes from ‘blue’ and 20 per cent from ‘red’ galaxies (rest‐frame g−r). We compare the dust properties of different galaxy populations by dividing the sample into bins of optical luminosity, stellar mass, colour and redshift. In blue galaxies we find that dust temperature and luminosity correlate strongly with stellar mass at a fixed redshift, but red galaxies do not follow these correlations and overall have lower luminosities and temperatures. We make reasonable assumptions to account for the contaminating flux from lensing by red‐sequence galaxies and conclude that galaxies with different optical colours have fundamentally different dust emission properties. Results indicate that while blue galaxies are more luminous than red galaxies due to higher temperatures, the dust masses of the two samples are relatively similar. Dust mass is shown to correlate with stellar mass, although the dust‐to‐stellar mass ratio is much higher for low stellar mass galaxies, consistent with the lowest mass galaxies having the highest specific star formation rates. We stack the 250 μ m‐to‐NUV luminosity ratio, finding results consistent with greater obscuration of star formation at lower stellar mass and higher redshift. Submillimetre luminosities and dust masses of all galaxies are shown to evolve strongly with redshift, indicating a fall in the amount of obscured star formation in ordinary galaxies over the last four billion years.
1 INTRODUCTION
Dust in galaxies represents only a tiny fraction of the mass density of the Universe,1 yet from an observational point of view it can provide some of the most important indications of the history of star formation. This is possible because most stars form in dense clouds of gas and dust. The dust in these regions is heated as it absorbs ultraviolet (UV) radiation from the hot massive stars that form within, and re‐radiates the energy as a modified blackbody (or greybody) with a characteristic temperature generally around 20–30 K. Measurement of the UV radiation from these massive stars is the most direct method of inferring the star formation rate (SFR), although as with all SFR indicators this relies on estimating the rate of massive star formation from the UV and assuming an initial mass function (IMF) to infer the total SFR. Dust poses a problem to this method since the UV attenuation must be accounted for, and this will vary from one star‐forming region to another as well as being dependent on the inclination angle that the galaxy disc is observed at. Observing the thermal emission of dust with far‐infrared (FIR) and submillimetre (submm) telescopes provides a way to trace the UV radiation field in all parts of the interstellar medium (ISM) of a galaxy, since in general the ISM is optically thin to FIR wavelengths. Hence, one can use the UV and FIR in tandem to measure the (massive) SFR in the unobscured and obscured phases, respectively.
A further complication arises from the fact that dust also exists in the large‐scale ISM, in regions not associated with star formation. Indeed this component forms the bulk of the dust mass in a galaxy, and in spiral galaxies can dominate the bolometric output in the FIR (Helou 1986; Dunne & Eales 2001; Sauvage, Tuffs & Popescu 2005; Draine et al. 2007). Because this ISM component is heated by the general interstellar radiation field (ISRF), its FIR emission is not necessarily correlated with the SFR. The extent to which both the UV and the FIR luminosities can be used to trace SFRs can only be understood by studying the full spectral energy distribution (SED) of galaxies from the UV to the FIR and submm.
We know that the mid‐IR (MIR)/FIR luminosity density of the Universe increases towards higher redshifts, as a result of increased star formation activity and increased dust content (Blain et al. 1999; Franceschini et al. 2001; Dunne, Eales & Edmunds 2003; Le Floc’h et al. 2005). Recent analyses of the submm luminosity function (LF; Eales et al. 2009, 2010a; Dye et al. 2010; Dunne et al. 2011, hereafter D11) with the Balloon‐borne Large Aperture Submillimeter Telescope (BLAST; Devlin et al. 2009) and the Herschel Space Observatory (Pilbratt et al. 2010) reveal strong evolution up to a redshift of at least ∼0.5, so there must be a substantial increase in the numbers and/or luminosities of dusty star‐forming galaxies as we look back in cosmic time. In this paper, we ask the question: what are the properties of dust in galaxies that are not selected to be dusty, and is there an evolution in their dust content with redshift equivalent to that seen in Herschel‐selected galaxies?
Galaxies in the Universe comprise an extremely varied population, with a wide range of different properties. The galaxies that we will concentrate on in this paper are the quintessential Hubble tuning fork types, both spirals and ellipticals, that comprise the majority of galaxies selected in optical surveys (e.g. Driver et al. 2006). We make no prior selection with respect to dust content or FIR luminosity, but it may be expected that the typical galaxies sampled are quiescent in nature, and are not undergoing excessive starburst or nuclear activity (as in typical FIR‐selected samples from IRAS or Spitzer). This sample may have more in common with the low‐redshift population in the Herschel‐Astrophysical Terahertz Large Area Survey (H‐ATLAS) sample selected at 250 μ m, which typically consists of optically luminous (Mr≲−20), blue (NUV−r < 4.5) galaxies (Dariush et al. 2011; D11), but unlike H‐ATLAS this sample will not be biased towards dusty galaxies in any way.
Most large statistical studies of the FIR/submm properties of FIR‐faint galaxies selected by their stellar light have focused on high redshift samples selected in the near‐IR (NIR; Zheng et al. 2006; Takagi et al. 2007; Serjeant et al. 2008; Marsden et al. 2009; Greve et al. 2010; Oliver et al. 2010b; Viero et al. 2010; Bourne et al. 2011). Studies of the FIR/submm properties of samples of normal galaxies at low/intermediate redshifts have been restricted to small sample sizes and most have therefore focused more on individual galaxies than populations (e.g. Popescu et al. 2002; Tuffs et al. 2002; Leeuw et al. 2004; Stevens, Amure & Gear 2005; Vlahakis, Dunne & Eales 2005; Cortese et al. 2006; Stickel, Klaas & Lemke 2007; Savoy, Welch & Fich 2009; Temi, Brighenti & Mathews 2009). This is simply because deep submm imaging of large areas of sky is necessary to cover a large enough sample of low‐redshift galaxies for statistical analysis. Until very recently, such data have not been available. Observations in the submm, over the Rayleigh–Jeans tail of the dust SED at >rsim 200 μ m, are crucial for constraining the mass of cold dust in the ISM of galaxies, since FIR studies using IRAS at ≲ 100 μ m were only able to constrain the more luminous but less massive contribution from warm dust in star‐forming regions (Dunne & Eales 2001).
H‐ATLAS (Eales et al. 2010a) is the first truly large area submm sky survey, and as such is ideal for this work. It is the largest open‐time key project on the Herschel Space Observatory and will survey 550 deg2 in five channels centred on 100, 160, 250, 350 and 500 μ m, using the PACS (Poglitsch et al. 2010) and SPIRE (Griffin et al. 2010) instruments. In this study we use SPIRE maps of the three equatorial fields in the Phase 1 Data Release, which cover 135 deg2 centred at RA of 9h, 12h and 14.5h along the celestial equator. We are currently unable to use the H‐ATLAS PACS maps for stacking due to uncertainties in the flux calibration at low fluxes; hence, this will be pursued in a follow‐up paper.
For our galaxy sample we make use of UV, optical and NIR data from the Galaxy and Mass Assembly (GAMA) survey (Driver et al. 2009) which overlaps with the H‐ATLAS equatorial fields at Dec. > −1°.0 in the 9h field and Dec. > −2°.0 in the other fields (Fig. 1). The GAMA survey combines optical data from the Sloan Digital Sky Survey (SDSS DR6; Adelman‐McCarthy et al. 2008), NIR data from the UKIRT Infrared Deep Sky Survey (UKIDSS) Large Area Survey (LAS DR4; Lawrence et al. 2007), and UV from the Galaxy Evolution Explorer (GALEX; Morrissey et al. 2005), with redshifts measured with the Anglo–Australian Telescope and supplemented with existing redshift surveys (see Driver et al. 2011 for further details).

Comparison of the H‐ATLAS (blue) and GAMA (red) coverage in the three equatorial fields. The dotted lines show the H‐ATLAS tiles which are not covered in the current data; these will be included in a future release. The sample used in this work lies within the overlapping area between current H‐ATLAS and GAMA coverage, approximately 126 deg2.
In this paper, we select galaxies in the r band, bin by their stellar properties derived from the UV–NIR, and investigate their dust properties in the submm. We employ a stacking technique to recover median submm fluxes without being limited by detection limits in the H‐ATLAS images. The optical data and sample selection are described in Section 2, and the submm imaging and stacking procedures are described in Section 3, although some more technical aspects of the stacking algorithm are left to Appendix A. In Section 4, we present the results of stacking as a function of redshift, colour, mass and luminosity, and we discuss implications for the sources of the extragalactic background. In this section, we also derive rest‐frame luminosities by inspecting the stacked SEDs, and investigate the effects of alternative binning schemes on our results. Section 5 contains a more in‐depth discussion of some of the implications of the stacking results with respect to dust luminosity, temperature and mass. Final conclusions are summarized in Section 6. Throughout this work, we assume a cosmology of ΩΛ= 0.7, ΩM= 0.3, and H0= 70 km s−1 Mpc−1. All celestial coordinates are expressed with respect to the J2000 epoch.
2 OPTICAL DATA
2.1 Sample selection
We base our selection function on the GAMA ‘Main Survey’ (Baldry et al. 2010), selecting objects from the GAMA catalogue which are classified as galaxies by morphology and optical/NIR colours, and are limited in magnitude to rpetro < 19.8 or (zmodel < 18.2 and rmodel < 20.5) or (Kmodel < 17.6 and rmodel < 20.5).2 In fact only 0.3 per cent of the sample have rpetro > 19.8, so the sample are effectively r selected. To simplify the selection function we use the same selection in all fields, so we go below the GAMA ‘Main Survey’ cut of rpetro < 19.4 in the 9h and 15h fields. For each galaxy we have matched‐aperture Kron photometry in nine bands: ugriz from SDSS and YJHK from UKIDSS‐LAS, plus FUV and NUV photometry from GALEX. More details of the GAMA photometry can be found in Hill et al. (2011). All magnitudes used in this paper are corrected for galactic extinction using the reddening data of Schlegel, Finkbeiner & Davis (1998) and are quoted in the AB system. The Kron magnitudes from Hill et al. (2011) are used for the colours described in this section; however, we use the Petrosian measurements for absolute magnitudes Mr. We purposely do not apply dust corrections based on the UV–NIR SED or optical spectra, because we want to study dust properties as a function of empirical properties, excluding as much as possible any bias or prejudice to the expected dust content.
We use spectroscopic redshifts from GAMA (year 3 data) where they are available and reliable [flagged with Z_QUALITY (nQ) ≥ 3]. These are supplemented with photometric redshifts computed from the optical–NIR photometry using annz (for more details see Smith et al. 2011a). The comparison of photometric and spectroscopic redshifts is shown in Fig. 2. For this work we apply an upper limit in redshift of 0.35, because the number of good spectroscopic and photometric redshifts drops rapidly beyond this point. This means that the redshift bins would have to be made wider to sample the same number of sources (hence diluting any sign of evolution); it also means that we only sample the brightest objects at the highest redshifts and their properties may not be comparable to the typically fainter objects sampled at lower redshift. We use photometric redshifts with relative errors δz/z < 20 per cent only, which excludes most of the poor matches in Fig. 2. In this way we exclude 8 per cent of the whole sample at z < 0.35 (7 per cent of zphot < 0.35), which means that the sample is still almost completely r‐band limited. The limiting redshift error translates to a 20 per cent error on luminosity distance, a 40 per cent error on stellar mass, and an absolute magnitude error of 0.3. The criterion tends to exclude lower redshift objects, leading to a relative paucity of photometric redshifts at z≲ 0.2. This does not pose a problem since there is near complete spectroscopic coverage at these lower redshifts. Overall, 90 per cent of the redshifts we use are spectroscopic, although the photometric fraction does increase with redshift out to z= 0.35. The histograms of spectroscopic and photometric redshifts are shown in Fig. 3. We tested the effect of random photometric redshift errors on our results by perturbing each photometric redshift by a random shift drawn from a Gaussian distribution with width σ=δz. After making these perturbations, we made the same cuts to the sample and repeated all the analysis, and found that all stacked results were robust, changing by no more than the error bars that we show.

Comparison between spectroscopic (nQ≥ 3) and photometric redshifts for the objects in our sample which have both. The black points have photometric redshifts with relative errors <20 per cent, while the orange points have greater errors. Using this limiting error and a limiting redshift of 0.35 (dashed lines) ensures a reliable set of photometric redshifts for our purposes.

Histogram of redshifts available in the catalogue: spectroscopic with nQ≥ 3 (black line); photometric with δz/z≤ 0.2 (red line); and all redshifts combined (grey‐shaded bars). In constructing the grey histogram, we take all the spectroscopic redshifts in the black histogram and add any additional photometric redshifts from the red.
We note that a substantial number of photometric redshifts at z > 0.3 appear to be biased low in Fig. 2. This explains why there appear to be more photometric redshifts than ‘all’ redshifts at 0.3 < z < 0.35 in Fig. 3– i.e. some of those objects have spectroscopic redshifts which are greater than 0.35 and hence do not appear in the same bin in the ‘all redshifts’ histogram. This issue could potentially affect the results in our highest redshift bin (z > 0.3); the effect would be to contaminate that bin with galaxies from a slightly higher redshift, which may complicate any evolutionary trends seen across the z= 0.3 boundary. We have chosen to leave the bin in our analysis because over 70 per cent of its galaxies have reliable spectroscopic redshifts, so the effect of a biased minority of photometric redshifts is considered to be small (and ultimately none of our conclusions hinge on this bin alone).
In total we have a sample of 86 208 optically selected galaxies with good spectroscopic or photometric redshifts within the 126 deg2 overlapping area of the SPIRE masks and the GAMA survey. We calculated k‐corrections for the UV–NIR photometry using kcorrect v4.2 (Blanton & Roweis 2007), with the spectroscopic and photometric redshifts described above. The final component of the input catalogue is the set of stellar masses from Taylor et al. (2011), which were computed by fitting Bruzual & Charlot (2003) stellar population models to the GAMA ugriz photometry, assuming a Chabrier (2003) IMF.3 Altogether we have stellar masses estimated for 90 per cent of the sample. The reason that 10 per cent are missing is that our sample reaches deeper than the GAMA Main Survey in two of the fields: we use the same magnitude limit in all three fields so that we can sample as large a population as possible.
2.2 Colour classifications and binning
A simple way to divide the sample in terms of stellar properties is to make a cut in rest‐frame optical colours. The bands which have good signal‐to‐noise ratio data for the whole sample are the three central SDSS bands; hence, the most reliable and complete optical colours to use are g−r, g−i and r−i. We found very little difference between the distributions of colours in any of these three alternatives; each appears equally effective at defining the populations of galaxy colours. We chose to use g−r since these bands have the best signal‐to‐noise ratio hence greatest depths, and we plot the colour–magnitude diagram (CMD) in Fig. 4.

Two‐dimensional histogram (CMD) of rest‐frame g−r and Mr for the full sample. Contours mark the log10 of the histogram density function, weighted with the 1/Vmax method to account for incompleteness as described in the text. Contours are smoothed by a Gaussian kernel with width equal to 0.8 of the bin width (1/50 of the range in each axis). The heavy black line is the red sequence fit given by equation (1). The two dashed lines mark the boundaries between red/green and green/blue classifications, which are given by equation (2).
In this figure, the colour–magnitude space is sampled by a two‐dimensional histogram in which the number density in each bin is weighted by (Schmidt 1968) to correct for the incompleteness of a flux‐limited sample. To achieve this we weighted each galaxy by the ratio of the volume of the survey (the comoving volume at z= 0.35) to the comoving volume enclosed by the maximal distance at which that galaxy could have been detected and included in the survey. To measure the latter, we use the rpetro limit which is the primary limiting magnitude of the sample (a negligible proportion of sources that were selected by z or K have fainter rpetro magnitudes). Our redshift limit of 0.35 was also considered (so no Vmax is greater than the comoving volume at z= 0.35).


Histograms of rest‐frame g−r colours split into eight Mr bins between −23 and −15 (weighted with the 1/Vmax method). Overlaid are the two Gaussian functions which were simultaneously fitted to each histogram, representing the blue and red populations, as well as the sum of the functions.



Histogram of rest‐frame, slope‐corrected g−r colours across −23 < Mr < −18 (weighted with the 1/Vmax method). Overlaid are the two Gaussian functions fitted to the histogram as well as the sum of the functions. The dashed vertical lines show the boundaries of the colour bins described in the text.
A limitation of optical colours such as g−r is a small separation in colour space between the red and blue populations. This would be improved by using a pair of bands which straddle the 4000 Å spectral break, but the only Sloan band bluewards of this is u, which has poor signal‐to‐noise ratio and therefore a relatively shallow magnitude limit. It has been shown in the literature (e.g. Yi et al. 2005; Wyder et al. 2007; Cortese & Hughes 2009) that a UV–optical colour such as NUV−r (or UV–NIR such as NUV−H) provides greater separation between red and blue and reveals a third population of galaxies in the green valley. A clear delineation of the populations would minimize contamination between the bins and should help to disambiguate trends in stacked results.




Two‐dimensional histogram (CMD) of rest‐frame NUV−r and Mr for the full sample. The contours mark the log10 of the histogram density function, again using the 1/Vmax method to account for incompleteness as described in the text. Contours are smoothed by a Gaussian kernel with width equal to one bin (1/40 of the range in each axis). The noise in the top right‐hand part is a result of a small amount of data close to the NUV detection limit having exceptionally high weights. The heavy black line is the red‐sequence fit to our data given by equation (3). The two dashed black lines mark the boundaries between red/green and green/blue classifications, which are given by equation (4).
In this paper we do not explicitly attempt to distinguish passive red galaxies from obscured, star‐forming red galaxies; rather we focus on the submm properties as a function of observed optical colours. We therefore may expect a somewhat mixed population in the red (and green) bins, even using NUV−r. There are various ways one might attempt to overcome this – applying dust corrections based on UV photometry or spectral line indices, or using the Sérsic index to predetermine the expected galaxy ‘type’– however, we opt to avoid biasing any of our results by any prior assumption about the nature of galaxies in each bin.4 We leave any analysis that accounts for morphology or spectral properties to a follow‐up paper.
3 SUBMILLIMETRE DATA AND STACKING
3.1 Stacking into the SPIRE maps
For the submm imaging, we use SPIRE images at 250, 350 and 500 μ m of the three equatorial GAMA fields in H‐ATLAS, which were made in a similar way to the science demonstration maps described by Pascale et al. (2011). The fields consist of 53.25 deg2 at 9h, 27.37 deg2 at 12h and 53.93 deg2 at 14.5h. Background subtraction was carried out using the nebuliser routine developed by Irwin (2010) which effectively filters out the highly varying cirrus emission present in the H‐ATLAS maps, as well as extended background emission including the Sunyaev–Zel’dovich effect in clusters and unresolved clustered sources at high redshift.
All sources are treated as point sources and flux densities (hereafter ‘fluxes’) are measured in cut‐outs of the map around each optical position, convolved with a point response function (PRF). We account for subpixel scale positioning by interpolating the PRF from the point spread function (PSF)5 at a grid of pixels offset from the centre by the distance between the optical source centre and the nearest pixel centre. This convolution with the PRF is the standard technique to obtain the minimum‐variance estimate of a point source’s flux (Stetson 1987). The PSFs at 250, 350 and 500 μ m have full widths at half‐maximum (FWHMs) equal to 18, 25 and 35 arcsec, respectively.
We then measure fluxes in the maps at the positions in the optical catalogue, and use simplifying assumptions to account for blended sources which would otherwise lead to overestimation of flux in the stacks. The procedure is described in detail in Appendix A. This automatically corrects stacked fluxes for the effect of clustered sources in the bins, with the caveat that sources not in the catalogue (i.e. below the optical detection limits) cannot be deblended. We stack fluxes by dividing the sample into three colour bins (as described in the previous section), then splitting each into five bins of redshift, then six bins of absolute magnitude. Redshift and magnitude bins are designed to each contain an approximately equal number of objects; in this way, we ensure that the sample is evenly divided between the bins to maintain good number statistics in each. All three fields are combined in each stack. We choose to use the median statistic to represent the typical flux, since the median value with a suitable error bar is a robust representation of the distribution of fluxes in a given bin. Unlike the mean, the median is insensitive to bias due to exceptionally bright sources which are outliers from the general population (e.g. White et al. 2007).
We also measure the background in the maps, since although they have been sky subtracted to remove the highly variable cirrus foreground emission, the overall background does not average to zero, and therefore has a significant contribution to stacked fluxes. In each map, we create a sample of 100 000 random positions within the region covered by our optical catalogue, masking around each source in the main stacking catalogue with a circle of radius equal to the beam FWHM in order to avoid including the target positions in the background stack. We then perform an identical stacking analysis at these sky positions, but reject any positions where we measure a flux greater than 5σ. This prevents a bias of our background measurement from sources detected in the SPIRE maps which have not been masked because they are not in the GAMA catalogue. The stacked flux measured in this way is a reasonable estimate of the average background flux in the corresponding map, and is subtracted from our stacked fluxes prior to further analysis. The average background levels are 3.5 ± 0.1, 3.0 ± 0.1 and 4.2 ± 0.2 mJy beam−1 in the 250‐, 350‐ and 500‐μm bands, respectively.
Fluxes measured in the SPIRE maps are calibrated for a flat νSν spectrum (Sν∝ν−1), whereas thermal dust emission longwards of the SED peak will have a slope between ν0 and ν2 depending on how far along the Rayleigh–Jeans tail a given waveband is. The SPIRE Observers’ Manual6 provides colour corrections suitable for various SED slopes, including the ν2 slope appropriate for bands on the Rayleigh‐Jeans tail. This is a suitable description of the SED in each of the SPIRE bands at low redshift, and we therefore modify fluxes by the colour corrections for this slope: 0.9417, 0.9498 and 0.9395 in the three bands, respectively. At increasing redshifts, however, a cold SED can begin to turn over in the observed‐frame 250‐μm band. From inspection of single‐component SEDs fitted to our stacks, we estimate that the SPIRE points in most of our bins fall on the Rayleigh–Jeans tail, although at the highest redshifts slopes can be as flat as ν0 at 250 μ m and ν1.5 at 350 μ m. The corresponding colour corrections are 0.9888 and 0.9630, respectively. We tried applying these corrections to the highest redshift bins and found no discernable difference from any of our stacked results; hence, our results are not dependent on the colour correction assumed.
3.2 Simulations
The stacking procedure that we used was tested on simulated maps to ensure that we could accurately measure faint fluxes when stacking in confused maps with realistic noise and source density. As described in Appendix A, we were able to accurately reproduce median fluxes and correct errors, although fluxes of individual sources could be under‐/over‐estimated if they were blended with a fainter/brighter neighbouring source.
In addition we simulated various distributions of fluxes to test that the median measured flux is unbiased and representative when stacking faint sources close to and below the noise and confusion limits. The results of these simulations indicate that the median can be biased in the presence of noise (see also White et al. 2007). We show details of the simulations in Appendix B. In summary, we assume a realistic distribution of fluxes described by dN/dS∝S−2, Smin < S < Smax; R= log 10(Smax/Smin) = 1.3, and for this we estimate the amount of bias in the measured median as a function of the true median flux, and correct our stacked fluxes for this bias. Correction factors are all in the range 0.6–1.0, and the effect is greatest for low fluxes (≲10 mJy). If we consider the true median to be representative of the typical source in any bin, then relative to this, the bias in the measured median is always less than or equal to the ‘bias’ in the mean resulting from extreme values (as we explain in Appendix B). We tested the sensitivity of the results to assumptions about the flux distribution, and found that although the level of bias does depend on the limits and slope of the distribution, all of our measured trends remain significant and all conclusions remain valid for any reasonable choice of distribution. This is equally true if we make no correction to the measured median.
We also tested the correlations found in the data by simulating flux distributions with various dependencies on redshift, absolute magnitude and colour. We first made simulations in which fluxes were randomized with no built‐in dependencies but with the same scatter as in the real data, and saw that stacked results were equal in every bin (as expected). We tried simulations in which fluxes varied with redshift as a non‐evolving LF (simply varying as the square of the luminosity distance, modified by the k‐correction), with realistic scatter, and also as an evolving LF (log flux increasing linearly with redshift at a realistic rate), also with scatter. We also allowed flux to vary linearly with optical colour and logarithmically with Mr, again including realistic scatter. In all simulations we found that we were able to recover the input trends by stacking.
Finally we simulated fluxes in the three SPIRE bands to produce a randomized distribution of submm colours (S250/S350, etc.) with scatter similar to that in the data but with no correlations built in. Results showed that no artificial correlations were introduced by stacking.
These results indicate that the correlations detected in the real data (described in the following chapter) should be true representations of the intrinsic distributions in the galaxy population, and are not artefacts created by the stacking procedure.
3.3 Errors on SPIRE fluxes
Errors on stacked fluxes are calculated in two different ways. First, we estimate the instrumental and confusion noise on each source and propagate the errors through the stacking procedure. We estimate the instrumental noise by convolving the variance map at the source position with the same PRF used for the flux measurement. To this we add in quadrature a confusion noise term, as in Rigby et al. (2011). Since fluxes are measured by filtering the map with a kernel based on the PSF (see Appendix A), we need to use the confusion noise as measured in the PSF‐filtered map. Pascale et al. (2011) measured confusion noises of 5.3, 6.4 and 6.7 mJy per beam in the PSF‐filtered H‐ATLAS Science Demonstration Phase (SDP) maps. We estimate that the confusion noise is at a similar level in our maps after PSF filtering, by comparing the total variance in random stacks on the background to the average instrumental noise described above. Hence, we combine these values of confusion noise with the measured instrumental noise of each source. In each stack the mean of these measured noises, divided by the square root of the number of objects stacked, and combined with the error on background subtraction, gives the ‘measurement error’ (σN) in Table 1.
Results of stacking in bins of g−r colour, redshift, and absolute magnitude (Mr). Columns are as follows: colour bin C= B (blue), G (green), R (red); median redshift z in bin (approximate z bin boundaries are 0.01, 0.12, 0.17, 0.22, 0.28, 0.35); median Mr; count N in the bin. The following columns for each of the three SPIRE bands: background‐subtracted flux S in mJy; signal‐to‐noise ratio S/σN; measurement error σN (mJy) computed from the mean variance of positions in the stack, divided by , and including the error on background subtraction; statistical error on the median flux (σS, mJy) following Gott et al. (2001); KS probability that the distribution of fluxes in each bin is the same as that at a set of random positions; and median k‐correction K′=K(z)/(1 +z) in the bin. This is a sample of the full table, which is available as Supporting Information with the online version of the article.
C | z | Mr | N | S250 | S/N250 | σN, 250 | σS,250 | KS250 | S350 | S/N350 | σN,350 | σS,350 | KS350 | S500 | S/N500 | σN,500 | σS,500 | KS500 | ![]() | ![]() | ![]() |
B | 0.11 | −21.1 | 1567 | 48.69 | 286.4 | 0.17 | 1.15 | 0 | 23.15 | 178.1 | 0.13 | 0.58 | 0 | 7.11 | 59.3 | 0.12 | 0.41 | 0 | 0.77 | 0.71 | 0.67 |
B | 0.10 | −20.1 | 1568 | 15.72 | 104.8 | 0.15 | 0.46 | 0 | 7.70 | 70.0 | 0.11 | 0.30 | 0 | 3.01 | 30.1 | 0.10 | 0.28 | 0 | 0.77 | 0.71 | 0.68 |
B | 0.10 | −19.6 | 1567 | 8.69 | 62.1 | 0.14 | 0.31 | 0 | 5.02 | 45.6 | 0.11 | 0.31 | 0 | 2.00 | 20.0 | 0.10 | 0.29 | 3E–39 | 0.77 | 0.72 | 0.68 |
B | 0.09 | −19.2 | 1568 | 5.60 | 43.1 | 0.13 | 0.34 | 0 | 3.33 | 33.3 | 0.10 | 0.31 | 0 | 1.63 | 16.3 | 0.10 | 0.30 | 5E–21 | 0.79 | 0.74 | 0.70 |
B | 0.08 | −18.6 | 1567 | 4.24 | 35.3 | 0.12 | 0.24 | 0 | 2.60 | 26.0 | 0.10 | 0.36 | 9E−43 | 1.42 | 14.2 | 0.10 | 0.30 | 1E–17 | 0.82 | 0.78 | 0.75 |
B | 0.04 | −17.5 | 1567 | 2.82 | 23.5 | 0.12 | 0.25 | 0 | 2.16 | 21.6 | 0.10 | 0.28 | 1E−26 | 1.40 | 14.0 | 0.10 | 0.27 | 7E–14 | 0.90 | 0.87 | 0.85 |
C | z | Mr | N | S250 | S/N250 | σN, 250 | σS,250 | KS250 | S350 | S/N350 | σN,350 | σS,350 | KS350 | S500 | S/N500 | σN,500 | σS,500 | KS500 | ![]() | ![]() | ![]() |
B | 0.11 | −21.1 | 1567 | 48.69 | 286.4 | 0.17 | 1.15 | 0 | 23.15 | 178.1 | 0.13 | 0.58 | 0 | 7.11 | 59.3 | 0.12 | 0.41 | 0 | 0.77 | 0.71 | 0.67 |
B | 0.10 | −20.1 | 1568 | 15.72 | 104.8 | 0.15 | 0.46 | 0 | 7.70 | 70.0 | 0.11 | 0.30 | 0 | 3.01 | 30.1 | 0.10 | 0.28 | 0 | 0.77 | 0.71 | 0.68 |
B | 0.10 | −19.6 | 1567 | 8.69 | 62.1 | 0.14 | 0.31 | 0 | 5.02 | 45.6 | 0.11 | 0.31 | 0 | 2.00 | 20.0 | 0.10 | 0.29 | 3E–39 | 0.77 | 0.72 | 0.68 |
B | 0.09 | −19.2 | 1568 | 5.60 | 43.1 | 0.13 | 0.34 | 0 | 3.33 | 33.3 | 0.10 | 0.31 | 0 | 1.63 | 16.3 | 0.10 | 0.30 | 5E–21 | 0.79 | 0.74 | 0.70 |
B | 0.08 | −18.6 | 1567 | 4.24 | 35.3 | 0.12 | 0.24 | 0 | 2.60 | 26.0 | 0.10 | 0.36 | 9E−43 | 1.42 | 14.2 | 0.10 | 0.30 | 1E–17 | 0.82 | 0.78 | 0.75 |
B | 0.04 | −17.5 | 1567 | 2.82 | 23.5 | 0.12 | 0.25 | 0 | 2.16 | 21.6 | 0.10 | 0.28 | 1E−26 | 1.40 | 14.0 | 0.10 | 0.27 | 7E–14 | 0.90 | 0.87 | 0.85 |
Results of stacking in bins of g−r colour, redshift, and absolute magnitude (Mr). Columns are as follows: colour bin C= B (blue), G (green), R (red); median redshift z in bin (approximate z bin boundaries are 0.01, 0.12, 0.17, 0.22, 0.28, 0.35); median Mr; count N in the bin. The following columns for each of the three SPIRE bands: background‐subtracted flux S in mJy; signal‐to‐noise ratio S/σN; measurement error σN (mJy) computed from the mean variance of positions in the stack, divided by , and including the error on background subtraction; statistical error on the median flux (σS, mJy) following Gott et al. (2001); KS probability that the distribution of fluxes in each bin is the same as that at a set of random positions; and median k‐correction K′=K(z)/(1 +z) in the bin. This is a sample of the full table, which is available as Supporting Information with the online version of the article.
C | z | Mr | N | S250 | S/N250 | σN, 250 | σS,250 | KS250 | S350 | S/N350 | σN,350 | σS,350 | KS350 | S500 | S/N500 | σN,500 | σS,500 | KS500 | ![]() | ![]() | ![]() |
B | 0.11 | −21.1 | 1567 | 48.69 | 286.4 | 0.17 | 1.15 | 0 | 23.15 | 178.1 | 0.13 | 0.58 | 0 | 7.11 | 59.3 | 0.12 | 0.41 | 0 | 0.77 | 0.71 | 0.67 |
B | 0.10 | −20.1 | 1568 | 15.72 | 104.8 | 0.15 | 0.46 | 0 | 7.70 | 70.0 | 0.11 | 0.30 | 0 | 3.01 | 30.1 | 0.10 | 0.28 | 0 | 0.77 | 0.71 | 0.68 |
B | 0.10 | −19.6 | 1567 | 8.69 | 62.1 | 0.14 | 0.31 | 0 | 5.02 | 45.6 | 0.11 | 0.31 | 0 | 2.00 | 20.0 | 0.10 | 0.29 | 3E–39 | 0.77 | 0.72 | 0.68 |
B | 0.09 | −19.2 | 1568 | 5.60 | 43.1 | 0.13 | 0.34 | 0 | 3.33 | 33.3 | 0.10 | 0.31 | 0 | 1.63 | 16.3 | 0.10 | 0.30 | 5E–21 | 0.79 | 0.74 | 0.70 |
B | 0.08 | −18.6 | 1567 | 4.24 | 35.3 | 0.12 | 0.24 | 0 | 2.60 | 26.0 | 0.10 | 0.36 | 9E−43 | 1.42 | 14.2 | 0.10 | 0.30 | 1E–17 | 0.82 | 0.78 | 0.75 |
B | 0.04 | −17.5 | 1567 | 2.82 | 23.5 | 0.12 | 0.25 | 0 | 2.16 | 21.6 | 0.10 | 0.28 | 1E−26 | 1.40 | 14.0 | 0.10 | 0.27 | 7E–14 | 0.90 | 0.87 | 0.85 |
C | z | Mr | N | S250 | S/N250 | σN, 250 | σS,250 | KS250 | S350 | S/N350 | σN,350 | σS,350 | KS350 | S500 | S/N500 | σN,500 | σS,500 | KS500 | ![]() | ![]() | ![]() |
B | 0.11 | −21.1 | 1567 | 48.69 | 286.4 | 0.17 | 1.15 | 0 | 23.15 | 178.1 | 0.13 | 0.58 | 0 | 7.11 | 59.3 | 0.12 | 0.41 | 0 | 0.77 | 0.71 | 0.67 |
B | 0.10 | −20.1 | 1568 | 15.72 | 104.8 | 0.15 | 0.46 | 0 | 7.70 | 70.0 | 0.11 | 0.30 | 0 | 3.01 | 30.1 | 0.10 | 0.28 | 0 | 0.77 | 0.71 | 0.68 |
B | 0.10 | −19.6 | 1567 | 8.69 | 62.1 | 0.14 | 0.31 | 0 | 5.02 | 45.6 | 0.11 | 0.31 | 0 | 2.00 | 20.0 | 0.10 | 0.29 | 3E–39 | 0.77 | 0.72 | 0.68 |
B | 0.09 | −19.2 | 1568 | 5.60 | 43.1 | 0.13 | 0.34 | 0 | 3.33 | 33.3 | 0.10 | 0.31 | 0 | 1.63 | 16.3 | 0.10 | 0.30 | 5E–21 | 0.79 | 0.74 | 0.70 |
B | 0.08 | −18.6 | 1567 | 4.24 | 35.3 | 0.12 | 0.24 | 0 | 2.60 | 26.0 | 0.10 | 0.36 | 9E−43 | 1.42 | 14.2 | 0.10 | 0.30 | 1E–17 | 0.82 | 0.78 | 0.75 |
B | 0.04 | −17.5 | 1567 | 2.82 | 23.5 | 0.12 | 0.25 | 0 | 2.16 | 21.6 | 0.10 | 0.28 | 1E−26 | 1.40 | 14.0 | 0.10 | 0.27 | 7E–14 | 0.90 | 0.87 | 0.85 |


4 RESULTS
4.1 Stacked fluxes
The results of stacking SPIRE fluxes as a function of g−r colour, redshift and absolute magnitude Mr are given in Table 1. Secure detections are obtained at 250 and 350 μ m in most bins, although many bins have low signal‐to‐noise ratio at 500 μ m. Note that the signal‐to‐noise ratios in the table are based on the measurement error reduced by (i.e. σN), since this strictly represents the noise level (instrumental plus confusion) which we compare our detections against. When talking about errors in all subsequent analysis we will refer to the statistical uncertainty on the median (σS) because this takes into account both instrumental noise and the distribution of values in the bin, both of which are contributions towards the uncertainty on the median stacked result.
Table 1 also contains the results of Kolmogorov–Smirnov (KS) tests which were carried out on each stack to test the certainty that the stacked flux represents a signal from a sample of real sources and is not simply due to noise. This was done by comparing the distribution of measured fluxes in each bin with the distribution of fluxes in the background sample for the same SPIRE band. These background samples (described in Section 3.1) were placed at a set of random positions in the map, after masking around the positions of input sources. If any of our stacks did not contain a significant signal from real sources then the KS test would return a high probability that the distribution of fluxes is drawn from the same population as the background sample. The great majority of our bins were found to have an extremely small KS probability, meaning that we can be confident that the signals measured are real. The highest probability is 0.04, for a 500‐μm stack in the highest redshift, red‐colour bin. A small sample of the bins are explored in more detail in Appendix C, where we show stacked postage‐stamp images and histograms on which the KS test was carried out.
Stacked fluxes at 250, 350 and 500 μ m in the observed frame are plotted in Fig. 8, showing the dependence on Mr and g−r at different redshifts. The majority of the bins have stacked fluxes well below the 5σ point‐source detection limits shown as horizontal lines in the figure. In all three bands there is a striking difference between the submm fluxes of blue, green and red galaxies, and a strong correlation with Mr in the low‐redshift bins of blue and green galaxies. These trends unsurprisingly indicate that the red galaxies tend to be passive and have lower dust luminosities than blue, and are generally well below the detection threshold in all SPIRE bands. They also show that submm flux varies little across the range of Mr in red galaxies, while in blue galaxies it correlates strongly with Mr such that only the most luminous r‐band sources have fluxes above the 250 μ m detection limit – a point noted by Dariush et al. (2011) and D11. The variation with redshift is also very different between the three optical classes, with the fluxes of blue galaxies diminishing with redshift more rapidly than those of red galaxies. Fluxes in the green bin initially fall more rapidly with increasing redshift than those in the blue, but at z > 0.18 they resemble those in the red bin and evolve very little. This is potentially due to a change in the nature of the galaxies classified as green at different redshifts, which is unsurprising since this bin contains a mixture of different galaxy types in the overlapping region between the blue cloud and red sequence. It is likely that the relative fractions of passive, relatively dust‐free systems and dusty star‐forming systems in this bin will change with redshift, as the star formation density of the Universe evolves over this redshift range (Lilly et al. 1996; Madau et al. 1996, etc.). The evolutionary trends discussed in this section can be better explored by deriving submm luminosities, which first requires a model for k‐correcting the fluxes, as we will discuss in Section 4.4.

Stacked SPIRE fluxes (not k‐corrected) as a function of g−r colour, redshift and absolute magnitude Mr. Top panels: 250 μ m; middle panels: 350 μ m; bottom panels: 500 μ m. Galaxies are binned by optical colour from blue to red (shown in the panels from the left‐hand to right‐hand side) and by redshift (shown by the plot symbols), and stacked fluxes in each bin are plotted against Mr. Error bars are the statistical 1σ errors in the bins as described in Section 3.3, and also include errors due to background subtraction. The horizontal dashed lines at 33.5, 39.5 and 44.0 mJy in 250, 350 and 500 μ m, respectively, represent the 5σ point‐source detection limits as measured in the PSF‐convolved Phase 1 maps.
At this point it is worth considering some potential sources of bias in different bins in case they might impact on the apparent trends. For example, it is reasonable to expect that certain classes of galaxy are more likely than others to inhabit dense environments: in particular redder galaxies, and more massive galaxies, are known to be more clustered (e.g. Zehavi et al. 2011). While we do account for blending in the flux measurements (see Appendix A) this is limited to the blending of sources within the catalogue. As we move to higher redshifts the catalogue becomes more incomplete, and it becomes more likely that the clustered galaxies will be blended with some unseen neighbour. We make no correction for this effect, but we expect the contamination to be small for the following reasons. The input sample is complete down to below the knee in the optical LF at z < 0.3 (see next section); we therefore account for the blending with most of the galaxies in the same redshift range. Contaminating flux would have to come from relatively small galaxies which are not likely to contribute a large amount of flux. Moreover, the blending corrections are on average very small in comparison with the trends that we observe (see Appendix A; Table A1), so a small additional blending correction should not significantly alter our conclusions.
Flux correction factors C= 1/(1 +F) based on equation (A8), compared with the typical/average ratios of deblended to non‐deblended flux using equations (A7) and (A3).
Statistical correction (C) | Mean (median) deblending correction | |
250 μ m | 0.9702 ± 0.0001 | 0.937 (1.000) |
350 μ m | 0.9550 ± 0.0002 | 0.932 (0.987) |
500 μ m | 0.9326 ± 0.0003 | 0.989 (0.902) |
Statistical correction (C) | Mean (median) deblending correction | |
250 μ m | 0.9702 ± 0.0001 | 0.937 (1.000) |
350 μ m | 0.9550 ± 0.0002 | 0.932 (0.987) |
500 μ m | 0.9326 ± 0.0003 | 0.989 (0.902) |
Flux correction factors C= 1/(1 +F) based on equation (A8), compared with the typical/average ratios of deblended to non‐deblended flux using equations (A7) and (A3).
Statistical correction (C) | Mean (median) deblending correction | |
250 μ m | 0.9702 ± 0.0001 | 0.937 (1.000) |
350 μ m | 0.9550 ± 0.0002 | 0.932 (0.987) |
500 μ m | 0.9326 ± 0.0003 | 0.989 (0.902) |
Statistical correction (C) | Mean (median) deblending correction | |
250 μ m | 0.9702 ± 0.0001 | 0.937 (1.000) |
350 μ m | 0.9550 ± 0.0002 | 0.932 (0.987) |
500 μ m | 0.9326 ± 0.0003 | 0.989 (0.902) |
4.2 Contamination from lensing
There is a risk that the submm fluxes of some galaxies in the sample are contaminated by flux from lensed background sources, via galaxy–galaxy lensing. This is especially likely in a submm survey as a result of the negative k‐correction and steep evolution of the LF (Blain 1996; Negrello et al. 2007). These factors conspire to make submm sources detectable up to very high redshifts, therefore providing an increased probability for one or more foreground galaxies to intrude in the line of sight and magnify the flux via strong gravitational lensing. The strong potential for detecting lensed high‐redshift sources in H‐ATLAS was conclusively demonstrated by Negrello et al. (2010). In this study, we target low‐redshift sources selected in the optical, but our sample will inevitably include some of the foreground lenses whose apparent fluxes are likely to be boosted by flux from the lensed background sources. The flux magnification is likely to be greatest for massive spheroidal lenses, as a result of their mass distribution (Negrello et al. 2010). This could pose a problem for our red bins, where spheroids will be mostly concentrated. To make matters worse, measured fluxes are lowest in our red bins which means that even a small lensing contamination of the order of 1 mJy could significantly boost the flux.
We can make an estimate of the lensing contribution to stacked fluxes by considering the predicted number counts of lenses from Lapi et al. (2011), which are based on the amplification distribution of strong lenses (amplification factors ≥ 2) from Negrello et al. (2007). Integrating these counts gives a total of 470 lensed submm sources per square degree, and integrating their fluxes per square degree gives the total surface brightness of lensed sources shown in the first row of Table 2. However, the counts are not broken down by redshift, and only those at z < 0.35 will contribute to our stacks. It is not trivial to predict what fraction of strong lenses are in this redshift range, but recent results from H‐ATLAS can provide us with the best estimate that is currently possible. González‐Nuevo et al. (2012) created a sample of 64 candidate strong lenses from the H‐ATLAS SDP by selecting sources with red SPIRE colours which have no reliable SDSS IDs, or have SDSS IDs with redshifts inconsistent with the submm SED. After matching to NIR sources in the VISTA Kilo‐degree INfrared Galaxy (VIKING; Sutherland et al., in preparation) survey, they reduced this sample to 33 candidates with photometric redshifts for both the lens (using the NIR photometry) and source (using SPIRE and PACS photometry). This sample, the H‐ATLAS Lensed Objects Selection (HALOS), is unique in being selected in the submm, enabling the selection of candidate lenses over a much larger redshift range than other lens samples to date (their lenses had photometric redshifts ≲ 1.8, while other surveys were confined to z < 1). HALOS therefore provides the best observational measurement of the lens redshift distribution.
Total surface brightness of lensed sources from the Lapi et al. (2011) counts model, and estimated contribution from the low‐redshift population of lenses assuming the lens redshift distribution from HALOS (González‐Nuevo et al. 2012). This is compared to the total surface brightness of red galaxies (g−r colour) at z < 0.35 from our stacks. We then estimate the fraction of the flux in each redshift bin of the red sample that comes from lensed background sources.
250 μ m | 350 μ m | 500 μ m | |
Total surface brightness (Jy deg−2) | |||
All lensed flux | 1.09 | 1.34 | 1.22 |
Lenses at z < 0.35 | ![]() | ![]() | ![]() |
Red galaxies | 2.6 ± 0.5 | 1.6 ± 0.2 | 0.8 ± 0.1 |
Lensed flux/red galaxy flux by z bin | |||
0.01 < z < 0.12 | 0.00 ± 0.00 | 0.00 ± 0.00 | 0.00 ± 0.00 |
0.12 < z < 0.17 | 0.06 ± 0.01 | 0.12 ± 0.02 | 0.21 ± 0.04 |
0.17 < z < 0.22 | 0.11 ± 0.02 | 0.20 ± 0.04 | 0.33 ± 0.06 |
0.22 < z < 0.28 | 0.16 ± 0.03 | 0.27 ± 0.05 | 0.44 ± 0.08 |
0.28 < z < 0.35 | 0.20 ± 0.04 | 0.35 ± 0.07 | 0.58 ± 0.11 |
250 μ m | 350 μ m | 500 μ m | |
Total surface brightness (Jy deg−2) | |||
All lensed flux | 1.09 | 1.34 | 1.22 |
Lenses at z < 0.35 | ![]() | ![]() | ![]() |
Red galaxies | 2.6 ± 0.5 | 1.6 ± 0.2 | 0.8 ± 0.1 |
Lensed flux/red galaxy flux by z bin | |||
0.01 < z < 0.12 | 0.00 ± 0.00 | 0.00 ± 0.00 | 0.00 ± 0.00 |
0.12 < z < 0.17 | 0.06 ± 0.01 | 0.12 ± 0.02 | 0.21 ± 0.04 |
0.17 < z < 0.22 | 0.11 ± 0.02 | 0.20 ± 0.04 | 0.33 ± 0.06 |
0.22 < z < 0.28 | 0.16 ± 0.03 | 0.27 ± 0.05 | 0.44 ± 0.08 |
0.28 < z < 0.35 | 0.20 ± 0.04 | 0.35 ± 0.07 | 0.58 ± 0.11 |
Total surface brightness of lensed sources from the Lapi et al. (2011) counts model, and estimated contribution from the low‐redshift population of lenses assuming the lens redshift distribution from HALOS (González‐Nuevo et al. 2012). This is compared to the total surface brightness of red galaxies (g−r colour) at z < 0.35 from our stacks. We then estimate the fraction of the flux in each redshift bin of the red sample that comes from lensed background sources.
250 μ m | 350 μ m | 500 μ m | |
Total surface brightness (Jy deg−2) | |||
All lensed flux | 1.09 | 1.34 | 1.22 |
Lenses at z < 0.35 | ![]() | ![]() | ![]() |
Red galaxies | 2.6 ± 0.5 | 1.6 ± 0.2 | 0.8 ± 0.1 |
Lensed flux/red galaxy flux by z bin | |||
0.01 < z < 0.12 | 0.00 ± 0.00 | 0.00 ± 0.00 | 0.00 ± 0.00 |
0.12 < z < 0.17 | 0.06 ± 0.01 | 0.12 ± 0.02 | 0.21 ± 0.04 |
0.17 < z < 0.22 | 0.11 ± 0.02 | 0.20 ± 0.04 | 0.33 ± 0.06 |
0.22 < z < 0.28 | 0.16 ± 0.03 | 0.27 ± 0.05 | 0.44 ± 0.08 |
0.28 < z < 0.35 | 0.20 ± 0.04 | 0.35 ± 0.07 | 0.58 ± 0.11 |
250 μ m | 350 μ m | 500 μ m | |
Total surface brightness (Jy deg−2) | |||
All lensed flux | 1.09 | 1.34 | 1.22 |
Lenses at z < 0.35 | ![]() | ![]() | ![]() |
Red galaxies | 2.6 ± 0.5 | 1.6 ± 0.2 | 0.8 ± 0.1 |
Lensed flux/red galaxy flux by z bin | |||
0.01 < z < 0.12 | 0.00 ± 0.00 | 0.00 ± 0.00 | 0.00 ± 0.00 |
0.12 < z < 0.17 | 0.06 ± 0.01 | 0.12 ± 0.02 | 0.21 ± 0.04 |
0.17 < z < 0.22 | 0.11 ± 0.02 | 0.20 ± 0.04 | 0.33 ± 0.06 |
0.22 < z < 0.28 | 0.16 ± 0.03 | 0.27 ± 0.05 | 0.44 ± 0.08 |
0.28 < z < 0.35 | 0.20 ± 0.04 | 0.35 ± 0.07 | 0.58 ± 0.11 |
Seven of the 33 HALOS candidates have lens redshifts < 0.35. González‐Nuevo et al. removed two of these from their final sample because the lenses were at z < 0.2, and they considered lenses at such low redshifts to have low probability both on theoretical grounds and on the evidence of previous surveys (Browne et al. 2003; Oguri et al. 2006; Faure et al. 2008). However, we do not want to risk underestimating the number of low‐redshift lenses, so we conservatively include those two in our analysis. The fraction of lenses at z < 0.35 is therefore per cent, using binomial techniques to estimate the 1σ confidence interval (Cameron 2011). Using these results to scale the total lensed flux from all redshifts, we obtain the contribution from lenses at z < 0.35, as shown in Table 2. Assuming that all these low‐redshift lenses fall in the red bin of our sample, we can compare these fluxes to the total stacked flux of our red bins as shown in Table 2, which indicates that about 10, 20 and 30 per cent of the 250‐, 350‐ and 500‐μm fluxes, respectively, comes from high redshift sources lensed by the targets. This may be a slight overestimate since some of the lenses may fall in the other bins; however, Auger et al. (2009) showed that 90 per cent of lenses are massive early‐type galaxies. Any lensing contribution to the blue or green bins would be negligible compared to the fluxes measured in those bins.
The lensed flux is divided between the redshift bins of the red sample in a way that is determined by the product of the lens number distribution nl(z) and the lens efficiency distribution Φ(z). The numbers nl(z) are given by González‐Nuevo et al. (2012), while the efficiency depends on the geometry between source, lens and observer. We estimate Φ(z) from the HALOS source and lens redshift distributions using the formula of Hu (1999), and compute the lens flux distribution from the product of total lensed flux, nl(z) and Φ(z). Comparing this to the total flux of red galaxies in each redshift bin, we compute the fractional contamination from lensed flux as shown in Table 2. Errors on the lensed flux per redshift bin are dominated by the Poisson error on the normalization of nl(z), which is simply the Poisson error on the count of 33 lenses. The relative error on the lensed flux is therefore . The error on the stacked red galaxy fluxes is dominated by the 7 per cent flux calibration error (Pascale et al., in preparation), hence the errors on the fractions in Table 2 are given by the quadrature sum of 7 per cent and 17 per cent, which is 19 per cent. Using the fractions derived above we can remove the estimated lensed contribution to stacked fluxes in each redshift bin of the red sample. The effect of subtracting this fraction from the fluxes of red galaxies is minor in comparison to the trends described in Section 4.1. The effect on other derived results will be discussed later in the paper.
4.3 Resolving the cosmic IR background
A useful outcome of stacking on a well‐defined population of galaxies such as the GAMA sample is that we can easily measure the integrated flux from this population and infer how much it contributes to the cosmic IR background (CIB; Puget et al. 1996; Fixsen et al. 1998). The cosmic background at FIR/submm wavelengths makes up a substantial fraction of the integrated radiative energy in the Universe (Dole et al. 2006), although the sources of this radiation are not fully accounted for. For example, Oliver et al. (2010a) calculated that the HerMES survey resolved only 15 ± 4 per cent of the CIB into sources detected with SPIRE at 250 μ m, down to a flux limit of 19 mJy. A greater fraction can be accounted for using P(D) fluctuation analysis to reach below the detection limit of the map: in HerMES, Glenn et al. (2010) resolved 64 ± 16 per cent of the 250‐μm CIB into SPIRE sources with S250 > 6.2 mJy. Stacking on 24‐μm sources has also proved successful, utilizing the greater depth of 24‐μm maps from Spitzer‐MIPS to determine source catalogues for stacking at longer wavelengths. Stacking into BLAST, Béthermin et al. (2010) resolved 48 ± 27 per cent of the 250‐μm CIB into 24‐μm sources with S24 > 250 μJy, while Marsden et al. (2009) resolved 83 ± 21 per cent into sources with S24 > 15 μJy. However, these BLAST measurements included no corrections for clustering; the authors claimed that the effect was negligible, although this observation may appear to conflict with similar analyses in the literature (Negrello et al. 2005; Serjeant et al. 2008; Serjeant 2010; Bourne et al. 2011).
Similarly, we can stack the GAMA sample to estimate what fraction of the CIB at 250, 350 and 500 μ m is produced by optically detected galaxies at low redshifts. To do this we measure the sum of measured fluxes in each bin and scale by a completeness correction to obtain the total flux of all r < 19.8 galaxies at z < 0.35. The correction accounts for two levels of incompleteness. The first is the completeness of the original magnitude‐limited sample: Baldry et al. (2010) estimate that the GAMA galaxy sample (after star–galaxy separation) is ≳99.9 per cent complete. The second completeness is the fraction of the catalogue for which we have good spectroscopic or photometric redshifts (i.e. spectroscopic z_quality≥3 or photometric δz/z < 0.2; see Section 2.1). This fraction is 91.9 per cent; however, we have only included galaxies with redshifts less than 0.35, which comprise 86.8 per cent of the good redshifts. We cannot be sure of the redshift completeness at z < 0.35 (accounting for both spectroscopic and photometric redshifts) so we simply assume that we have accounted for 91.9 per cent of these, to match the redshift completeness of the full sample.7 Finally, we scale by the fraction of galaxies at z < 0.35 that are within the overlap region between the SPIRE mask and the GAMA survey, which is 72.4 per cent. The combined correction factor is η= 1/(0.999 × 0.919 × 0.724) = 1.504. The corrected flux is converted into a radiative intensity (nW m−2 sr−1) by dividing by the GAMA survey area (0.0439 sr). We compare this to the CIB levels expected in the three SPIRE bands (Glenn et al. 2010) – these are calculated by integrating the CIB fit from Fixsen et al. (1998) over the SPIRE bands. We find that the optical galaxies sampled by GAMA account for ≲5 per cent of the background in the three bands (see Table 3). In the table, we also show the percentage of the CIB produced by galaxies at z < 0.28, since in this range the catalogue is complete down to below the knee of the optical LF at (Petrosian magnitude, h= 0.7; Hill et al. 2011).
Total intensities of rpetro < 19.8 galaxies from stacking at 250, 350 and 500 μ m, in comparison to the corresponding CIB levels from Fixsen et al. (1998). We show the intensity as a percentage of the CIB for the full stack, and for the z < 0.28 subset which is complete in Mr down to (Hill et al. 2011). We also show the contributions of the individual redshift bins and g−r colour bins. All contributions from red galaxies have been corrected for the lensed flux contamination using the fractions in Table 2. All errors include our statistical error bars from stacking, the error on the lensing correction (where applicable) and a 7 per cent flux calibration error (Pascale et al., in preparation).
250 μ m | 350 μ m | 500 μ m | |
Intensity (nW m−2 sr−1) | |||
CIB | 10.2 ± 2.3 | 5.6 ± 1.6 | 2.3 ± 0.6 |
Total stack | 0.508 ± 0.036 | 0.208 ± 0.015 | 0.064 ± 0.005 |
0.01 < z < 0.28 | 0.428 ± 0.030 | 0.173 ± 0.012 | 0.054 ± 0.004 |
Per cent of CIB | |||
Total stack | 4.98 ± 0.39 | 3.71 ± 0.30 | 2.79 ± 0.22 |
0.01 < z < 0.28 | 4.19 ± 0.34 | 3.08 ± 0.26 | 2.33 ± 0.19 |
0.01 < z < 0.12 | 1.57 ± 0.17 | 1.11 ± 0.13 | 0.81 ± 0.09 |
0.12 < z < 0.17 | 0.97 ± 0.10 | 0.71 ± 0.08 | 0.53 ± 0.06 |
0.17 < z < 0.22 | 0.85 ± 0.08 | 0.63 ± 0.07 | 0.49 ± 0.05 |
0.22 < z < 0.28 | 0.80 ± 0.08 | 0.63 ± 0.07 | 0.50 ± 0.05 |
0.28 < z < 0.35 | 0.78 ± 0.08 | 0.63 ± 0.07 | 0.46 ± 0.05 |
Blue | 3.02 ± 0.26 | 2.34 ± 0.21 | 1.67 ± 0.15 |
Green | 1.12 ± 0.10 | 0.84 ± 0.08 | 0.64 ± 0.06 |
Red | 0.83 ± 0.08 | 0.64 ± 0.07 | 0.48 ± 0.05 |
250 μ m | 350 μ m | 500 μ m | |
Intensity (nW m−2 sr−1) | |||
CIB | 10.2 ± 2.3 | 5.6 ± 1.6 | 2.3 ± 0.6 |
Total stack | 0.508 ± 0.036 | 0.208 ± 0.015 | 0.064 ± 0.005 |
0.01 < z < 0.28 | 0.428 ± 0.030 | 0.173 ± 0.012 | 0.054 ± 0.004 |
Per cent of CIB | |||
Total stack | 4.98 ± 0.39 | 3.71 ± 0.30 | 2.79 ± 0.22 |
0.01 < z < 0.28 | 4.19 ± 0.34 | 3.08 ± 0.26 | 2.33 ± 0.19 |
0.01 < z < 0.12 | 1.57 ± 0.17 | 1.11 ± 0.13 | 0.81 ± 0.09 |
0.12 < z < 0.17 | 0.97 ± 0.10 | 0.71 ± 0.08 | 0.53 ± 0.06 |
0.17 < z < 0.22 | 0.85 ± 0.08 | 0.63 ± 0.07 | 0.49 ± 0.05 |
0.22 < z < 0.28 | 0.80 ± 0.08 | 0.63 ± 0.07 | 0.50 ± 0.05 |
0.28 < z < 0.35 | 0.78 ± 0.08 | 0.63 ± 0.07 | 0.46 ± 0.05 |
Blue | 3.02 ± 0.26 | 2.34 ± 0.21 | 1.67 ± 0.15 |
Green | 1.12 ± 0.10 | 0.84 ± 0.08 | 0.64 ± 0.06 |
Red | 0.83 ± 0.08 | 0.64 ± 0.07 | 0.48 ± 0.05 |
Total intensities of rpetro < 19.8 galaxies from stacking at 250, 350 and 500 μ m, in comparison to the corresponding CIB levels from Fixsen et al. (1998). We show the intensity as a percentage of the CIB for the full stack, and for the z < 0.28 subset which is complete in Mr down to (Hill et al. 2011). We also show the contributions of the individual redshift bins and g−r colour bins. All contributions from red galaxies have been corrected for the lensed flux contamination using the fractions in Table 2. All errors include our statistical error bars from stacking, the error on the lensing correction (where applicable) and a 7 per cent flux calibration error (Pascale et al., in preparation).
250 μ m | 350 μ m | 500 μ m | |
Intensity (nW m−2 sr−1) | |||
CIB | 10.2 ± 2.3 | 5.6 ± 1.6 | 2.3 ± 0.6 |
Total stack | 0.508 ± 0.036 | 0.208 ± 0.015 | 0.064 ± 0.005 |
0.01 < z < 0.28 | 0.428 ± 0.030 | 0.173 ± 0.012 | 0.054 ± 0.004 |
Per cent of CIB | |||
Total stack | 4.98 ± 0.39 | 3.71 ± 0.30 | 2.79 ± 0.22 |
0.01 < z < 0.28 | 4.19 ± 0.34 | 3.08 ± 0.26 | 2.33 ± 0.19 |
0.01 < z < 0.12 | 1.57 ± 0.17 | 1.11 ± 0.13 | 0.81 ± 0.09 |
0.12 < z < 0.17 | 0.97 ± 0.10 | 0.71 ± 0.08 | 0.53 ± 0.06 |
0.17 < z < 0.22 | 0.85 ± 0.08 | 0.63 ± 0.07 | 0.49 ± 0.05 |
0.22 < z < 0.28 | 0.80 ± 0.08 | 0.63 ± 0.07 | 0.50 ± 0.05 |
0.28 < z < 0.35 | 0.78 ± 0.08 | 0.63 ± 0.07 | 0.46 ± 0.05 |
Blue | 3.02 ± 0.26 | 2.34 ± 0.21 | 1.67 ± 0.15 |
Green | 1.12 ± 0.10 | 0.84 ± 0.08 | 0.64 ± 0.06 |
Red | 0.83 ± 0.08 | 0.64 ± 0.07 | 0.48 ± 0.05 |
250 μ m | 350 μ m | 500 μ m | |
Intensity (nW m−2 sr−1) | |||
CIB | 10.2 ± 2.3 | 5.6 ± 1.6 | 2.3 ± 0.6 |
Total stack | 0.508 ± 0.036 | 0.208 ± 0.015 | 0.064 ± 0.005 |
0.01 < z < 0.28 | 0.428 ± 0.030 | 0.173 ± 0.012 | 0.054 ± 0.004 |
Per cent of CIB | |||
Total stack | 4.98 ± 0.39 | 3.71 ± 0.30 | 2.79 ± 0.22 |
0.01 < z < 0.28 | 4.19 ± 0.34 | 3.08 ± 0.26 | 2.33 ± 0.19 |
0.01 < z < 0.12 | 1.57 ± 0.17 | 1.11 ± 0.13 | 0.81 ± 0.09 |
0.12 < z < 0.17 | 0.97 ± 0.10 | 0.71 ± 0.08 | 0.53 ± 0.06 |
0.17 < z < 0.22 | 0.85 ± 0.08 | 0.63 ± 0.07 | 0.49 ± 0.05 |
0.22 < z < 0.28 | 0.80 ± 0.08 | 0.63 ± 0.07 | 0.50 ± 0.05 |
0.28 < z < 0.35 | 0.78 ± 0.08 | 0.63 ± 0.07 | 0.46 ± 0.05 |
Blue | 3.02 ± 0.26 | 2.34 ± 0.21 | 1.67 ± 0.15 |
Green | 1.12 ± 0.10 | 0.84 ± 0.08 | 0.64 ± 0.06 |
Red | 0.83 ± 0.08 | 0.64 ± 0.07 | 0.48 ± 0.05 |
4.4 The submm SED and k‐corrections


Since we cannot fit an SED to individual galaxies, we instead examine the ratios between stacked fluxes in each bin. Fluxes in the red bins are first corrected to remove the contribution from lensing as discussed in Section 4.2. The colour–colour diagrams in Fig. 9 show the resulting flux ratios in the observed frame in each of the five redshift bins, alongside a selection of models, which are plotted with a range of temperatures increasing from the left‐hand to right‐hand side. We try both a single greybody and a two‐component model, but there is little to choose between them in these colours, since the SPIRE bands are at long wavelengths at which the SED is dominated by the cold dust, with little contribution from transiently heated small grains or hot dust. We therefore adopt a single component for simplicity. The scatter in the data is large, as are the errors on the 350/500 μ m flux ratio. Moreover, with all our data points on the longward side of the SED peak we are unable to resolve the degeneracy between the dust temperature and the emissivity index (β). This is shown by the close proximity of the β= 1 and 2 models in Fig. 9, which overlap in different temperature regimes. With these limitations we are forced to assume a constant value of β across all our bins. We choose a value of 2.0, which has been shown to be realistic in this frequency range (e.g. Dunne & Eales 2001; James et al. 2002; Popescu et al. 2002; Blain, Barnard & Chapman 2003; Leeuw et al. 2004; Hill et al. 2006; Paradis, Bernard & Mény 2009). For comparison the Planck Collaboration found an average value of β= 1.8 ± 0.1 by fitting SEDs to data at 12 μ m–21 cm from across the Milky Way (Planck Collaboration 2011a, also references therein).8

Colour–colour diagram of the observed‐frame SPIRE fluxes. The plot is divided into five redshift bins, in which the data from that bin are plotted along with the colours expected from various models as they would be observed at the median redshift in the bin. Data are divided into the three g−r colour bins, denoted by symbols and colours, and six Mr bins denoted by the size of the data point (larger = brighter). Data points and error bars in the red bins include the lensing correction and its uncertainty. Three families of models are shown: two consist of isothermal SEDs with either β= 1 or 2, and various dust temperatures; the third is a two‐component SED with β= 2, warm dust temperature Tw= 30 K, with a cold‐to‐hot dust ratio of 100. Each model is given a range of (cold) dust temperatures; the dots along the lines indicate 1 K increments from 10 K (lower left‐hand side) to 30 K. Choosing a single‐component model with β= 2 leads to a range of temperatures between 13 and 22 K.
Under the assumption of a constant β (whatever its value) the dust temperatures implied by Fig. 9 take a wide range of values across the various bins (between 11 and 22 K for β= 2). This is not just random scatter; red galaxies tend towards colder temperatures than blue and green, while blue galaxies in some redshift bins show a trend towards lower temperatures at brighter Mr. For the purposes of k‐corrections we can estimate the temperature more accurately by fitting greybody SED models to the three data points at the emitted frequencies given by the observed frequency scaled by 1 +z, using the median redshift in the bin.9 In general one must be careful when using stacked fluxes in this way to examine the SED, since when stacking many galaxies with different SEDs, the ratios between the stacked fluxes can be unpredictable and not representative of the individual galaxy SEDs. In this case, however, we believe we can be fairly confident of the results because we bin the galaxies in such a way that we should expect the SEDs within each bin to be similar, and so inferred dust temperatures and other derived parameters should be accurate.
The best‐fitting temperatures range between 12 and 28 K, with a median value of 18.5 K. Using β= 1.5 instead, the temperatures are increased by a factor of 1.2–1.6, ranging from 13 to 46 K with a median of 23.0 K. We can compare this median value to temperatures derived from single‐component fits in the literature. For example, Dye et al. (2010) derived a median isothermal temperature of 26 K (β= 1.5) for the detected population in the H‐ATLAS science demonstration data, in agreement with the BLAST sample of Dye et al. (2009). The value of 26 K is within the range of our temperatures using β= 1.5, and only slightly higher than the median. Higher temperatures were found by Hwang et al. (2010) in their PEP/HerMES/SDSS sample of 190 local galaxies: they reported median temperatures rising as function of IR luminosity, from around 26 K at 109 L⊙ to 32 K at 1011 L⊙ and 40 K at 1012 L⊙ (β= 1.5). These temperatures may be higher because Hwang et al. required a detection shortward of the SED peak (i.e. in an AKARI‐FIS or IRAS band) for galaxies to be included in their sample. Fitting a single greybody to an SED which contains both a cold (≲20 K) and a warm (≳30 K) component (Dunne & Eales 2001) may give results that are not comparable to ours, which fit only the cold component. On the other hand, Smith et al. (2011b) fitted greybodies with β= 1.5 to the H‐ATLAS 250 μ m‐selected sample of low‐redshift galaxies matched to SDSS, and found a median temperature of 22.5 ± 5.5 K (similar to our result), and unlike Hwang et al. they found no evidence for a correlation with luminosity.
The Planck Collaboration (2011b) compiled a sample of around 1700 local galaxies by matching the Planck Early Release Compact Source Catalogue and the Imperial IRAS Faint Source Redshift Catalogue, and fitted SEDs to data between 60 and 850 μ m using both single‐component fits with variable β and dual‐component fits with fixed β= 2. In their single‐component fits they found a wide range of temperatures (15–50 K) with median T= 26.3 K and median β= 1.2. This median temperature is consistent with the Herschel and BLAST results, and the low value of β is likely to be due to the inclusion of shorter wavelength data. The authors state that the two‐component fit is statistically favoured in most cases; these fits indicate cold dust temperatures mostly between ∼10 and 22 K, consistent with the range in our data.
In any case we do not necessarily expect to find the same dust temperatures in an optically selected sample as in a submm‐selected sample. For the purposes of k‐corrections, this is relatively unimportant, at least at the low redshifts covered in this work. The choice between cold T/high β and hot T/low β makes very little difference to monochromatic luminosities, as crucially they both fit the data. Likewise the range of temperatures has little effect on k‐corrections: using the median fitted temperature of 18.5 K in all bins gives essentially the same results as the using the temperature fitted to each bin separately. To remove the effect of the variation between models, we carry out all analysis of monochromatic luminosities using the median temperature of Tdust= 18.5 K and β= 2.0 to derive k‐corrections using equation (7) (except where stated otherwise). The implications of the fitted SEDs on the physical properties of galaxies in the sample will be discussed in Section 5. First we will concentrate on the observational results of the stacking which are not dependent on the model used to interpret the submm fluxes.
4.5 Luminosity evolution
To calculate stacked luminosities we apply equation (6) to each measured flux and stack the results. The error on the stacked value is again calculated using the Gott et al. (2001) method. Note that this method is not the same as applying equation (6) to the median flux and median redshift of each bin, since luminosity is a bivariate function of both flux and redshift. Fluxes of sources in the red bin are corrected for the fractional contributions from lensing given in Table 2, as explained in Section 4.2. The 1σ errors on these corrections are included in the luminosity errors. Results in Fig. 10 show a generally strong correlation between luminosities in the r band and all three submm bands, as is the expected trend across such a broad range, but the dependence is not on Mr alone. This becomes obvious when comparing the data points with the grey line, which shows the linear least‐squares fit to the results from the lowest redshift blue galaxies (the line is the same in each panel from the left‐hand to right‐hand side). In the blue galaxies, there may be a slight flattening of the correlation for the brightest galaxies and/or the higher redshifts, but this effect is much stronger in the green galaxies, which are intermediate between the blue and red samples. For the red galaxies the correlation disappears entirely but for the faintest bin at low redshift. The luminosities of the red galaxies all lie below the grey line, showing that red galaxies emit less in the submm than blue or green galaxies of the same Mr, strongly suggesting that they are dominated by a more passive population than green and blue galaxies. These trends are greater than the uncertainties on the lensing correction.

Stacked SPIRE luminosities as a function of g−r colour, redshift and absolute magnitude Mr. Layout as in Fig. 8. Error bars are the statistical 1σ errors in the bins as described in Section 3.3. Fluxes in the red bin have been corrected for the lensing contribution as described in Section 4.2, and error bars include the associated uncertainty. The thick grey line is the same from the left‐hand to right‐hand panel, and is the linear least‐squares fit to the results for the lowest redshift blue galaxies.
Apart from this colour dependence there is also a significant increase in submm luminosity with redshift for green and red galaxies of the same r‐band luminosity. This evolution appears to occur at all Mr, without being particularly stronger for either bright or faint galaxies, but it is especially strong for red galaxies. This may indicate a transition in the make‐up of the red population, with obscured star‐forming galaxies gradually becoming more dominant over the passive population as redshift increases. Such a scenario might be expected as we look back to earlier times towards the peak of the universal star formation history. One problem with this explanation is that we might expect an increase in obscured star formation to be accompanied by an increase in the dust temperatures at higher redshifts, which we found no evidence for in the SPIRE colours (Section 4.4).
Meanwhile the green sample shows similarities with the blue at low redshift and low r‐band luminosity, but at high redshifts and stellar masses the luminosity dependence on Mr is flatter and more similar to that of red galaxies. This could be due to a shift in the dominant population of the green bin, between blue‐cloud‐like galaxies and red‐sequence‐like galaxies at different redshifts and Mr.
4.5.1 UV–optical versus optical colours
Splitting the sample by the NUV−r colour index provides a slightly different sampling regime and reduces contamination between the colour bins because the red and blue populations are better separated (see Section 2.2). It therefore offers a useful test of the robustness of the results of stacking by g−r. Fig. 11 shows that stacked 250 μ m luminosities follow the same trends with colour, redshift and Mr as in the g−r stacks (350 and 500 μ m results are similar). This supports the interpretation that the three colour bins sample intrinsically different populations in terms of the dust properties. The red sample in either NUV−r or g−r appears to be dominated by passive galaxies at low redshifts at least, but the emission from dust increases by a factor of around 10 over the redshift range.

Stacked 250‐μm luminosity as a function of NUV−r colour, redshift and Mr. Error bars are the statistical 1σ errors in the bins as described in Section 3.3. Data and errors in the red bin incorporate the correction for lensing. The thick grey line is the same from the left‐hand to right‐hand panel, and is the linear least‐squares fit to the results for the lowest redshift blue galaxies.
Errors are slightly larger in this sample, particularly in the red bin, because we are limited to the 52 773 galaxies with NUV detections. Since results appear to be independent of the colour index used, we opt to use the more complete r‐limited sample of 86 208 sources with g−r colours for all subsequent analysis.
4.5.2 Stellar mass versus absolute magnitude
An alternative to dividing the sample by Mr is to use the stellar masses which were calculated from the GAMA ugriz photometry by Taylor et al. (2011) assuming a Chabrier (2003) IMF. Stellar mass is a simple physical property of the galaxy so may reveal more about intrinsic dependencies; on the other hand, it depends much more on the models used to fit the optical SED than Mr, which is only subject to a small k‐correction and the assumed cosmology (for a given redshift). Relative errors on stellar masses are dominated by systematics, but are small (Δlog Mstar∼ 0.1; Taylor et al. 2011). We confirmed that our results are robust to these errors by repeating all analysis after making random perturbations to the stellar masses, where the size of each perturbation was drawn from a Gaussian distribution with width σ=Δlog Mstar. No results were systematically affected by these perturbations, and random deviations in stacked values were smaller than the error bars.
Fig. 12 shows that the results of stacking by g−r colour and stellar mass differ slightly from the results of stacking by Mr (Fig. 10). Again we found very little difference from these results when we stacked by NUV−r colour and stellar mass. The results of stacking by mass seem to differ most in the blue bin. Whereas there was little luminosity evolution at fixed Mr, these results show evolution at fixed stellar mass. Furthermore this evolution is dependent on stellar mass, suggesting that smaller galaxies tend to evolve more rapidly. The samples in Figs 10 and 12 are slightly different since stellar masses were only available for 90 per cent of the full sample; however, we know that this is not responsible for the discrepancy since repeating the stacking by Mr with the stellar mass sample gives identical results (the stellar mass incompleteness does not vary between bins). The difference arises because Mr does not directly trace stellar mass, which leads to a mixing of galaxies of different masses within a given Mr bin. This is unavoidable since we must split the sample in three ways (colour, redshift and mass/magnitude) because dust luminosity varies strongly as a function of all of these. We split the sample by colour first, then by redshift and finally divide into mass or magnitude bins, but each bin can still contain a relatively broad range of redshifts. Within each bin there will be a strong degeneracy between redshift and Mr, simply because Mr is a strong function of redshift. In Fig. 13 we plot stellar mass against Mr with the points colour‐coded by redshift, showing that redshifts increase steadily from left to right, with decreasing Mr. A narrow range in Mr would select a narrow range of redshifts, while a similarly narrow range in mass selects a much broader range of redshifts.

Stacked SPIRE luminosities as a function of g−r colour, redshift and stellar mass. Error bars are the statistical 1σ errors in the bins as described in Section 3.3. Data and errors in the red bin incorporate the correction for lensing. The thick grey line is the same from the left‐hand to right‐hand panel, and is the linear least‐squares fit to the results for the lowest‐redshift blue galaxies.

Stellar masses of the sample as a function of absolute magnitude and coloured by redshift, showing the degeneracy between Mr and z resulting from an r‐band selection. To guide the eye, the slope of the grey line indicates direct proportionality between mass and luminosity, i.e. log10Mstar=−0.4Mr+C (for this line C= 1.0). The spread in the data perpendicular to this line reveals the broad range of mass‐to‐light ratios which are responsible for the differences between stacking by Mr and by Mstar.
The effect of this on the blue bins in Figs 10 and 11 is a tendency for the data points of different redshift bins to lie along the same relation of L250 as a function of Mr. The degeneracy is (partially) broken when splitting by stellar mass, thus separating out the trends with redshift and with mass in Fig. 12. This effect is much less noticeable in the red bin simply because the redshift evolution is much stronger while the mass dependency is very weak in the red sample.
5 DISCUSSION
5.1 Dust temperatures and SED fitting
In Section 4.4 we stated that assumptions about the dust temperature and β had negligible effect on the k‐corrections to SPIRE fluxes at these low redshifts. Hence, we chose to use the same SED model to compute monochromatic luminosities, assuming a single‐component greybody with Tdust= 18.5 K and β= 2.0. However, if we want to infer properties of the full IR SED, these considerations are much more important.
We fitted single‐component SEDs with β= 2.0 to the stacked SPIRE fluxes in each bin, shifting the observed wavelengths by (1 +z) using the median redshift in the corresponding bin. Fluxes in the red bin were first corrected for the predicted lensing contamination as described in Section 4.2. The effect of this is to increase the fitted temperatures in the red bin by around 1–3 K, which is small compared with the range of temperatures observed, although the errors on temperatures are significantly increased. The fitting was carried out using the IDL routine mpfitfun,10 which performs Levenberg–Marquardt least‐squares fitting to a general function. Best‐fitting values of the free parameters (temperature, normalization) are returned with formal 1σ errors computed from the covariance matrix. Some examples of the fits are plotted in Appendix C, showing a range of fitted temperatures. The derived temperatures depend on the assumption of a fixed emissivity parameter (β). Varying this as a function of optical colour and/or stellar mass could to some extent account for the variation in submm colours, which we interpret as a temperature variation. However, the variation in β would need to be severe (Δβ > 1) to fully account for the trends in the stacked colours in Fig. 9. It therefore seems likely that the cause for these variations is either temperature alone or a combination of β and temperature.
Fig. 14 shows the results of all the temperature fits as a function of colour, stellar mass and redshift. There are strong deviations in some bins from the value that we have been using, and these show strong dependence on colour and stellar mass. In the blue bin we see that dust temperature is tightly correlated with stellar mass in all redshift bins, with a peak at around 6 × 1010 M⊙, but galaxies of higher masses appear to have colder dust. The temperature distribution in the the green bin is less well correlated and very noisy, but again there is evidence that the warmest galaxies are towards the middle of the mass range. There is no clear evolution with redshift. As with the luminosity results, the red bin appears very different from the other two, and there is no evidence for the temperature to increase with stellar mass. The most important result would seem to be that temperatures are generally much lower in the red than the blue bins. Over the mass range in which the bins overlap (3 × 109 < Mstar < 3 × 1011 M⊙), the mean (standard deviation) of the temperatures in the blue bins is 22.7 (2.9) K, compared with 19.4 (2.8) K in the green and 16.1 (2.9) K in the red. The difference in the means is statistically significant, since an unpaired t‐test gives a probability of 10−12 that the means of the red and blue bins are the same. The t‐test assumes that the errors on each of the measurements are independent, but this is not true since the errors on the red stacks are dominated by the error on the lensing correction. The mean temperature error of the red stacks is 3.1 K, which means that including the lensing uncertainty, the blue and red mean temperatures are only different at the 2.1σ level. The incidence of colder dust in redder galaxies may be explained by the relationship between dust temperature and the intensity of the ISRF. The colour temperature of the ISRF is directly related to the stellar population. Old stars produce less UV flux, and so heat the dust less, which causes galaxies dominated by older stars to have cooler dust. Such a temperature differential is therefore consistent with the notion of red galaxies being passive. Meanwhile, colder dust temperatures in the least massive blue galaxies is consistent with these galaxies having more extended dust discs in comparison with their stellar discs, since the ISRF becomes weaker at greater galactocentric radii. Evidence for an extended dust disc in at least one low mass system has been reported by Holwerda et al. (2009), although larger samples would be needed to judge whether this is a widespread phenomenon.

Results of fitting single‐component greybodies with β= 2.0 to the observed (not k‐corrected) fluxes in each bin to estimate dust temperatures. Fluxes in the red bin were corrected for lensing before fitting. Error bars are the 1σ errors on fitted temperatures computed by mpfitfun.
It is possible when fitting the SPIRE bands that the SED shape could be biased by differential effects between the three bands. In particular, the 500‐μm band is the most affected by confusion and blending, as well as being the noisiest, and is also potentially subject to contamination from other emission mechanisms, including synchrotron from within the galaxies, and extended radiation from the Sunyaev–Zel’dovich effect in galaxy clusters (although we note the latter should have been removed in background subtraction). To check for such bias, we also tried fitting only the 250‐ and 350‐μm data, and found that the derived temperatures and trends were not significantly different.
5.1.1 Bolometric luminosities
It can be useful to consider the total IR (TIR) luminosities LTIR () of galaxies as this allows some comparison between observations at different IR wavelengths and between data and the predictions of models. In all cases this entails making assumptions about the shape of the SED, which must be interpolated – and indeed extrapolated – from the limited photometric data available. In this particular case, we are limited to just three SPIRE bands, all of which lie longwards of the peak in the SED and as such do little to constrain the warmer end of the SED at λ≲ 100 μ m. This is why they are well fitted by single‐component SEDs, representing a single component of cold dust. In contrast, the TIR luminosity is highly sensitive to emission from the hotter components of dust, especially the ≳30 K dust associated with Hii regions, which is heated by UV radiation from hot young stars.
Bearing in mind these limitations, we nevertheless consider it useful to make some attempt at estimating the TIR luminosities representative of our stacked samples. Since our sample is thought to be dominated by normal star‐forming and quiescent galaxies, we need to choose an appropriate IR SED template. A commonly used set of templates is that of Chary & Elbaz (2001, hereafter CE01). These templates are based on libraries of MIR and FIR templates representing a range of SED types (from normal spirals to ULIRGs11) fitted to data on ∼100 local galaxies at 6.7, 12, 15, 25, 60, 100 and 850 μ m.12 From these we select the most appropriate template for each stack by computing chi‐squared between each of the templates and our rest‐frame (lensing corrected) SPIRE luminosities, and assign to each stack the LTIR of the template with the minimum chi‐squared. Results are shown in Fig. 15(a). Errors on LTIR were estimated with Monte Carlo simulations using the 1σ errors on the SPIRE luminosities and refitting the templates 200 times to obtain the 1σ error bar on the template LTIR.

Integrated LTIR (8–1000 μ m) of (a) CE01 and (b) H‐ATLAS (Smith et al. 2011b) templates fitted to the stacked SPIRE luminosities in each bin as described in the text. Error bars are 1σ errors estimated from Monte Carlo simulations using the 1σ errors on the SPIRE luminosities. Fluxes in the red bin have been corrected for the lensing contribution as described in Section 4.2, and error bars include the associated uncertainty. Note that in panels (a) the first point in the blue sample (i.e. lowest mass, lowest redshift bin) had SPIRE luminosities that fall below all of the CE01 templates, and so the luminosity of the faintest template is used as an upper limit.
The CE01 templates are of limited value for our sample because they are fitted to IRAS and SCUBA data for a relatively small sample of local galaxies. The necessity for IRAS detections means that the galaxies in their sample may have been biased towards hotter SEDs, and may not be representative of the larger population sampled in this work. As an alternative we can compare the results of using the CE01 templates with a set of templates modelled on the H‐ATLAS SDP source catalogue (Smith et al. 2011b). There is a danger that the opposite bias is active here, since the templates are based on sources selected at 250 μ m, which are more likely to have cold SEDs. However, by comparing the H‐ATLAS L250 LF from D11 with the range of L250 of optical galaxies (Figs 10–12) we see that the luminosity ranges spanned by the two surveys are remarkably similar, implying that the SEDs of H‐ATLAS sources could provide a reasonable representation of an optically selected sample. We use a single template based on the mean of all H‐ATLAS SED models from Smith et al. (2011b). In Fig. 15(b) we show the results of fitting this template to our stacked SPIRE luminosities, minimizing chi‐squared to obtain the correct normalization and integrating the SED from 8 to 1000 μ m to obtain LTIR (errors were estimated from Monte Carlo simulations using the 1σ errors on the SPIRE luminosities in the same way as for the CE01 templates).
The results of the two sets of templates are strikingly different, with the CE01 models suggesting significantly higher luminosities, reaching the level of ‘luminous IR galaxies’ (LIRGs; LTIR > 1011 L⊙) at z > 0.22 or Mstar≳ 2 × 1010 M⊙. The H‐ATLAS templates are much colder so give much more moderate luminosities, with around five times lower bolometric luminosity for the same L250. With only the SPIRE data to constrain the SED, we cannot conclusively say that either set of templates is better suited to describing the optical sample, although for the reasons outlined above we believe that the H‐ATLAS templates are more likely to be suitable. The addition of data points at shorter wavelengths, from PACS in the FIR and WISE in the MIR, would permit a much more accurate derivation of the bolometric luminosity; we leave this for a future study.
Both parts of Fig. 15 show trends in the bolometric luminosities that are similar to those seen in the monochromatic SPIRE luminosities. That is to say there is a clear evolution with redshift and that this is much stronger in the red than the blue sample. We can quantify this evolution in the form L(z) ∝ (1 +z)α for the mass range in which the bins overlap. We fit this function in log‐space by chi‐squared minimization, using the IDL routine linfit to fit the bins which fall in the mass range . We find that LTIR from the H‐ATLAS SEDs evolves with α= 4.1 ± 0.2 in the blue sample, α= 1.3 ± 0.3 in the green, and α= 6.9 ± 1.0 in the red. The same evolution is also seen in L250(z) in the same mass bins: this is described by α= 4.0 ± 0.2 (blue); α= 1.1 ± 0.4 (green); and α= 7.7 ± 1.6 (red). It appears counterintuitive that the intermediate green bin should show the least evolution. The likely reason for this is the aforementioned possibility for the green sample to probe different populations at different redshifts (see Sections 4.5 and 4.1). Any sign of genuine luminosity evolution would be counteracted by sampling a less luminous population (e.g. with more passive red sequence galaxies in the green bin) at higher redshifts. We must, however, note that any evolution in the submm SED of any of these samples (blue, green or red) would render these single template fits unreliable.
The evolution in LTIR of normal galaxies was also observed by Oliver et al. (2010b) for a large sample selected in the optical and NIR with redshifts in the range 0–2. Dividing their sample by redshift, stellar mass and optical class (each derived from broad‐band SED fitting of Rowan‐Robinson et al. 2008), they stacked into 70‐ and 160‐μm Spitzer images and showed that both ‘blue’ galaxies (with spiral‐like SEDs) and ‘red’ galaxies (with elliptical‐like SEDs) increased in specific IR luminosity (i.e. LTIR/Mstar) as a function of redshift. The evolution in specific IR luminosity of all galaxies in their sample increased as (1 +z)4.4 ± 0.3 (independent of stellar mass), which is nearly identical to our result for blue galaxies. When they split the sample into red and blue colours, they also found that red galaxies evolved more strongly, but only with the index 5.7 ± 2.5 compared with our 6.9 ± 1.0. Their blue subsample evolved with the index 3.4 ± 0.3 compared with our 4.1 ± 0.2. The agreement is not exact but general trends with redshift and colour are certainly compatible between the two samples. Assuming a correlation between IR luminosity and SFR (e.g. Kennicutt 1998), we can also draw parallels with other Spitzer‐stacking (such as Damen et al. 2009a,b; Magnelli et al. 2009) as well as radio‐stacking studies (Dunne et al. 2009a; Pannella et al. 2009; Karim et al. 2011), all of which have shown similar dependence of (specific) SFR on stellar mass and redshift in NIR‐selected massive galaxies covering larger redshift ranges (up to z∼ 3). These studies have variously reported redshift evolution in specific SFR with indices (α) ranging from 3.4 to 5.0, all comparable with the luminosity evolution of our blue sample. It is unsurprising that our blue sample generally agrees with other samples selected by rest‐frame optical light with no regard to colour, since our blue selection is by far the largest of our colour bins, comprising 50 per cent of our sample.
It is well reported in the literature that there is strong evolution in the IR LF out to at least z= 1 (Saunders et al. 1990; Blain et al. 1999; Pozzi et al. 2004; Le Floc’h et al. 2005; Eales et al. 2009, 2010a; Dye et al. 2010; Gruppioni et al. 2010; Rodighiero et al. 2010; D11; Goto et al. 2011; Sedgwick et al. 2011). This requires an increase in the luminosity of the brightest (≳) galaxies, leading to an increase in the numbers of LIRGs and ULIRGs at higher redshifts. Our results show that this evolution at low redshifts also occurs in ordinary galaxies well below the LIRG threshold; these are the galaxies that dominate the number density. Such an evolution in the IR luminosities of all galaxies leads naturally to an evolution in the characteristic luminosity L★. Exactly this sort of evolution in normal (i.e. non‐merging) star‐forming galaxies is predicted by the semi‐analytic model of Hopkins et al. (2010), essentially as a result of an evolving gas fraction and using the Schmidt–Kennicutt law (Kennicutt 1998). D11 also show that an evolving gas fraction is required to explain the luminosity evolution in the H‐ATLAS sample, based on the chemical evolution model of Gomez et al. (in preparation).
5.2 The cosmic spectral energy distribution

We thus calculate the luminosities of the cosmic SED at z= 0 to be (3.9 ± 0.3) × 1033, (1.5 ± 0.1) × 1033 and (4.3 ± 0.3) × 1032 W Mpc−3 h70 at 250, 350 and 500 μ m, respectively. Errors are dominated by the 7 per cent calibration error on the SPIRE fluxes, which is correlated across the three bands. These results agree very closely with the submm luminosities predicted from GAMA data by Driver et al. (in preparation), by calculating the total energy absorbed by dust in the UV–NIR and assuming it is reprocessed as FIR emission with templates from Dale & Helou (2002). They are also close to the pre‐Herschel‐era prediction of Serjeant & Harrison (2005), based on modelling the SEDs of IRAS sources with SCUBA submm measurements. Using equation (7) of that paper, the predicted luminosities at 250, 350 and 500 μ m are 4.52 × 1033, 1.43 × 1033 and 3.48 × 1032 W Mpc−3 h70, respectively. Our measurements are within 3σ of these values, although they arguably suggest that the slope of the cosmic SED may be a little shallower than the prediction. These measurements are independent of the SEDs and temperatures assumed (since k‐corrections are small) and of the lensing assumptions (the lensed flux is negligible at z≲ 0.1).
5.3 Evolution of dust masses
Dust mass is a quantity which we can expect to constrain much more accurately than LTIR using the SPIRE luminosities, because the cold component that they trace is thought to dominate the total dust mass (Dunne & Eales 2001, and references therein). We therefore estimate the mass of the cold dust component described by our greybody fits and use this as a proxy for the total dust mass, assuming that any warmer components have a negligible contribution to the mass.

Dust mass results depend strongly on the temperatures assumed (although not as strongly as the bolometric luminosities). They are therefore subject to our assumption of constant emissivity index (β), as well as the assumption of a constant absorption coefficient (κ). Any variation of κ with redshift, stellar mass, optical colour or indeed with dust temperature or dust mass itself could alter the trends that we see in dust mass.
In Fig. 16 we show the dust masses derived using the fitted temperatures from Fig. 14. The dust mass is seen to range from around 2 × 106 to 8 × 107 M⊙ across the sample. In all bins dust and stellar mass are correlated, but this weakens slightly with increasing redshift and/or stellar mass. There is a definite evolution towards higher dust masses with increasing redshift. Following the evolutionary form Mdust(z) ∝ (1 +z)α we fit data in the range with slopes of α= 3.9 ± 1.7 for the blue sample, α= 3.0 ± 3.3 for the green, and α= 6.8 ± 4.6 for the red. The slopes of the evolution are thus similar to the evolution in luminosities (as might be expected from the lack of evolution in temperature). Given the error bars we cannot claim to detect evolution in the dust masses of red galaxies, despite having a significant detection of evolution of their luminosities. The reason for this is that the uncertainty of the lensing contribution increases the uncertainty in the fitted temperatures. If we assume that dust temperatures of the red sample do not evolve (as they do not for the blue and green samples), then the evolving luminosities must result from dust mass evolution. However, this may not be a valid assumption if the composition of the red sample changes with redshift.

Stacked dust mass as a function of g−r colour, redshift and stellar mass. Dust mass is derived from equation (9) using the fitted dust temperatures from Fig. 14. Error bars include the statistical 1σ errors in the bins as described in Section 3.3, with an additional contribution due to the error on the fitted temperature. The lensing contribution has been removed from the red bins, and error bars include the associated uncertainty. The shaded region shows the range of characteristic dust masses measured by D11, which evolve from 3.8 × 107 M⊙ at z < 0.1 to 2.2 × 108 M⊙ at z∼ 0.35.
If we ignore the highest redshift red bin, which has the largest errors due to lensing, then the results seem to suggest that the difference in dust mass between the red and blue galaxies is weaker than the difference in luminosity, for a given stellar mass and redshift. The dependence of luminosity on colour therefore appears to be driven more by the temperature than the mass of the dust. We note that if the same temperatures were used in deriving the dust mass in every bin then the dust masses would be directly proportional to L250 and would follow the trends seen in Fig. 12. This shows the vital importance of having photometry at multiple points along the SED, without which it would be impossible to constrain the SED shape and inferences about dust mass evolution would have to assume a constant temperature.
Significantly, this analysis suggests that the cold dust masses of red and blue galaxies are not strikingly discrepant in the stellar mass range . Previous studies fitting two‐component dust models to normal spiral galaxies in the local Universe have derived ranges of cold dust masses comparable to our sample: 3 × 105–1 × 108 (Popescu et al. 2002); 2 × 107−01 × 109 (Stevens et al. 2005); and 4 × 106–6 × 107 M⊙ (Vlahakis et al. 2005). Dust masses measured from fits to the cold dust in local ellipticals (which should be akin to our red sample) are often a little lower: 2 × 105–2 × 106 (Leeuw et al. 2004); 4 × 104–5 × 107 (Temi et al. 2004); 2 × 104–2 × 107 (Savoy et al. 2009). However, both Vlahakis et al. (2005) and Stickel et al. (2007) reported little or no significant difference in the typical dust masses of galaxies of different Hubble types (including spheroidals, spirals and irregulars), although Stickel et al. reported that spheroidal and irregular types reached significantly lower dust masses. One particular issue noted by Vlahakis et al. was the possibility of contamination of their 850 μ m fluxes with synchrotron emission, which would lead them to overestimate a few of their dust masses. We note the caveat that in contrast to our unbiased sample, the references in this paragraph were studies of individual galaxies selected variously with IRAS or ISOPHOT in the FIR, or SCUBA in the submm, so the range of dust masses sampled would not have been complete (with the exception of the optically selected sample of Popescu et al. 2002).
Popescu et al. (2002) observed colder dust in later Hubble types (which might be expected to be the bluest galaxies). Their sample of spirals would probably reside entirely within our blue bin so such a trend would not be apparent between our colour bins if it does not extend beyond the blue cloud. However, there is a correlation between Hubble type and mass; later types have lower stellar masses, so our observation of a strong correlation between stellar mass and dust temperature in blue galaxies is entirely consistent with the findings of Popescu et al.
Meanwhile, results from the Herschel Reference Survey (HRS; Boselli et al. 2010b) indicate that early‐type galaxies (E+S0+S0a) detected by Herschel in a volume‐limited sample of the local Universe have dust masses in the range (Smith et al. 2011c) – although they only detected 34 per cent of ellipticals and 61 per cent of S0s. Their sample have NUV−r colours that place them in our ‘red’ bin, yet their dust masses are much lower than the typical dust masses that we find for red galaxies. This is perhaps due in part to the higher derived temperatures of the HRS sample:
K, with a mean of 24 K in comparison with our mean of 16.1 K for the red sample (both assuming β= 2). Smith et al. concluded that the dust masses of S0’s were around 10 times lower than those of the HRS spirals, while those of ellipticals were 10 times lower again (for the same stellar mass), which seems to contrast with our results.
Rowlands et al. (2012) studied early‐type galaxies detected in H‐ATLAS and found dust masses mostly between 2 × 107 and 2 × 108 M⊙, with a mean dust mass similar to that of spirals (5.5 × 107 M⊙). However, the stellar mass distributions of spirals and early types were very different. The NUV−r colours of the Rowlands et al. spiral sample lie mostly within our blue bin, and the early types mostly within our green bin. Their redshift range is similar to ours (z < 0.3), and the mean dust and stellar masses of their spirals/early types lie within the locus of our blue/green bins in Fig. 16. However, derived dust masses depend on the temperature assumed. The cold dust temperatures fitted by Rowlands et al. range from 15 to 25 K, while the temperatures in our blue/green samples for the same stellar mass range (> 1010 M⊙) are between 15 and 28 K. The correspondence is not exact but the ranges are similar so average dust mass results should be comparable. For reference, changing the temperature from 15 to 25 K results in a drop in the derived dust mass by a factor 5, which is similar to the range of dust masses across the redshift range in Fig. 16. Rowlands et al. (2012) compared their H‐ATLAS‐detected early types with a control sample of undetected early‐types selected to have a matching distribution of redshifts and r‐band magnitudes. They concluded that the detected early types were unusually dusty compared with the control sample, and could be undergoing a transition from the blue cloud to the red sequence. It seems likely that those objects do indeed comprise the top end of the dust mass distribution (at a given stellar mass and redshift), although they do not appear to be exceptional outliers when compared with the median dust masses in our sample. This is surprising when we consider that Rowlands et al. found the detected early types to have around 10 times as much dust as typical early types: why are they not also outliers compared to typical red galaxies? The answer may be that typical red galaxies are generally dustier than typical early types, which supports the notion (as discussed in earlier sections) that our red sample is comprised of a mixture of different populations. The red bin is likely to contain most of the early type galaxies in the sample volume, and if these are relatively dust poor then there must be a substantial population of red dusty galaxies boosting the median dust masses in the red bin. This could also explain the discrepancy between our red sample and the HRS early types (Smith et al. 2011c).
There is also the possibility of an environmental factor in the offset between the HRS results and our own. Many of the galaxies in the HRS sample reside in the Virgo cluster, while most of the galaxies in GAMA will be in lower density environments. The lower dust masses of HRS early types compared with GAMA red galaxies could be due to early types in clusters being dominated by passive red‐sequence systems, while red galaxies in lower density environments are more likely to be dusty. Such a division is indeed suggested by the higher detection rate of early types outside Virgo in HRS, compared to those inside (Smith et al. 2011c). It is, however, unclear whether such an effect could be strong enough to fully explain the discrepancy we find.
On the other hand, if the lensing contamination is slightly greater than we have predicted, then the derived dust masses of our red sample could be a lot lower; however, this is only likely to affect the higher redshift bins due to the weak lensing efficiency at lower redshifts. The Rowlands et al. sample is less likely to be biased by strong lensing than our sample is, because they excluded detections. The HRS results are unlikely to be biased by lensing because their fluxes were much higher than the red galaxies in our sample, and the low dust masses and high temperatures they derive (relative to our own) argue against their results being biased by lensing.
5.3.1 Dust‐to‐stellar mass ratios
In Fig. 17 we plot the ratio of dust to stellar mass across the sample, which shows several interesting features. First, there is in general a strong correlation with stellar mass: the more massive galaxies have smaller dust‐to‐stellar mass ratios. The correlation appears steeper for red galaxies of the highest masses in each redshift bin, but this is not so obvious in the blue and green samples which do not reach to quite such high stellar masses. Nevertheless it seems not unreasonable to observe that the most massive red galaxies, many of which will be passively evolving giant ellipticals, have especially low dust‐to‐stellar mass ratios.

Stacked dust mass per unit stellar mass as a function of g−r colour, redshift and stellar mass. Error bars include the statistical 1σ errors in the bins as described in Section 3.3, with an additional contribution due to the error on the fitted temperature. The shaded region shows the range from D11, from 2 × 10−3 at z < 0.1 to 7 × 10−3 at z∼ 0.35.
It is worth pointing out the potential for a negative correlation between Mdust/Mstar and Mstar to be produced artificially in binned data. This can occur if there is a large range of stellar masses with large errors, even when there is no correlation between Mdust and Mstar, since a bin that selects data with low Mstar also selects those with high Mdust/Mstar. In general the slope of the measured correlation could be affected by this artificial phenomenon; however, we can be sure in this case that the trends are real because they can also be discerned in the median Mdust values in Fig. 16, and in any case our stellar mass errors are small (Δlog M★∼ 0.1; Taylor et al. 2011).
The aforementioned redshift evolution is very apparent in Fig. 17, and although we naturally select higher stellar masses at higher redshift, the dust masses in the sample rise more rapidly resulting in an increasing dust‐to‐stellar mass ratio with redshift. Using the (1 +z)α model once again we find that the evolution is consistent with the slopes derived for the dust mass evolution. This evolution in dust mass echoes the results of D11, who found a strongly evolving dust mass function (DMF) in H‐ATLAS sources up to z∼ 0.4, as well as results from other surveys reaching higher redshifts (Eales et al. 2009, 2010a). Using dust masses from SED fitting by Smith et al. (2011b), D11 showed that the characteristic dust mass () of the H‐ATLAS DMF increases from 3.8 × 107 M⊙ at z < 0.1 to around 2.1 × 108 M⊙ at z∼ 0.35 (although they note that this does not measure the true evolution because there is an accompanying fall in the characteristic density φ★). This range is indicated by the shaded region in Fig. 16. In all of the bins, our typical dust masses reach lower than the minimum mass sampled in equivalent redshift slices in the D11 DMF, but the fact that we see evolution indicates that galaxies of a given stellar mass shift up the DMF at increasing redshifts. We see this happening in galaxies of all colours and stellar masses, indicating that the evolution in the DMF is the result of changing dust masses in all galaxies, both passive and star forming. The evolution (around a factor of 3–4) we see is similar to that seen by D11, who fitted two‐component dust masses temperatures using a detailed physically motivated SED model.
It is also interesting to compare Mdust/Mstar in our results with the detected H‐ATLAS galaxies in D11. The H‐ATLAS galaxies were found to have higher dust‐to‐stellar mass ratios than predicted by models, ranging from 2 × 10−3 at z < 0.1 to 7 × 10−3 at z∼ 0.3 (this range is shaded in Fig. 17). Our results are typically lower but the strong dependence on stellar mass means that they span a very wide range; many of the blue galaxies in the lower mass bins have much higher dust/stellar mass ratios than the H‐ATLAS sample for the same redshifts. It is perhaps not surprising that we sample a much wider range than the H‐ATLAS sources since our sample selection criteria are independent of dust content. The dust/stellar mass results have implications for understanding the dust production mechanism, as we can show by comparing the results from a chemical evolution model with the parameters obtained for the H‐ATLAS galaxies (Gomez et al., in preparation). The models (based on the framework in Morgan & Edmunds 2003) show that values of Mdust/Mstar > 10−3 cannot be achieved with a purely stellar source of dust. Even including dust production in supernovae ejecta (e.g. Rho et al. 2008; Dunne et al. 2009b; Gomez et al. 2011; Matsuura et al. 2011), models require the condensation efficiency in the ejecta to be close to 100 per cent to reach the high values of Mdust/Mstar∼ 10−2 seen both in the H‐ATLAS detected sample and in the lowest‐mass blue galaxies in our sample. However, as discussed by D11 and Gomez et al. (in preparation), such high dust yields from supernovae are difficult to produce, leading to the invocation of alternative explanations such as dust grain growth in the ISM (Draine & Salpeter 1979; Dwek & Scalo 1980; Draine 1990; Edmunds 2001) or a top‐heavy IMF (e.g. Harayama, Eisenhauer & Martins 2008). The models also indicate that the low mass galaxies with high Mdust/Mstar are less efficient at turning their gas into stars (compared to high mass sources with low Mdust/Mstar). In other words, low‐mass systems have a longer star formation time‐scale, so although they produce less dust mass per year from stars, there are more metals and dust in the ISM for longer, in comparison to massive galaxies (Gomez et al., in preparation).
Moving on to higher redshifts, Santini et al. (2010) showed that the dust/stellar mass ratios of ordinary low‐redshift galaxies are much lower than those of high‐redshift submm galaxies (SMGs) from the Herschel‐PEP survey. Our Mdust/Mstar values from stacking are consistent with their sample of low‐redshift spirals from the SINGS survey (with typical stellar masses of ∼ 1011 M⊙). Their dust masses were derived by fitting GRASIL models (Silva et al. 1998) to photometry spanning the FIR/submm SED, and are consistent with the value of β= 2 that we have assumed. Our results therefore support the conclusion of Santini et al. that high‐redshift SMGs have much higher dust content (by a factor of ∼30) than local spiral galaxies. This offset is also consistent with the comparison between low‐redshift H‐ATLAS sources in D11 and high‐redshift SCUBA SMGs in Dunne et al. (2003). It has now become clear that the SMGs detected in early submm surveys are exceptionally dusty systems in comparison with the low‐redshift galaxy population.
There is evidence that the dust/stellar mass ratio correlates with specific SFR (da Cunha et al. 2010; Smith et al. 2011b; Gomez et al., in preparation), so these results imply that the least massive galaxies at low redshift are the most actively star forming, and that all galaxies become more active towards higher redshifts. At z∼ 0.3, galaxies with 2 × 1010 M⊙ of stars have the same Mdust/Mstar as galaxies a quarter of that size do at z≲ 0.1. This is consistent with the picture of downsizing, in which the specific SFRs of high‐mass galaxies peak earlier in the Universe than those of low‐mass galaxies (Cowie et al. 1996).
5.4 Obscuration
A further step that we can take towards understanding the nature of the galaxies is to investigate the relative luminosities at UV and submm wavelengths, which can give information about the fraction of star formation that is obscured by dust in the galaxies (e.g. Buat et al. 2010; Wijesinghe et al. 2011). The submm luminosity represents the energy absorbed and re‐radiated by dust. Naively, we might expect that this energy originated as UV radiation from young stars, hence the ratio of submm light to UV light detected is directly related to the fraction of UV light which is obscured by dust. If UV light is assumed to come primarily from star‐forming regions, this is a measure of the ratio of obscured to unobscured star formation. However, both the UV and the submm radiation could also trace populations unrelated to star formation: there are open questions as to how much UV radiation can be produced by evolved stars (Chavez & Bertone 2011, and references therein) as well as how much of the dust probed at submm wavelengths is heated by old stars in the galaxy (Bendo et al. 2010; Boselli et al. 2010a; Law, Gordon & Misselt 2011). Detailed radiative transfer calculations (e.g. Popescu et al. 2011), which have been used to make detailed predictions for a few well‐studied spiral galaxies, can be used in the future to analyse statistical samples of galaxies to address this question in a quantitative way. For the moment, however, the generalization to galaxy populations as a whole is uncertain.
In Fig. 18 we stack L250/LNUV for the NUV‐detected sample. Since we require NUV detections for this, we use the NUV−r colour which is likely to be a cleaner colour separation, and we use L250 instead of LTIR because the monochromatic luminosity is not model‐dependent. Simulations showed that stacking this ratio is robust even for small 250‐μm fluxes with low signal‐to‐noise ratio, since the NUV fluxes all have reasonably high signal‐to‐noise ratio (in contrast, stacking LNUV/L250 was found to be unreliable in simulations since this quantity diverges as the 250‐μm flux approaches zero). As before, we correct 250‐μm fluxes for the expected contribution from lensing as described in Section 4.2.

Stacked 250‐μm/NUV luminosity as a function of NUV−r colour, redshift and stellar mass. This fraction can be used as a proxy for the relative obscuration of star formation, subject to the limitations discussed in the text. Error bars are the statistical 1σ errors in the bins as described in Section 3.3. Data and errors in the red bin incorporate the correction for lensing described in Section 4.2.
Some of the results implied by Fig. 18 are not trivial to explain, and should be treated with caution since there is a strong bias introduced by the UV selection. It appears that the obscuration increases with increasing stellar mass for blue galaxies, but there appears to be a decrease with redshift, at least for stellar masses ≲5 × 1010 M⊙. This contrasts with the increase in 250‐μm luminosity with redshift, which would imply that while the obscured SFR increases with redshift up to z= 0.3, the unobscured SFR (UV luminosity) must increase faster for the relative obscuration to fall. However, it is likely that these observations are affected by selection bias: we only detect the low mass galaxies in the UV if they are relatively unobscured, and as redshift increases we detect fewer and fewer of the obscured ones. This effect could cancel out any intrinsic increase in obscuration with redshift, and cause the observed L250/LNUV to decrease.
In contrast, we detect almost exactly the opposite trends in the red sample, and in the high‐mass end of the green sample, which suggests that the selection bias could hide similar trends in blue galaxies (which generally have much lower stellar mass for the same redshift). Number statistics are poor in the red bin because the selection is naturally biased against red galaxies, and errors are compounded by the uncertainty on the lensing contamination. Nevertheless the observed trends of increasing obscuration with increasing redshift and with decreasing stellar mass cannot be explained by the selection bias. These trends are both perfectly consistent with the trends in Mdust/Mstar in Fig. 17: a higher amount of dust per stellar mass is almost certain to increase the obscuration of UV light. The red galaxy sample therefore appears to contain a larger fraction of obscured star‐forming galaxies at higher redshifts and lower masses. This is consistent with the findings of Zhu et al. (2011) in an analysis of the MIR colours of optically selected galaxies at redshifts between 0.1 and 0.5. It is also in agreement with Tojeiro et al. (2011), who stacked SDSS spectra of luminous red galaxies, and fitted stellar population models to obtain representative star formation histories, metallicity and dust content as a function of colour, luminosity and redshift. Their results showed strong correlations of dust extinction with optical luminosity and redshift which are consistent with our own findings. Such agreement between independent measures of the dust extinction is encouraging.
In any case we must be careful in the interpretation of L250/LNUV as a tracer of obscuration, in particular due to the potential for L250 to be uncorrelated with star formation. We showed in Section 5.1 that the conversion from SPIRE luminosities to LTIR is extremely model‐dependent, and to plot the ratio LTIR/LUV using our SED fits for LTIR would be misleading when LTIR is based only on the SPIRE photometry. In addition, it has been shown by Wijesinghe et al. (2011) that the ratio of LTIR (from fitting SEDs to GAMA/H‐ATLAS data including PACS and SPIRE) to LUV is poorly correlated with other measures of obscuration (the Balmer decrement and UV slope), probably because they trace a different component of the dust in galaxies. We therefore hesitate to take this particular analysis any further without the addition of shorter wavelength data to better constrain the IR SED.
6 CONCLUSIONS
We have conducted the first submm stacking analysis of a large sample of about 80 000 galaxies uniformly selected by optical (r‐band) magnitude. We divided the sample by rest‐frame colour, absolute magnitude/stellar mass and redshift (0.01 ≤z≤ 0.35) and stacked into SPIRE maps covering about 126 deg2 at 250, 350 and 500 μ m. We used a simple (but effective) deblending method to avoid the problem of overestimating the flux of blended sources when stacking in confused images; this ensures that stacked flux ratios are not biased by the increasing level of confusion at longer wavelengths. Our main results are summarized below:
The submm fluxes of all but the most massive optically selected galaxies are below the 5σ limits of H‐ATLAS; yet with the large sample size made possible by the coverage of H‐ATLAS and GAMA we are able to probe more than an order of magnitude below these limits using stacking.
We estimate that the total emission from optically selected galaxies at r < 19.8 and z < 0.35 accounts for only 5.0 ± 0.4 per cent of the CIB at 250 μ m. At z < 0.28, where the sample is complete to below M★, this fraction is 4.2 ± 0.3 per cent. Of this, roughly 60 per cent originates from blue galaxies, and 20 per cent each from the red and green bins of our sample.
We derive the total k‐ and e‐corrected luminosity density of the Universe at z= 0 to be (3.9 ± 0.3) × 1033, (1.5 ± 0.1) × 1033 and (4.3 ± 0.3) × 1032 W Mpc−3 h70 at 250, 350 and 500 μ m, respectively (h70=H0/70 km s−1 Mpc−1).
We show that stacked fluxes of red galaxies can be significantly contaminated by the lensing of background SMGs. Using models for the lensing amplification distribution and observed lens number counts, we estimate that around 10, 20 and 30 per cent (at 250, 350 and 500 μ m, respectively) of the stacked fluxes are likely to result from lensing. We correct our stacked results for this contamination to red galaxy stacks making reasonable assumptions for the redshift distribution of lensed flux, and include the uncertainty from the lens number counts.
We observe a strong dependence of submm luminosity on optical colour (g−r) and stellar mass or Mr, with red galaxies being up to an order of magnitude less luminous than blue galaxies of equal stellar mass. The luminosities of green galaxies are intermediate between the two. The observed trends of SPIRE luminosities are not strongly dependent on the SED model assumed, and cannot be explained by lensing, which implies a fundamental difference between the dust emission properties of red and blue galaxies.
We measure cold dust temperatures that vary strongly as a function of stellar mass in blue galaxies, from ∼11 K at 3 × 108 M⊙ to ∼28 K at 5 × 1010 M⊙. Correcting for the contamination from lensing, red galaxies have dust temperatures ∼ 16 K at all stellar masses between 3 × 109 and 8 × 1010 M⊙ at z < 0.35. The dust temperatures of green galaxies appear to have a greater scatter (with mean T= 19.4 K) but are not correlated with stellar mass as with the blue; this is indicative of a mixed population. Temperature values depend on the assumption of a constant emissivity parameter β= 2.
The temperature variation can account for much of the difference in luminosities between red and blue galaxies; however, it is not responsible for an increase in luminosity with redshift by a factor of around 2 for blue galaxies at a given stellar mass. This appears to be due to an increase in the dust masses of galaxies of all stellar masses by a factor of 3–4 between z∼ 0 and z∼ 0.3. The red sample exhibit a stronger luminosity evolution, for which the likely explanation is also dust mass evolution. Due to the lensing uncertainty we cannot rule out evolution in the temperatures of the red sample, although there is no temperature evolution in the other colour bins.
We fit the evolution in L250 (which is not dependent on the temperature) with the function L(z) ∝ (1 +z)α, and obtain indices for the three colour bins at
. We find α= 4.0 ± 0.2 for blue galaxies, α= 1.1 ± 0.4 for green and α=7.7±1.6 for red (the larger error on the evolution of red galaxies is due to the uncertainty on the lensing correction). Consistent rates of evolution are also derived for the dust masses. The evolution suggests a change in the dominant population of red galaxies, from passive systems at low redshift to obscured star‐forming systems at higher redshift.
The redshift evolution of galaxies classified as green seems to indicate a change in the nature of galaxies selected in this way, from a sample dominated by blue‐cloud‐like galaxies at low redshift and low Mr (or Mstar) to a sample more similar to the red bin at the higher redshift and brighter Mr.
Deriving TIR luminosities is problematic with only the SPIRE data, and we show that the results obtained depend sensitively on the SED model used (therefore the dust temperature). The low temperatures implied by the SPIRE colours indicate that cold models such as the H‐ATLAS SED fits (Smith et al. 2011b) are more appropriate than earlier models based on IRAS and SCUBA data (CE01).
The dust‐to‐stellar mass ratio is strongly anticorrelated with stellar mass, varying by more than an order of magnitude between Mstar∼ 108 and 1011 M⊙. This relationship appears to vary little between different optical colours, although it evolves toward higher values with increasing redshift. These results provide a challenge to dust formation models that rely on a purely stellar source of dust, implying a need for dust formation in supernovae and/or the ISM to reach the high dust masses in galaxies at the lower end of the stellar mass function.
We attempt to explore the obscuration of galaxies in our sample using the L250/LNUV ratio, and see that red and green galaxies may become more obscured at increasing redshift and decreasing stellar mass (results for the blue galaxies are unclear due to selection bias). This conclusion is dependent on the assumption that this ratio is a good tracer of obscuration, but due to uncertainties in the heating mechanism for cold dust this may not be valid. Nevertheless, such trends in obscuration are consistent with the trends of luminosity and dust/stellar mass.
This study is the first of its kind and provides some tantalizing glimpses of the characteristics of emission from dust in normal galaxies. Our understanding of the IR SED of optically selected galaxies and of the obscuration of star formation would be greatly improved by the availability of data covering the peak of the SED. In a future study, we hope to stack data from PACS and WISE in order to make a much more detailed analysis of the full SED.
NB wishes to thank Carlos Hoyos for useful discussions regarding the biases discussed in Appendix B, and Douglas Scott and Dave Clements for providing thoughtful comments on drafts of this paper. This publication makes use of Ned Wright’s Javascript Cosmology Calculator (Wright 2006), as well as the IDL Astronomy Library (Landsman 1993). H‐ATLAS is a project with Herschel, which is an ESA space observatory with science instruments provided by European‐led Principal Investigator consortia and with important participation from NASA. The H‐ATLAS website is http://www.h‐atlas.org/. GAMA is a joint European–Australasian project based around a spectroscopic campaign using the Anglo–Australian Telescope. The GAMA input catalogue is based on data taken from the SDSS and the UKIRT Infrared Deep Sky Survey. Complementary imaging of the GAMA regions is being obtained by a number of independent survey programmes including GALEX‐MIS, VST KIDS, VISTA VIKING, WISE, H‐ATLAS, GMRT and ASKAP providing UV to radio coverage. GAMA is funded by the STFC (UK), the ARC (Australia), the AAO, and the Participating Institutions. The GAMA website is http://www.gama‐survey.org/.
Footnotes
The cosmic dust mass density was estimated to be 0.0083 per cent of the baryon density at redshift 0.0 by Driver et al. (2007).
Model magnitudes are the best fit of an exponential and a de Vaucouleurs fit as described by Baldry et al. (2010).
The NIR photometry was not used in deriving stellar masses due to problems fitting the UKIDSS bands as discussed by Taylor et al. (2011).
The exception to this empirical approach is the assumption of the k‐correction, which is necessary to make fair comparisons between redshift bins. The k‐corrections are very well constrained by photometry in 5–11 bands and uncertainties are small compared with our colour bins.
The important distinction between the PSF and the PRF is that the PRF represents the discrete response function of the detector pixels to the continuous distribution of light (PSF) which reaches the detector from an astronomical point source.
Available from http://herschel.esac.esa.int/Docs/SPIRE/html/spire_om.html
This may be a slight underestimate of the redshift completeness at z < 0.35, in which case we would overestimate the total corrected flux by a maximum of 8.7 per cent.
A further issue with fitting SEDs is that β may vary with frequency. For example, Paradis et al. (2009) analysed data on the Milky Way from 100 μ m to 3.2 mm and showed that β was generally steeper at than
. This is an effect that we cannot take any account of without many more photometric points on the SED, but it could have some effect on our fitted temperatures and therefore luminosities.
The temperature fits and trends mentioned here are discussed further in Section 5.1.
mpfitfun available from Craig Markwardt’s IDL library:
http://cow.physics.wisc.edu/~craigm/idl/idl.html
Ultraluminous IR galaxies; LTIR > 1012 L⊙.
CE01 templates were obtained from http://www.its.caltech.edu/~rchary/
REFERENCES
Appendices
APPENDIX A: ESTIMATING FLUXES OF BLENDED SOURCES BELOW THE NOISE LEVEL
A1 Deblending individual sources
When stacking sources we must be careful not to overcount the flux in blended sources, which would lead to overestimation of stacked fluxes, especially in bins whose galaxies are more clustered, and in the longer wavelength images which have lower resolution. Since many sources are close to or below the noise level in the images, it is impossible to model them from the images themselves, and since submm flux is poorly correlated with optical flux, we have no other prior information to base models on. We must therefore make some simplifying assumptions in order to avoid overcounting.








(a) Two simulated point sources A and B in a one‐dimensional image, modelled with the same PSF but different normalizations, represented by the dashed and triple‐dot–dashed curves, respectively. The total flux in the image as a function of position x is given by equation (A1) and is represented by the solid line. (b) The thin black lines are the PSFs, pA and pB, centred at x= 200 and 300, respectively. Both PSFs have width σ= 50. The thick grey lines are the PSFs weighted for deblending, qA and qB, given by equation (A5). (c) The reconstructed sources given by the image data (solid line in panel a) weighted by the PSFs in the middle panel. The thin black lines are obtained using the unweighted PSFs p in equation (A2), and the thick grey lines using the weighted PSFs q in equation (A6).
On the other hand, the individual fluxes measured using equation (A7) are not exactly correct because blended flux is shared evenly between the two sources, whereas ideally it should be distributed according to the flux ratio of the sources. Hence, in this example the recovered flux of A is too low and that of B too high (this effect is worsened by closer proximity of the sources). However, with no prior information on the true flux ratio this is the best estimate that can be made.
To generalize this method to a two‐dimensional image with multiply‐blended sources, we start with an image array of the same dimensions as the data image, filled with values of zero. To this we add a PRF for every source in the input catalogue, centred on the pixel where the source is located and interpolated from the PSF with a small offset to correctly account for subpixel‐scale positioning. In the region of an isolated source, this image will be identical to the individual PRF, but where sources are blended it is equal to the sum of the PRFs (analogous to the sum of the thin black lines in Fig. A1b). Thus, all multiple blends are automatically accounted for, and the image we have constructed is analogous to the denominator in equation (A4) (i.e. pA+pB+⋅⋅⋅). For each source i we derive the weighting function gi(x, y) as the ratio of the PRF to a cut‐out region of our all‐PRFs image, as in equation (A4). In other words, the weight given to the flux in a pixel (x, y) is the value of the PRF of the target at (x, y) divided by the sum total of the contributions of all PRFs in that pixel. We measure the flux of each source by convolving the data image (Jy pixel−1) with the weighted PRF, as in equation (A6). We tested the method in simulated maps with realistic source densities and using the PRFs of the three SPIRE bands. We found that the correct mean and median fluxes were always recovered when stacking, and that convolving with the PRF without any deblending always led to an overestimate of the median and mean fluxes.
The deblending technique is carried out before any binning, so all catalogue sources in the field are automatically deblended, not just those in the same bin as the target in question. We note that the method is essentially very similar to the ‘global deblending’ technique described by Kurczynski & Gawiser (2010), which was demonstrated in that paper to minimize bias and variance in the stacks.
A2 Comparison to a statistical approach



The two‐point angular correlation function of the GAMA catalogue used in this work, averaged over the G09, G12 and G15 fields. Error bars are the standard errors between the values obtained in the three fields. A power‐law fit by linear regression (with free slope and normalization) gives w(θ) = (0.012 ± 0.001)θ(− 0.76 ± 0.03).
By substituting into equation (A8) the fit to w(θ) and the beam sizes at 250, 350 and 500 μ m (18, 25, and 35 arcsec FWHM), we obtained the fractional contribution to stacked fluxes due to blending. The corrected flux is then obtained by multiplying the stacked flux by C= 1/(1 +F). Results are summarized in Table A1. We see that this average statistical correction factor is broadly similar to the typical correction to individual fluxes using the deblending technique (equation (A7)). The deblending technique has the advantage of correcting the flux of each target individually; thus, however, the sample is binned the appropriate average correction is made. This is not the case using the correlation function of the entire catalogue, since that only gives a single correction factor. It would be possible to calculate separate correction factors for each bin, using the cross‐correlation between the bin and the full sample (as in Bourne et al. 2011), but the uncertainties would be significantly increased since each bin contains a relatively small number of objects.
APPENDIX B: SIMULATIONS OF BIAS IN THE MEDIAN
We simulated power‐law distributions of fluxes with Gaussian noise added, and found that in certain cases the median measured flux (true flux plus noise) was biased high with respect to the median true flux, as a result of noise in the measured values. The amount of bias depends on (i) the flux limits of the distribution; (ii) the 1σ noise level in relation to the flux limits; and (iii) the slope of the power law describing the underlying flux distribution. The bias only becomes apparent when considering distributions with a median signal‐to‐noise ratio less than 5σ.
In order to ascertain the level of bias that could be present in our stacks, we must consider the shape of the underlying (true) flux distribution of sources in the stacks. In order to do so we must look at the distribution of fluxes in the much brighter H‐ATLAS‐detected sample, which are not dominated by noise. The situation is helped considerably by the fact that we bin the sample according to Mr and redshift, meaning that each bin is likely to have a limited range of fluxes with a distribution determined by the LF. To estimate the flux limits in a given bin, we can look at the submm fluxes of the galaxies with the highest optical fluxes. D11 show the distribution of r magnitude and S250 in the SDP ID catalogue. At r < 16 the catalogue contains the full range of submm fluxes, which at a given r spans 1.3 dex in S250. We inspected the Phase 1 reliable IDs with good spectroscopic redshifts (250‐μm sources matched to SDSS data using the same method as Smith et al. 2011a; these will be described by Hoyos et al., in preparation). We found the same range of 1.3 dex in r < 16 sources in Phase 1 as in SDP.
We therefore assume that any bin of Mr and redshift will have fluxes within a limited range. The actual limits of this range, Smax and Smin, will depend on the range of Mr and z sampled by the bin (although the majority of fluxes at all Mr and z < 0.35 will lie within the range 0.1–100 mJy). However, since the range in a bin is determined by the LF, we can safely assume that the logarithmic range R= log 10(Smax/Smin) = 1.3 will be consistent in all bins. Similar ranges of 1.3 dex are expected at all three SPIRE wavelengths (although of course fluxes at different wavelengths will be offset with respect to each other due to the shape of the SED). Within these limits the fluxes are assumed to follow a power‐law distribution: in the Phase 1 IDs this is approximately described by differential number counts dN/dS∝S−2.5, although this is unreliable due to the incompleteness of the ID catalogue. A similar slope is apparent in the 250 μ m number counts from P(D) analysis of the HerMES maps (Glenn et al. 2010), although there is some evidence in that data for a shallower slope at S250≲ 10 mJy. We must, however, remember that the number counts in our bins will not follow the overall submm number counts, since our bins contain only a narrow distribution of Mr and particularly of redshift. In sufficiently narrow redshift bins the distribution of the counts will approach the submm LF: this has a slope of −1.01 at the faint end (L < L★) of the H‐ATLAS LFs derived by D11. We can therefore be confident that in finite redshift bins the slope will be intermediate between −1 and −2.5. Lapi et al. (2011) have modelled the number counts to fit the data from H‐ATLAS, HerMES and BLAST. We split these into redshift bins (δz= 0.05) and found that the faint‐end differential counts at z < 0.35 follow a slope of approximately −2. Our redshift bins have similar widths (0.05 ≲δz≲ 0.11); hence, it is reasonable to assume that the flux distributions in the bins will have a similar slope.
For these reasons we conclude that a simulated flux distribution that spans a range R= 1.3 dex with a power law dN/dS∝S−2 is representative of our bins. We therefore created 16 simulated distributions with these parameters, with a range of lower limits between 0.1 and 30 mJy: the corresponding upper limits are 20 times larger, given by R= 1.3. The resulting distributions have medians in the range 0.3–70 mJy, which is sufficient to cover all the bins in our real data. We added random noise to these true flux distributions, where the noise values were drawn from a Gaussian distribution with zero mean, and σ given by the average total noise level (instrumental plus confusion) in the Phase 1 maps: σ= 6.7, 7.9 and 8.8 mJy beam−1 at 250, 350 and 500 μ m, respectively. We then compared median measured flux (true flux plus noise) to the median true flux, and quantified the bias factor as the ratio of measured to true median flux.
We corrected the measured median fluxes in our stacked data by interpolating the relationship between true median flux and measured median flux from the simulations. This was done separately for each of the three bands (since the bias behaves differently as a result of the different noise levels). All correction factors are in the range 0.6–1.0, which is generally small in comparison to the range of stacked fluxes resulting from true differences between the bins.
For these corrections we have assumed that the slope of differential number counts is −2 and that fluxes in each bin lie in a range given by R= 1.3 dex. However, the bias factor depends on both of these variables, as we show in Fig. B1. To test the sensitivity of our results to these parameters, we tried correcting our results by the bias factors obtained using different values. We tried slopes of dN/dS ranging from −0.5 to −4 and R values between 1 and 2. The level of bias varies, indicating that there is some uncertainty in the correction, but we note that the corrections for slopes between −1.5 and −2 give almost identical corrections (for R= 1.3). We are sure that the slope is between −1 and −2.5, and over this range the bias corrections do not vary by more than 15 per cent. Increasing the range R has a greater impact on the correction; however, the value of 1.3 is well motivated and in narrow bins of Mr and redshift there is good reason to expect the flux range to be limited.

Results of a set of stacking simulations: (a) Comparing median measured flux and median true flux, for distributions in various flux ranges. The range in each simulation is given by R= log 10(Smax/Smin) = 1.3 (the length of the grey horizontal bar), and the simulations were run 16 times with different Smin and Smax values to produce distributions with a range of median (true) fluxes. Each coloured line in the figure connects 16 data points, one point from each simulation, showing how the median measured flux (after adding noise) varies as the median true flux is varied. Noise is drawn from a Gaussian distribution centred on zero, with σ= 6.7, 7.9 and 8.8 mJy beam−1 at 250, 350 and 500 μ m, respectively. Different coloured lines correspond to sets of simulations with different slopes of dN/dS between −1 and −4. (b) Bias factor (ratio of median measured to median true flux) as a function of median true flux, for the same set of simulations, showing how the bias depends on the slope of dN/dS for R= 1.3. Note that bias does not vary monotonically with slope for fixed R, but is greatest for a slope of −1.5 in this case. (c) Bias factor as a function of median true flux, showing the effect of increasing the range of true fluxes. Different lines correspond to sets of simulations with different values of R between 1 and 2; in each of these, the slope of dN/dS is −2.
Crucially, all of our conclusions remain valid, and all trends remain significant, for any combination of slope (−0.5 to −4) and range (1–2). This is equally true if we make no corrections.
An alternative to these corrections would be to use the mean instead of the median when stacking. The mean is not altered by the effects of noise as we found the median to be; however, the mean will be highly biased simply by the skewed shape of the distribution. In fact the bias in the mean at all flux levels is equal to the maximum bias seen in the median at the lowest flux levels. The reason for this is simple: at the lowest flux levels, where noise dominates over the true flux, the distribution of measured fluxes closely resembles the noise distribution – a symmetrical Gaussian – but instead of being centred on zero as the noise is, it has the same mean as the true flux distribution. The mean is not altered by the addition of noise if the mean noise is zero. The measured flux distribution is therefore symmetrical in this case; hence, the median is equal to the mean. Thus, the maximum amount of bias in the median occurs when the shape of the distribution becomes dominated by the noise rather than by the true fluxes.
In conclusion, we choose to use the corrected median estimator rather than the mean, because at all but the lowest fluxes the median is a better descriptor of the underlying (true) flux distribution. At the lowest fluxes, the median becomes biased and approaches the mean of the distribution. However, as a result of our binning scheme we can make a good estimate of the shape of the underlying flux distribution and can be reasonably certain of the correction factors for the bias. The fact that all of our ultimate conclusions remain valid for any reasonable choice of the distribution indicates the robustness of our results to these corrections.
APPENDIX C: POSTAGE STAMPS, HISTOGRAMS AND SPECTRAL ENERGY DISTRIBUTIONS OF STACKS
In Figs C1–C4 we choose some example bins to show the stacked postage‐stamp images, the distribution of measured fluxes, and the SED data and model. The figures below contain the following information.

A stack of galaxies with blue g−r colours, median Mr=−21.1 and median z= 0.11. This stack is one of the brightest in submm flux, and contains 1567 objects. Stacked, PSF‐filtered postage stamps are shown with a logarithmic grey‐scale between 0.0001 and 0.05 Jy beam−1 and signal‐to‐noise ratio contours at 5, 10, 15, 20, 25, 50, 100, 150, 200, 250. Histograms of the measured fluxes in the stack (red) and in a random background stack (blue) are shown with the KS probability that they were drawn from the same distribution. Also plotted is the rest‐frame SED fit with the temperature indicated, assuming β= 2. More details are given in Appendix C.

A stack of galaxies with green g−r colours, median Mr=−21.1 and median z= 0.15. This stack has moderate submm flux, and contains 570 objects. Plots are as in Fig. C1 except that the postage stamps are plotted on a logarithmic grey‐scale between 0.0001 and 0.02 Jy beam−1.

A stack of galaxies with red g−r colours, median Mr=−21.7 and median z= 0.21. This stack has among the faintest submm fluxes, and contains 1111 objects. Plots are as in Fig. C1 except that the postage stamps are plotted on a logarithmic grey‐scale between 0.0001 and 0.005 Jy beam−1.

A stack of galaxies with red g−r colours, median Mr=−22.5 and median z= 0.27. This stack has among the faintest submm fluxes, and contains 1002 objects. Plots are as in Fig. C1 except that the postage stamps are plotted on a logarithmic grey‐scale between 0.0001 and 0.004 Jy beam−1.
Postage stamps of the stack in the three SPIRE bands are shown with contours at signal‐to‐noise ratio levels of 5, 10, 15, 20, 25, 50, 100, 150, 200 and 250. The images are each 41 pixels square, corresponding to 3 arcmin and 25 arcsec at 250 μ m and 6 arcmin and 50 arcsec in the other two bands. These images are illustrative only, and were made by stacking in the PSF‐filtered, background‐subtracted SPIRE maps (we do not measure fluxes in these maps but using the deblended filter method described in Appendix A). The flux and signal‐to‐noise ratio reached in the central pixel of each postage stamp agree with the stacked values in Table 1, although the postage stamps show slightly boosted fluxes due to blending not being accounted for. Similar agreement was found in all stacks, although only a selection are shown here for brevity.
We also show histograms of the measured fluxes in the stack (red) and of a set of fluxes measured at random positions in the background (blue), as described in Section 4. The number shown is the KS probability that these two samples were drawn from the same distribution. Beneath these is plotted the single‐component SED fitted to the three stacked fluxes (plotted in the rest frame), assuming β= 2. The SED is fitted to obtain the temperature which is printed over the SED, as described in Section 5.
SUPPLEMENTARY MATERIAL
Additional Supporting Information may be found in the online version of this article
Table 1. Results of stacking in bins of g − r colour, redshift and absolute magnitude (Mr).
Please note: Wiley-Blackwell are not responsible for the content or functionality of any supporting materials supplied by the authors. Any queries (other than missing material) should be directed to the corresponding author for the article.
Author notes
Herschel is an ESA space observatory with science instruments provided by European‐led Principal Investigator consortia and with important participation from NASA.