-
PDF
- Split View
-
Views
-
Cite
Cite
R. Cid Fernandes, G. Stasińska, A. Mateus, N. Vale Asari, A comprehensive classification of galaxies in the Sloan Digital Sky Survey: how to tell true from fake AGN?, Monthly Notices of the Royal Astronomical Society, Volume 413, Issue 3, May 2011, Pages 1687–1699, https://doi.org/10.1111/j.1365-2966.2011.18244.x
- Share Icon Share
Abstract
We use the WHα versus [N ii]/Hα (WHAN) diagram introduced by us in previous work to provide a comprehensive emission-line classification of Sloan Digital Sky Survey galaxies. This classification is able to cope with the large population of weak line galaxies that do not appear in traditional diagrams due to a lack of some of the diagnostic lines. A further advantage of the WHAN diagram is to allow the differentiation between two very distinct classes that overlap in the low-ionization nuclear emission-line region (LINER) region of traditional diagnostic diagrams. These are galaxies hosting a weakly active galactic nucleus (wAGN) and ‘retired galaxies’ (RGs), i.e. galaxies that have stopped forming stars and are ionized by their hot low-mass evolved stars.
A useful criterion to distinguish true from fake AGN (i.e. the RGs) is the value of ξ, which measures the ratio of the extinction-corrected Hα luminosity with respect to the Hα luminosity expected from photoionization by stellar populations older than 108 yr. We find that ξ follows a markedly bimodal distribution, with a ξ≫ 1 population composed by systems undergoing star formation and/or nuclear activity, and a peak at ξ∼ 1 corresponding to the prediction of the RG model. We base our classification scheme not on ξ but on a more readily available and model-independent quantity which provides an excellent observational proxy for ξ: the equivalent width of Hα. Based on the bimodal distribution of WHα, we set the practical division between wAGN and RGs at WHα= 3 Å.
Five classes of galaxies are identified within the WHAN diagram:
pure star-forming galaxies:
and WHα > 3 Å;
strong AGN (i.e. Seyferts):
and WHα > 6 Å;
weak AGN:
and WHα between 3 and 6 Å;
RGs (i.e. fake AGN): WHα < 3 Å;
passive galaxies (actually, lineless galaxies): WHα and W[N ii] < 0.5 Å.
A comparative analysis of star formation histories and of other physical and observational properties in these different classes of galaxies corroborates our proposed differentiation between RGs and wAGN in the LINER-like family. This analysis also shows similarities between strong and weak AGN on the one hand, and retired and passive galaxies on the other.
1 INTRODUCTION
The goal of a classification scheme is to rationally organize a large collection of objects (animals, plants, books, galaxies) into a fixed number of smaller classes without leaving any object aside, in order to help dealing with this large collection. There are obviously many ways to classify objects, depending on what properties one is mostly interested in. For galaxies, one could, for example, be mainly interested in the morphology (Hubble 1936), colours (de Vaucouleurs 1960), spectral features (Morgan & Mayall 1957) etc. It is not only the considered object properties that change from one classification system to another, but also the philosophies underlying the classification process (hierarchical or non-hierarchical, supervised or not supervised,...) all of which have their advantages and drawbacks. Ideally, a classification scheme should be objective and allow easy incorporation of new objects into predefined categories. The last criterion, for instance, is not fulfilled by the classifications using principal component analysis (Sodré & Cuevas 1994; Connolly et al. 1995). Most importantly, the classification scheme must be useful, Hubble’s famous tuning fork being a classical example. Another one is the emission-line classification scheme pioneered by Baldwin, Phillips & Terlevich (1981) (see also Pastoriza 1968), which has been widely used over the last three decades.
The advent of large surveys of galaxies providing extensive collections of homogeneous data sets not easily tractable with conventional methods fostered new classification schemes based on automatic procedures such as neural networks (Folkes, Lahav & Maddox 1996), k-means cluster analysis of entire galaxy spectra (Sánchez Almeida et al. 2010) or on human-eye analysis of galaxy morphology by hundreds of thousands of volunteers, as in the beautiful Galaxy Zoo project (Lintott et al. 2010).
Emission-line classifications of galaxies are among the easiest to carry out (once the emission-line intensities are measured) and allow one to deal with such issues as star formation, chemical composition or nuclear activity. They have, for example, helped to gain insight into the nature of warm infrared galaxies (de Grijp et al. 1992), of luminous infrared galaxies (Smith, Lonsdale & Lonsdale 1998), of galaxies in compact groups (Coziol et al. 1998) or into the connection between active galactic nuclei (AGN) and their host galaxies (Kauffmann et al. 2003). The most famous emission-line diagnostic diagram is that based on the [O iii]λ5007/Hβ versus [N ii]λ6584/Hα line ratios1[dubbed the BPT diagram after Baldwin et al. 1981]. This diagram was built to pin down the main source of ionization in the spectra of extragalactic objects, and became one of the major tools for the classification and analysis of emission-line galaxies (ELGs) in the Sloan Digital Sky Survey (SDSS; York et al. 2000).
As emphasized by Cid Fernandes et al. (2010, hereafter CF10), the BPT diagram leaves unclassified a large proportion of ELGs in the SDSS due to quality requirements on four emission lines. The Kewley et al. (2006) classification is even more demanding, as it requires the intensities of as many as seven lines. To remedy this situation, CF10 have proposed a much more economic diagram that allows one to attain the same objectives using only two lines, Hα and [N ii], which are generally the most prominent in the spectra of galaxies. This is a diagram plotting the Hα equivalent width versus [N ii]/Hα, which was dubbed the EWHαn2 diagram in that paper and which we will from now on refer to as the WHAN diagram, for simplicity.2 The borderlines between different classes of galaxies in the WHAN diagram were defined by optimal transpositions of the Kauffmann et al. (2003), Stasińska et al. (2006, hereafter S06) and Kewley et al. (2006) borderlines on to the WHα versus [N ii]/Hα plane.
The categories of ELGs considered by Kauffmann et al. (2003) and Kewley et al. (2006) comprised star-forming galaxies (SF), Seyferts and LINERs.3 This latter category is supposed to comprise objects hosting low-level nuclear activity. However, as demonstrated by Stasińska et al. (2008, hereafter S08), the LINER region in the BPT diagram also contains galaxies that have stopped forming stars and are actually ionized by the hot low-mass evolved stars (HOLMES) contained in them. This idea is not new. It was already put forward to explain the emission-line spectrum of elliptical galaxies (Binette et al. 1994; Macchetto et al. 1996) and even earlier in different contexts (Hills 1972; Terzian 1974; Lyon 1975; Sokolowski & Bland-Hawthorn 1991). This idea is recently gaining support (Kaviraj 2010; Masters et al. 2010; Schawinski et al. 2010), especially with the detailed study of nearby galaxies (Annibali et al. 2010; Eracleous, Hwang & Flohic 2010; Sarzi et al. 2010). As shown by CF10, when taking into account the ‘forgotten population’ of weak line galaxies, the proportion of such ‘retired galaxies’ (RGs) increases dramatically. It is therefore essential to find ways to distinguish these RGs from truly ‘active’ galaxies, i.e. galaxies which contain a (weak) AGN. This is one of the goals of the present paper.
Our second goal is to link the universe of ELGs with the universe of galaxies without emission lines (often dubbed ‘passive galaxies’, PG). We will see that RGs and PGs are in fact very similar objects.
With our WHAN diagram, we will thus be able to provide a comprehensive classification of all SDSS galaxies, including PGs.
The paper is structured as follows. Section 2 presents the samples of galaxies used to work out our classification and our starlight data base from which the properties of galaxies that we use are extracted. Section 3 lists the initial definition of galaxy classes: SF, Seyferts, LINERs and PGs. Section 4, the core of this paper, sorts out RGs from AGN among LINER-like galaxies and proposes a practical criterion to identify them. Section 5 describes our comprehensive classification of galaxies based on the WHAN diagram and leading to five classes: SF, strong AGN (sAGN), weak AGN (wAGN), RG and PG. Section 6 examines the star formation histories (SFHs) of galaxies within our new classification scheme, placing RGs in the context of the galaxy population in general. Section 7 discusses the distributions of a series of physical and observed properties in the different galaxy classes, to test the pertinence of our new classification. Finally, Section 8 summarizes our main results.
2 SAMPLES AND DATA PRODUCTS
We draw from the nearly 700 thousand galaxies from the main galaxy sample (Strauss et al. 2002) in the seventh data release of the SDSS (Abazajian et al. 2009). Besides the SDSS data, we make use of a suite of starlight-derived products available at http://www.starlight.ufsc.br, like stellar masses, stellar extinctions and the full SFHs (Cid Fernandes et al. 2005). Emission-line properties are measured after subtraction of the underlying stellar component determined with the starlight fit, as described in Mateus et al. (2006).
Two samples are used throughout this work. The ‘full sample’ (hereafter ‘sample F’) is selected by requiring (i) a signal-to-noise ratio (S/N) ≥10 in the continuum around 4750 Å, (ii) no faulty pixels within ±15 Å of Hα and [N ii], (iii) z < 0.17 to eliminate the few objects at higher redshift which survive the previous criterion and (iv) z > 0.04 to minimize apertures effects. In total, sample F contains 379 330 galaxies.
Sample V is built from a volume-limited sample satisfying 0.04 < z < 0.095 and absolute r-band magnitude Mr < −20.43, to which we further apply criteria (i) and (ii) above. These latter cuts imply only a 6 per cent reduction, with little loss of completeness at the faint end, so this sample is effectively limited in volume. Sample V comprises 152 749 objects. By choosing z < 0.095 we eliminate cases where Hα and [N ii] fall in a region of strong sky lines, so that criterion (ii) is fulfilled.
These samples fulfil the basic requirements for this work. Condition (i) ensures meaningful spectral fits, necessary for stellar population analysis and the production of templates to aid emission-line measurements, while (ii) ensures that the detection of Hα and [N ii] (the most important lines in this study) is not hampered by artefacts like bad pixels or sky residuals. At the same time, neither of these samples is biased for or against the presence of emission lines, which allows us to treat systems with strong, weak or no emission lines on an equal footing.
Sample F has the advantage of being more numerous and of extending to lower luminosities, while sample V is more adequate for demographic studies and unbiased comparisons among subsamples. The novelties in this work, namely the criterion to separate true from fake AGN and the resulting revision of conventional spectral taxonomy, are mainly related to the most luminous galaxies, so that, in practise, we could use either sample V or F. In what follows, unless stated otherwise, we use sample V.
3 PRELIMINARY GALAXY CLASSES
We subdivide galaxies into different classes according to their emission-line properties. The first division is among galaxies with and ‘without’ emission lines, denoted ELGs and PGs, respectively.
3.1 Passive galaxies
PGs are defined as those with very weak or undetected emission lines. Galaxies are deemed passive if the equivalent widths (Wλ) of both Hα and [N ii] fall below 0.5 Å, a criterion which applies to 19 (20) per cent of sample V (F). Since these are the two strongest optical lines, this criterion automatically implies that other lines will be even weaker in general (e.g. Brinchmann et al. 2004, CF10).
In the literature one often finds PGs and ELGs defined in terms of limits on the S/Nλ of emission lines (e.g. Miller et al. 2003; Brinchmann et al. 2004; Mateus et al. 2006). We consider a Wλ-based criterion more appropriate, as it is based on a more direct measurement of the line strength and less explicitly dependent on the quality of the data. In addition Wλs have an astrophysical meaning. We emphasize that the adopted limiting value of 0.5 Å is not really meaningful. As will be seen later, none of the discussion presented in this paper would have changed if had we adopted a slightly different limit. We also note in passing that the denomination ‘passive’ for ‘lineless’ systems is adopted just for compatibility with current nomenclature, with no evolutionary connotation as the word would suggest.
3.2 Emission-line galaxies: SF, Seyferts and LINERs
ELGs are defined as everything else, i.e. any source where either WHα or W[N ii] is ≥0.5 Å. Notice that this stretches the definition of an ELG to its limit, since, in extreme cases, only one of Hα or [N ii] is detected! Sample V (F) has 124 410 (303 194) ELGs, 94 (92) per cent of which have both lines stronger than 0.5 Å.
ELGs are normally subdivided into SF and AGN-like classes on the basis of diagnostic diagrams involving at least four lines, like the BPT diagram ([O iii]/Hβ versus [N ii]/Hα). As shown in CF10, the weakness of either or both of Hβ and [O iii] implies that such traditional classification schemes leave large numbers of ELGs unclassified. This problem is particularly severe in the right wing of the BPT diagram, where AGN-like systems reside, but well over half of the galaxies have unusable Hβ and/or [O iii] fluxes. In that same paper we introduced alternative diagnostic diagrams able to cope with galaxies with weak or undetected Hβ and/or [O iii], the most economic of which is WHα versus [N ii]/Hα (WHAN diagram). We base all our classifications on this diagram, which, unlike any other, allows us to classify all galaxies.
The WHAN diagram is shown in Fig. 1. The vertical line at corresponds to the optimal transposition of the S06 BPT-based SF/AGN division, designed to differentiate sources where star formation provides all ionizing photons from those where a harder ionizing spectrum is required. Similarly, the division at WHα = 6 Å represents an optimal transposition of the Kewley et al. (2006) division between Seyferts and LINERs on the basis of seven emission lines. Unlike Kewley et al. (2006) and other studies (e.g. Kauffmann et al. 2003; Mateus et al. 2006), we do not define a ‘SF+AGN composite’ category, and so our Seyfert/LINER borderline goes all the way to the SF/AGN frontier. (The reader is referred to CF10 for a discussion of the pros and cons of this diagram, as well as for a discussion of why we prefer not to define a composite category.) Sample V splits into percentage fractions of 23, 21 and 37 SF, Seyferts and LINERs, respectively. PGs account for the remaining 19 per cent. The corresponding fractions for sample F are very similar: 25, 19, 36 and 20 per cent.
![The WHAN diagram, used to establish emission-line classes. The line labelled S06T represents the optimal transposition of the S06 SF/AGN spectral classification scheme, while the line at WHα = 6 Å represents the transposed version of the Kewley et al. (2006) Seyfert/LINER class division. Dotted lines mark WHα = 0.5 and W[N ii] = 0.5 Å limits, below which line measurements are uncertain. Passive galaxies (WHα and W[N ii] < 0.5 Å; red dots) lie below the dashed line. Dots correspond to sample F and contours to sample V. Contours are drawn at number densities: 0.1, 0.2, 0.4 and 0.6 of the peak value. As in other figures in this paper, only one-tenth of the objects is plotted to avoid overcrowding.](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/mnras/413/3/10.1111/j.1365-2966.2011.18244.x/2/m_mnras0413-1687-f1.jpeg?Expires=1750237534&Signature=OrYCoJlHWIWAnWpaN-tTMM4VbGEF3AojNujItJMSqA5phTac3H2FjUf-Gnpn9mb4ApArkA14kQ7rSbM-jSziuq6sxDthDop7FNx-xQRUBScWu3HpZjs3jgOd4Th1uDcLwcL72CJdvwKbe86gUYSM~ZQJJfIuWAfIwKc1s4LJK9PtBR8OEwQAq-lQzgqQku5~019X4t1jQlK~7i8zYVHzmEgqADRlUOrTWo6YamQx4ePXLq-Q1PGoDm5AKCjk2onpNsMpzMlpt7rvnfxK8RPt0avkI1TN6qaIArWbnE1-Im3XhgXNLfGqMCJZ6Lj1~XA8~1j5meW9LscDRQkfGa6i-A__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
The WHAN diagram, used to establish emission-line classes. The line labelled S06T represents the optimal transposition of the S06 SF/AGN spectral classification scheme, while the line at WHα = 6 Å represents the transposed version of the Kewley et al. (2006) Seyfert/LINER class division. Dotted lines mark WHα = 0.5 and W[N ii] = 0.5 Å limits, below which line measurements are uncertain. Passive galaxies (WHα and W[N ii] < 0.5 Å; red dots) lie below the dashed line. Dots correspond to sample F and contours to sample V. Contours are drawn at number densities: 0.1, 0.2, 0.4 and 0.6 of the peak value. As in other figures in this paper, only one-tenth of the objects is plotted to avoid overcrowding.
The dotted lines in Fig. 1 correspond to min (WHα, W[N ii]) = 0.5 Å. Points below these lines (painted in orange or red) are thus very uncertain,4 but are nevertheless plotted to remind the reader of the existence of such objects. There is indeed some advantage in including lineless galaxies in an emission-line classification scheme since the absence of emission lines is also an information! The region below the dotted lines includes both ELGs with only one line ≥0.5 Å, as well as PGs (below the dashed lines, in red) for which we have Hα and [N ii] fluxes in our data base.
4 IDENTIFYING RETIRED GALAXIES
Ideally, splitting ELGs into SF and AGN would separate systems whose emission lines are powered exclusively by young stars from those where accretion on to a central supermassive black hole dominates (or at least contributes to) the ionizing radiation field. However, as reviewed in the introduction, this binary scheme does not account for a third phenomenon: photoionization by HOLMES. These RGs are numerous in large-scale surveys (S08), where they occupy LINER-like locations in the commonly used diagnostic diagrams, and are thus erroneously associated to non-stellar activity.
This section investigates ways of identifying RGs. We start with a theoretically motivated definition, but end up with a very simple empirical criterion to diagnose RGs among ELGs.
4.1 Ionizing photons budget: the ξ ratio
By definition, RGs are systems whose old stellar populations suffice to explain the observed emission line properties. Since we already know that nebulae photoionized by HOLMES cover most regions in intensity ratios diagrams (S08), we shall adopt an operationally simple energy-budget criterion and define RGs as galaxies whose old stellar populations are capable of accounting for all of the observed Hα emission.

This way of identifying RGs was previously employed by S08. In what follows we open up the details and discuss the caveats related to the estimation of ξ.
4.2 Computation of the expected Hα luminosity


The starlight fits used in this work employ the same base of SSPs from Bruzual & Charlot (2003, hereafter BC03) described by Mateus et al. (2006). Naturally, the spectral fits are carried out in the optical range, but the BC03 models also provide predictions for the ionizing energies. This is a key element of our analysis, so it is worth to open a parenthesis to examine these predictions more closely.
4.3 SSP models beyond 1 Rydberg
Fig. 2 shows the evolution of SSP properties of interest to this work. Panels on the left (a–d) correspond to BC03 models of six different metallicities used in our analysis. These are discussed first.

Evolution of the ratio between the He i and H i ionizing photon rates (a), H i ionizing photon rate (b; in units of s−1 M−1), Hα continuum (c; in units of L⊙ Å−1 M−1
) and Hα equivalent width (d) for simple stellar populations in the BC03 models of metallicities between Z★ = 0.005 and 2.5 Z⊙. Panels on the right are Z⊙ models from five different sources (including BC03 for comparison). See text for details. The right axis scale in panels (c) and (g) gives the mass-to-light ratio in the Hα continuum: κHα=c−1Hα (in units of M⊙ L−1
Å).
Fig. 2(a) shows the evolution of the hardness of the ionizing spectrum, as measured by the ratio of He i to H i ionizing photons. The difference between the soft ionization field produced by OB stars (t < 107 yr) and that produced by HOLMES is clearly visible. Panel (b) shows the specific ionizing photon rate, qH(t, Z), the most relevant quantity for our analysis. One sees the well-known drop by ∼5 orders of magnitude in qH from the H ii region phase to the time when the HOLMES regime sets in at 108 yr and qH approximately stabilizes at a level of the order of 1041 s−1 M−1. The specific continuum around Hα, cHα (equivalent to the inverse of the mass to light ratio, κHα), however, keeps decreasing with time (panel c) and therefore the predicted WHα (bottom panel) goes through a minimum at 108 yr and grows to 1.5–2.5 Å at t≳ 1010 yr. This is enough to explain a significant proportion of the ELGs in the LINER zone of Fig. 1.
The right-hand column of Fig. 2 shows the predictions obtained for other evolutionary synthesis models at solar metallicity and Chabrier (2003) initial mass function (IMF; the same one used for the BC03 models on the left-hand panels). Red lines show results for a preliminary version of Charlot & Bruzual models. Blue lines are used for Popstar models (Mollá, García-Vargas & Bressan 2009) with the Lejeune, Cuisinier & Buser (1997) library and NLTE models by Rauch (2003) and ‘Padova 1994’ tracks (version updated by Mollá & García-Vargas 2000). Green lines represent Pegase models (Fioc & Rocca-Volmerange 1997) with their 200 Å–10 μm stellar library (see their section 2.2.1 for more details). Two sets of evolutionary tracks are shown for Pegase: light green is used for Bressan et al. (1993) tracks up to thermally pulsing asymptotic giant branch (TP-AGB), dark green shows the set for Schaller et al. (1992) and Charbonnel et al. (1996) tracks. Both sets use Schoenberner (1983) and Bloecker (1995) for the post-asymptotic giant branch (PAGB). For reference, the BC03 predictions for Z⊙ are repeated (black lines).
Despite a general qualitative agreement, there are significant quantitative differences. For ages above 108 yr, the hardness ratio varies from 10 to 20 per cent, while qH differs by up to 1 dex from one model to another. Those differences could be due to differences in stellar tracks, model atmospheres and even interpolation techniques. For WHα, the greatest difference is above 109 yr for the BC03 models, which are ∼0.5 Å larger than the others. We have also compared different IMFs (e.g. Salpeter 1955; Kroupa 2001). For a given metallicity in a model, different IMFs result in variations from 0.2 up to 0.5 dex in qH and cHα, which is typically of the same order of variations for different models with the same IMF.
For the sake of internal consistency with our starlight analysis of SDSS galaxies, in what follows we adopt the predictions from BC03, but this quick compilation illustrates that there are still plenty of uncertainties in model predictions for the Lyman continuum of old stellar populations. Naturally, these uncertainties propagate to our estimate of ξ. Fortunately, there is an empirical alternative to circumvent the uncertainties related to this choice (Section 4.8).
4.4 Extinction correction
Let us now turn to the numerator of equation (1). The caveat in this case is that a nebular extinction is needed to correct LobsHα to LintHα, which in turn requires good measurements of both Hα and Hβ. While this is not a problem for ELGs in the SF and Seyfert categories, most of the LINER-like systems do not comply with this requirement because of their feeble Hβ.
One way to circumvent this difficulty is to estimate Hα/Hβ in terms of something else. To do this, we first select LINERs with reliable Hα/Hβ, and then calibrate the relation between nebular (AnebV) and stellar (A★V) extinctions (as done by Asari et al. 2007 for SF galaxies), such that A★V can be used to estimate AnebV. In practice, we impose WHα and WHβ > 0.5 Å, and Hα/Hβ > 3 (such that AnebV > 0),5 which leaves us with only ∼30 per cent of the full list of LINERs. Nebular and stellar extinctions are indeed correlated for this subsample (Spearman rank correlation coefficient RS = 0.38), albeit with substantial scatter. A fit through the median relation yields AnebV = 1.56 A★V+ 0.53, with typical residuals of ∼0.5 mag. Objects used in this calibration have AnebV = 1.0 ± 0.5 (median ± semi-interquartile range), while for LINERs as a whole the above relation leads to AnebV = 0.8 ± 0.2. This method most likely overestimates AnebV, as there is a trend of increasing AnebV with increasing WHα, and Hα is much weaker in the full sample than in the subset used to calibrate the AnebV(A★V) relation (WHα = 1.5 versus 3.7 Å in the median, respectively).
Another alternative is to apply the extinction correction when the data allow (WHα and WHβ > 0.5 Å, and Hα/Hβ > 3), and assume AnebV = 0 otherwise.
These methods lead to estimates of ξ which should bracket the correct solution. Since AnebV = 1 implies a 0.32 dex increase in LintHα, for the typical inferred AnebV values these two methods should differ by ∼0.2–0.3 dex in ξ.
4.5 Results: the bimodal distribution of ξ
The ξ values derived range from ∼0.1 to 1000. This is illustrated in Fig. 3(a), which shows the distribution of ξ values for all ELGs in sample V. The distribution is strongly bimodal, with a high ξ peak at ∼30 and a lower one centred close to ξ = 1.
![Histograms of the ratio between the intrinsic Hα luminosity LHα to that predicted to originate from photoionization by HOLMES (equation 1) for ELGs in our volume-limited sample, and for two choices of extinction correction for objects with unreliable Hα/Hβ: estimate AnebV from A★V (solid lines), and no correction (dotted). Panel (a) includes all sources, while in panel (b) we select galaxies whose populations younger than 108 yr contribute less than X8 = 10 per cent to the continuum at λ = 4020 Å. Panel (c) shows the ξ distribution which results from requiring convincing detections of Hβ and [O iii], as well as Hα and [N ii].](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/mnras/413/3/10.1111/j.1365-2966.2011.18244.x/2/m_mnras0413-1687-f3.jpeg?Expires=1750237534&Signature=dC61HtVEleiSz52YCKLpfXgJSye6p4OVQ1tYumrEuecLPZ7Dl~7~18yL809OZM9JldFYm8t332NjEAywod2C2VK3M8Xu7fRwSWIjwHfcH3O~IzQgp3N66xMfF0i5gr3YKhS3EDNOWpfwT0cuIAd40fsbck9LnHBHZkp3jc9uo5toCbX~dITSxAvkUtW18v7czhsEygri1zpP4KcE~svLEYVSWjNe-3MFxpYg9Ec5qX1vf5B95yRcMsFKjG4bxsZ-UbAy30Sm1W8CwNSSnc2m7yDF09BJ9jfKrT2XdODRmiRfoiqwVRyCfecc2NMFbNtrbGK4LiJM4yJ77rbxnJjo2A__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
Histograms of the ratio between the intrinsic Hα luminosity LHα to that predicted to originate from photoionization by HOLMES (equation 1) for ELGs in our volume-limited sample, and for two choices of extinction correction for objects with unreliable Hα/Hβ: estimate AnebV from A★V (solid lines), and no correction (dotted). Panel (a) includes all sources, while in panel (b) we select galaxies whose populations younger than 108 yr contribute less than X8 = 10 per cent to the continuum at λ = 4020 Å. Panel (c) shows the ξ distribution which results from requiring convincing detections of Hβ and [O iii], as well as Hα and [N ii].
The two histograms in Fig. 3(a) differ in the recipe to correct LobsHα for extinction when Hα/Hβ is unreliable. The dotted curve is for no correction, whereas the solid one is for the AnebV(A★V) method discussed above. The curves are identical for ξ > 10 because these are Seyfert and SF galaxies, with Hα and Hβ fluxes good enough to apply the standard correction. In the low ξ peak the two methods differ by just 0.2 dex on the median, as anticipated. Considering the smallness of this difference and that these recipes are likely to bracket the range of possibilities, we henceforth use the average ξ, whose distribution peaks at ξ∼ 1.
Overall, the low ξ population is unbelievably close to the exact prediction for RGs: ξ = 1! However, are these galaxies really retired in the sense of not having formed stars for a long while? To answer this question, Fig. 3(b) shows the histograms obtained selecting only galaxies whose populations younger than 108 yr contributes no more than X8 = 10 per cent of the light at 4020 Å.6 Clearly, the low ξ peak is made up of galaxies which have formed little or no stars in the past 108 yr, in full agreement with the RG scenario of S08.
The width of the RG peak in the ξ distribution (as measured from the one-sided dispersion of its lower half, and assuming that the distribution is symmetric) is σlog ξ∼ 0.24 dex for the full ELG sample or 0.13 dex eliminating those with WHα < 0.5 Å. From the considerations in the above sections, one would hardly expect ξ to be more precise than a factor of 2, so that this observed dispersion can be accounted for by uncertainties in the data and the analysis.
While theory says that RGs should have ξ≤ 1, the observed bimodality in the ξ distribution suggests that ξ < 2 or 3 is a more effective way of separating RGs from the rest of the ELG population (Fig. 3a). About one-third of the full ELG sample has ξ < 2. The reason why this huge population of RGs has received little attention to date is because standard selection criteria tend to exclude them from emission line studies. Fig. 3(c) illustrates how the distribution of ξ changes by imposing a Wλ > 0.5 Å quality control for the four lines in the BPT diagram: Hβ, [O iii], Hα and [N ii]. This reduces the ELG sample as a whole by ∼1/3, but the fraction of ξ < 2 sources decreases by a factor of 3. The bimodality is still present, but the low ξ peak is heavily suppressed, so that RGs are the main victims of this cut. With the WHAN diagram we are able to rescue RGs and weak line galaxies in general, eliminating this bias and thus drawing a much more complete view of ELGs in the local Universe.
4.6 Results: ξ versus [N ii]/Hα
Fig. 4 shows the results of this analysis in a ξ versus [N ii]/Hα diagram, colour coding SF, Seyferts and LINERs according to their location on the WHAN diagram (Fig. 1). The plot shows that SF, Seyferts and LINERs are well separated in terms of ξ and [N ii]/Hα, with SF and Seyferts being responsible for the ξ≫ 1 peak and LINERs for the ξ∼ 1 RG-like population. The plot also illustrates how dangerous it is to define a galaxy as an AGN on the basis of [N ii]/Hα alone. Miller et al. (2003), for instance, define ‘2-lines AGN’ as sources with (similar to the ‘low S/N AGN’ in Brinchmann et al. 2004), while our analysis suggests that most of this population does not require an energetically significant non-stellar source of ionizing photons.
![ELGs in the ξ versus [N ii]/Hα diagram, colour coding PGs, LINERS, Seyferts and SF, as in Fig. 1. Also as in Fig. 1, orange points are sources with either or both of Hα and [N ii] weaker than 0.5 Å. Solid contours are for the full sample V, while the dashed ones are for galaxies with less than X8 = 10 per cent of their flux at λ = 4020 Å due to populations younger than 108 yr (for clarity, only the two outer contours are plotted in this case). Dotted horizontal lines mark a range of ×2 around the prediction of the RG model (ξ = 1).](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/mnras/413/3/10.1111/j.1365-2966.2011.18244.x/2/m_mnras0413-1687-f4.jpeg?Expires=1750237534&Signature=dcA9nCmlC0zYjcarLPDWPEF83VJLMhst9iMX1YbqM04ryzb2jhPnjnu49hmzvv6fyQM6aTMPrN3rLemlJ6w1CNOGWscb9RYr-0g~lW3zSNjpsA~9boMBXnEYW2D3SU5C9H5cLjawmNfrI7qAfjDIrykUMH9DjTWpTnACQfpEAHnPsazCZwiGYXOCWuLTKUY4Q0-t7B07i7LcR99qxxEFI-vGaU4EmfLlA2-Pi5upWCosuBnQpwOXs2SensQvlFZMFEaBfXZjPN9KGjMXQjOQXjvGZqZzqG8TvJQPGg1gxmochPeZt77if2b1sasxcB91YOS7Cl8nzw~JjO-pJy-kug__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
ELGs in the ξ versus [N ii]/Hα diagram, colour coding PGs, LINERS, Seyferts and SF, as in Fig. 1. Also as in Fig. 1, orange points are sources with either or both of Hα and [N ii] weaker than 0.5 Å. Solid contours are for the full sample V, while the dashed ones are for galaxies with less than X8 = 10 per cent of their flux at λ = 4020 Å due to populations younger than 108 yr (for clarity, only the two outer contours are plotted in this case). Dotted horizontal lines mark a range of ×2 around the prediction of the RG model (ξ = 1).
The dashed contours in Fig. 4 show the location of ELGs with X8 < 10 per cent. This clarifies that the bulk of the population of large ξ sources among galaxies with a predominantly old stellar population (seen in the middle panel of Fig. 3) is composed of LINERs approaching the Seyferts zone. Further considering that the X8 cut eliminates sources with significant on-going star formation, this strongly indicates that these sources have a non-stellar source powering most of its line emission (i.e. they are AGN in old hosts).
We note in passing a tiny population of galaxies with small ξ (i.e. consistent with our RG model) but located to the left of the S06 line in Fig. 4. These could correspond low-mass, low-metallicity galaxies in a quiescent phase between bursts of star formation, such as the galaxies discussed by Sánchez Almeida et al. (2008).
4.7 ξ versus WHα
An important aspect of Fig. 4 is its striking similarity to the WHAN diagram of Fig. 1. This happens because ξ and WHα are very closely related. The equivalent width of Hα is defined as WHα=LobsHα/CobsHα, where the denominator is the observed continuum luminosity around Hα.7 Hence, extinction corrections aside, WHα and ξ (equation 1) have the same numerator.
Because qH(t, Z) varies little with t or Z after 108 yr (Figs 2b and f), and since most of the mass is always in the old populations (over 99 per cent for our samples), QexpH(t > 108 yr) in equation (3) becomes ∼M★qoldH, where qoldH, the specific ionizing flux due to HOLMES, is of the order of 1041 s−1 M−1. M★ can be written in terms of the intrinsic continuum at Hα and the galaxy’s luminosity weighted mass-to-light ratio: M★=CintHα〈κHα〉L, where κHα=c−1Hα was previously discussed in Section 4.3. While κHα spans nearly three decades for SSPs between 106 and 1010 yr (Fig. 2c), galaxies are not SSPs, and young (low κHα) populations rarely make a substantial contribution to the Hα continuum, so that the luminosity weighted κHα values are of the same order for most sources. For the V sample, for instance, log〈κHα〉L = 4.56 ± 0.14 (median ± semi-interquartile range), while subsamples of SF, Seyferts, LINERs and PGs have 4.35 ± 0.12, 4.47 ± 0.08, 4.65 ± 0.08 and 4.70 ± 0.06, respectively (in units of M⊙ L−1
Å). As a result, LexpHα is directly proportional to the Hα continuum, with little scatter due to variations in the SFHs, and thus the denominators of ξ and WHα essentially measure the same thing.




Fig. 5 shows the observed relation between ξ and WHα. The bottom panel (c) shows the dust corrected ξ value. Points in cyan and orange correspond to galaxies where Hα/Hβ is or is not reliable, respectively. The correlation is very tight, as indicated by the 10 and 90 per cent percentile lines (in dashed). However, the differential extinction correction to ξ creates a hump starting around WHα∼ 2 Å, coinciding with the transition between Hα/Hβ-based corrections and those based on other methods (see Section 4.4).

Relation between ξ and WHα. The bottom panel (c) shows the average ξ value discussed in Section 4.4, while in the middle panel (b) no extinction correction was applied to LHα. Points in cyan (orange) correspond to galaxies where Hα/Hβ is (is not) reliable. The black solid line (panels b and c) shows the median relation, while the dashed lines represent the 10 and 90 per cent percentiles. In all projected histograms the black line represents all ELGs, while the filled region represents those with (i.e. to the right of the S06T line in the WHAN diagram), and the dashed red line indicates PGs for which a (necessarily small and uncertain) Hα flux is available. The arrow in the WHα distribution marks the proposed limit to separate RGs from other ELGs.
Fig. 5(b) shows the same relation assuming that Hα and its continuum are equally affected by dust (AnebV=A★V), so that the last term in equation (4) disappears, and so does the hump in Fig. 5(c). The relation is linear except at high WHα, where it steepens because of the decrease in 〈κHα〉L in galaxies with lots of star formation. A fit through the median curve in its low WHα end yields log ξobs = 1.07log WHα− 0.17. Forcing a linear fit leads to WoldHα = 0.67 Å. For comparison, this value of WoldHα is reached for ages of a few Gyr for the BC03 SSPs, or closer to 10 Gyr for other models (bottom panels in Fig. 2).
4.8 Retired galaxies: an empirical diagnostic
This section started with the promise of finding a diagnostic to identify RGs. The ξ ratio was introduced with this purpose, and we have seen that it is indeed capable of capturing in a single number the essence of RGs. However, there are caveats related to this index. First, uncertainties related to the extinction correction affect ξ at a level of ±0.2 dex. Secondly, and far more important, ξ is explicitly dependent on evolutionary synthesis models, whose predictions for the ionizing fluxes from old stellar populations are still plagued by systematic uncertainties (Section 4.3, see also S08). Hence, while it is clear that RGs should populate the bottom of the ξ-scale, theory alone does not allow us to establish a firm upper limit below which a galaxy can confidently be considered as retired. A last caveat is that the evaluation of ξ requires a stellar population analysis, which is not always available or possible.
Fortunately, there is a simple empirical alternative to circumvent these difficulties.
As seen in Fig. 5(a), and in agreement with previous studies (e.g., Bamford et al. 2008), the distribution of WHα is strongly bimodal, with a low peak centred at WHα = 1 Å, an upper one at 16 Å, and an intermediate minimum in the neighbourhood of 3 Å. This happens both for the sample as a whole and for the subset of AGN-like galaxies, shown by the filled histogram. This bimodality strongly suggests that different mechanisms are at work: photoionization by HOLMES on the low WHα side and by massive stars and/or AGN for large WHα. One can therefore use the observed WHα distribution to set an empirical bound for RGs. Of course, the same can be done for the ξ distribution, but it is clearly advantageous to draw this boundary on the basis of a universally available and model-independent quantity: WHα.
We conducted various experiments to define an optimal boundary to separate the two peaks, all giving results between 3 and 4 Å. We settled for WHα = 3 Å. We thus propose to the following practical definition: RGs are ELGs with WHα < 3 Å.
For WHα < 3 Å, the central black hole may well be active (i.e. accreting), but its ionizing photon output is comparable to or weaker than that produced by HOLMES. For WoldHα values between 1 and 2 Å, which are compatible with current models (bottom panels in Fig. 2), one finds that at our proposed WHα = 3 Å borderline the AGN contributes between ∼1/3 and 2/3 of the ionizing power. Naturally, this fraction decreases even more as WHα decreases, to the point that AGN contribution is negligible for the bulk of the low WHα population. It is therefore not correct to use the emission lines of WHα < 3 Å systems to infer AGN properties.
5 A COMPREHENSIVE CLASSIFICATION OF GALAXIES
We are now able to separate fake AGN (= RGs) from true AGN. In the LINER zone of Fig. 1, true AGN are defined by 3 < WHα < 6 Å. To avoid confusion, we shall call these sources ‘wAGN’. In the interest of a consistent notation, we shall also rename Seyferts as ‘sAGN’. Finally, we remove from the SF category those galaxies consistent with a RG classification, that is to say those having WHα < 3 Å.
Our final classification scheme is shown in Fig. 6. Sample V sources split into percentage fractions of 22, 21, 8, 31 and 18 SF, sAGN, wAGN, RG and PG, respectively. RGs therefore exist in large numbers in the SDSS, in agreement with the basic prediction that such systems are bound to exits as a mere consequence of stellar evolution. AGN, on the other hand, are far less common than one would infer associating all LINERs to non-stellar activity.
![The WHAN diagram with the revised categories: SF, sAGN, wAGN, RGs and PGs. Points with one of [N ii] or Hα weaker than 0.5 Å are plotted in orange, except for PGs, which are plotted in red.](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/mnras/413/3/10.1111/j.1365-2966.2011.18244.x/2/m_mnras0413-1687-f6.jpeg?Expires=1750237534&Signature=3iY0~OsU3EsVPOtkDGPuMFHPIas5R8iNqMCgZAXLLgGruMLDEb8MizJEeUAXixkBT1vCS3Rp5GAopPoFN8l9W5tFMCL39aEpxwn9orFD24p-bkx7VYZotA9KbWqRkjjM8Q7OWY8DYK4nvbXZuOUSRqFFSqxudAut-ZurlKe3SiHBgFeBm1pUPrBLISPR8yTKu8J24uFn9elLsUMZLqDobGu-btFYiCRSyT5T4gotNqhXV0bwojCkRjh40DoZ~PNyXfEDq8jetp4Mdj6wDpQcMyEgTzFcMtfz5CRn8nIotZIIVxj3Q9KqoD-LegSHlhzYC3exQE5aJA1pi8F9GrnCsg__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
The WHAN diagram with the revised categories: SF, sAGN, wAGN, RGs and PGs. Points with one of [N ii] or Hα weaker than 0.5 Å are plotted in orange, except for PGs, which are plotted in red.
This revised classification scheme has three main virtues.
It identifies the three main ionizing agents in galaxies: young stars, AGN and HOLMES.
It is based on the cheapest and therefore most inclusive diagnostic diagram, the only one able to handle the huge population of weak line galaxies (CF10). In fact, it even allows us to visualize a region occupied by PGs, something which is not possible in conventional intensity ratio diagnostic diagrams (of course, galaxies with no emission lines at all do not appear in this diagram).
The SF/AGN and wAGN/sAGN frontiers are based on optimal transpositions of widely used demarcation lines defined on the basis of traditional diagnostic diagrams, such that the WHAN diagram preserves traditional classification schemes where they work well, while at the same time fixing their inability to deal with weak line galaxies and to distinguish true from fake AGN.
Before exploring the realm of galaxies under the perspective of this new classification scheme, a few clarifications are in order.
First, notice that our SF/AGN borderline corresponds to the transposition of the S06 BPT-based criterion to isolate systems powered exclusively by young stars from those where a harder ionizing source is required. As such, it would have been better to call it a ‘pure-SF’ demarcation line, instead of a ‘SF/AGN’ divisory line. This is all the more true now that we realize that the AGN-like population of ELGs in the BPT is full of RGs whose black holes are not active enough to affect their emission lines.
Secondly, notice that the WHα < 3 Å criterion slightly modifies our idealized definition of RGs as ‘systems whose old stellar populations suffice to explain the observed emission-line properties’ (Section 4.1) to one where HOLMES make at least a substantial contribution to the ionizing field. Such a fuzzy frontier is unavoidable. Our criterion is designed to avoid RGs being misclassified as AGN. On the other hand, some galaxies hosting an AGN, particularly those containing a weakly active nucleus, may drift to the WHα < 3 Å zone if observed within an aperture larger than the SDSS 3 arcsec spectroscopic fibre.
Numerical experiments show that this is indeed likely to happen. Picking the Hα luminosity of a randomly chosen AGN and adding it to the spectrum of a random bona fide RG with WHα within 1 ± 0.5 Å, we find that about one-third of the resulting simulated spectra have WHα < 3 Å, most of which in the 2–3 Å range. We thus expect that a fraction of the systems here classified as RGs turn out to harbour wAGN when looked up more closely. At first sight, this seems to be a drawback of using WHα for spectral classification, but it is not. After all, if an AGN is present and yet WHα < 3 Å, then it is still correct to infer that old stars make a substantial contribution to the observed line emission.
Finally, we note that the division of AGN into strong and weak categories on the basis of WHα is also to some extent aperture dependent, since sAGN may be diluted to wAGN as more stellar continuum is sampled. This, however, is a minor issue, since both classes are clearly AGN-dominated systems, so there is no risk of misidentifying the dominant ionizing agent.
6 THE STAR FORMATION HISTORIES WITHIN OUR CLASSIFICATION SCHEME
A popular approach to digest the flood of information from mega surveys like the SDSS is to compare the properties of galaxies in different classes (SF versus AGN, Seyfert versus LINER, early versus late type, field versus cluster etc.), in search of clues which may help understanding why galaxies are the way they are (see Blanton & Moustakas 2009 for examples of studies following this general strategy). Taxonomy, of course, plays a central role in such comparative studies. In this section we present yet another comparative study, but this time following the new taxonomical framework proposed in the previous section and summarized in Fig. 6. The main novelty in this scheme is the introduction of the RG category, whose members were previously misclassified as AGN. A major goal of this section is to place RGs in the context of the galaxy population in general, which was never done before. On the one hand, we want to compare RGs with other ELGs, especially wAGN, with which RGs were so far confused. It is also interesting to compare wAGN to sAGN, since this division was merely introduced for the sake of ‘backwards compatibility’ with the historical Seyfert/LINER subdivision of AGNs. On the other hand, it is equally important to compare RGs and PGs. According to the RG concept, galaxies without emission lines should not exist, since HOLMES provide a minimum floor ionizing field capable of powering emission lines. Seen from this perspective, the existence of PGs raises questions like: Are they truly lineless systems or is it just an issue of detectability? Perhaps PGs lack the warm/cold ISM to reprocess ionizing photons?
The starlight–SDSS data base offers plenty of material for this comparative study. In what follows we use a suite of physical properties as well as the detailed SFHs to address the issues raised above. Only the volume-limited sample is used from here on.
An instructive way to start this comparative study is to investigate how SFHs vary as a function of the SF/sAGN/wAGN/RG/PG spectral types defined in Fig. 6. This is done in Fig. 7 where we chopped the WHAN diagram in 0.2 × 0.3 dex wide bins in [N ii]/Hα and WHα. In each box we plot the time-dependent median of the SSFR for galaxies in the box, colour coding the curves according to the Fig. 6 emission-line taxonomy. The SSFR(t) functions were smoothed exactly as in Asari et al. (2007), which should be consulted for details on how this and other representations of starlight-based SFHs are derived.

SFHs as a function of spectral class and the location in the WHAN diagram. Each box shows the median SSFR(t) for galaxies of different types, coded by different colours. The scale of these plots is shown in the inset. Curves are only drawn for boxes containing over 200 galaxies. Only galaxies in the volume-limited sample are used in this plot. Contours correspond to 0.1 and 0.6 of the peak density.
Because this is our classification diagram, most boxes contain only one curve, with exceptions at locations straddling the nominal frontier between RGs and wAGN (at WHα = 3 Å) and wAGN/sAGN (WHα = 6 Å). Although there are strong systematic changes in SFHs across this plane, in these borderline cases one sees little difference between the SFHs of two adjacent classes.
Several trends are visible in Fig. 7. For instance, focusing on the column centred at , one sees that both PGs and RGs have SFHs concentrated in the largest ages, while, moving upwards the retrieved SFHs reveal increasing levels of recent star formation. Because of the large apertures covered by the SDSS fibres, most of the Hα emission in these AGN probably comes from star formation rather than the AGN (e.g. Kauffmann et al. 2003), so the increase in recent SSFR with WHα is expected. However, the effect of the AGN is present, as can be inferred by the decreasing levels of recent SSFR towards the right for fixed WHα (best visible in the rows centred at WHα∼ 10 Å).
Before we proceed, a parenthesis must be opened to clarify the meaning of the small upturn in SSFR(t) at t≲ 107 yr for RGs and PGs. This feature is almost certainly an artefact of incomplete modelling of the horizontal branch phase (Koleva et al. 2008; Cid Fernandes & González Delgado 2010; Ocvirk 2010). This effect was quantified by Ocvirk (2010), who estimates that these ‘fake young bursts’ account for up to 12 per cent of the recovered optical light fractions in spectral synthesis of genuinely old stellar populations. In units of the SSFR, this corresponds to a few times 10−12 yr−1, which is precisely the level at which upturns at small ages are seen in Fig. 7. We further find a clear trend for this artefact to be related to the stellar metallicity, with the low t hump increasing for decreasing , in agreement with the fact that bluer horizontal branches occur for lower Z★ (Harris 1996; see also Fig. 8 for indirect evidence of this trend). Finally, if the upturn were really due to young stars, WHα should be much larger than observed. Including these fake young bursts in the predictions for the ionizing flux leads to (in the median) expected WHα values over three times as large as observed for RGs. The overprediction for PGs is even worse, with a median expected WHα of 3.2 Å, while their definition requires WHα < 0.5 Å.

As Fig. 7, but in the plane defined by the mean stellar surface density and stellar mass. μ★ is computed dividing half the stellar mass by the radius which contains half the z-band light.
Credible levels of recent star formation only appear for boxes centred at WHα≥ 3 Å in Fig. 7, coinciding with the RG/wAGN transition. This suggests that all AGN are associated to at least some level of ongoing star formation. While this is well known and documented for sAGN (the so-called ‘starburst–AGN connection’) the situation is much less clear for wAGN (e.g. González Delgado et al. 2004 and references therein). It is difficult to compare this finding with those in previous studies because of varying definitions of wAGN, but we can confidently say that the signatures of star formation in wAGN identified in this work would be wiped out without removing the RGs from the AGN category.
We now investigate how SFHs vary in terms of properties not explicitly involved in the definition of our spectral classes. Fig. 8 shows SFHs in a stellar mass (M★) versus mass density (μ★) diagram. This allows us to compare the SFHs of galaxies which are similar in terms of global properties, but differ in terms of emission-line properties, since this time each box can contain galaxies of different classes. Again, several trends are evident to the eye. One of them is the signature of downsizing, here manifested by the decrease in recent SSFR levels as one moves up the mass scale, seen, for instance, in the curves for SF galaxies in the first two columns of boxes. A similar behaviour is also seen for RGs and PGs,8 but shifted in time to t∼ 109 yr.
For this study, the more relevant trend in Fig. 8 is that in every box the lines corresponding to our different classes appear in the same order, with recent star formation decreasing from SF galaxies to sAGN, wAGN, RGs and PGs. These last two, in fact, have median SFHs which are very similar and sometimes indistinguishable. Both types retired from forming stars over 108 yr ago. The plot also shows that the recent SSFR of wAGN never reaches levels as low as those of RGs and PGs. Despite their generally low levels of star formation compared to sAGN of same M★ and μ★, wAGN are not retired, reinforcing the idea that star formation and AGN activity are connected even in the weakest cases, at least in a statistical sense.
Fig. 9 shows another example. The plot shows the observational colour–magnitude diagram, with the familiar red sequence, blue cloud and green valley pattern. This diagram is meant to provide an empirical representation of the age–mass relation. As expected, systems in the blue cloud are more actively forming stars nowadays than in their past average, while red sequence galaxies (with the exception of AGN) tend to be retired. Again, the SF to PG sequence in levels of recent SSFR and the similarity between RGs and PGs seen in Figs 7 and 8 is present in every box.

In summary, the extended emission-line classification scheme proposed here bears an excellent correspondence with the SFHs of galaxies. RGs and PGs have very similar SFHs for similar global properties. wAGN are more similar to sAGN in terms of SFHs than to RGs or PGs. Traditional classification schemes which ignore RGs, would, on the contrary, find wAGN to be more like PGs.
7 DISTRIBUTIONS OF GLOBAL GALAXY PROPERTIES
Comparing the distributions of global properties of different galaxy classes can provide a further test of the pertinence of our classification scheme. This is done in Fig. 10 that shows normalized histograms of PG, RG, wAGN, sAGN and SF galaxies in terms of a series of properties related to their structure, stellar populations, gas and dust content.

Normalized histograms of properties for different galaxy classes. Colours code the SF, sAGN, wAGN, RG and PG classes as in Fig. 6. Horizontal bars trace the 16 to 84 percentile ranges, with the median marked by a circle, with the vertical shifts ordered as SF (top), sAGN, wAGN, RG and PG (bottom). All galaxies from our volume-limited sample are included, except for panel (l), for which a reliable Hα/Hβ was required, hence excluding all PGs and most RGs.
A striking feature of these histograms is that for most properties PG and RG have similar distributions, reinforcing our interpretation that PG and RG are essentially the same thing, or at least that these classes overlap heavily. On the other hand, the distributions for wAGN are more similar to the ones of sAGN, suggesting that these galaxies belong to the same family. This is a salutary a posteriori validation of our splitting of LINERs into two very different classes: RGs and wAGN.
Except for SF galaxies, all other galaxy types have masses within roughly ±0.5 dex of 1011 M⊙ (Fig. 10a). This is mostly due to the luminosity cut for the V sample, which imposes a mass completeness limit of ∼1010.8 M⊙. In the median, MsAGN★ < MwAGN★∼MPG★ < MRG★. In terms of surface densities and velocity dispersions (Figs 10b and c), PGs and RGs are statistically similar, occupying the high end of the distributions. Despite the substantial overlap, it is clear that wAGN and sAGN tend to have smaller σ★ and μ★ than RGs and PGs. The same is true for the escape velocity, computed from (GM★/R)1/2 with R given by the half-light z-band radius (Fig. 10d). PGs and RGs also differ from AGN in terms of structural properties like the concentration index (CI; Fig. 10e) and the SDSS deVAB_r index (Fig. 10f). Inspection of SDSS images corroborates the similarities between PGs and RGs, most of which are ellipticals, although some of the RGs turn out to be spirals (as can be deduced by the tail towards low CI in Fig. 10e), including an apparently higher than normal rate of edge-on systems. An investigation of this issue is beyond the scope of this paper, but we note that this ‘contaminant’ population tends to have WHα close to the RG/wAGN frontier, and thus may be related to the unavoidable fuzziness of classification schemes.
In terms of luminosity-weighted mean stellar ages (Fig. 10g), sAGN are younger than wAGN, which in turn are younger than RGs and PGs. From Fig. 7 we infer that the tail of RGs towards smaller mean ages is due to systems at the RG/wAGN border. The mean stellar metallicities (Fig. 10h) are similar for all types, with a tail of sAGN reaching lower values.
The distributions of stellar extinctions (Fig. 10i) show a distinct PG < RG < wAGN < sAGN pattern. To double-check this result, Fig. 10(l) shows the distribution of Hα/Hβ, but only for sources where this ratio is reliable, a criterion which excludes PGs and most RGs. Despite its incompleteness, Fig. 10(l) confirms that wAGN have higher extinction than RGs and lower than sAGN. The class with the lowest extinction is the SF class, in spite of the intense star formation which is usually associated with high extinction. The underlying reason for a smaller extinction in SF than the rest of galaxies lies in their lower masses (see Stasińska et al. 2004; Garn & Best 2010). Regarding the PG → RG → wAGN → sAGN extinction sequence, its interpretation is not straightforward, since it involves the dust mass and distribution, as well as its composition (dust grains recently produced by the progenitors of HOLMES and expected to be found in RGs do not have the same extinction properties as grains processed in the interstellar medium and associated with regions of star formation).
Fig. 10(j) shows the distribution of ξ uncorrected for dust. The PG → RG → wAGN → sAGN → SF sequence reflects the central role of WHα in our classification scheme. This sequence is mirrored in the distribution of observed Hα luminosities (Fig. 10k). PGs included in these panels have, by definition, very weak Hα (median WHα = 0.23 Å, and median S/NHα = 2.4 counting only those with an Hα flux in our data base). Still, their locations in these diagrams are consistent with them not being completely lineless.
8 DISCUSSION AND SUMMARY
The goal of this paper was to provide a comprehensive classification of galaxies according to their emission-line properties which would be able to cope with the large population of weak line galaxies that do not appear in traditional diagrams (e.g. BPT) because they lack some of the diagnostic lines. This is possible using the WHAN diagram (WHα versus [N ii]/Hα) that we proposed in earlier work (CF10). An additional problem with the BPT diagram and other similar line-ratio diagrams is that, as shown by S08, the LINER region contains a mix of two completely different families of galaxies: galaxies that contain a wAGN and ‘RGs’, i.e. galaxies that have stopped forming stars and whose emission lines are powered by their HOLMES. A truly comprehensive classification scheme must account for this phenomenon, and the WHAN diagram allows so.
To distinguish true AGN from fake ones (i.e. the RGs), we have shown that a useful criterion is the value of ξ, which measures the intrinsic extinction-corrected Hα luminosity in units of the Hα luminosity expected from stellar populations older than 108 yr uncovered by our stellar population analysis software starlight. In principle, ξ should be equal to 1 if the galaxy absorbs all the ionizing photons provided by its HOLMES, and >1 if any other source of ionization is present. We have shown that this index is bimodally distributed, with a ξ≫ 1 population composed by galaxies undergoing vigorous star formation and/or nuclear activity, and a population of galaxies centred right at the predicted value for RGs. Computing ξ for each galaxy, however, is not a trivial task: it requires a stellar population analysis able to identify the populations of HOLMES and a recipe to obtain the ionizing radiation field from these HOLMES. The latter strongly depends on how the (yet quite uncertain) evolutionary tracks for post-asymptotic giant branch stars are incorporated into the stellar population codes as well as on other issues such as the IMF and the metallicities of the stellar populations.
It is therefore preferable to base the distinction between true and fake AGN on the Hα equivalent width, an easy to obtain observational parameter which provides an excellent proxy for ξ. In view of the strong dichotomy in the observed WHα distribution for galaxies in the right wing of the BPT diagram (i.e. the wing previously supposed to contain only AGN), we propose a practical separation between wAGN and RGs based on a class frontier at WHα = 3 Å. This physically inspired but data-guided separation corresponds to the minimum between the two modes in the WHα distribution.
We thus identify five classes of galaxies in the WHAN diagram:
SF: pure star-forming galaxies: log[N ii]/Hα < −0.4 and WHα > 3 Å;
sAGN: strong AGN (i.e. Seyferts): log[N ii]/Hα > −0.4 and WHα > 6 Å;
wAGN: weak AGN: log[N ii]/Hα > −0.4 and 3 < WHα < 6 Å;
RG: retired galaxies (i.e. fake AGN): WHα < 3 Å;
PG: passive galaxies (actually, lineless galaxies): WHα and W[N ii] < 0.5 Å.
Note that the border between RGs and PGs is somewhat arbitrary, but this is not really a concern.
We have then analysed the SFHs of the different classes of galaxies, as well as the distribution of a dozen of physical or observational properties, such as the galaxy stellar mass M★, surface densities μ★, velocity dispersions σ★ and stellar extinctions A★V obtained through starlight. All these comparisons corroborate our proposed differentiation between RGs and wAGN in the LINER-like family. We also find that wAGN have properties overlapping with those of sAGN, while RGs line up with PGs. In other words, wAGN are the low-activity end of the AGN family, while RGs are PGs with emission lines. As a matter of fact, both RGs and PGs are ‘retired’ or ‘passive’ galaxies, the real spectroscopic difference between them being the presence or absence of emission lines. It is only the inherited custom which imposed the adopted nomenclature.
Now that these emission-line classes have been established, further work could be to understand what is at the root of the strength of the AGN phenomenon or why some PGs present emission lines and others do not.
In the remaining of this paper, [O iii]λ5007 and [N ii]λ6584 will be denoted [O iii] and [N ii], respectively.
Our WHAN diagram is, in a way, very similar to the W[N ii] versus [N ii]/Hα diagram of Coziol et al. (1998). However, the WHAN diagram is easier to interpret, since it concerns two physically independent quantities, WHα measuring the amount of ionizing photons absorbed by the gas relative to the stellar mass, while [N ii]/Hα is a function of the nitrogen abundance, of the ionization state and temperature of the gas.
In those papers, LINERs should have rather been called ‘galaxies with LINER-like spectra’, given that LINER is the abbreviation of ‘low ionization nuclear emission regions’ (as originally introduced by Heckman 1980), and the SDSS spectra cover much more than the nuclear regions of galaxies.
Our line fitting code, described in Mateus et al. (2006), only accepts fits with Wλ≥ 0.1 Å, which produces the empty triangle in the bottom left part of the plot.
For the Cardelli, Clayton & Mathis (1989) law used in our analysis, AnebV = 7.215 log[(Hα/Hβ)obs/(Hα/Hβ)int]. We adopt an intrinsic Balmer decrement of 3, suitable for LINERs (S08).
To have an idea of what this means, X8 < 10 per cent corresponds to specific star formation rates (SSFRs) ≲2.5 × 10−12 yr−1, or, equivalently, to less than 0.025 per cent of the stellar mass having formed in the past 108 yr. In fact, for light levels as low as X8 = 10 per cent the spectral synthesis analysis is affected both by its own limitations and by side-effects of incompleteness in evolutionary synthesis models, such that the true contribution from young stars cannot be reliably estimated (see Section 6).
CHαis not to be confused with the continuum luminosity per unit mass, cHα, introduced in Section 4.3. We use capital letters for extensive quantities (like CHα and QH), and lowercase for specific ones (cHα and qH).
Recall that the ‘mini bursts’ at t≲ 107 yr are not real, but artefacts of spectral fits with an incomplete base (Ocvirk 2010). The fact that these fake bursts correlate with galaxy mass, as seen in Fig. 8, is likely an indirect consequence of the connection between horizontal branch morphology and Z★ coupled to the M★–Z★ relation.
The starlight project is supported by the Brazilian agencies CNPq, CAPES, by the France–Brazil CAPES–COFECUB programme, and by Observatoire de Paris.
Funding for the SDSS and SDSS-II has been provided by the Alfred P. Sloan Foundation, the Participating Institutions, the National Science Foundation, the US Department of Energy, the National Aeronautics and Space Administration, the Japanese Monbukagakusho, the Max Planck Society and the Higher Education Funding Council for England. The SDSS website is http://www.sdss.org/.
The SDSS is managed by the Astrophysical Research Consortium for the Participating Institutions. The Participating Institutions are the American Museum of Natural History, Astrophysical Institute Potsdam, University of Basel, University of Cambridge, Case Western Reserve University, University of Chicago, Drexel University, Fermilab, the Institute for Advanced Study, the Japan Participation Group, Johns Hopkins University, the Joint Institute for Nuclear Astrophysics, the Kavli Institute for Particle Astrophysics and Cosmology, the Korean Scientist Group, the Chinese Academy of Sciences (LAMOST), Los Alamos National Laboratory, the Max-Planck-Institute for Astronomy (MPIA), the Max-Planck-Institute for Astrophysics (MPA), New Mexico State University, Ohio State University, University of Pittsburgh, University of Portsmouth, Princeton University, the United States Naval Observatory and the University of Washington.
REFERENCES