Logistic regression to boost exoplanet detection performances

2.2.1 Step (1): generation of the training data-set: planet’s injections and noise map

A data cube is used to generate a number of SNR maps (see Fig. 1) that contains either only noise or signals of previously injected fake giant planets of various masses. The objective is to provide guaranteed positive and negative samples for the machine-learning algorithm.

2.2.1.1 Noise maps

There are two ways to create a ‘noise map only’ from the data set. The first one is to invert the direction of rotation. Because we use ADI-based techniques taking advantage of the rotation, inverting the direction destroys any astrophysical signal, leaving only noise. Another way to create noise only maps is to perform a temporal shuffle of the frames.

2.2.1.2 Injection of fake planets

FPs with given masses (expressed in Jupiter masses |${M}_{\text{jup}})$| are randomly injected in the reduced and centred data cubes provided by the SPHERE data centre (Delorme et al. 2017; Chomez et al. 2023b) taking into account the inverted direction of rotation (see above). The injected masses range from 1 to 5|${M}_{\text{jup}}$|⁠. A minimum distance between the injections is enforced to avoid any stacked signal. To convert masses into contrasts in the H2 and H3 bands, we use COND (Allard et al. 2001) evolution models, and assume that the planet is coeval with its parent star. Those data cubes are then processed by PACO using ADI and ASDI.

Each SNR map generated from an original image |$\mathcal {I}$| (made of two SNR maps in H2 and H3) gives a number of sub-images (stamp) that are known to contain or not an object (an injected planet/companion). The size of the stamps, 19 × 19 pixels, is large enough to give the context needed to detect an astrophysical point source (field companion or exoplanet).

2.2.2 Step (2): negative stamps from the noise maps

The extraction of the stamps is performed using the H2 SNR map. To reduce the number of stamps and centre them on SNR peaks, we apply edge detection techniques (Sobel’s filter, Duda & Hart 1973) and compute a gradient map of the SNR map. More precisely, each pixel of such a map gives an approximation of the norm of the gradient of the corresponding SNR function. An example of such a gradient map can be seen in Fig. 2.

Figure 2.

An SNR map (left panel) and the corresponding gradient map (right panel).

A pixel u of an edge is identified as a pixel with a gradient value |$g_u$| greater than a threshold (denoted t in Algorithm 1). Consider the graph of edges where vertices are pixels u such that |$g_u \ge t$| (edge pixels) and connect two such vertices if they are adjacent pixels in the image. More precisely, consider the graph |$G=(V, E)$| with |$V = \lbrace u | g_u \ge t, u \in \mathcal {I}\rbrace$| and |$E=\lbrace (u,v) | u \in V, v \in V, u \textrm {~and~} v \textrm {~are~adjacent}\rbrace$|⁠. An area of high SNR in the original image is related to a connected component of G. These components might take a variety of shapes and for sake of simplicity we consider the smallest rectangular box containing the component as the relevant area.

Algorithm 1

Stamps generation algorithm.

The value of t controls the number of components found and Algorithm 1 performs a dichotomy search to reach a given average size of the boxes identifying the areas of interest. In practice, we set this target size to 12 pixels. Note that the stamps definition could also be done from the H3 SNR map which might contain SNR peaks located differently. The final set of stamps would differ and might be more complete but this remains to be investigated.

2.2.3 Step (3): training

A simple classifier based on logistic regression is trained from the collection of stamps whose class is known at this stage. We outline two key points. First, the classifier is dedicated to a specific original image. The stamps generated for learning are based on injections and randomization of the original data cube on which the classifier is finally used. So that it potentially takes into account the peculiarities on this data set (e.g weather conditions). Secondly, the learning is not performed on the whole image. On the contrary, a number of limited and meaningful features are computed for each stamp. This typically refers to old-fashioned learning as opposed to deep learning. A dozen of features are computed and detailed in the Appendix. These features relate to high-level descriptive statistic of the stamp such a the mean SNR value (MeanSnr) or the mean, maximum, and standard deviation of the gradient approximation of the stamp (MeanGra, MaxGra, and StdevGra) but also to physical phenomenons. A key feature targets the pattern that can leave an Airy figure on the SNR map (AiryFig). Another one (MeanSpec) tries to quantify whether the nine central pixels of a stamp indicate that the SNR signal is due to a speckle. Note that the features are not real physical models of the phenomenon but simple numerical quantities that are correlated with the presence of the phenomenon. An analysis of these correlations is presented in Section 3.3. Most of the features are computed on the H2 SNR part of the stamp but a number of them are also computed on the H3 signal (typically the features dealing with the presence of a speckle). In total, a dozen of such features are used. Finally, the logistic regression is trained using cross-validation.

2.2.4 Step (4): usage

The classifier is then used only on stamps extracted on the original image and whose centre is a pixel with an SNR (in H2) greater than 2. We chose this value because below this threshold in H2, it would currently not be considered as a potential candidate by a user (an astrophysicist) in any case. A more elaborate criterion can be used, for instance using the SNR value of multiple wavelengths. But in principle, any pixel can be considered and this step is not computationally time consuming. The features of the stamp centred on that pixel are computed and if the value of the logistic regression function of the classifier is above 0.5, the stamp is kept as a candidate. It is considered to belong to the class of objects. The probability of 0.5 can be tuned and this is discussed in Section 3.7.

2.2.5 Step (5): usage – clustering of candidates

Since stamps centred on any pixel are potentially submitted to the classifier, adjacent pixels are likely to be classified similarly. As a result, candidates are made of close clusters that overlap and two closely overlapping stamps are most likely the same object. We therefore present the classification result as clusters of candidates.

The distance between two stamps is defined as the number of distinct pixels. The smaller this distance, the more the stamps overlap. An agglomerative clustering algorithm (Müllner 2011) merges a cluster pair if the minimum of the distances between the stamps of a cluster is lower than a threshold. In practice this threshold is set to 55.5 per cent of the pixels so two stamps of the same cluster overlap over at least 55.5 per cent (200 pixels for 19 ×19 stamps).

3 TESTS

3.1 Data sets and terminology

We used four IRDIS data sets obtained on HD 108767B, HIP 1993, HIP 12394, and HIP 107345. These stars were chosen as they were also used in a blind test that was performed among the SPHERE SHINE consortium to compare the merits of different algorithms (see below). They are representative of different atmospheric conditions. Table 1 summarizes the aforementioned stars properties and Table 2 provide the observing conditions.

Table 1.

Targets used for the tests.

Star name	RA (J2000)	Dec. (J2000)	Spectral type	R_mag	H_mag	Age\|$^a$\| (Myr)
HD108767B	12 29 50.8908	−16 31 15.2081	K 1	8.2	6.3	\|$180^{+170}_{-80}$\|
HIP1993	00 25 14.6618	−61 30 48.2527	M0V	10.7	7.9	\|$45^{+5}_{-10}$\|
HIP12394	02 39 35.3612	−68 16 01.0103	B8V	4.1	4.4	\|$45^{+5}_{-10}$\|
HIP107345	21 44 30.1227	−60 58 38.8946	M0	10.5	8.1	\|$45^{+5}_{-10}$\|

Star name	RA (J2000)	Dec. (J2000)	Spectral type	R_mag	H_mag	Age\|$^a$\| (Myr)
HD108767B	12 29 50.8908	−16 31 15.2081	K 1	8.2	6.3	\|$180^{+170}_{-80}$\|
HIP1993	00 25 14.6618	−61 30 48.2527	M0V	10.7	7.9	\|$45^{+5}_{-10}$\|
HIP12394	02 39 35.3612	−68 16 01.0103	B8V	4.1	4.4	\|$45^{+5}_{-10}$\|
HIP107345	21 44 30.1227	−60 58 38.8946	M0	10.5	8.1	\|$45^{+5}_{-10}$\|

^aAges values extracted from Desidera et al. (2021).

Table 1.

Targets used for the tests.

Star name	RA (J2000)	Dec. (J2000)	Spectral type	R_mag	H_mag	Age\|$^a$\| (Myr)
HD108767B	12 29 50.8908	−16 31 15.2081	K 1	8.2	6.3	\|$180^{+170}_{-80}$\|
HIP1993	00 25 14.6618	−61 30 48.2527	M0V	10.7	7.9	\|$45^{+5}_{-10}$\|
HIP12394	02 39 35.3612	−68 16 01.0103	B8V	4.1	4.4	\|$45^{+5}_{-10}$\|
HIP107345	21 44 30.1227	−60 58 38.8946	M0	10.5	8.1	\|$45^{+5}_{-10}$\|

Star name	RA (J2000)	Dec. (J2000)	Spectral type	R_mag	H_mag	Age\|$^a$\| (Myr)
HD108767B	12 29 50.8908	−16 31 15.2081	K 1	8.2	6.3	\|$180^{+170}_{-80}$\|
HIP1993	00 25 14.6618	−61 30 48.2527	M0V	10.7	7.9	\|$45^{+5}_{-10}$\|
HIP12394	02 39 35.3612	−68 16 01.0103	B8V	4.1	4.4	\|$45^{+5}_{-10}$\|
HIP107345	21 44 30.1227	−60 58 38.8946	M0	10.5	8.1	\|$45^{+5}_{-10}$\|

^aAges values extracted from Desidera et al. (2021).

Table 2.

Test targets observation logs.

Star	Observation date	Filter	DIT(s)\|$\times$\|N_frame	\|$\Delta$\|PA (⁠\|$^\circ)^a$\|	Seeing (arcmin)\|$^b$\|	Airmass\|$^b$\|	\|$\tau _0$\| (ms)\|$^{a,b}$\|	Program ID
HD108767B	2018-01-24	DB_H23	64 × 72	94.4	0.59	1.02	8.3	1100.C-0481(D)
HIP1993	2015-11-28	DB_H23	64 × 64	25.8	1.57	1.26	7.2	096.C-0241(B)
HIP12394	2016-09-15	DB_H23	32 × 160	29.1	0.42	1.38	9.2	097.C-0865(D)
HIP107345	2015-07-04	DB_H23	64 × 64	26.0	1.07	1.25	2	095.C-0298(C)

Star	Observation date	Filter	DIT(s)\|$\times$\|N_frame	\|$\Delta$\|PA (⁠\|$^\circ)^a$\|	Seeing (arcmin)\|$^b$\|	Airmass\|$^b$\|	\|$\tau _0$\| (ms)\|$^{a,b}$\|	Program ID
HD108767B	2018-01-24	DB_H23	64 × 72	94.4	0.59	1.02	8.3	1100.C-0481(D)
HIP1993	2015-11-28	DB_H23	64 × 64	25.8	1.57	1.26	7.2	096.C-0241(B)
HIP12394	2016-09-15	DB_H23	32 × 160	29.1	0.42	1.38	9.2	097.C-0865(D)
HIP107345	2015-07-04	DB_H23	64 × 64	26.0	1.07	1.25	2	095.C-0298(C)

|$^b$|Values extracted from the updated DIMM info and averaged over the sequence.

Table 2.

Test targets observation logs.

Star	Observation date	Filter	DIT(s)\|$\times$\|N_frame	\|$\Delta$\|PA (⁠\|$^\circ)^a$\|	Seeing (arcmin)\|$^b$\|	Airmass\|$^b$\|	\|$\tau _0$\| (ms)\|$^{a,b}$\|	Program ID
HD108767B	2018-01-24	DB_H23	64 × 72	94.4	0.59	1.02	8.3	1100.C-0481(D)
HIP1993	2015-11-28	DB_H23	64 × 64	25.8	1.57	1.26	7.2	096.C-0241(B)
HIP12394	2016-09-15	DB_H23	32 × 160	29.1	0.42	1.38	9.2	097.C-0865(D)
HIP107345	2015-07-04	DB_H23	64 × 64	26.0	1.07	1.25	2	095.C-0298(C)

Star	Observation date	Filter	DIT(s)\|$\times$\|N_frame	\|$\Delta$\|PA (⁠\|$^\circ)^a$\|	Seeing (arcmin)\|$^b$\|	Airmass\|$^b$\|	\|$\tau _0$\| (ms)\|$^{a,b}$\|	Program ID
HD108767B	2018-01-24	DB_H23	64 × 72	94.4	0.59	1.02	8.3	1100.C-0481(D)
HIP1993	2015-11-28	DB_H23	64 × 64	25.8	1.57	1.26	7.2	096.C-0241(B)
HIP12394	2016-09-15	DB_H23	32 × 160	29.1	0.42	1.38	9.2	097.C-0865(D)
HIP107345	2015-07-04	DB_H23	64 × 64	26.0	1.07	1.25	2	095.C-0298(C)

|$^b$|Values extracted from the updated DIMM info and averaged over the sequence.

The four data sets were used to generate true positive (TP) stamps (by planet’s injection) and true negative (TN) stamps for the learning step (by processing, with Algorithm 1, a map of pure noise generated from the original image). The blind tests consist in performing a number of injections in the original image and submitting the resulting modified data set to the classifier. Additional unknown exoplanets might also be present since the original image is used for the test.

In the following, we use the following notations and terminology: #U is the number of stamps extracted on the original image and considered at the Usage step. The classifier is used only on stamps with a SNR greater than 2 in H2 and this number is reported in columns #U|$_{{\rm SNR}\ge 2}$|⁠. #TP and #TN refers to numbers of stamps that are known to contain an injected planet or noise, i.e TPs or TNs. Finally, when considering the results obtained with our algorithm, we note #TPF the number of TPs Found, and #C the number of remaining Candidates proposed by the algorithm. These candidates are likely false positive but since the usage is done on the original image, it could also be real, and so far undetected, exoplanets. Therefore, we refer to them as candidates.

The first column of Tables 3 and 4 reports the number of TP (column #TP) and negative stamps (column #TN) obtained with our methodology and that have been used to train the classifier (column Learning). The remaining columns gives an overview of the number of stamps generated for the three Blind Tests (BT1, BT2, BT3 and BT4) detailed below. Note that some injections might lead to SNR values below 2 and thus would not be submitted to the classifier. The results are therefore given only for injected planets giving an SNR greater than 2 in H2 (the exact numbers of such injections are reported in columns titled #TP for each blind test).

Table 3.

Number of positive (planet’s injection) and negative stamps (extracted from a pure noise map generated with Algorithm 1).

Star name	Learning			BT1		BT2		BT3
	#TP	#TN	#TN\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|
All	1026	77 565	8180	29	14 691	140	19 017	90	15 878
HD108767B	259	15 621	1724	7	3577	26	3777	15	3390
HIP1993	253	15 228	1271	8	2376	37	3479	24	2810
HIP12394	251	15 636	2920	6	6670	39	8226	25	7191
HIP107345	263	31 080	2265	8	2068	38	3535	26	2487

Star name	Learning			BT1		BT2		BT3
	#TP	#TN	#TN\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|
All	1026	77 565	8180	29	14 691	140	19 017	90	15 878
HD108767B	259	15 621	1724	7	3577	26	3777	15	3390
HIP1993	253	15 228	1271	8	2376	37	3479	24	2810
HIP12394	251	15 636	2920	6	6670	39	8226	25	7191
HIP107345	263	31 080	2265	8	2068	38	3535	26	2487

Note. The total number as well as the number of negative stamps of SNR H2 greater than 2 are reported.

Table 3.

Number of positive (planet’s injection) and negative stamps (extracted from a pure noise map generated with Algorithm 1).

Star name	Learning			BT1		BT2		BT3
	#TP	#TN	#TN\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|
All	1026	77 565	8180	29	14 691	140	19 017	90	15 878
HD108767B	259	15 621	1724	7	3577	26	3777	15	3390
HIP1993	253	15 228	1271	8	2376	37	3479	24	2810
HIP12394	251	15 636	2920	6	6670	39	8226	25	7191
HIP107345	263	31 080	2265	8	2068	38	3535	26	2487

Star name	Learning			BT1		BT2		BT3
	#TP	#TN	#TN\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|	#TP	#U\|$_{{\rm SNR}\ge 2}$\|
All	1026	77 565	8180	29	14 691	140	19 017	90	15 878
HD108767B	259	15 621	1724	7	3577	26	3777	15	3390
HIP1993	253	15 228	1271	8	2376	37	3479	24	2810
HIP12394	251	15 636	2920	6	6670	39	8226	25	7191
HIP107345	263	31 080	2265	8	2068	38	3535	26	2487

Note. The total number as well as the number of negative stamps of SNR H2 greater than 2 are reported.

Table 4.

Number of positive (planet’s injection) and negative stamps (extracted from a pure noise map generated with Algorithm 1).

Star name	Learning		BT4
	#TP	#TN	#TP	#U\|$_{{\rm SNR}\ge 2}$\|
HIP1993	243	15 076	34	2863
HIP12394-4M	214	15 540	34	8384
HIP12394-3M	214	15 540	34	7707
HIP107345	192	15 791	34	2421

Star name	Learning		BT4
	#TP	#TN	#TP	#U\|$_{{\rm SNR}\ge 2}$\|
HIP1993	243	15 076	34	2863
HIP12394-4M	214	15 540	34	8384
HIP12394-3M	214	15 540	34	7707
HIP107345	192	15 791	34	2421

Note. The total number as well as the number of negative stamps of SNR H2 greater than 2 are reported.

Table 4.

Number of positive (planet’s injection) and negative stamps (extracted from a pure noise map generated with Algorithm 1).

Star name	Learning		BT4
	#TP	#TN	#TP	#U\|$_{{\rm SNR}\ge 2}$\|
HIP1993	243	15 076	34	2863
HIP12394-4M	214	15 540	34	8384
HIP12394-3M	214	15 540	34	7707
HIP107345	192	15 791	34	2421

Star name	Learning		BT4
	#TP	#TN	#TP	#U\|$_{{\rm SNR}\ge 2}$\|
HIP1993	243	15 076	34	2863
HIP12394-4M	214	15 540	34	8384
HIP12394-3M	214	15 540	34	7707
HIP107345	192	15 791	34	2421

Note. The total number as well as the number of negative stamps of SNR H2 greater than 2 are reported.

3.2 Tests description

We started with a blind test constructed by the SHINE consortium, BT1. Eight fake companions featuring spectral types between early M and late T were injected for each of the stars considered; their contrasts were chosen to provide a TLOCI signal (Marois et al. 2014) of about 5σ (mean of H2 and H3 contrasts). We note that the objects do not necessarily represent realistic planets. Note also that in a few cases, the injected planets are not detected with PACO even with a low threshold. This happens when the planets are very close to the stars, and their expected contrasts are overestimated. Finally, one out of the eight planets around HD10767B has an SNR in H2 lower than 2, and two out of the eight planets around HIP12394 have to SNR in H2 lower than 2. They are therefore not be considered here.

BT2 and BT3 are blind tests generated using the process described in section 2.2.1. The planet’s masses considered are 1, 2, 3, 4, and 5M_Jup. The contrasts of the companions with respect to the stars are computed according to their masses and to the age of the stars (planets and stars are assumed to be coeval), and using the COND models. Conversely to BT1, the FPs correspond to realistic cases. We note that the respective contrasts in BT1 are very different from those in BT2 and BT3. Fig. 3 shows the distribution of the SNR values of the injections in the three blind tests.

Figure 3.

Box plots summarizing the SNR distribution of the injections in H2 and H3 (left and right in each box plot, respectively) for the three Blind Tests: BT1, BT2, and BT3. Each box plot shows the minimum, first quartile, median, third quartile, maximum (by convention, the third quartile plus at most 1.5 of the interquartile range), and additional points above.

The last test, BT4, is similar to BT2 and BT3. We massively injected FPs close to the star, between 0.25 and 1 arcsec. The FPs were chosen so that their contrasts be along the 5σ contrast curves to test the capability of the classifier. We chose to use the classifier on HIP12394 with 3M_jup and 4M_jup injected planets because both planets cross the contrast curve between 0 and 1 arcsec.

3.3 Features analysis

We analyse the distribution of six of the features (MeanSnr, MeanGra, MaxGra, MaxMin, AiryFig, and MeanSpec) across TP and TN stamps. We recall that a feature is simply a real number computed on a stamp and a good feature is correlated to a class (positive/negative). Fig. 4 provides, for each feature, two box plots showing the distribution of its values for TP (box labelled pos) and TN (box labelled neg) stamps. A box plot gives a summary of the distribution in five numbers from bottom to top: minimum, first quartile, median, third quartile, and maximum. Typically, the values of the feature for 50 per cent of the stamps lie in the box. The median value is the orange horizontal line within the box. Values considered as outliers (below or above 1.5 the interquartile range, which is defined as the maximum) are not shown for sake of clarity. The correlation coefficient (r value) is given for each feature.

$Box plots summarizing the distribution of six features by showing the minimum, first quartile, median, third quartile, and maximum. The distribution for positive and negative stamps are shown for each feature to help visualize whether the feature discriminates the two classes. The values are computed over stamps of SNR H2 $\ge 2$ of the four images (in total 2069 positives, 18 074 negatives).$

Figure 4.

Box plots summarizing the distribution of six features by showing the minimum, first quartile, median, third quartile, and maximum. The distribution for positive and negative stamps are shown for each feature to help visualize whether the feature discriminates the two classes. The values are computed over stamps of SNR H2 |$\ge 2$| of the four images (in total 2069 positives, 18 074 negatives).

We expect useful features to show distinct distributions for positive and negative stamps in order to help the classifier discriminating between the two. The size of the intersection of the two distributions (for the positive and negative class) gives an idea of the discrimination power of the feature.

Four features appear decisive (correlation coefficient |$r \ge 0.6$|⁠): the mean SNR intensity (MeanSnr), the gradient features (MeanGra, MaxGra) as well as the feature related to the presence of an Airy Figure (AiryFig).

The feature related to speckles is not strongly correlated to the presence of a companion (⁠|$r = 0.29$|⁠). This is expected as speckles are only present within the star halo. Restricting the analysis to the halo region, between 30 and 140 pixels (i.e. 370 to 1700 mas), significantly increases the correlation to |$r = 0.61$| (see Fig. 5). It might therefore be appropriate to build two distinct classifiers, one for the star halo that includes MeanSpec and one for the remaining area without it. But this tends to complicate further the overall process. We decided to keep it simple for the moment and used the feature MeanSpec on the entire field with an additional 0/1 indicative feature (⁠|$f_9$| in the Appendix) defining whether a stamp is or not in the star halo. In other words, this indicative feature tells when MeanSpec is relevant and can be eventually help the classifier.

$Distribution of the MeanSpec feature for three selected subsets of stamps from left to right, respectively: (1) all stamps of SNR H2 $\ge 2$ (2069 positives, 18074 negatives), (2) all stamps of SNR H2 $\ge 2$ located between 30 and 140 pixels (554 positives, 1969 negatives), and (3) all stamps of SNR H2 $\ge 2.5$ located between 30 and 140 pixels (492 positives, 558 negatives).$

Figure 5.

Distribution of the MeanSpec feature for three selected subsets of stamps from left to right, respectively: (1) all stamps of SNR H2 |$\ge 2$| (2069 positives, 18074 negatives), (2) all stamps of SNR H2 |$\ge 2$| located between 30 and 140 pixels (554 positives, 1969 negatives), and (3) all stamps of SNR H2 |$\ge 2.5$| located between 30 and 140 pixels (492 positives, 558 negatives).

3.4 Analysis

The results are presented in Table 5 for BT1–BT4. The results of the classifier (column RegL) are compared to the threshold approach (thresholds of 3 and 5) for two flavors of PACO: ADI and ASDI. In ADI, the SNR is computed considering separately the H2 and H3 channels. So a detection is made if any of the SNR value in H2 or H3 is above the threshold. In ASDI, PACO optimally combines both bands (Flasseur et al. 2020). The number of planets found (#TPF) and the number of candidates (#C) are reported for each approach (see Section 3.1 for the definition of #TPF and #C).

Table 5.

Comparison between the results from the logistic regression approach (Column RegL) and those from threshold detections.

		RegL		ADI 3		ADI 5		ASDI 3		ASDI 5
	#Inj	#TPF	#C	#TPF	#C	#TPF	#C	#TPF	#C	#TPF	#C
	BT1
HD108767B	7	5	11	6	153	3	0	6	717	5	0
HIP1993	8	6	1	8	100	3	0	8	724	7	0
HIP12394	6	5	2	5	397	2	0	6	611	5	0
HIP107345	8	7	2	8	65	2	0	8	694	8	0
Total	29	23	16	27	715	10	0	28	2746	25	0
	BT2
HD108767B	26	22	13	21	2	16	0	23	716	18	1
HIP1993	37	33	1	32	91	27	0	31	627	23	1
HIP12394	39	27	1	28	373	20	0	24	620	22	1
HIP107345	38	38	1	38	77	33	0	36	659	33	1
Total	140	120	16	119	543	96	0	114	2622	96	4
	BT3
HD108767B	15	6	10	8	118	2	0	8	440	3	1
HIP1993	24	20	4	19	65	19	0	23	390	18	0
HIP12394	25	19	2	19	348	16	0	20	386	19	1
HIP107345	26	24	2	24	61	21	0	26	414	24	1
Total	90	69	18	70	592	58	0	77	1630	64	3
	BT4
HIP1993	34	30	7	31	100	19	2	32	956	28	1
HIP12394-4M_Jup	34	33	12	34	392	26	7	33	909	29	0
HIP12394-3M_jup	34	21	12	25	388	12	7	26	734	13	0
HIP107345	34	27	2	29	75	18	2	30	951	30	0
Total	136	111	33	119	955	75	18	121	3550	100	1

		RegL		ADI 3		ADI 5		ASDI 3		ASDI 5
	#Inj	#TPF	#C	#TPF	#C	#TPF	#C	#TPF	#C	#TPF	#C
	BT1
HD108767B	7	5	11	6	153	3	0	6	717	5	0
HIP1993	8	6	1	8	100	3	0	8	724	7	0
HIP12394	6	5	2	5	397	2	0	6	611	5	0
HIP107345	8	7	2	8	65	2	0	8	694	8	0
Total	29	23	16	27	715	10	0	28	2746	25	0
	BT2
HD108767B	26	22	13	21	2	16	0	23	716	18	1
HIP1993	37	33	1	32	91	27	0	31	627	23	1
HIP12394	39	27	1	28	373	20	0	24	620	22	1
HIP107345	38	38	1	38	77	33	0	36	659	33	1
Total	140	120	16	119	543	96	0	114	2622	96	4
	BT3
HD108767B	15	6	10	8	118	2	0	8	440	3	1
HIP1993	24	20	4	19	65	19	0	23	390	18	0
HIP12394	25	19	2	19	348	16	0	20	386	19	1
HIP107345	26	24	2	24	61	21	0	26	414	24	1
Total	90	69	18	70	592	58	0	77	1630	64	3
	BT4
HIP1993	34	30	7	31	100	19	2	32	956	28	1
HIP12394-4M_Jup	34	33	12	34	392	26	7	33	909	29	0
HIP12394-3M_jup	34	21	12	25	388	12	7	26	734	13	0
HIP107345	34	27	2	29	75	18	2	30	951	30	0
Total	136	111	33	119	955	75	18	121	3550	100	1

Notes. Two thresholds are considered: 3 and 5, and two setups are considered: ADI and ASDI. Columns #TPF reports the number of planets found whereas #C is the number of additional candidates produced. #Inj is the number of injections (hidden planets) in the corresponding blind test.

Table 5.

Comparison between the results from the logistic regression approach (Column RegL) and those from threshold detections.

		RegL		ADI 3		ADI 5		ASDI 3		ASDI 5
	#Inj	#TPF	#C	#TPF	#C	#TPF	#C	#TPF	#C	#TPF	#C
	BT1
HD108767B	7	5	11	6	153	3	0	6	717	5	0
HIP1993	8	6	1	8	100	3	0	8	724	7	0
HIP12394	6	5	2	5	397	2	0	6	611	5	0
HIP107345	8	7	2	8	65	2	0	8	694	8	0
Total	29	23	16	27	715	10	0	28	2746	25	0
	BT2
HD108767B	26	22	13	21	2	16	0	23	716	18	1
HIP1993	37	33	1	32	91	27	0	31	627	23	1
HIP12394	39	27	1	28	373	20	0	24	620	22	1
HIP107345	38	38	1	38	77	33	0	36	659	33	1
Total	140	120	16	119	543	96	0	114	2622	96	4
	BT3
HD108767B	15	6	10	8	118	2	0	8	440	3	1
HIP1993	24	20	4	19	65	19	0	23	390	18	0
HIP12394	25	19	2	19	348	16	0	20	386	19	1
HIP107345	26	24	2	24	61	21	0	26	414	24	1
Total	90	69	18	70	592	58	0	77	1630	64	3
	BT4
HIP1993	34	30	7	31	100	19	2	32	956	28	1
HIP12394-4M_Jup	34	33	12	34	392	26	7	33	909	29	0
HIP12394-3M_jup	34	21	12	25	388	12	7	26	734	13	0
HIP107345	34	27	2	29	75	18	2	30	951	30	0
Total	136	111	33	119	955	75	18	121	3550	100	1

		RegL		ADI 3		ADI 5		ASDI 3		ASDI 5
	#Inj	#TPF	#C	#TPF	#C	#TPF	#C	#TPF	#C	#TPF	#C
	BT1
HD108767B	7	5	11	6	153	3	0	6	717	5	0
HIP1993	8	6	1	8	100	3	0	8	724	7	0
HIP12394	6	5	2	5	397	2	0	6	611	5	0
HIP107345	8	7	2	8	65	2	0	8	694	8	0
Total	29	23	16	27	715	10	0	28	2746	25	0
	BT2
HD108767B	26	22	13	21	2	16	0	23	716	18	1
HIP1993	37	33	1	32	91	27	0	31	627	23	1
HIP12394	39	27	1	28	373	20	0	24	620	22	1
HIP107345	38	38	1	38	77	33	0	36	659	33	1
Total	140	120	16	119	543	96	0	114	2622	96	4
	BT3
HD108767B	15	6	10	8	118	2	0	8	440	3	1
HIP1993	24	20	4	19	65	19	0	23	390	18	0
HIP12394	25	19	2	19	348	16	0	20	386	19	1
HIP107345	26	24	2	24	61	21	0	26	414	24	1
Total	90	69	18	70	592	58	0	77	1630	64	3
	BT4
HIP1993	34	30	7	31	100	19	2	32	956	28	1
HIP12394-4M_Jup	34	33	12	34	392	26	7	33	909	29	0
HIP12394-3M_jup	34	21	12	25	388	12	7	26	734	13	0
HIP107345	34	27	2	29	75	18	2	30	951	30	0
Total	136	111	33	119	955	75	18	121	3550	100	1

Overall, ADI, ASDI 3, and RegL retrieve most of the planets. Yet, the former two detect also a huge number of candidates: typically several hundreds candidates are found in ASDI 3 and often about one hundred in ADI 3, while RegL detects a much more limited number of candidates. Such large lists of candidates found by ADI and ASDI 3 prevent from identifying the TPs. This justifies the usual choice of a threshold of 5 when using PACO. Using a threshold of 5 removes nearly all candidates but the detection performances are reduced (in particular for BT2 but also BT3) compared to our classifier. Hence, RegL performs better than ADI or ASDI, giving a better compromise between the number of detections and the number of candidates. We detail the results of each test below.

BT1. RegL finds 23 planets while ADI 3 finds 27. Yet, RegL finds 16 false positives, to be compared to the 715 found by ADI 3. ASDI 5 find 25 planets and avoids any additional candidates. We will discuss this result below.

BT2. RegL retrieves 120 out of the 140 injected planets, identifying one more planet than ADI 3. Noticeably, RegL finds only 16 additional candidates (likely false positive), to be compared to the 543 found with the threshold approach. ASDI 3 finds 114 planets and produces much more additional candidates than RegL. RegL finds more planets than ADI and ASDI 5 (which find 96 planets), with a yet slightly larger number of candidates (16 instead of 0 and 4, respectively).

BT3. RegL does not performs better than ADI or ASDI 3 regarding the number of planets found but the number of candidates with RegL (18) is significantly smaller with with ADI (592) or ASDI (1630). It behaves better when compared to ADI or ASDI 5, with a larger amount of detected planets (69 versus 58 with ADI 5 and 64 with ASDI 5), and a small number of candidates (18 versus 0 and 3, respectively).

BT4. RegL does not perform better than ADI or ASDI 3 regarding the number of planets found but the number of candidates with RegL is much smaller than with ADI 3 or ASDI 3. It is relevant to notice that despite of the little gap between the number TPF and the number of planets injected, RegL finds for HIP1993, HIP12394-4M and HIP107345 almost all the sources with a centre SNR > 2 and misses only five planets with SNR very close to 2 for HIP12394-3M. In fact, only 26 planets out of the 34 have an SNR above the threshold of RegL. This gap can be explained by the fact that we injected along the 5σ contrast curves at the constant mass so some planets close to the star are below this curve.

Fig. 6 shows the number of planets found (⁠|$\#$|TPF) (for all blind tests) depending on their projected separations to the star (in [0, 1715[ or |$\ge 1715$| mas), as well as on their SNR value in H2 (in [2, 5[ or |$\ge 5$|⁠). Results for ADI 3 are not reported here since they lead to far too many candidates. As expected, the logistic regression improves over ADI/ASDI for low H2 SNR.¹ It can also be noted here that most of the injections of these blind tests have a high SNR and are located outside of the star halo.

Figure 6.

Number of detections (#TPF) achieved by the different methods depending on the SNR and separation from the stars of the targets.

Finally, note that the classifier is expected to perform better on BT2/BT3 than on BT1 because the exact same injection process is used at the learning step and the usage step for BT2/BT3. Conversely, the injections of BT1 were performed independently of this work with different parameters (thus a different opinion on what is most realistic) and can show different patterns that are therefore not learnt by the classifier. In particular, injections of BT1 can have a low SNR in H2 but a high a SNR in H3. This pattern is not typical of the injection process used in this work (Section 2.2.1) for the learning step. As a result, the classifier does not learn to consider such patterns and is less efficient on BT1. It remains competitive but its performances on BT1 could certainly be improved by including injections at the learning step that are consistent with the one used for the test.

3.5 Image dedicated classifier versus a single classifier.

So far in our study, a classifier is learnt for each image and its usage is dedicated to the original image from which the exoplanets are to be detected. This has the advantage to take into account peculiarities of the image such as the weather conditions at the time it was taken. The alternative is to train a single and common classifier using the four data sets together aiming for better generalization.

The table of Fig. 7 gives the result of such a single classifier (first line) trained over all data sets. The two approaches produce very similar results. The single classifier identifies one additional planet on BT3 (70 versus 69) but misses one on BT1 (22 versus 23) and tend to produce the same amount of candidates that are not injected planets. However, these results might considerably depend on the quality and uniformity of the images. We expect that a larger data set sampling a wider range of weather conditions and quality might be needed to investigate this option further.

Figure 7.

Comparing a single classifier trained over the four data sets to a classifier dedicated to each data set.

3.6 Handling speckles with two classifiers

A speckle moves radially away from the centre between the two wavelengths depending on its distance to the centre and the ratio |$r = \frac{\lambda _{H3}}{\lambda _{H2}}$|⁠. One of the features proposed tries to take advantage of this motion to detect speckles. But since the motion depends on the distance to the star, a speckle near the centre might shift less than a single pixel which makes the feature irrelevant. Additionally, speckles are not present far from the star where another regime occurs and the noise tends to be dominated by photon and instrument noise. Overall, the feature is only valid at a minimum |$d_{\rm min}$| and maximum |$d_{\rm max}$| distance from the star and associated to an indicator (another feature) that is set to one when the centre of the stamp is located in the range |$[d_{\rm min}, d_{\rm max}]$| (see the Annexe for details).

Another approach is to train two distinct classifiers. The first one is trained with stamps included in |$[d_{\rm min}, d_{\rm max}]$| and includes the speckle feature whereas the second one is trained with stamps within |$[0, d_{\rm min}[ \cup ]d_{\rm max}, +\infty [$| and does not use the speckle feature. The results obtained with two classifiers are shown in the last row of Fig. 7 and appear to be very similar or slightly worse. Additional training samples might be required to properly train two classifiers as opposed of one.

3.7 Threshold of the logistic regression

Logistic regression gives a probability of belonging to a class. By default, a threshold of 0.5 is used to decide whether the object belongs to the class or not. However, it is possible to choose another threshold value and thus obtain more or less samples classified as true. A classical way to determine this threshold is to use a ROC curve (Receiver Operating Characteristic), the TP rate against the false positive rate. But, in the case of imbalanced data, the precision recall curve is often preferred. Precision is the proportion of relevant items among all the proposed items; recall is the proportion of relevant items proposed among all the relevant items.

To determine a threshold from precision and recall, we use the f-score calculated as follows: |$F_\beta = (1 + \beta ^2) \times \frac{\mathrm{precision} \times \mathrm{recall}}{(\beta ^2 \cdot \mathrm{precision})\,+\,\mathrm{recall}}$|⁠.

Increasing the value of |$\beta$| increases the weight of the precision. The threshold that maximizes the f-score on the training data is calculated and then used for classification. As can be seen in Fig. 8, as |$\beta$| increases, the number of candidates proposed can increase, as well as the number of objects found. Comparing it with a threshold set at 0.5 (on the right panel, Fig. 8), we find a larger number of objects, but also a much greater number of candidates.

$Number of objects found ($\#$TPF in blue - left bar) and candidates ($\#$C in black - right bar) according to the value of $\beta$. The black line represents the number of injections (29 for BT1 and 140 for BT2).$

Figure 8.

The classifier can therefore be adjusted to the best compromise (number of TPs versus number of false positives) that suits the user. It depends on how much effort a user can afford to check by hand the candidates in order to increase the chance of a true detection. In the present case, we note that a significant increase of the number of candidates is required to detect only a few more exoplanets.

3.8 Amount of data required.

We evaluated the amount of training data required to reach the performance reported for the single classifier. The injection process can require a non-negligible computational effort in practice. It turns out to be more costly than the learning, clustering, and classification steps. To evaluate the real need for the injections, we run the training for the following numbers of injected planets: #pos |$\in \lbrace 25,50,100,150,200,250\rbrace$|⁠. We also investigate different number of guaranteed noise stamps with #neg |$\in \lbrace 1000, 5000, 10000, 15000\rbrace$| to get a sense of the effect of imbalance. This analysis was restricted to BT2.

Overall the 120 planets (actual performance of our approach on BT2) are found using only 150 injected planets in the training stage (whereas 250 injections were initially used). A minimum of 10000 negative samples are needed to avoid too many false positives.

4 APPLICATION TO 51 ERI

4.1 51 Eridani

51 Eridani (HIP 21547) is an F0-type star that hosts one 2–4|${M}_{\text{jup}}$| planet imaged in 2014 by the Gemini Planet Imager (Macintosh et al. 2015). We apply the proposed methodology to four IRDIS data sets taken with IRDIS on 25/12/2015, 15/01/2016, 11/12/2016, and 12/12/2016. The observing log and setup of these four observations can be found in Table 6. Note that on 2016 December, the data, obtained under good atmospheric conditions, are affected by the so called low wind effect that occurred when the wind was very low, and considerably degraded the image quality (Milli et al. 2018).²

Table 6.

51 Eri observation logs.

STAR	Date observation	Filter	DIT(s)\|$\times$\|N_frame	\|$\Delta$\|PA (⁠\|$^\circ)^a$\|	Seeing (arcsec)\|$^b$\|	Airmass\|$^b$\|	\|$\tau _0$\| (ms)\|$^{a,b}$\|	Program ID
HIP 21547	2015-12-25	DB_H23	16x256	37.6	1.18	1.10	1.8	096.C-0241(C)
HIP 21547	2016-01-15	DB_H23	16x256	41.8	1.91	1.08	1.3	096.C-0241(G)
HIP 21547	2016-12-11	DB_H23	64x54	25.3	1.97	1.12	1.5	198.C-0209(C)
HIP 21547	2016-12-12	DB_H23	64x72	45.0	0.84	1.09	5.7	198.C-0209(C)

STAR	Date observation	Filter	DIT(s)\|$\times$\|N_frame	\|$\Delta$\|PA (⁠\|$^\circ)^a$\|	Seeing (arcsec)\|$^b$\|	Airmass\|$^b$\|	\|$\tau _0$\| (ms)\|$^{a,b}$\|	Program ID
HIP 21547	2015-12-25	DB_H23	16x256	37.6	1.18	1.10	1.8	096.C-0241(C)
HIP 21547	2016-01-15	DB_H23	16x256	41.8	1.91	1.08	1.3	096.C-0241(G)
HIP 21547	2016-12-11	DB_H23	64x54	25.3	1.97	1.12	1.5	198.C-0209(C)
HIP 21547	2016-12-12	DB_H23	64x72	45.0	0.84	1.09	5.7	198.C-0209(C)

|$^b$|Values extracted from the updated DIMM info and averaged over the sequence.

Table 6.

51 Eri observation logs.

STAR	Date observation	Filter	DIT(s)\|$\times$\|N_frame	\|$\Delta$\|PA (⁠\|$^\circ)^a$\|	Seeing (arcsec)\|$^b$\|	Airmass\|$^b$\|	\|$\tau _0$\| (ms)\|$^{a,b}$\|	Program ID
HIP 21547	2015-12-25	DB_H23	16x256	37.6	1.18	1.10	1.8	096.C-0241(C)
HIP 21547	2016-01-15	DB_H23	16x256	41.8	1.91	1.08	1.3	096.C-0241(G)
HIP 21547	2016-12-11	DB_H23	64x54	25.3	1.97	1.12	1.5	198.C-0209(C)
HIP 21547	2016-12-12	DB_H23	64x72	45.0	0.84	1.09	5.7	198.C-0209(C)

STAR	Date observation	Filter	DIT(s)\|$\times$\|N_frame	\|$\Delta$\|PA (⁠\|$^\circ)^a$\|	Seeing (arcsec)\|$^b$\|	Airmass\|$^b$\|	\|$\tau _0$\| (ms)\|$^{a,b}$\|	Program ID
HIP 21547	2015-12-25	DB_H23	16x256	37.6	1.18	1.10	1.8	096.C-0241(C)
HIP 21547	2016-01-15	DB_H23	16x256	41.8	1.91	1.08	1.3	096.C-0241(G)
HIP 21547	2016-12-11	DB_H23	64x54	25.3	1.97	1.12	1.5	198.C-0209(C)
HIP 21547	2016-12-12	DB_H23	64x72	45.0	0.84	1.09	5.7	198.C-0209(C)

|$^b$|Values extracted from the updated DIMM info and averaged over the sequence.

We consider the four data sets completely independently, as our approach is not informed of the temporality. Table 7 reports the SNR of the planet in each image with PACO ADI and PACO ASDI (column SNR), whether it was found or not by our method (yes/no of column Found) and the number of candidates proposed (column #C).

Table 7.

Results obtained on 51 Eridani.

25/12/2015				15/01/2016				11/12/2016				12/12/2016
SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C
4.71	4.3	Yes	3	5.28	6.6	Yes	3	< 2.5	< 4	No	4	2.69	4.1	Yes	2

25/12/2015				15/01/2016				11/12/2016				12/12/2016
SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C
4.71	4.3	Yes	3	5.28	6.6	Yes	3	< 2.5	< 4	No	4	2.69	4.1	Yes	2

Table 7.

https://pagesperso.g-scop.grenoble-inp.fr/~catussen/exoplanet/report/

Results obtained on 51 Eridani.

25/12/2015				15/01/2016				11/12/2016				12/12/2016
SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C
4.71	4.3	Yes	3	5.28	6.6	Yes	3	< 2.5	< 4	No	4	2.69	4.1	Yes	2

25/12/2015				15/01/2016				11/12/2016				12/12/2016
SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C	SNR H2	SNR ASDI	Found	#C
4.71	4.3	Yes	3	5.28	6.6	Yes	3	< 2.5	< 4	No	4	2.69	4.1	Yes	2

Our approach allows to find 51 Eri b in 3 out of the 4 data sets, and with an SNR lower than 5 (in the H2 band) in two cases. Moreover, a very small number of additional candidates are proposed (2 or 3). Unfortunately, none of the additional signals found by the classifier seems gravitationally bound to the star (i.e. detected at multiple epoch with a motion compatible with a bound object). We classify them as false positives.

5 CONCLUSIONS

Statistical approaches have proved very efficient to avoid self-subtraction when searching for exoplanets in ADI high contrast images, and to provide means to quantify the confidence of a detection. The SNR maps produced still contain too many artefacts related to background noise to simply identify planets with a threshold of SNR typically lower than |$5\sigma$|⁠. But planetary signals and noise also leave specific patterns and shapes within the SNR map that can help discriminating them. We have proposed a methodology using simple algorithmic techniques (edge-detection, regression, and clustering) to help separating noise and planetary signals in these SNR maps. We demonstrated that the proposed methodology can considerably reduce the number of false positives and even improve detection in some cases (see, for instance, the case study of 51 Eridani). Moreover, it is well suited to learning with small data sets (limited number of samples for the learning compared to current need of deep-learning techniques) since it relies on dedicated and informative features of the application domain. This also helps explaining the results because the features have a meaning for the user. We now mostly intend to generalize it to spectroscopic data and test it on a larger scale.

ACKNOWLEDGEMENTS

This project is supported in part by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (COBREX; grant agreement no 885593) as well as the CNRS (Mission pour les Initiatives Transverses et Interdisciplinaire: MITI).

DATA AVAILABILITY STATEMENT

The data as well as detailed results are available on a web page³ and upon request.

Footnotes

A detection can be made by ASDI 5 below an SNR threshold of 5 in H2 because PACO combines both H2 and H3 bands in ASDI. It can also be made in ADI 5 if the SNR in H3 alone is above 5.

This effect has been taken care of afterwards

REFERENCES

Absil

, et al. ,

2013

A&A

559

L12

Allard

Hauschildt

P. H.

Alexander

D. R.

Tamanai

Schweitzer

2001

ApJ

556

357

10.1086/321547

Beuzit

J.-L.

et al. ,

2019

A&A

631

A155

Cantalloube

, et al. ,

2015

A&A

582

A89

Chauvin

Lagrange

A.-M.

Dumas

Zuckerman

Mouillet

Song

Beuzit

J.-L.

Lowrance

2004

A&A

425

L29

10.1051/0004-6361/202245723

Chomez

et al. ,

2023a

A&A

675

A205

10.1051/0004-6361/202347044

Chomez

et al. ,

2023b

A&A

676

L10

10.1051/0004-6361/202038806

Delorme

et al. ,

2017

, in

Reylé

Di Matteo

Herpin

Lagadec

Lançon

Meliani

Royer

, eds,

SF2A-2017

French Society of Astronomy and Astrophysics

, p.

347

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Desidera

et al. ,

2021

A&A

651

A70

https://www.worldcat.org/oclc/00388788

Dohlen

et al. ,

2008

, in

McLean

I. S.

Casali

M. M.

eds,

SPIE Conf. Ser. Ground-based and Airborne Instrumentation for Astronomy II

Astron. Soc. Pac

San Francisco

7014

. p.

70143L

Duda

R. O.

Hart

P. E.

1973

Pattern Classification and Scene Analysis

Wiley

New York

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Flasseur

Denis

Thiébaut

É.

Langlois

2020

A&A

637

10.1051/0004-6361/201937239

Flasseur

Denis

Thiébaut

É.

Langlois

2018

A&A

618

A138

Flasseur

Bodrito

Mairal

Ponce

Langlois

Lagrange

A.-M.

2024

MNRAS

527

1534

10.1093/mnras/stad3143

Gebhard

T. D.

Bonse

M. J.

Quanz

S. P.

Schölkopf

2020

preprint

(

arXiv

)

Gomez Gonzalez

C. A.

Absil

P.-A.

Van Droogenbroeck

Mawet

Surdej

2016

A&A

589

A54

Gomez Gonzalez

C. A.

Absil

Van Droogenbroeck

2018

A&A

613

A71

Lagrange

A.-M.

et al. ,

2009

A&A

493

L21

Lagrange

A.-M.

et al. ,

2010

Science

329

PubMed

Macintosh

et al. ,

2014

Proc. Natl. Acad. Sci.

111

12661

Macintosh

et al. ,

2015

Science

350

10.1126/science.aac5891

PubMed

Marois

Lafreniere

Doyon

Macintosh

Nadeau

2005

ApJ

641

556

Marois

Macintosh

Barman

Zuckerman

Song

Patience

Lafreniere

Doyon

2008

Science

322

1348

10.1126/science.1166585

10.1017/S1743921313007813

PubMed

Marois

Correia

Véran

J.-P.

Currie

2013

Proc. International Astronomical Union Symp. 8

Kluwer

Dordrecht

, p.

Marois

Correia

Véran

J.-P.

Currie

2014

, in

Booth

Matthews

B. C.

Graham

J. R.

, eds,

Exploring the Formation and Evolution of Planetary Systems, Vol. 299

. p.

Milli

et al. ,

2018

, in

L. M.

Schreiber

Schmidt

, eds,

SPIE Conf. Ser. Vol. 10703, Adaptive Optics Systems VI

SPIE

Bellingham

. p.

107032A

Müllner

2011

Modern Hierarchical, Agglomerative Clustering Algorithms

preprint

(

arXiv

)

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Nielsen

E. L.

et al. ,

2019

158

10.3847/1538-3881/ab16e9

Soummer

Pueyo

Larkin

2012

ApJ

755

L28

10.1051/0004-6361/202038107

Vigan

et al. ,

2021

A&A

651

A72

Yip

K. H.

et al. ,

2020

, in

Brefeld

Fromont

Hotho

Knobbe

Maathuis

Robardet

, eds,

Machine Learning and Knowledge Discovery in Databases

Springer International Publishing

Cham

, p.

322

APPENDIX A: FEATURES

In the following, we denote the maximum, mean, and standard deviation over a set of reals numbers |$X \subset \mathbb {R}$| by |$\max (X)$|⁠, |$\mu (X)$|⁠, and |$\sigma (X)$|⁠.

A stamp S is a small sub-image restricted to a |$D\times D$| area. This sub-image gives two |$D\times D$| matrices of pixels or SNR values extracted from the original image in the two wavelengths |$H2$| and |$H3$|⁠. In practice, we experimented with stamps of size |$29\times 29$| and |$19\times 19$| (⁠|$D = 29$| and 19). For sake of simplicity, we assume D is always odd so that the centre of the stamp is a pixel (and not in between two pixels). To make it easier to generalize D to other contexts, we recall that we were working with 12 270 milliarcsec per pixel and two wavelengths at |$\lambda _1=1.593\,{\rm \mu m}$| and |$\lambda _2=1.667\,{\rm \mu m}$|⁠. Consider pixel |$u = (i,j)$| in the stamp S (⁠|$u \in S$|⁠), we denote by

|$s^{H2}_{u}$| (resp. |$s^{H3}_{u}$|⁠): the SNR value of pixel u in wavelength |$H2$| (resp |$H3$|⁠).
|$g_{u}$|⁠: the norm of the SNR gradient at pixel u in wavelength |$H2$|⁠.
c: the pixel at the centre of the stamp, i.e|$c = (\lfloor D/2 \rfloor , \lfloor D/2 \rfloor)$|⁠.
|$\mathcal {C}$|⁠: the set of nine pixels in a |$3\times 3$| area at the centre of the stamp, i.e|$\mathcal {C} = \lbrace (i,j) | i \in [\![ \lfloor D/2 \rfloor -1,\lfloor D/2 \rfloor +1]\!], j \in [\![ \lfloor D/2 \rfloor -1, \lfloor D/2 \rfloor +1]\!]\rbrace$|

Note that we refer to pixel |$u=(i,j)$| using its i and j coordinates when needed so that |$s^{H2}_{u}$| can also be written |$s^{H2}_{ij}$|⁠. Fig. A1 gives a summary of the notations.

Figure A1.

Summary of the notations for a stamp.

The features used are the following:

(MeanSnr). Mean SNR value in |$H2$| and |$H3$|⁠:
$$\begin{eqnarray} f^{H2}_1 = \mu (\lbrace s^{H2}_u | u \in S\rbrace), \qquad f^{H3}_1 = \mu (\lbrace s^{H3}_u | u \in S\rbrace). \end{eqnarray}$$
(MaxCenteredSnr). Max SNR value in |$H2$| and |$H3$| in the centre |$\mathcal {C}$|⁠:
$$\begin{eqnarray} f^{H2}_2 = \max (\lbrace s^{H2}_u | u \in \mathcal {C} \rbrace), \qquad f^{H3}_2 = \max (\lbrace s^{H3}_u | u \in \mathcal {C} \rbrace). \end{eqnarray}$$
(MaxGra, MeanGra, StdevGra). Maximum, mean, and standard deviation of the gradient in |$H2$|⁠:
$$\begin{eqnarray} f_3 = \max (\lbrace g_u | u \in S\rbrace), \end{eqnarray}$$
$$\begin{eqnarray} f_4 = \mu (\lbrace g_u | u \in S\rbrace), \end{eqnarray}$$
$$\begin{eqnarray} f_5 = \sigma (\lbrace g_u | u \in S\rbrace). \end{eqnarray}$$
(MaxMin). Consider the image defined for each pixel u by the minimum SNR value between both wavelength: |$\min (s^{H2}_u,s^{H3}_u)$|⁠. This minimum image only keep the SNR quantities present in both wavelength. The idea is that a speckle systematically move between H2 and H3 whereas real companions can be present in both H2 and H3. We use the following feature:
$$\begin{eqnarray} f_6 = \max (\lbrace \min (s^{H2}_u,s^{H3}_u) | u \in S\rbrace). \end{eqnarray}$$
(AiryFig). To capture the presence of an Airy figure, we rely on an azimuthal mean of SNR values. More precisely, let us denote |$h_u(d)$| the mean SNR values in a ring of one pixel’s width located at distance d of pixel u. Fig. A2 shows |$h_{\rm c}(d)$|⁠, i.e. from the centre c of our example stamp.
An Airy figure at the centre c of the stamp is expected to have a higher standard deviation |$\sigma (c) = \sigma (\lbrace h_{c}(d)| d \in [\![ 1, D/2 [\![ \rbrace)$| than random noise. This, however, assumes that the stamp is precisely centred. We therefore relax this constraint and take the best value among the nine central pixels. We use the following feature:
$$\begin{eqnarray} f_7 = \max (\lbrace \sigma (u) | u \in \mathcal {C}\rbrace). \end{eqnarray}$$
Feature |$f_6$| is computed for each wavelength which gives, in practice, two distinct features |$f^{H2}_6$| and |$f^{H3}_6$|⁠.
(MeanSpec). A speckle moves radially away from the centre (the star) of the image depending on its distance to the centre and the ratio |$r=\frac{\lambda _{H3}}{\lambda _{H2}}= \frac{1.667}{1.593}$| (see Fig. A3). Assuming the star is the origin of the coordinate system, if pixel |$u=(i,j)$| in |$H_2$| is part of a speckle and its centre is located at coordinates |$(x_i,y_j)$| in the star system, the point |$ru = (rx_i,ry_j)$| in |$H3$| is expected to have the same SNR intensity. Since |$(rx_i,ry_j)$| does not necessarily match exactly the centre of another pixel, we compute a weighted average of its four neighbours. Let |$N(u)$| be the nine closest pixels of the real point |$ru = (rx_i,ry_j)$|⁠, we compute |$w(u)$| (where |$d^{\prime }(u,v)= \frac{1}{d(u,v)}$| is the inverse of the distance between pixel u and v):
$$\begin{eqnarray} w(u) = \frac{\sum _{v\in N(u)} d^{\prime }(ru,v) \times s^{H3}_{v}}{\sum _{v \in N(u)} d^{\prime }(ru,v)}. \end{eqnarray}$$
The following quantity |$sp(u)$| is used to quantify how much the SNR intensity |$s^{H2}_u$| is found at |$(rx_i,ry_j)$| in |$H3$|⁠, i.e. is close to |$w(u)$|⁠:
$$\begin{eqnarray} sp(u) = \frac{|s^{H2}_{u} - w(u)|}{\max (s^{H2}_{u},w(u))}. \end{eqnarray}$$
Finally, the feature considers the nine central pixels to identify a speckle:
$$\begin{eqnarray} f_8 = \mu (\lbrace sp(u) | u \in \mathcal {C}\rbrace). \end{eqnarray}$$
The previous characterization of a speckle is only valid at minimum and maximum distance from the star. We therefore introduce an indicative feature:
$$\begin{eqnarray} f_9 = \left\lbrace \begin{array}{ll}1 & \textrm {~if~}d_{\rm min} \le d(c,{\rm star}) \le d_{\rm max}\\0 & \textrm {~otherwise~}. \end{array}\right. \end{eqnarray}$$
We used the values (in arcseconds) of |$d_{\rm min} = 0.2\,{\rm arcsec}$| and |$d_{\rm max} = 1.7\,{\rm arcsec}$|⁠.
(MeanNoise, StdevNoise). At greater distances from the star (⁠|$\gt d_{\rm max}$|⁠) a companion object can sometimes disappear from |$H2$| to|$H3$|⁠. The quantity |$n(u)$| measures how much of the intensity in |$H2$| at pixel u disappears in |$H3$|⁠:
$$\begin{eqnarray} n(u) = {\rm max}(0,\frac{s^{H2}_{u} -s^{H3}_{u}}{\max (s^{H2}_{u},s^{H3}_{u})}). \end{eqnarray}$$
We use the mean and the standard deviation of this quantity, at the centre of the stamp, as features:
$$\begin{eqnarray} f_{10} = \mu (\lbrace n(u) |u \in \mathcal {C}\rbrace), \qquad f_{11} = \sigma (\lbrace n(u) |u \in \mathcal {C}\rbrace). \end{eqnarray}$$
(Dist). The distance of the centre of the stamp to the star is also included in the features and denoted |$f_{12}$|⁠.

$$h_{\rm c}(d)$ function for the stamp presented Fig. A1. The first dark ring can be seen around distance 4 whereas the first white ring is around 7.$

Figure A2.

|$h_{\rm c}(d)$| function for the stamp presented Fig. A1. The first dark ring can be seen around distance 4 whereas the first white ring is around 7.

APPENDIX B: ANNEXE

Figs B1–B3 present the results directly on the SNR maps. The exoplanets injected are located as little green squares, the results (candidates proposed) of PACO ASDI 5 are shown as little blue circles whereas the results of our approach regL are displayed as large red circles. For instance, exoplanets found only by PACO ASDI 5 therefore appear as a little square within a little circle. Reversely, a square within a large circle is a target identified by regL. Some targets are not found by any of the methods.

Figure A3.

Motion of a speckle from H2 to H3.

Figure B1.

Visualization of the results on BT1. An injected exoplanet is displayed by a green square, a candidate proposed by PACO ASDI 5 is a little blue circle, a candidate proposed by regL is a large red circle. Top left panel: HD108767B. Top right panel: HIP1993. Bottom left panel: HIP12394. Bottom right panel: HIP107345.

Figure B2.

Visualization of the results on BT2. An injected exoplanet is displayed by a green square, a candidate proposed by PACO ASDI 5 is a little blue circle, a candidate proposed by regL is a large red circle. Top left panel: HD108767B. Top right panel: HIP1993. Bottom left panel: HIP12394. Bottom right panel: HIP107345.

Figure B3.

Visualization of the results on BT3. An injected exoplanet is displayed by a green square, a candidate proposed by PACO ASDI 5 is a little blue circle, a candidate proposed by regL is a large red circle. Top left panel: HD108767B. Top right panel: HIP1993. Bottom left panel: HIP12394. Bottom right panel: HIP107345.