Απόσπασμα:
Αρχική Δημοσίευση από SenseiG
«Το άστρο HD 40307 είναι ένα εξαιρετικά ήσυχο παλαιό άστρο-νάνος, όποτε δεν υπάρχει κάποιος λόγος ώστε ένας τέτοιος πλανήτης να μην μπορεί να διαθέτει «γήινη» ατμόσφαιρά», τόνισε o Anglad-Escude.
Η έρευνα των αστρονόμων δημοσιεύεται εδώ.
|
Καλά που έβαλες και την πηγή,
γιατί διαφορετικά θα αμφισβητούσαμε την γνησιότητα του άρθρου,
του οποίου παραθέτω τον πρόλογο παρακάτω, για κάποιους δύσπιστους ... 
Habitable-zone super-Earth candidate in a six-planet system
around the K2.5V star HD 40307
Mikko Tuomi
⋆1,2, Guillem Anglada-Escud´e⋆⋆3, Enrico Gerlach4, Hugh R. A. Jones1, Ansgar Reiners3,
Eugenio J. Rivera
5, Steven S. Vogt5, and R. Paul Butler6
1
University of Hertfordshire, Centre for Astrophysics Research, Science and Technology Research Institute, College Lane, AL10
9AB, Hatfield, UK
2
University of Turku, Tuorla Observatory, Department of Physics and Astronomy, V¨ais¨al¨antie 20, FI-21500, Piikki¨o, Finland
3
Universit¨at G¨ottingen, Institut f¨ur Astrophysik, Friedrich-Hund-Platz 1, 37077 G¨ottingen, Germany
4
Lohrmann Observatory, Technical University Dresden, D-01062 Dresden, Germany
5
UCO/Lick Observatory, Department of Astronomy and Astrophysics, University of California at Santa Cruz, Santa Cruz, CA 95064,
USA
6
Department of Terrestrial Magnetism, Carnegie Institute of Washington, Washington, DC 20015, USA
Received XX.XX.2012
/ Accepted XX.XX.XXXX
ABSTRACT
Context.
The K2.5 dwarf HD 40307 has been reported to host three super-Earths. The system lacks massive planets and is therefore
a potential candidate for having additional low-mass planetary companions.
Aims.
We re-derive Doppler measurements from public HARPS spectra of HD 40307 to confirm the significance of the reported
signals using independent data analysis methods. We also investigate these measurements for additional low-amplitude signals.
Methods.
We used Bayesian analysis of our radial velocities to estimate the probability densities of different model parameters. We
also estimated the relative probabilities of models with di
ffering numbers of Keplerian signals and verified their significance using
periodogram analyses. We investigated the relation of the detected signals with the chromospheric emission of the star. As previously
reported for other objects, we found that radial velocity signals correlated with the S-index are strongly wavelength dependent.
Results.
We identify two additional clear signals with periods of 34 and 51 days, both corresponding to planet candidates with
minimum masses a few times that of the Earth. An additional sixth candidate is initially found at a period of 320 days. However, this
signal correlates strongly with the chromospheric emission from the star and is also strongly wavelength dependent. When analysing
the red half of the spectra only, the five putative planetary signals are recovered together with a very significant periodicity at about 200
days. This signal has a similar amplitude as the other new signals reported in the current work and corresponds to a planet candidate
with
M sin i ∼ 7 M⊕ (HD 40307 g).
Conclusions.
We show that Doppler measurements can be filtered for activity-induced signals if enough photons and a sufficient
wavelength interval are available. If the signal corresponding to HD 40307 g is a genuine Doppler signal of planetary origin, this
candidate planet might be capable of supporting liquid water on its surface according to the current definition of the liquid water
habitable zone around a star and is not likely to su
ffer from tidal locking. Also, at an angular separation of ∼ 46 mas, HD 40307 g
would be a primary target for a future space-based direct-imaging mission.
Key words.
Methods: Statistical, Numerical – Techniques: Radial velocities – Stars: Individual: HD 40307
1. Introduction
Current high-precision spectrographs, such as the High
Accuracy Radial Velocity Planet Searcher (HARPS; Mayor
et al., 2003) and the High Resolution Echelle Spectrograph
(HIRES; Vogt et al., 1994), enable detections of low-mass planets
orbiting nearby stars. During recent years, radial velocity
(RV) planet searches have revealed several systems of super-
Earths and
/or Neptune-mass planets around nearby stars (e.g.
Mayor et al., 2009a,b; Lovis et al., 2011a; Pepe et al., 2011;
Tuomi, 2012).
The system of three super-Earths orbiting HD 40307 has received
much attention because the planets appear in dynamically
packed orbits close to mean motion resonances (Mayor et al.,
2009a). This has been used as an argument to suggest that lowmass
planets may be found in highly compact multiple systems
⋆
e-mail: guillem.anglada@gmail.com
that are still stable in long-term, e.g. a possibility of having ten
planets with masses of 17 M
⊕ within a distance of 0.26 AU on
stable orbits (Funk et al., 2010). However, the physical nature of
these companions as scaled-up versions of the Earths is not entirely
clear (Barnes et al., 2009a). Their masses, between those
of Earth and Neptune, suggest they are Neptune-like proto-gas
giants that could not accumulate enough gas before it was blown
away by the newly born star. On the other hand, recent transit observations
of hot super-Earths around bright nearby stars (L´eger
et al., 2009; Batalha et al., 2011;Winn et al., 2011) indicate that
a good fraction of these hot super-Earth mass objects can have
rocky compositions.
In this article we re-analyse the 345 HARPS spectra publicly
available through the ESO archive using a newly developed software
tool called HARPS-TERRA (template-enhanced radial velocity
re-analysis application; Anglada-Escud´e& Butler, 2012).
Instead of the classic cross-correlation function method (CCF)
implemented by the standard HARPS-ESO data reduction software
(HARPS-DRS), we derive Dopplermeasurements by leastsquares
matching of each observed spectrum to a high signal-tonoise
ratio template built from the same observations. A description
of the method and the implementation details are given in
Anglada-Escud´e & Butler (2012). In addition to an increase in
precision (especially for K andMdwarfs), this method allows us
to performadditional analyses and tests beyond those enabled by
the CCF data products provided by the HARPS-DRS. As an example,
it allows us to re-obtain the RV measurements using only
a restricted wavelength range. As we show in Section 5, this capability
can be instrumental in ruling out the planetary nature of
prominent signals correlated with stellar activity.
We rely on the Bayesian framework when estimating the orbital
parameters supported by the data, determining the significances
of the signals, and the modelling of the noise in the measurements.
In previous studies, radial velocities received within
an interval of an hour or so have been commonly binned together
in an attempt to reduce the noise caused by stellar- surfacerelated
e
ffects, i.e. stellar oscillations and granulation, and other
factors within this timescale (Dumusque et al., 2010). In principle,
this would enable the detections of planets smaller than
roughly 5 M
⊕ with HARPS over a variety of orbital distances,
even at or near the stellar habitable zone (Dumusque et al., 2010,
2011a). In our approach, and instead of binning, we apply a selfconsistent
scheme to account for and quantify correlated noise
in the Bayesian framework and use Bayesian model probabilities
to show that a solution containing up to six planets is clearly
favoured by the data, especially when the redmost part of the
stellar spectrum is used in the RV analysis. Only the confluence
of refinements in these data analysis methods (re-analysis of the
spectra and Bayesian inference) allows the detection and verification
of these low-amplitude signals.
We start with a brief description of the stellar properties of
HD 40307 (Section 2) and describe the statistical modelling
of the observations and the data analysis techniques we used
(Section 3). In Section 4 we describe the properties of the RV
measurements and perform a detailed Bayesian analysis that
identifies up to three new candidate signals. We discuss the stellar
activity indicators and their possible correlations with the RV
signals in Section 5. In this same section, we find that one of the
candidates is spuriously induced by stellar activity by showing
that the corresponding periodic signal (P
∼ 320 days) is strongly
wavelength dependent. When the RVs obtained on the redmost
part of the spectrum are analysed (Section 6), the 320-day signal
is replaced by a signal of a super-Earth-mass candidate with a
200-day period with a minimum mass of about
∼ 7M⊕ orbiting
within the liquid water habitable zone of HD 40307. The analysis
of the dynamical stability of the system (Section 7) shows
that stable solutions compatible with the data are feasible and the
potential habitability of the candidate at 200 days (HD 40307 g)
is discussed in Section 8.We give some concluding remarks and
discuss the prospects of future work in Section 9.
K2.5 V star is a nearby dwarf with a Hipparcos parallax of 77.95
2. Stellar properties of HD 40307
We list the basic stellar properties of HD 40307 in Table 1. This
??
0.53 mas, which implies a distance of 12.83 ?? 0.09 pc. It is
somewhat smaller (M
⋆ = 0.77 ?? 0.05 M⊕; Sousa et al., 2008)
and less luminous (log L
star/L⊙ = −0.639 ?? 0.060; Ghezzi et
al., 2010) than the Sun. The star is quiescent (log
R′
HK
< −4.99;
Mayor et al., 2009a) and relatively metal-poor with [Fe
/H] = -
0.31
??0.03 (Sousa et al., 2008). It also lacks massive planetary
companions, which makes it an ideal target for high-precision
Table 1.
Stellar properties of HD 40307.
Parameter Estimate Reference
Spectral Type K2.5 V Gray et al. (2006)
log
R′
HK
-4.99 Mayor et al. (2009a)
π
[mas] 77.95??0.53 van Leeuwen (2007)
log L
star/L⊙ -0.639??0.060 Ghezzi et al. (2010)
log
g 4.47??0.16 Sousa et al. (2008)
M
star [M⊙] 0.77??0.05 Sousa et al. (2008)
T
eff [K] 4956??50 Ghezzi et al. (2010)
[Fe
/H] -0.31??0.03 Sousa et al. (2008)
v
sin i [kms−1] <1 Mayor et al. (2009a)
P
rot [days] ∼ 48 Mayor et al. (2009a)
Age [Gyr]
∼ 4.5 Barnes (2007)
RV surveys aiming at finding low-mass planets. According to
the calibration of Barnes (2007), HD 40307 likely has an age
similar to that of the Sun (
∼ 4.5 Gyr).
3. Statistical analyses
3.1. Statistical models
We modelled the HARPS RVs using a statistical model with a
moving average (MA) term and two additional Gaussian white
noise components consisting of two independent random variables.
The choice of anMA approach instead of binning is based on
the results of Tuomi et al. (2012b) and accounts for the fact that
uncertainties of subsequent measurements likely correlate with
one another at time-scales of an hour in an unknownmanner.We
limit our analysis to MA models of third order (MA(3) models)
because higher order choices did not improve the noise model
significantly. E
ffectively, the MA(3) component in our noise
model corresponds to binning. However, unlike when binning
measurements and artificially decreasing the size of the data set,
this approach better preserves information on possible signals in
the data. The two Gaussian components of the noise model are
the estimated instrument noise with zero-mean and known variance
(nominal uncertainties in the RVs) and another with zeromean
but unknown variance corresponding to all excess noise in
the data. The latter contains the white noise component of the
stellar surface, usually referred to as stellar “jitter”, and any additional
instrumental systematic e
ffects not accounted for in the
nominal uncertainties. Keplerian signals and white noise component
were modelled as in Tuomi et al. (2011).
In mathematical terms, an MA(
p) model is implemented on
measurement
mi as
m
i
= rk (ti) + γ + ǫi +
p
X
j
=1
φ
jhmi−j − rk(ti−j) − γi exp ti−j − ti, (1)
where
rk(ti) is the superposition of k Keplerian signals at epoch
t
i
and γ is the reference velocity. The random variable ǫi is
the Gaussian white noise component of the noise model with
zero-mean and variance
σ2 = σ2i
+
σ2J
, where
σ2i
is the (fixed)
nominal uncertainty of the
ith measurements and σ2J
is a free
parameter describing the magnitude of the jitter component.
Finally, the free parameters of the MA(
p) model are denoted as
φ
j, j = 1, ..., p – they describe the amount of correlation between
the noise of the
ith and i − jth measurements. The exponential
term in Equation (1) ensures that correlations in the noise are
modelled on the correct time-scale. Specifically, using hours as
units of time, the exponential term (that is always
< 1 because
t
i
> ti −j) vanishes in few hours.
To demonstrate the impact of binning on this data set,
Section 4.1 shows the analyses of binned RVs with the common
assumption that the excess noise is purely Gaussian. This
corresponds to using the nightly average as the individual RV
measurements and setting
φj = 0 for all j in Eq. (1).
3.2. Bayesian analyses and detection thresholds
To estimate the model parameters and, especially, their uncertainties
as reliably as possible, we drew random samples from
the parameter posterior densities using posterior sampling algorithms.
We used the adaptive Metropolis algorithm of Haario
et al. (2001) because it can be used to receive robust samples
from the posterior density of the parameter vector when applied
to models with multiple Keplerian signals (e.g. Tuomi et al.,
2011; Tuomi, 2012). This algorithmis simply a modified version
of the famous Metropolis-Hastings Markov chain Monte Carlo
(MCMC) algorithm (Metropolis et al., 1953; Hastings, 1970),
which adapts the proposal density to the information gathered
from the posterior density. We performed samplings of models
with
k = 0, 1, ...7 Keplerian signals.
The samples from the posterior densities were then used to
perform comparisons of the di
fferent models. We used the oneblock
Metropolis-Hastings (OBMH) method (Chib & Jeliazkov,
2001; Clyde et al., 2007) to calculate the relative posterior probabilities
of models with di
ffering numbers of Keplerian signals
(e.g. Tuomi & Kotiranta, 2009; Tuomi, 2011; Tuomi et al., 2011;
Tuomi, 2012). We performed several samplings using di
fferent
initial values of the parameter vector and calculated the means
and the corresponding deviations as measures of uncertainties of
our Bayesian evidence numbers
P(m|Mk), where m is the measurement
vector and
Mk denotes the model with k Keplerian
signals.
The prior probability densities in our analyses were essentially
uniform densities. As in Tuomi (2012), we adopted the
priors of the RV amplitude
π(Ki) = U(0, aRV), reference velocity
π
(γ) = U(−aRV, aRV), and jitter π(σJ) = U(0, aRV), where
U
(a, b) denotes a uniform density in the interval [a, b]. Since
the observed peak-to-peak di
fference in the raw RVs is lower
than 10 m s
−1, the hyperparameter aRV was conservatively selected
to have a value of 20 ms
−1. The priors of the longitude of
pericentre (
ω) and the mean anomaly (M0) were set to U(0, 2π),
in accordance with the choice of Ford & Gregory (2007) and
Tuomi (2012). We used the logarithm of the orbital period as
a parameter of our model because, unlike the period as such, it
is a scale-invariant parameter. The prior of this parameter was
set uniform such that the two cut-o
ff periodicities were Tmin and
T
max
. These hyperparameters were selected as Tmin = 1.0 days
and
Tmax = 10Tobs because we did not expect to find signals with
periods less than 1 day. Also, we did not limit the period space
to the length of the baseline of the HARPS time series (
Tobs), because
signals in excess of that can be detected in RV data (Tuomi
et al., 2009) and because there might be long-period signals apparent
as a trend with or without curvature in the data set.
Unlike in traditional Bayesian analyses of RV data, we did
not use uniform prior densities for the orbital eccentricities.
Instead, we used a semi-Gaussian as
π(ei) ∝ N(0, σ2e
) with the
corresponding normalisation, where the hyperparameter
σe was
chosen to have a value of 0.3. This value decreases the posterior
probabilities of very high eccentricities in practice, but still enables
themif they explain the data better than lower ones (Tuomi,
2012; Tuomi et al., 2012a).
For the MA components
φj, we selected uniform priors as
π
(φj) = U(−1, 1), for all j = 1, 2, 3. This choice was made to
ensure that theMA model was stationary, i.e. time-shift invariant
– a condition that is satisfied exactly when the values are in the
interval [-1, 1].
Finally, we did not use equal prior probabilities for the models
with di
ffering numbers of Keplerian signals. Instead, following
Tuomi (2012), we set them as
P(Mk) = 2P(Mk+1), which
means that the model with
k Keplerian signals was always twice
as probable prior to the analyses than the model with
k + 1
Keplerian signals. While this choice makes our results more robust
in the sense that a posterior probability that exceeds our
detection threshold is actually already underestimated with respect
to equal prior probabilities, there is a physical motivation
as well. We expect that the dynamical interactions of planets in
any given system make the existence of an additional planet less
probable because there are fewer dynamically stable orbits. This
also justifies the qualitative form of our prior probabilities for
the eccentricities.
Our criterion for a positive detection of
k Keplerian signals
is as follows. First, we require that the posterior probability of
a
k-Keplerian model is at least 150 times greater than that of
the
k − 1-Keplerian model (Kass & Raftery, 1995; Tuomi, 2011,
2012; Tuomi et al., 2011; Feroz et al., 2011). Second, we require
that the radial velocity amplitudes of all signals are statistically
significantly di
fferent from zero. Third, we also require that the
periods of all signals are well-constrained fromabove and below.
These criteria were also applied in Tuomi (2012).
We describe the parameter posterior densities using three
numbers, namely, the maximum
a posteriori (MAP) estimate
and the limits of the corresponding 99%Bayesian credibility sets
(BCSs) or intervals in one dimension (e.g. Tuomi & Kotiranta,
2009).
3.3. Periodogram analysis
As is traditionally the case when searching for periodic signals
in time series, we used least-squares periodograms (Lomb, 1976;
Scargle, 1982) to probe the next most significant periods left in
the data. In particular, we used the least-squares periodograms
described in Cumming (2004), which adjust for a sine wave and
an o
ffset at each test period and plot each test period against the
F-ratio statistic (or power) of the fit. While strong powers likely
indicate the existence of a periodic signal (though strong powers
may be caused by sampling-related features in the data as well),
the lack of them does not necessarily mean that there are no significant
periodicities left (e.g. Tuomi, 2012). This is especially
so in multi-Keplerian fits due to strong correlations and aliases
between clearly detected signals and yet-undetected lower amplitude
companions (e.g. Anglada-Escud´e, et al., 2010). The reason
is that residuals must necessarily be calculated with respect
to a model that is assumed to be correct, which is clearly not the
case when adding additional degrees of freedom, i.e. additional
planetary signals, to the model. Therefore, determining the reliability
of a new detection based on goodness-of-fit comparisons
is prone to biases, which e
ffectively reduces the sensitivity and
reliability of these detections (Tuomi, 2012).
While periodograms of the residuals are very useful, they do
not properly quantify the significance of the possibly remaining
periodicities and, therefore, we used them as a secondary rather
than a primary tool to assess the significance of new signals. The
analytic false-alarm probability (FAP) thresholds as derived by
Cumming (2004) are provided in the figures as a reference and
for illustrative purposes only. We used the same periodogram
tools to assess the presence of periodicities in the time series of
a few activity indicators.
4. Analysis of the RV data
The 345 measurements taken on 135 separate nights were obtained
over a baseline of
∼ 1900 days. In contrast to the discussion
in Mayor et al. (2009a), we could not confirm a long-period
trend using our new RVs and we did not detect evidence for
a trend in the new CCF RVs obtained using the HARPS-DRS.
As shown in Anglada-Escud´e & Butler (2012) (Fig. 3), changes
in the continuum flux accross each echelle order (also called
blaze function) induce RV shifts of several ms
−1 if not properly
accounted for. This e
ffect was reported to affect HARPS
measurements in Pepe (2010) and appears to have been fixed
by HARPS-DRS v3.5 based on the release notes
1 (issued on
29 October, 2010). With respect to HD 40307 in particular, we
found that when RVs were derived without blaze function correction,
a strong positive drift (
∼ 2 ma−1yr−1) was left in the
Doppler time series. We speculate that the trend reported earlier
may be caused by this blaze function variability. A notable feature
of the HD 40307 data set is that the epochs have a long gap
of 638 days between 3055 - 3693 [JD-2450000].This feature can
e
ffectively decrease the phase-coverage of the data on longer periods
and complicates the interpretation of periodograms due to
severe aliases.
4.1. Analysis of binned data
In a first quicklook analysis, we worked with the nightly averages
of the radial velocities as obtained from HARPS-TERRA
using the standard setup for K dwarfs. In this setup, all HARPS
echelle apertures are used and a cubic polynomial is fitted to
correct for the blaze function of each aperture. After this correction,
the weighted means ˆ
ve of all RV measurements within each
night
e are calculated. As a result, we obtain internal uncertainties
of the order of 0.3-0.4 ms
−1 for the HARPS-TERRA RVs.
Because of stellar and
/or instrumental systematic errors, we observed
that these individual uncertainties are not representative
of the real scatter within most nights with five or more measurements.
Using three of those nights we estimate that at least 0.6
m
s−1 must be added in quadrature to each individual uncertainty
estimate. After this, the uncertainty of a given epoch is obtained
as
σ−1
e
= PNe
i
(σei
)
−1, where the sum is calculated over all exposures
obtained during a given night. Finally, based on their longterm
monitoring of inactive stars, Pepe et al. (2011) inferred a
noise level of 0.7 ms
−1 to account for instrumental and stellar
noise. After some tests, we found that adding 0.5ms
−1 in quadrature
to the uncertainties of the nightly averages ensured that none
of the epochs had uncertainties below the 0.7 ms
−1 level. The
typical uncertainties of a single night derived this way were of
the order of 0.8 ms
−1. These corrections are basically only welleducated
guesses based on the prior experiencewith RV data and
reported stability of the instrument. Therefore, onemust be especially
careful not to over-interpret the results derived from them
(e.g., powers in periodograms and significance of the signals). In
the fully Bayesian approach, we treat the excess noise as a free
parameter of the model, therefore the Bayesian estimates of the
noise properties should in principle also be more reliable.
First, we re-analysed the nighly binned RVs to see whether
we could independently reproduce the results of Mayor et al.
1
www.eso.org/sci/facilities/lasilla/instruments/harps/tools/drs.html
5
10
15
20
0
5
10
15
20
Power
0
5
10
15
20
Power
0
Power
10 100 1000
5
10
15
20
Period [days]
0
Power
Nightly binned RVs
10%
1%
0.1%
10%
10%
10%
1%
1%
1%
0.1%
0.1%
0.1%
51-d
34-d
320-d
4-th signal
5-th signal
6-th signal
7-th signal
200-d
200-d
200-d
Fig. 1.
Least-squares periodograms of the binned HD 40307 radial velocities
for the residuals of the models with three (top) to six (bottom)
periodic signals. The analytic 10%, 1%, and 0.1% FAPs are shown as
horizontal lines.
(2009a) when HARPS-TERRA measurements and our Bayesian
methods were used. Assuming an unknown Gaussian noise parameter
(e.g. Tuomi et al., 2011) in addition to the estimated
measurement uncertainties, the posterior samplings and the corresponding
model probabilities easily revealed the three strong
signals corresponding to periods of 4.3, 9.6, and 20.4 days. The
residual periodogram of the three-Keplerian model revealed additional
strong periodicities exceeding the 1% FAP level (Fig.
1, top panel) and we tested more complicated models with up
to six Keplerian signals. Especially, we tested whether the additional
power present in the three-Keplerianmodel residuals (Fig.
1, top panel) at periods of 28.6, 34.8, 51.3, and 308 days, peaking
above the 10% FAP level, are statistically significant by starting
our MCMC samplings at nearby seed periods.
The global four-Keplerian solution was found to correspond
to the three previously known super-Earth signals and an additional
signal with an MAP period of 320 days. This period was
bounded from above and below and its amplitude was strictly
positive – in accordance with our detection criteria. The corresponding
posterior probability of the four-Keplerian model was
1.9
??105 times greater than that of a three-Keplerian one, making
the 320-day signal significant. In addition to this signal, we
could identify a 51-day periodicity (Fig. 1, second panel) that
satisfied the detection criteria as well. Including this fifth signal
in the model further increased the model probabilitity by a
factor of 6.6
??106. We could furthermore identify a sixth signal
with our six-Keplerian model, correponding to a period of 34.4
days. However, even though the samplings converged well and
the solution looked well-constrained, the six-Keplerian model
was only five times more probable than the five-Keplerian one
and would not be detected using our criteria in Section 3.2. Both
of the two new significant signals had MAP estimates of their
radial velocity amplitudes slightly lower than 1.0 ms
−1 – the signals
at 51 and 320 days had amplitudes of 0.70 [0.31, 1.09] ms
−1
and 0.75 [0.38, 1.12] ms
−1, respectively, where the uncertainties
are denoted using the intervals corresponding to the 99%
BCSs. We note that the periodogram of sampling does not have
strong powers at the periods we detect (see Fig. 1 in Mayor et
al., 2009a).
Given the uncertain nature of the signal at 34.4 days and
the potential loss of information when using the nightly averaged
RVs (artificial reduction of the number of measurements),
we performed a complete Bayesian reanalysis of the full dataset
(345 RVs), now including the aforementioned moving average
approach to model the velocities.
4.2. Analysis using all RV measurements
The analyses of the unbinned data immediately showed the three
previously announced signals (Mayor et al., 2009a) with periods
of 4.3, 9.6, and 20.4 days. Modelling the data with the superposition
of
k Keplerian signals and an MA(3) noise model plus the
two Gaussian white noise components, our posterior samplings
and periodogram analyses identified these signals very rapidly,
enabling us to draw statistically representative samples from the
corresponding parameter densities.
The residual periodogram of this model (three Keplerians
and MA(3) components of the noise removed) revealed some
significant powers exceeding the 0.1% and 1% FAP level at 320
and 50.8 days, respectively (Fig. 2, top panel). Samplings of the
parameter space of a four-Keplerian model indicated that the
global solution contained the 320-day periodicity as the fourth
signal and yielded a posterior probability for the four-Keplerian
model roughly 1
.5??106 times higher than for the three-Keplerian
one. The nature of this signal and its relation to the stellar activity
(Section 5) is discussed in Section 5.3.
We continued by calculating the periodogram of the residuals
of our four-Keplerian model (Fig. 2, second panel) and observed
a periodogram power that almost reached the 1% FAP
level at a period of 50.8 days. Including this fifth signal further
increased the posterior probability of our model by a factor of
6
.4 ?? 105.
The residuals of the five-Keplerian model contained a periodicity
at 34.7 days (Fig. 2, third panel) exceeding the 1% FAP
level. The corresponding six-Keplerian model with this candidate
received the highest posterior probability – roughly 5
.0??108
times higher than that of the five-Keplerian model. Since the
parameters of this sixth candidate were also well-constrained,
we conclude that including the 34.7-day signal in the statistical
model is fully justified by the data.
We also attempted to sample the parameter space of a seven-
Keplerian model but failed to find a clear probability maximum
for a seventh signal (see also the residual periodogram
of the six-Keplerian model in Fig. 2, bottom panel). Although
the periodicity of the seventh signal did not converge to a wellconstrained
probability maximum, all periodicities in the six-
Keplerian model at 4.3, 9.6, 20.4, 34.7, 50.8, and 320 days were
still well-constrained, i.e. their radial velocity amplitudes were
statistically distinguishable from zero and their periods had clear
5
10
15
20
0
5
10
15
20
Power
0
5
10
15
20
Power
0
Power
10 100 1000
5
10
15
20
Period [days]
0
Power
Full spectrum RVs
10%
1%
0.1%
10%
10%
10%
1%
1%
1%
0.1%
0.1%
0.1%
51-d
34-d
320-d
4-th signal
5-th signal
6-th signal
7-th signal
200-d
200-d
200-d
Fig. 2.
Least-squares periodograms of all 345 RVs of HD 40307 for the
residuals of the models with three (top) to six (bottom) periodic signals
together with the analytic 10%, 1%, and 0.1% FAPs.
parameters of a seventh signal using the posterior samplings, we
cannot be sure whether the corresponding Markov chains had
converged to the posterior density and cannot reliably calculate
an estimate for the posterior probability of the seven-Keplerian
model. Therefore we stopped looking for additional signals.
From this analysis, we can state confidently that there are
six significant periodicities in the HARPS-TERRA radial velocities
of HD 40307 when the whole spectral range of HARPS is
used. As we show in the next section, one of them has the same
period as the chromospheric activity indicator (S-index) and requires
more detailed investigation. The analysis of all 345 RVs
indicates that for these data binning appears to be a retrograde
step in extracting periodic signals from the RV data. We infer
that binning serves to alter measurement uncertainties and damp
the significance levels of the periodicities in the data.
probability maxima. Because we were unable to constrain the
5. Stellar activity
We examined the time series of two activity indicators derived
from the cross correlation function properties as provided by the
HARPS-DRS. They are the bisector span (BIS) and the fullwidth
at half-maximum (FWHM) of the CCF. These indices
monitor di
fferent features of the average stellar line. Briefly, BIS
is a measure of the stellar line asymmetry and should correlate
with the RVs if the observed o
ffsets are caused by spots or
plages rotating with the star (Queloz et al., 2001). The FWHM
is a measure of the mean spectral line width. Its variability
(when not instrumental) is usually associated with changes in
the convective patterns on the surface of the star. A third index,
the so-called S-index in theMountWilson system (Baliunas
et al., 1995), is automatically measured by HARPS-TERRA
on the blaze-corrected one-dimensional spectra provided by the
HARPS-DRS. The S-index is a measure of the flux of the CaII
H and K lines (
λH = 3933.664 Å and λK = 3968.470 Å,
respectively) relative to a locally defined continuum (Lovis et
al., 2011a) and is an indirect measurement of the total chromospheric
activity of the star. For simplicity, the analysis of the
activity indicators was performed throughout for the 135 nightly
averaged values using sequential least-squares fitting of periodic
signals that are each described by a sine-wave model (period,
amplitude and phase).
5.1. Analysis of the FWHM and BIS
The BIS was remarkably stable (RMS
∼ 0.5 ms−1) and the periodogram
of its time series did not show any significant powers.
Visual inspection of the time series for the FWHM already
shows a very significant trend of 5.3 ms
−1 yr−1. The 345 measurements
of the FWHM are listed in Table A.2. A sinusoidal
fit to this trend suggested a period of 5000 days or more (see
top panel in Fig. 3). After removing the trend, two more signals
strongly show up in the residuals. The first one was found at
23 days and had an analytic FAP of 0.005%. After fitting a sinusoid
to this signal and calculating the residuals, an extremely
significant peak appeared at 1170 days with an analytic FAP of
0.002%. After including this in a model with three sinusoids, no
additional signals could be seen in the periodogram of the residuals
with analytic FAP estimates lower than 10% (Fig. 3, bottom
panel). We also show the FWHM values together with the fitted
periodic curves in Fig. 4. While the signals in the FWHM were
significant, we did not clearly detect their counterparts in the
RVs. Given that BIS does not show any obvious signals either,
we suspect that the periodicities in the FWHM might be caused
by instrumental e
ffects, e.g. tiny changes of the focus inside the
spectrograph, rather than intrinsic variability of the stellar lines.
The instrumental origin would reconcile the absence of correspondent
drifts in the BIS and in the RVs. A similar indication
of drifts and sensitivity of the FWHM to instrumental issues has
been reported in e.g. Lovis et al. (2011a).While this adds some
caveats on the long-term stability of the HARPS instrumental
profile (and therefore its long-term precision), unless it is found
to have similar periods, we see no reason to suspect that any of
the signals in the RVs are spuriously induced by changes in the
FWHM (intrinsic or instrumental).
5.2. Analysis of the S-index
of 345 measurements of the S-index are provided in Table A.2.
Again, this analysis was performed for the nightly binned measurements
using least-squares periodograms and the sequential
inclusion of sinusoidal signals. The last two S-index measurements
were well above the average and could not be reproduced
by any smooth function. Even if they are representative
of a physical process, such outlying points cannot be easily
modelled by a series of a few sinusoids because these would
add many ambiguities to the interpretation of the results. When
these two points were removed, the periodograms looked much
The S-index also shows strong coherent variablity. The full set
5
10
15
20
25
30
0
5
10
15
20
25
30
Power
0
5
10
15
20
25
30
Power
0
Power
10 100 1000
5
10
15
20
25
30
Period [days]
0
Power
FWHM
10%
1%
0.1%
10%
10%
10%
1%
1%
1%
0.1%
0.1%
0.1%
5000+ d
23-d
1170-d
1st signal
2nd signal
3rd signal
4th signal
Fig. 3.
Periodogram series of the signals detected in the FWHM, from
most significant to less significant (top to bottom).
53000 53500 54000 54500 55000
-20
-10
0
10
20
30
JD-2400000 [days]
-30
FWHM - 5910 [m s
-1]
Fig. 4.
Habitable-zone super-Earth candidate in a six-planet system
around the K2.5V star HD 40307
Time series of the FWHM activity index. The solid black line
represents the best fit to a model containing three sinusoids (periods of
Mikko Tuomi
⋆1,2, Guillem Anglada-Escud´e⋆⋆3, Enrico Gerlach4, Hugh R. A. Jones1, Ansgar Reiners3,
Eugenio J. Rivera5, Steven S. Vogt5, and R. Paul Butler6
1
University of Hertfordshire, Centre for Astrophysics Research, Science and Technology Research Institute, College Lane, AL10
9AB, Hatfield, UK
2
University of Turku, Tuorla Observatory, Department of Physics and Astronomy, V¨ais¨al¨antie 20, FI-21500, Piikki¨o, Finland
3
Universit¨at G¨ottingen, Institut f¨ur Astrophysik, Friedrich-Hund-Platz 1, 37077 G¨ottingen, Germany
4
Lohrmann Observatory, Technical University Dresden, D-01062 Dresden, Germany
5
UCO/Lick Observatory, Department of Astronomy and Astrophysics, University of California at Santa Cruz, Santa Cruz, CA 95064,
USA
ABSTRACT
Context.
Aims.
Methods.
6
Department of Terrestrial Magnetism, Carnegie Institute of Washington, Washington, DC 20015, USA
Received XX.XX.2012 / Accepted XX.XX.XXXX
The K2.5 dwarf HD 40307 has been reported to host three super-Earths. The system lacks massive planets and is therefore
a potential candidate for having additional low-mass planetary companions.
We re-derive Doppler measurements from public HARPS spectra of HD 40307 to confirm the significance of the reported
signals using independent data analysis methods. We also investigate these measurements for additional low-amplitude signals.
We used Bayesian analysis of our radial velocities to estimate the probability densities of different model parameters. We
also estimated the relative probabilities of models with differing numbers of Keplerian signals and verified their significance using
periodogram analyses. We investigated the relation of the detected signals with the chromospheric emission of the star. As previously
reported for other objects, we found that radial velocity signals correlated with the S-index are strongly wavelength dependent.
Results.
Conclusions.
We identify two additional clear signals with periods of 34 and 51 days, both corresponding to planet candidates with
minimum masses a few times that of the Earth. An additional sixth candidate is initially found at a period of 320 days. However, this
signal correlates strongly with the chromospheric emission from the star and is also strongly wavelength dependent. When analysing
the red half of the spectra only, the five putative planetary signals are recovered together with a very significant periodicity at about 200
days. This signal has a similar amplitude as the other new signals reported in the current work and corresponds to a planet candidate
with M sin i ∼ 7 M⊕ (HD 40307 g).
We show that Doppler measurements can be filtered for activity-induced signals if enough photons and a sufficient
wavelength interval are available. If the signal corresponding to HD 40307 g is a genuine Doppler signal of planetary origin, this
candidate planet might be capable of supporting liquid water on its surface according to the current definition of the liquid water
habitable zone around a star and is not likely to su ffer from tidal locking. Also, at an angular separation of ∼ 46 mas, HD 40307 g
would be a primary target for a future space-based direct-imaging mission.
Key words.
1. Introduction
Methods: Statistical, Numerical – Techniques: Radial velocities – Stars: Individual: HD 40307
Current high-precision spectrographs, such as the High
Accuracy Radial Velocity Planet Searcher (HARPS; Mayor
et al., 2003) and the High Resolution Echelle Spectrograph
(HIRES; Vogt et al., 1994), enable detections of low-mass planets
orbiting nearby stars. During recent years, radial velocity
(RV) planet searches have revealed several systems of super-
Earths and
/or Neptune-mass planets around nearby stars (e.g.
Mayor et al., 2009a,b; Lovis et al., 2011a; Pepe et al., 2011;
Tuomi, 2012).
The system of three super-Earths orbiting HD 40307 has received
much attention because the planets appear in dynamically
packed orbits close to mean motion resonances (Mayor et al.,
2009a). This has been used as an argument to suggest that lowmass
planets may be found in highly compact multiple systems
⋆
⋆⋆
that are still stable in long-term, e.g. a possibility of having ten
planets with masses of 17 M
⊕ within a distance of 0.26 AU on
stable orbits (Funk et al., 2010). However, the physical nature of
these companions as scaled-up versions of the Earths is not entirely
clear (Barnes et al., 2009a). Their masses, between those
of Earth and Neptune, suggest they are Neptune-like proto-gas
giants that could not accumulate enough gas before it was blown
away by the newly born star. On the other hand, recent transit observations
of hot super-Earths around bright nearby stars (L´eger
et al., 2009; Batalha et al., 2011;Winn et al., 2011) indicate that
a good fraction of these hot super-Earth mass objects can have
rocky compositions.
In this article we re-analyse the 345 HARPS spectra publicly
available through the ESO archive using a newly developed software
tool called HARPS-TERRA (template-enhanced radial velocity
re-analysis application; Anglada-Escud´e& Butler, 2012).
Instead of the classic cross-correlation function method (CCF)
implemented by the standard HARPS-ESO data reduction software
(HARPS-DRS), we derive Dopplermeasurements by leastsquares
matching of each observed spectrum to a high signal-tonoise
ratio template built from the same observations. A description
of the method and the implementation details are given in
Anglada-Escud´e & Butler (2012). In addition to an increase in
precision (especially for K andMdwarfs), this method allows us
to performadditional analyses and tests beyond those enabled by
the CCF data products provided by the HARPS-DRS. As an example,
it allows us to re-obtain the RV measurements using only
a restricted wavelength range. As we show in Section 5, this capability
can be instrumental in ruling out the planetary nature of
prominent signals correlated with stellar activity.
We rely on the Bayesian framework when estimating the orbital
parameters supported by the data, determining the significances
of the signals, and the modelling of the noise in the measurements.
In previous studies, radial velocities received within
an interval of an hour or so have been commonly binned together
in an attempt to reduce the noise caused by stellar- surfacerelated
e ffects, i.e. stellar oscillations and granulation, and other
factors within this timescale (Dumusque et al., 2010). In principle,
this would enable the detections of planets smaller than
roughly 5 M ⊕ with HARPS over a variety of orbital distances,
even at or near the stellar habitable zone (Dumusque et al., 2010,
2011a). In our approach, and instead of binning, we apply a selfconsistent
scheme to account for and quantify correlated noise
in the Bayesian framework and use Bayesian model probabilities
to show that a solution containing up to six planets is clearly
favoured by the data, especially when the redmost part of the
stellar spectrum is used in the RV analysis. Only the confluence
of refinements in these data analysis methods (re-analysis of the
spectra and Bayesian inference) allows the detection and verification
of these low-amplitude signals.
We start with a brief description of the stellar properties of
HD 40307 (Section 2) and describe the statistical modelling
of the observations and the data analysis techniques we used
(Section 3). In Section 4 we describe the properties of the RV
measurements and perform a detailed Bayesian analysis that
identifies up to three new candidate signals. We discuss the stellar
activity indicators and their possible correlations with the RV
signals in Section 5. In this same section, we find that one of the
candidates is spuriously induced by stellar activity by showing
that the corresponding periodic signal (P ∼ 320 days) is strongly
wavelength dependent. When the RVs obtained on the redmost
part of the spectrum are analysed (Section 6), the 320-day signal
is replaced by a signal of a super-Earth-mass candidate with a
200-day period with a minimum mass of about ∼ 7M⊕ orbiting
within the liquid water habitable zone of HD 40307. The analysis
of the dynamical stability of the system (Section 7) shows
that stable solutions compatible with the data are feasible and the
potential habitability of the candidate at 200 days (HD 40307 g)
is discussed in Section 8.We give some concluding remarks and
discuss the prospects of future work in Section 9.
2. Stellar properties of HD 40307
We list the basic stellar properties of HD 40307 in Table 1. This
K2.5 V star is a nearby dwarf with a Hipparcos parallax of 77.95
??
0.53 mas, which implies a distance of 12.83 ?? 0.09 pc. It is
somewhat smaller (M ⋆ = 0.77 ?? 0.05 M⊕; Sousa et al., 2008)
and less luminous (log L star/L⊙ = −0.639 ?? 0.060; Ghezzi et
al., 2010) than the Sun. The star is quiescent (log R′
HK
< −4.99;
Mayor et al., 2009a) and relatively metal-poor with [Fe /H] = -
0.31 ??0.03 (Sousa et al., 2008). It also lacks massive planetary
companions, which makes it an ideal target for high-precision
Table 1.
Stellar properties of HD 40307.
Parameter Estimate Reference
Spectral Type K2.5 V Gray et al. (2006)
log R′
HK
-4.99 Mayor et al. (2009a)
π
v
P
[mas] 77.95??0.53 van Leeuwen (2007)
log Lstar/L⊙ -0.639??0.060 Ghezzi et al. (2010)
log g 4.47??0.16 Sousa et al. (2008)
M star [M⊙] 0.77??0.05 Sousa et al. (2008)
T eff [K] 4956??50 Ghezzi et al. (2010)
[Fe /H] -0.31??0.03 Sousa et al. (2008)
sin i [kms−1] <1 Mayor et al. (2009a)
rot [days] ∼ 48 Mayor et al. (2009a)
Age [Gyr] ∼ 4.5 Barnes (2007)
3. Statistical analyses
3.1. Statistical models
m
i
RV surveys aiming at finding low-mass planets. According to
the calibration of Barnes (2007), HD 40307 likely has an age
similar to that of the Sun (
∼ 4.5 Gyr).
We modelled the HARPS RVs using a statistical model with a
moving average (MA) term and two additional Gaussian white
noise components consisting of two independent random variables.
The choice of anMA approach instead of binning is based on
the results of Tuomi et al. (2012b) and accounts for the fact that
uncertainties of subsequent measurements likely correlate with
one another at time-scales of an hour in an unknownmanner.We
limit our analysis to MA models of third order (MA(3) models)
because higher order choices did not improve the noise model
significantly. E
ffectively, the MA(3) component in our noise
model corresponds to binning. However, unlike when binning
measurements and artificially decreasing the size of the data set,
this approach better preserves information on possible signals in
the data. The two Gaussian components of the noise model are
the estimated instrument noise with zero-mean and known variance
(nominal uncertainties in the RVs) and another with zeromean
but unknown variance corresponding to all excess noise in
the data. The latter contains the white noise component of the
stellar surface, usually referred to as stellar “jitter”, and any additional
instrumental systematic e ffects not accounted for in the
nominal uncertainties. Keplerian signals and white noise component
were modelled as in Tuomi et al. (2011).
In mathematical terms, an MA( p) model is implemented on
measurement mi as
= rk(ti) + γ + ǫi +
p
X
j
=1
φ
jhmi−j − rk(ti−j) − γi exp ti−j − ti, (1)
where rk(ti) is the superposition of k Keplerian signals at epoch
t
i
+
and γ is the reference velocity. The random variable ǫi is
the Gaussian white noise component of the noise model with
zero-mean and variance σ2 = σ2i
σ2J
, where
σ2i
is the (fixed)
nominal uncertainty of the
ith measurements and σ2J
is a free
parameter describing the magnitude of the jitter component.
Finally, the free parameters of the MA(
p) model are denoted as
φ
t
i
j, j = 1, ..., p – they describe the amount of correlation between
the noise of the ith and i − jth measurements. The exponential
term in Equation (1) ensures that correlations in the noise are
modelled on the correct time-scale. Specifically, using hours as
units of time, the exponential term (that is always < 1 because
> ti−j) vanishes in few hours.
To demonstrate the impact of binning on this data set,
Section 4.1 shows the analyses of binned RVs with the common
assumption that the excess noise is purely Gaussian. This
corresponds to using the nightly average as the individual RV
measurements and setting φj = 0 for all j in Eq. (1).
3.2. Bayesian analyses and detection thresholds
To estimate the model parameters and, especially, their uncertainties
as reliably as possible, we drew random samples from
the parameter posterior densities using posterior sampling algorithms.
We used the adaptive Metropolis algorithm of Haario
et al. (2001) because it can be used to receive robust samples
from the posterior density of the parameter vector when applied
to models with multiple Keplerian signals (e.g. Tuomi et al.,
2011; Tuomi, 2012). This algorithmis simply a modified version
of the famous Metropolis-Hastings Markov chain Monte Carlo
(MCMC) algorithm (Metropolis et al., 1953; Hastings, 1970),
which adapts the proposal density to the information gathered
from the posterior density. We performed samplings of models
with
k = 0, 1, ...7 Keplerian signals.
The samples from the posterior densities were then used to
perform comparisons of the di fferent models. We used the oneblock
Metropolis-Hastings (OBMH) method (Chib & Jeliazkov,
2001; Clyde et al., 2007) to calculate the relative posterior probabilities
of models with di ffering numbers of Keplerian signals
(e.g. Tuomi & Kotiranta, 2009; Tuomi, 2011; Tuomi et al., 2011;
Tuomi, 2012). We performed several samplings using di fferent
initial values of the parameter vector and calculated the means
and the corresponding deviations as measures of uncertainties of
our Bayesian evidence numbers P(m|Mk), where m is the measurement
vector and Mk denotes the model with k Keplerian
signals.
The prior probability densities in our analyses were essentially
uniform densities. As in Tuomi (2012), we adopted the
priors of the RV amplitude π(Ki) = U(0, aRV), reference velocity
π
U
(γ) = U(−aRV, aRV), and jitter π(σJ) = U(0, aRV), where
(a, b) denotes a uniform density in the interval [a, b]. Since
the observed peak-to-peak difference in the raw RVs is lower
than 10 m s −1, the hyperparameter aRV was conservatively selected
to have a value of 20 ms −1. The priors of the longitude of
pericentre ( ω) and the mean anomaly (M0) were set to U(0, 2π),
in accordance with the choice of Ford & Gregory (2007) and
Tuomi (2012). We used the logarithm of the orbital period as
a parameter of our model because, unlike the period as such, it
is a scale-invariant parameter. The prior of this parameter was
set uniform such that the two cut-o ff periodicities were Tmin and
T
max
. These hyperparameters were selected as Tmin = 1.0 days
and Tmax = 10Tobs because we did not expect to find signals with
periods less than 1 day. Also, we did not limit the period space
to the length of the baseline of the HARPS time series ( Tobs), because
signals in excess of that can be detected in RV data (Tuomi
et al., 2009) and because there might be long-period signals apparent
as a trend with or without curvature in the data set.
Unlike in traditional Bayesian analyses of RV data, we did
not use uniform prior densities for the orbital eccentricities.
Instead, we used a semi-Gaussian as π(ei) ∝ N(0, σ2e
) with the
corresponding normalisation, where the hyperparameter
σe was
chosen to have a value of 0.3. This value decreases the posterior
probabilities of very high eccentricities in practice, but still enables
themif they explain the data better than lower ones (Tuomi,
2012; Tuomi et al., 2012a).
For the MA components φj, we selected uniform priors as
π
3.3. Periodogram analysis
(φj) = U(−1, 1), for all j = 1, 2, 3. This choice was made to
ensure that theMA model was stationary, i.e. time-shift invariant
– a condition that is satisfied exactly when the values are in the
interval [-1, 1].
Finally, we did not use equal prior probabilities for the models
with di ffering numbers of Keplerian signals. Instead, following
Tuomi (2012), we set them as P(Mk) = 2P(Mk+1), which
means that the model with k Keplerian signals was always twice
as probable prior to the analyses than the model with k + 1
Keplerian signals. While this choice makes our results more robust
in the sense that a posterior probability that exceeds our
detection threshold is actually already underestimated with respect
to equal prior probabilities, there is a physical motivation
as well. We expect that the dynamical interactions of planets in
any given system make the existence of an additional planet less
probable because there are fewer dynamically stable orbits. This
also justifies the qualitative form of our prior probabilities for
the eccentricities.
Our criterion for a positive detection of k Keplerian signals
is as follows. First, we require that the posterior probability of
a k-Keplerian model is at least 150 times greater than that of
the k − 1-Keplerian model (Kass & Raftery, 1995; Tuomi, 2011,
2012; Tuomi et al., 2011; Feroz et al., 2011). Second, we require
that the radial velocity amplitudes of all signals are statistically
significantly di fferent from zero. Third, we also require that the
periods of all signals are well-constrained fromabove and below.
These criteria were also applied in Tuomi (2012).
We describe the parameter posterior densities using three
numbers, namely, the maximum a posteriori (MAP) estimate
and the limits of the corresponding 99%Bayesian credibility sets
(BCSs) or intervals in one dimension (e.g. Tuomi & Kotiranta,
2009).
As is traditionally the case when searching for periodic signals
in time series, we used least-squares periodograms (Lomb, 1976;
Scargle, 1982) to probe the next most significant periods left in
the data. In particular, we used the least-squares periodograms
described in Cumming (2004), which adjust for a sine wave and
an o
ffset at each test period and plot each test period against the
F-ratio statistic (or power) of the fit. While strong powers likely
indicate the existence of a periodic signal (though strong powers
may be caused by sampling-related features in the data as well),
the lack of them does not necessarily mean that there are no significant
periodicities left (e.g. Tuomi, 2012). This is especially
so in multi-Keplerian fits due to strong correlations and aliases
between clearly detected signals and yet-undetected lower amplitude
companions (e.g. Anglada-Escud´e, et al., 2010). The reason
is that residuals must necessarily be calculated with respect
to a model that is assumed to be correct, which is clearly not the
case when adding additional degrees of freedom, i.e. additional
planetary signals, to the model. Therefore, determining the reliability
of a new detection based on goodness-of-fit comparisons
is prone to biases, which e ffectively reduces the sensitivity and
reliability of these detections (Tuomi, 2012).
While periodograms of the residuals are very useful, they do
not properly quantify the significance of the possibly remaining
periodicities and, therefore, we used them as a secondary rather
than a primary tool to assess the significance of new signals. The
analytic false-alarm probability (FAP) thresholds as derived by
Cumming (2004) are provided in the figures as a reference and
for illustrative purposes only. We used the same periodogram
tools to assess the presence of periodicities in the time series of
a few activity indicators.
4. Analysis of the RV data
The 345 measurements taken on 135 separate nights were obtained
over a baseline of
∼ 1900 days. In contrast to the discussion
in Mayor et al. (2009a), we could not confirm a long-period
trend using our new RVs and we did not detect evidence for
a trend in the new CCF RVs obtained using the HARPS-DRS.
As shown in Anglada-Escud´e & Butler (2012) (Fig. 3), changes
in the continuum flux accross each echelle order (also called
blaze function) induce RV shifts of several ms −1 if not properly
accounted for. This e ffect was reported to affect HARPS
measurements in Pepe (2010) and appears to have been fixed
by HARPS-DRS v3.5 based on the release notes 1 (issued on
29 October, 2010). With respect to HD 40307 in particular, we
found that when RVs were derived without blaze function correction,
a strong positive drift ( ∼ 2 ma−1yr−1) was left in the
Doppler time series. We speculate that the trend reported earlier
may be caused by this blaze function variability. A notable feature
of the HD 40307 data set is that the epochs have a long gap
of 638 days between 3055 - 3693 [JD-2450000].This feature can
e ffectively decrease the phase-coverage of the data on longer periods
and complicates the interpretation of periodograms due to
severe aliases.
4.1. Analysis of binned data
e
i
In a first quicklook analysis, we worked with the nightly averages
of the radial velocities as obtained from HARPS-TERRA
using the standard setup for K dwarfs. In this setup, all HARPS
echelle apertures are used and a cubic polynomial is fitted to
correct for the blaze function of each aperture. After this correction,
the weighted means ˆ
ve of all RV measurements within each
night e are calculated. As a result, we obtain internal uncertainties
of the order of 0.3-0.4 ms −1 for the HARPS-TERRA RVs.
Because of stellar and /or instrumental systematic errors, we observed
that these individual uncertainties are not representative
of the real scatter within most nights with five or more measurements.
Using three of those nights we estimate that at least 0.6
m s−1 must be added in quadrature to each individual uncertainty
estimate. After this, the uncertainty of a given epoch is obtained
as σ−1
= PNe
(σei
)
−1, where the sum is calculated over all exposures
obtained during a given night. Finally, based on their longterm
monitoring of inactive stars, Pepe et al. (2011) inferred a
noise level of 0.7 ms −1 to account for instrumental and stellar
noise. After some tests, we found that adding 0.5ms −1 in quadrature
to the uncertainties of the nightly averages ensured that none
of the epochs had uncertainties below the 0.7 ms −1 level. The
typical uncertainties of a single night derived this way were of
the order of 0.8 ms −1. These corrections are basically only welleducated
guesses based on the prior experiencewith RV data and
reported stability of the instrument. Therefore, onemust be especially
careful not to over-interpret the results derived from them
(e.g., powers in periodograms and significance of the signals). In
the fully Bayesian approach, we treat the excess noise as a free
parameter of the model, therefore the Bayesian estimates of the
noise properties should in principle also be more reliable.
First, we re-analysed the nighly binned RVs to see whether
we could independently reproduce the results of Mayor et al.
1
www.eso.org/sci/facilities/lasilla/instruments/harps/tools/drs.html
[www.eso.org]
0
5
10
15
20
Power
0
5
10
15
20
Power
0
5
10
15
20
Power
10 100 1000
Period [days]
0
5
10
15
20
Power
Nightly binned RVs
10%
1%
0.1%
10%
10%
10%
1%
1%
1%
0.1%
0.1%
0.1%
320-d
51-d
34-d
4-th signal
5-th signal
6-th signal
7-th signal
200-d
200-d
200-d
Fig. 1.
Least-squares periodograms of the binned HD 40307 radial velocities
for the residuals of the models with three (top) to six (bottom)
periodic signals. The analytic 10%, 1%, and 0.1% FAPs are shown as
horizontal lines.
(2009a) when HARPS-TERRA measurements and our Bayesian
methods were used. Assuming an unknown Gaussian noise parameter
(e.g. Tuomi et al., 2011) in addition to the estimated
measurement uncertainties, the posterior samplings and the corresponding
model probabilities easily revealed the three strong
signals corresponding to periods of 4.3, 9.6, and 20.4 days. The
residual periodogram of the three-Keplerian model revealed additional
strong periodicities exceeding the 1% FAP level (Fig.
1, top panel) and we tested more complicated models with up
to six Keplerian signals. Especially, we tested whether the additional
power present in the three-Keplerianmodel residuals (Fig.
1, top panel) at periods of 28.6, 34.8, 51.3, and 308 days, peaking
above the 10% FAP level, are statistically significant by starting
our MCMC samplings at nearby seed periods.
The global four-Keplerian solution was found to correspond
to the three previously known super-Earth signals and an additional
signal with an MAP period of 320 days. This period was
bounded from above and below and its amplitude was strictly
positive – in accordance with our detection criteria. The corresponding
posterior probability of the four-Keplerian model was
1.9
??105 times greater than that of a three-Keplerian one, making
the 320-day signal significant. In addition to this signal, we
could identify a 51-day periodicity (Fig. 1, second panel) that
satisfied the detection criteria as well. Including this fifth signal
in the model further increased the model probabilitity by a
factor of 6.6 ??106. We could furthermore identify a sixth signal
with our six-Keplerian model, correponding to a period of 34.4
days. However, even though the samplings converged well and
the solution looked well-constrained, the six-Keplerian model
was only five times more probable than the five-Keplerian one
and would not be detected using our criteria in Section 3.2. Both
of the two new significant signals had MAP estimates of their
radial velocity amplitudes slightly lower than 1.0 ms −1 – the signals
at 51 and 320 days had amplitudes of 0.70 [0.31, 1.09] ms −1
and 0.75 [0.38, 1.12] ms
−1, respectively, where the uncertainties
are denoted using the intervals corresponding to the 99%
BCSs. We note that the periodogram of sampling does not have
strong powers at the periods we detect (see Fig. 1 in Mayor et
al., 2009a).
Given the uncertain nature of the signal at 34.4 days and
the potential loss of information when using the nightly averaged
RVs (artificial reduction of the number of measurements),
we performed a complete Bayesian reanalysis of the full dataset
(345 RVs), now including the aforementioned moving average
approach to model the velocities.
4.2. Analysis using all RV measurements
The analyses of the unbinned data immediately showed the three
previously announced signals (Mayor et al., 2009a) with periods
of 4.3, 9.6, and 20.4 days. Modelling the data with the superposition
of
k Keplerian signals and an MA(3) noise model plus the
two Gaussian white noise components, our posterior samplings
and periodogram analyses identified these signals very rapidly,
enabling us to draw statistically representative samples from the
corresponding parameter densities.
The residual periodogram of this model (three Keplerians
and MA(3) components of the noise removed) revealed some
significant powers exceeding the 0.1% and 1% FAP level at 320
and 50.8 days, respectively (Fig. 2, top panel). Samplings of the
parameter space of a four-Keplerian model indicated that the
global solution contained the 320-day periodicity as the fourth
signal and yielded a posterior probability for the four-Keplerian
model roughly 1 .5??106 times higher than for the three-Keplerian
one. The nature of this signal and its relation to the stellar activity
(Section 5) is discussed in Section 5.3.
We continued by calculating the periodogram of the residuals
of our four-Keplerian model (Fig. 2, second panel) and observed
a periodogram power that almost reached the 1% FAP
level at a period of 50.8 days. Including this fifth signal further
increased the posterior probability of our model by a factor of
6 .4 ?? 105.
The residuals of the five-Keplerian model contained a periodicity
at 34.7 days (Fig. 2, third panel) exceeding the 1% FAP
level. The corresponding six-Keplerian model with this candidate
received the highest posterior probability – roughly 5 .0??108
0
5
10
15
20
Power
0
5
10
15
20
Power
0
5
10
15
20
Power
10 100 1000
Period [days]
0
5
10
15
20
Power
Full spectrum RVs
10%
1%
0.1%
10%
10%
10%
1%
1%
1%
0.1%
0.1%
0.1%
320-d
51-d
34-d
4-th signal
5-th signal
6-th signal
7-th signal
200-d
200-d
200-d
Fig. 2.
probability maxima. Because we were unable to constrain the
parameters of a seventh signal using the posterior samplings, we
cannot be sure whether the corresponding Markov chains had
converged to the posterior density and cannot reliably calculate
an estimate for the posterior probability of the seven-Keplerian
model. Therefore we stopped looking for additional signals.
From this analysis, we can state confidently that there are
six significant periodicities in the HARPS-TERRA radial velocities
of HD 40307 when the whole spectral range of HARPS is
used. As we show in the next section, one of them has the same
period as the chromospheric activity indicator (S-index) and requires
more detailed investigation. The analysis of all 345 RVs
indicates that for these data binning appears to be a retrograde
step in extracting periodic signals from the RV data. We infer
that binning serves to alter measurement uncertainties and damp
the significance levels of the periodicities in the data.
5. Stellar activity
times higher than that of the five-Keplerian model. Since the
parameters of this sixth candidate were also well-constrained,
we conclude that including the 34.7-day signal in the statistical
model is fully justified by the data.
We also attempted to sample the parameter space of a seven-
Keplerian model but failed to find a clear probability maximum
for a seventh signal (see also the residual periodogram
of the six-Keplerian model in Fig. 2, bottom panel). Although
the periodicity of the seventh signal did not converge to a wellconstrained
probability maximum, all periodicities in the six-
Keplerian model at 4.3, 9.6, 20.4, 34.7, 50.8, and 320 days were
still well-constrained, i.e. their radial velocity amplitudes were
statistically distinguishable from zero and their periods had clear
Least-squares periodograms of all 345 RVs of HD 40307 for the
residuals of the models with three (top) to six (bottom) periodic signals
together with the analytic 10%, 1%, and 0.1% FAPs.
We examined the time series of two activity indicators derived
from the cross correlation function properties as provided by the
HARPS-DRS. They are the bisector span (BIS) and the fullwidth
at half-maximum (FWHM) of the CCF. These indices
monitor di
fferent features of the average stellar line. Briefly, BIS
is a measure of the stellar line asymmetry and should correlate
with the RVs if the observed o ffsets are caused by spots or
plages rotating with the star (Queloz et al., 2001). The FWHM
is a measure of the mean spectral line width. Its variability
(when not instrumental) is usually associated with changes in
the convective patterns on the surface of the star. A third index,
the so-called S-index in theMountWilson system (Baliunas
et al., 1995), is automatically measured by HARPS-TERRA
on the blaze-corrected one-dimensional spectra provided by the
HARPS-DRS. The S-index is a measure of the flux of the CaII
H and K lines ( λH = 3933.664 Å and λK = 3968.470 Å,
respectively) relative to a locally defined continuum (Lovis et
al., 2011a) and is an indirect measurement of the total chromospheric
activity of the star. For simplicity, the analysis of the
activity indicators was performed throughout for the 135 nightly
averaged values using sequential least-squares fitting of periodic
signals that are each described by a sine-wave model (period,
amplitude and phase).
5.1. Analysis of the FWHM and BIS
The BIS was remarkably stable (RMS
∼ 0.5 ms−1) and the periodogram
of its time series did not show any significant powers.
Visual inspection of the time series for the FWHM already
shows a very significant trend of 5.3 ms −1 yr−1. The 345 measurements
of the FWHM are listed in Table A.2. A sinusoidal
fit to this trend suggested a period of 5000 days or more (see
top panel in Fig. 3). After removing the trend, two more signals
strongly show up in the residuals. The first one was found at
23 days and had an analytic FAP of 0.005%. After fitting a sinusoid
to this signal and calculating the residuals, an extremely
significant peak appeared at 1170 days with an analytic FAP of
0.002%. After including this in a model with three sinusoids, no
additional signals could be seen in the periodogram of the residuals
with analytic FAP estimates lower than 10% (Fig. 3, bottom
panel). We also show the FWHM values together with the fitted
periodic curves in Fig. 4. While the signals in the FWHM were
significant, we did not clearly detect their counterparts in the
RVs. Given that BIS does not show any obvious signals either,
we suspect that the periodicities in the FWHM might be caused
by instrumental e ffects, e.g. tiny changes of the focus inside the
spectrograph, rather than intrinsic variability of the stellar lines.
The instrumental origin would reconcile the absence of correspondent
drifts in the BIS and in the RVs. A similar indication
of drifts and sensitivity of the FWHM to instrumental issues has
been reported in e.g. Lovis et al. (2011a).While this adds some
caveats on the long-term stability of the HARPS instrumental
profile (and therefore its long-term precision), unless it is found
to have similar periods, we see no reason to suspect that any of
the signals in the RVs are spuriously induced by changes in the
FWHM (intrinsic or instrumental).
5.2. Analysis of the S-index
The S-index also shows strong coherent variablity. The full set
of 345 measurements of the S-index are provided in Table A.2.
Again, this analysis was performed for the nightly binned measurements
using least-squares periodograms and the sequential
inclusion of sinusoidal signals. The last two S-index measurements
were well above the average and could not be reproduced
by any smooth function. Even if they are representative
of a physical process, such outlying points cannot be easily
modelled by a series of a few sinusoids because these would
add many ambiguities to the interpretation of the results. When
these two points were removed, the periodograms looked much
0
5
10
15
20
25
30
Power
0
5
10
15
20
25
30
Power
0
5
10
15
20
25
30
Power
10 100 1000
Period [days]
0
5
10
15
20
25
30
Power
FWHM
10%
1%
0.1%
10%
10%
10%
1%
1%
1%
0.1%
0.1%
0.1%
5000+ d
23-d
1170-d
1st signal
2nd signal
3rd signal
4th signal
Fig. 3.
Periodogram series of the signals detected in the FWHM, from
most significant to less significant (top to bottom).
53000 53500 54000 54500 55000
JD-2400000 [days]
-30
-20
-10
0
10
20
30
FWHM - 5910 [m s
-1]
Fig. 4.
Time series of the FWHM activity index. The solid black line
represents the best fit to a model containing three sinusoids (periods of
.