moult-likelihoods.Rmd
The moultmcmc
package implements the regression models
outlined in Les G. Underhill and Zucchini
(1988) and L. G. Underhill, Zucchini, and
Summers (1990). In their notation1, samples consist of
pre-moult birds,
birds in active moult, and
post-moult birds. Birds in each category are observed on days
;
;
,
respectively. Moult scores for actively moulting birds, where available,
are encoded as
.
Each moult state has a probability of occurrence
Further, assuming a linear progression
of the moult indices over time, the probability density of a particular
moult score at time
is
In moultmcmc
the
unobserved start date of the study population is assumed to follow a
normal distribution with mean
and standard deviation
,
such that
where
is the standard normal distribution function and
.
We further assume that has parameters and for convenience the start date , duration , and population standard deviation of moult will be elements of .
Type 1 data consist of observations of categorical moult state (pre-moult, active moult, post-moult) and sampling is representative in all three categories. The likelihood of these observations is
Type 2 data consist of observations of birds in all three moult states (pre-moult, active moult, post-moult). Sampling is representative for all three categories, and for actively moulting birds a sufficiently linear moult index (e.g. percent feather mass grown) is known. The likelihood of these observations is where
Lumped type 2 data consist of observations of birds in all three moult states (pre-moult, active moult, post-moult), but where the pre-moult and post-moult states cannot be distinguished from each other, yielding observations of dates on which non-moulting birds were observed. Sampling is representative for all three categories, and for actively moulting birds a sufficiently linear moult index (e.g. percent feather mass grown) is known. The likelihood of these observations is where and
Type 3 data consist of observations of actively moulting birds only, and a sufficiently linear moult index (e.g. percent feather mass grown) is known for each individual. The likelihood of these observations is
Type 4 data consist of observations of birds in active moult and post-moult only. Sampling is representative for these two categories, and for actively moulting birds a sufficiently linear moult index (e.g. percent feather mass grown) is known. The likelihood of these observations is
Type 5 data consist of observations of birds in pre-moult and active moult. Sampling is representative for these two categories, and for actively moulting birds a sufficiently linear moult index (e.g. percent feather mass grown) is known. The likelihood of these observations is
As outlined in Les G. Underhill and Zucchini (1988) estimates can also be derived from mixtures of data types. Type 1 + 2 data consist of observations of birds in all three moult states (pre-moult, active moult, post-moult). Sampling is representative for all three categories, but a sufficiently linear moult index (e.g. percent feather mass grown) is known only for some of the actively moulting birds. This means the sample consist of pre-moult birds, birds in active moult with known indices, birds in active moult without known indices but known capture dates , and post-moult birds. The likelihood of these observations is
moultmcmc
currently implements a recaptures model which
allows for heterogeneity in start dates
but assumes a common moult duration
.
When repeat observations are available an individual’s start date
then becomes
where is a row vector containing the values of individual-specific predictors (in the same format as ), and is an individual-level random effect intercept
where is the individual-specific standard deviation. We can then exploit the linearity assumption and treat observed moult scores as where captures any unmodelled variance in as well as any measurement error in .
The likelihood for the Type 3-like model for a sample of birds in active moult without repeated observations, and birds in active moult with a total of repeated observations and then is
where follows from above.
Users have a choice between two set of priors for the intercept terms of the linear predictors on the start date , the duration , and the population standard deviation of the start date , respectively. By default flat priors are used for and and a vaguely informative normal prior on
In some cases the models sample poorly with these priors, and better
convergence can be achieved by setting the argument
flat_prior = FALSE
. In this case vaguely informative
truncated normal priors are used for
and
:
These priors work well for data from passerines in seasonal environments, i.e. when the sampling occasion data is encoded as days from mid-winter.
For any additional regression coefficients an improper flat prior is used as a default.
Note that in Les G. Underhill and Zucchini (1988) the variable is doubly defined. It is both a generic variable of time in the model derivation, and denotes the sample dates of pre-moult birds in the data likelihoods↩︎