[R-sig-eco] repeated measures NMDS?

Discussion:

Eduard Szöcs

2010-11-08 14:39:31 UTC

Hi listers,

I have species and environmental data for 24 sites that were sampled thrice. If I want to analyze the data with NMDS I could run metaMDS on the whole dataset (24 sites x 3 times = 72) and then fit environmental data, but this would be some kind of pseudoreplication given that the samplings are not independent and the gradients may be overestimated, wouldn`t it?

For environmental data a factor could be included for the sampling dates - but this would not be possible for species data.

Is there an elegant way either to aggregate data before ordination or to conduct sth. like a repeated measures NMDS?

Thank you in advance,
Eduard Szöcs

Gavin Simpson

2010-11-08 21:01:40 UTC

Permalink

Post by Eduard SzÃ¶cs
Hi listers,
I have species and environmental data for 24 sites that were sampled
thrice. If I want to analyze the data with NMDS I could run metaMDS on
the whole dataset (24 sites x 3 times = 72) and then fit environmental
data, but this would be some kind of pseudoreplication given that the
samplings are not independent and the gradients may be overestimated,
wouldn`t it?
For environmental data a factor could be included for the sampling
dates - but this would not be possible for species data.
Is there an elegant way either to aggregate data before ordination or
to conduct sth. like a repeated measures NMDS?
Thank you in advance,
Eduard Szöcs

Depends on how you want to fit the env data - the pseudo-replication
isn't relevant o the nMDS. If you are doing it via function `envfit()`,
then look at argument `'strata'` which should, in your case, be set to a
factor with 24 levels. This won't be perfect because your data are a
timeseries and, strictly, one should permute them whilst maintaining
their ordering in time, but as yet we don't have these types of
permutations hooked into vegan.

If you are doing the fitting some other way you'll need to include
"site" as a fixed effect factor to account for the within site
correlation.

You don't need to worry about the species data and accounting for
sampling interval. You aren't testing the nMDS "axes" or anything like
that, and all the species info has been reduced to dissimilarities and
thence to a set of nMDS coordinates. You need to account for the pseudo
rep at the environmental modelling level, not the species level.

HTH

G

--
%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%
Dr. Gavin Simpson [t] +44 (0)20 7679 0522
ECRC, UCL Geography, [f] +44 (0)20 7679 0565
Pearson Building, [e] gavin.simpsonATNOSPAMucl.ac.uk
Gower Street, London [w] http://www.ucl.ac.uk/~ucfagls/
UK. WC1E 6BT. [w] http://www.freshwaters.org.uk
%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%

Vít Syrovátka

2010-11-08 22:41:01 UTC

Permalink

Hi listers,

I am facing a problem with pseudoreplications, similar to that by
Eduard Szöcs.
However, I want to try to explain the variation in species composition
by some environmental variables.
I decided to use adonis() for this, as there are many zeros in the
data-set, but don't know if it is correct, or how to do it so that it
was correct.

The design is:
Aquatic insect larvae were collected at 17 sites three times (spring,
summer, autumn), giving a total of 51 samples.
During each sampling campaign several chemical and substratum
variables were measured, while others were measured only once, as they
characterize the whole site and are not expected to change in time.

Now I want to relate the species composition to environmental
variables. I see that the samples are not independent, so I thought
about setting strata in the adonis model, but it didn't make me much
sense. I am interested in the differences among sites, therefore
strata should probably be season (?). Moreover, the effect of
environmental variables might differ among seasons, therefore
interaction with season should be probably included.

I ended up with this model:
adonis(log(spe+1) ~ env1 %in% season + env2 %in% season + ...,
distance= 'bray', permut= 999, data= env)
(where season is a factor with 3 levels)

I don't feel good about this model and would greatly appreciate any
suggestions on how to build an appropriate model.
Is there a way to analyze such data without splitting them into 3 data-
sets according to the season and analyze them separately?

Thanks in advance for any suggestions,
Vit Syrovatka

Eduard Szöcs

2010-11-10 12:57:25 UTC

Permalink

Thanks, that helped.

permuted.index2() generates these types of permutations. But envfit()
does not use this yet.
What if I modify vectorfit() (used by envfit() ) in such a way that it
uses permuted.index2() instead of permuted.index()?

Eduard Szöcs