Project

General

Profile

Firstwrkshnotes » History » Version 15

Corinna Gries, 03/11/2014 08:13 AM

1 1 Corinna Gries
h1. Workshop Notes
2
3 3 Corinna Gries
are on etherpad: https://epad.nceas.ucsb.edu/p/commdyn-20140105
4 4 Corinna Gries
5 1 Corinna Gries
6 10 Corinna Gries
7
h1. Breakout session 1: Metrics brainstorming
8
9 7 Corinna Gries
* what are you currently using
10
* what would you like to use
11
* how widely is it used
12
* can it be applied to different biological community datasets (sampling approach)
13
* is it already coded {in R}
14 1 Corinna Gries
15 8 Corinna Gries
16 5 Corinna Gries
h2. Metrics
17 6 Corinna Gries
18 5 Corinna Gries
# *Diversity* (all of these are generally in R, mostly in vegan)
19
## Jaccard index
20
## Simpson's diversity 
21
## Shannons index
22
## Turnover - different ways to calculate
23
## Dominance 
24
## Evenness
25
## Richness
26
## Rank abundance shift
27 9 Corinna Gries
## Proportion of overall diversity
28 5 Corinna Gries
## Beta diversity
29
# *Community metrics/ordination*
30
## NMDS (vegan)
31
## PCA (vegan)
32
## Bray curtis (vegan)
33
## Variance tracking, quantify variability change
34 9 Corinna Gries
## Position in ordination-space
35 5 Corinna Gries
# *Spatial*
36
## patch scale 
37
## spatial autoregression
38 9 Corinna Gries
## Endemism
39
## Summary of species' positions within their ranges
40 5 Corinna Gries
## meta community statistics
41
# *Mechanistic models*
42
## MAR, needs driver matrix, problem auto-corelation, mostly fresh water or marine (Eli Holmes has state-space MAR in R implemented, not sure if it's on CRAN)   http://cran.r-project.org/web/packages/MARSS/index.html
43
## MANOVA (vegan? Also, permanova is in vegan)
44 9 Corinna Gries
## Ecosystem function (e.g. N deposition)
45 5 Corinna Gries
## interaction population models - inter specific competition (Ben Bolker's book and corresponding package)
46 9 Corinna Gries
## Economically/legally relevant metrics (e.g. Maximum sustainable yield)
47 5 Corinna Gries
# *Food webs*
48
## connectance
49
## network analysis
50
# *Traits/phylogentic*
51 9 Corinna Gries
## functional/phylogenetic diversity
52 1 Corinna Gries
## species aggregation (functional groups, trophic levels
53 9 Corinna Gries
## phylogenetic dispersion
54 1 Corinna Gries
## Native/exotic
55 9 Corinna Gries
## Phylogeographic history
56 1 Corinna Gries
# *Temporal indices*
57 9 Corinna Gries
## species turnover
58
## rate of return
59 4 Corinna Gries
## Variance ratio
60
## Mean-variance scaling
61 5 Corinna Gries
## Spectral analysis
62 1 Corinna Gries
## Regresssion windows (strucchange)
63 9 Corinna Gries
## time series models of abundance -- metric would be parameters of model
64 1 Corinna Gries
# *null models*
65 9 Corinna Gries
# *Comparative analysis of small noise vs large noise systems. What drives differences?*
66 5 Corinna Gries
67 8 Corinna Gries
h2. Coded in R
68
69
* Richness/diversity metrics: http://cran.r-project.org/web/packages/vegan/index.html
70
* Diversity metrics (alpha, beta, gamma): http://cran.r-project.org/web/packages/vegetarian/index.html
71
* Hubble metrics: http://cran.r-project.org/web/packages/untb/index.html
72 1 Corinna Gries
* Leading indicators, variance, autocorrelation, skew, heteroscedasticity: http://cran.at.r-project.org/web/packages/earlywarnings/index.html
73
74 8 Corinna Gries
not yet coded:
75
* state-space models and community level resilience
76 13 Corinna Gries
* variance components analysis
77 8 Corinna Gries
78 12 Corinna Gries
h1. Breakout Session 2: Identify research questions
79 10 Corinna Gries
80 15 Corinna Gries
# Group 1
81
## Data set transformation to allow compute of many metrics
82
## Time series analysis of community level metrics (consider higher freq data too)(earlywarnings R package)
83
# Group 2
84
## New R code for capturing climate variance at seasonal and interannual scales and residuals
85
## R model for analyzing more spatial variability (Eric's LTER project)
86
## Review of non-stationarity
87
### Variance partitioning 
88
### Temporal and spatial variance
89 13 Corinna Gries
90
h1. Discussion and Feedback: Collaboration Approaches
91
92
*Most Important Limitations*
93
94
* Data
95 14 Corinna Gries
** Lack of coordinated long-term measurements
96
** Time necessary to find data
97
** Determine usability of data, e.g. stations within a boundary envelope with at least 2 samples over 2 years
98 13 Corinna Gries
** Time necessary to clean data
99
** Quality control data and deal with problems
100
** Data sharing permission issues 
101
102 14 Corinna Gries
* Workflows
103
** Need incentive to document as you work; would be different if pushed to KNB as work progresses and get credit for that work done
104
105 13 Corinna Gries
* Collaboration
106
** Scattered resources: data and code in different locations, hard to move back and forth, hard to work on the code together, hard to know who's working on which parts of the code
107
** Workspace integration and accessibility
108 1 Corinna Gries
** Project management/tool integration
109 14 Corinna Gries
** Time investment in learning different tools, training needs
110
** Github is too technical
111 13 Corinna Gries
112
*Recommendations*
113
114
* Data
115
** Dataset format: long format with columns for species and count/biomass, plus columns for site (plot, subplot, etc.) and date. Separate table with species name to be able to add functional groups, taxonomic rank, etc.. Separate table for site descriptions (manipulations, land use, etc.)
116
** Gather additional data on biogeochemistry, climate etc.
117 1 Corinna Gries
** Develop standard methods for dealing with outliers, large gaps, species names and spellings
118 14 Corinna Gries
** Develop standards for classifying data points into aggregated 
119
** Create library of cleaned data sets that are massaged into one format
120 1 Corinna Gries
121 14 Corinna Gries
* Workflows
122
** Create library of workflows that provide general cleaning routines that can be applied to arbitrary data, possibly interactive with some user input
123
** Create library of workflows that make reshaping more accessible to people with little coding experience
124
** Create library of workflows specifically for dealing with taxonomic names.
125
** Link workflows to publications, e.g., via a website (repository) where scientists can publish citeable workflows (ecologicalworkflows.org, like myexperiment.org, but possibly more agnostic with respect to dependencies/tools that connect to it (package descriptions))
126
** Make this repository more accessible by keeping the 'ecology' emphasis, make workflows much more visible in existing repositories (KNB, DataONE) by linking to datasets.
127
** Create library of workflows for training purposes (e.g. Dan Bunker's R tutorial), link to datasets in a repository
128
129
130 1 Corinna Gries
* Collaboration Tool
131 14 Corinna Gries
** Pair programming: changes how you work; divide and conquer worked well
132
** Git repository, has been used successfully in this workshop when some people were familiar with it a could bootstrap the use for other people quickly
133 1 Corinna Gries
** Way to replicate or interface with services like {Google open refine, db constraints, taxize, TNRS)
134 14 Corinna Gries
** Develop a 'Redmine' that is more useful for academics; becomes the point for integration of multiple tools; also BaseCamp/Trello, Digital notebook environments
135
** Run workflows, organize outputs, communicate with collaborators
136
** Ability to couple models at multiple scales (e.g., spatial or temporal scales), scale up computing as well
137
** Incorporate writing process, version control for documents (Google docs is not sufficient)
138
** Incorporate mechanisms to maintain social connection even in absence of face to face meetings
139 13 Corinna Gries
140
*Datasets*
141
142
* small mammal (VCR, SEV)
143
* arthropod data (CAP, KNZ, FCE)
144
* datasets on kelp published in ESA journal
145
* Cedar Creek : 
146
** species compostion data Accessible at: http://doi.org/10.6073/pasta/50db8bde41c9ea8b32dfbdde8bb0fad2
147
** climate data accessible at http://doi.org/10.6073/pasta/24eb99ad3102cdcb2f8d02de93dd551e
148
	
149
* PISCO intertidal biodiversity surveys
150
** Methods: http://cbsurveys.ucsc.edu/sampling/images/dataprotocols.pdf
151
** Point contact data (percent cover, good for sessile/common spp): https://knb.ecoinformatics.org/m/#view/doi:10.6085/AA/pisco_intertidal.50.6
152
** Quadrat data (percent cover, good for mobile spp): https://knb.ecoinformatics.org/m/#view/doi:10.6085/AA/pisco_intertidal.52.7
153
** Swath data (extensive, only select rare species like seastars): https://knb.ecoinformatics.org/m/#view/doi:10.6085/AA/pisco_intertidal.51.6
154
155
* Konza
156
** climate data (KNZ headquarters): doi:10.6073/pasta/ac19b27f2c28a63890d59ece32f5116b
157
** Konza species composition (belowground experiment for N addition contrasts): doi:10.6073/pasta/b6653594d336bddf9d5f7f72c7d9200c Konza only collects cover for N addition treatments every 5 years, so we will abandon for now