Project

General

Profile

Firstwrkshnotes » History » Version 16

Corinna Gries, 03/11/2014 08:14 AM

1 1 Corinna Gries
h1. Workshop Notes
2
3 10 Corinna Gries
h1. Breakout session 1: Metrics brainstorming
4
5 7 Corinna Gries
* what are you currently using
6
* what would you like to use
7
* how widely is it used
8
* can it be applied to different biological community datasets (sampling approach)
9
* is it already coded {in R}
10 1 Corinna Gries
11 8 Corinna Gries
12 5 Corinna Gries
h2. Metrics
13 6 Corinna Gries
14 5 Corinna Gries
# *Diversity* (all of these are generally in R, mostly in vegan)
15
## Jaccard index
16
## Simpson's diversity 
17
## Shannons index
18
## Turnover - different ways to calculate
19
## Dominance 
20
## Evenness
21
## Richness
22
## Rank abundance shift
23 9 Corinna Gries
## Proportion of overall diversity
24 5 Corinna Gries
## Beta diversity
25
# *Community metrics/ordination*
26
## NMDS (vegan)
27
## PCA (vegan)
28
## Bray curtis (vegan)
29
## Variance tracking, quantify variability change
30 9 Corinna Gries
## Position in ordination-space
31 5 Corinna Gries
# *Spatial*
32
## patch scale 
33
## spatial autoregression
34 9 Corinna Gries
## Endemism
35
## Summary of species' positions within their ranges
36 5 Corinna Gries
## meta community statistics
37
# *Mechanistic models*
38
## MAR, needs driver matrix, problem auto-corelation, mostly fresh water or marine (Eli Holmes has state-space MAR in R implemented, not sure if it's on CRAN)   http://cran.r-project.org/web/packages/MARSS/index.html
39
## MANOVA (vegan? Also, permanova is in vegan)
40 9 Corinna Gries
## Ecosystem function (e.g. N deposition)
41 5 Corinna Gries
## interaction population models - inter specific competition (Ben Bolker's book and corresponding package)
42 9 Corinna Gries
## Economically/legally relevant metrics (e.g. Maximum sustainable yield)
43 5 Corinna Gries
# *Food webs*
44
## connectance
45
## network analysis
46
# *Traits/phylogentic*
47 9 Corinna Gries
## functional/phylogenetic diversity
48 1 Corinna Gries
## species aggregation (functional groups, trophic levels
49 9 Corinna Gries
## phylogenetic dispersion
50 1 Corinna Gries
## Native/exotic
51 9 Corinna Gries
## Phylogeographic history
52 1 Corinna Gries
# *Temporal indices*
53 9 Corinna Gries
## species turnover
54
## rate of return
55 4 Corinna Gries
## Variance ratio
56
## Mean-variance scaling
57 5 Corinna Gries
## Spectral analysis
58 1 Corinna Gries
## Regresssion windows (strucchange)
59 9 Corinna Gries
## time series models of abundance -- metric would be parameters of model
60 1 Corinna Gries
# *null models*
61 9 Corinna Gries
# *Comparative analysis of small noise vs large noise systems. What drives differences?*
62 5 Corinna Gries
63 8 Corinna Gries
h2. Coded in R
64
65
* Richness/diversity metrics: http://cran.r-project.org/web/packages/vegan/index.html
66
* Diversity metrics (alpha, beta, gamma): http://cran.r-project.org/web/packages/vegetarian/index.html
67
* Hubble metrics: http://cran.r-project.org/web/packages/untb/index.html
68 1 Corinna Gries
* Leading indicators, variance, autocorrelation, skew, heteroscedasticity: http://cran.at.r-project.org/web/packages/earlywarnings/index.html
69
70 8 Corinna Gries
not yet coded:
71
* state-space models and community level resilience
72 13 Corinna Gries
* variance components analysis
73 8 Corinna Gries
74 12 Corinna Gries
h1. Breakout Session 2: Identify research questions
75 10 Corinna Gries
76 15 Corinna Gries
# Group 1
77
## Data set transformation to allow compute of many metrics
78
## Time series analysis of community level metrics (consider higher freq data too)(earlywarnings R package)
79
# Group 2
80
## New R code for capturing climate variance at seasonal and interannual scales and residuals
81
## R model for analyzing more spatial variability (Eric's LTER project)
82
## Review of non-stationarity
83
### Variance partitioning 
84
### Temporal and spatial variance
85 13 Corinna Gries
86
h1. Discussion and Feedback: Collaboration Approaches
87
88
*Most Important Limitations*
89
90
* Data
91 14 Corinna Gries
** Lack of coordinated long-term measurements
92
** Time necessary to find data
93
** Determine usability of data, e.g. stations within a boundary envelope with at least 2 samples over 2 years
94 13 Corinna Gries
** Time necessary to clean data
95
** Quality control data and deal with problems
96
** Data sharing permission issues 
97
98 14 Corinna Gries
* Workflows
99
** Need incentive to document as you work; would be different if pushed to KNB as work progresses and get credit for that work done
100
101 13 Corinna Gries
* Collaboration
102
** Scattered resources: data and code in different locations, hard to move back and forth, hard to work on the code together, hard to know who's working on which parts of the code
103
** Workspace integration and accessibility
104 1 Corinna Gries
** Project management/tool integration
105 14 Corinna Gries
** Time investment in learning different tools, training needs
106
** Github is too technical
107 13 Corinna Gries
108
*Recommendations*
109
110
* Data
111
** Dataset format: long format with columns for species and count/biomass, plus columns for site (plot, subplot, etc.) and date. Separate table with species name to be able to add functional groups, taxonomic rank, etc.. Separate table for site descriptions (manipulations, land use, etc.)
112
** Gather additional data on biogeochemistry, climate etc.
113 1 Corinna Gries
** Develop standard methods for dealing with outliers, large gaps, species names and spellings
114 14 Corinna Gries
** Develop standards for classifying data points into aggregated 
115
** Create library of cleaned data sets that are massaged into one format
116 1 Corinna Gries
117 14 Corinna Gries
* Workflows
118
** Create library of workflows that provide general cleaning routines that can be applied to arbitrary data, possibly interactive with some user input
119
** Create library of workflows that make reshaping more accessible to people with little coding experience
120
** Create library of workflows specifically for dealing with taxonomic names.
121
** Link workflows to publications, e.g., via a website (repository) where scientists can publish citeable workflows (ecologicalworkflows.org, like myexperiment.org, but possibly more agnostic with respect to dependencies/tools that connect to it (package descriptions))
122
** Make this repository more accessible by keeping the 'ecology' emphasis, make workflows much more visible in existing repositories (KNB, DataONE) by linking to datasets.
123
** Create library of workflows for training purposes (e.g. Dan Bunker's R tutorial), link to datasets in a repository
124
125
126 1 Corinna Gries
* Collaboration Tool
127 14 Corinna Gries
** Pair programming: changes how you work; divide and conquer worked well
128
** Git repository, has been used successfully in this workshop when some people were familiar with it a could bootstrap the use for other people quickly
129 1 Corinna Gries
** Way to replicate or interface with services like {Google open refine, db constraints, taxize, TNRS)
130 14 Corinna Gries
** Develop a 'Redmine' that is more useful for academics; becomes the point for integration of multiple tools; also BaseCamp/Trello, Digital notebook environments
131
** Run workflows, organize outputs, communicate with collaborators
132
** Ability to couple models at multiple scales (e.g., spatial or temporal scales), scale up computing as well
133
** Incorporate writing process, version control for documents (Google docs is not sufficient)
134
** Incorporate mechanisms to maintain social connection even in absence of face to face meetings
135 13 Corinna Gries
136
*Datasets*
137
138
* small mammal (VCR, SEV)
139
* arthropod data (CAP, KNZ, FCE)
140
* datasets on kelp published in ESA journal
141
* Cedar Creek : 
142
** species compostion data Accessible at: http://doi.org/10.6073/pasta/50db8bde41c9ea8b32dfbdde8bb0fad2
143
** climate data accessible at http://doi.org/10.6073/pasta/24eb99ad3102cdcb2f8d02de93dd551e
144
	
145
* PISCO intertidal biodiversity surveys
146
** Methods: http://cbsurveys.ucsc.edu/sampling/images/dataprotocols.pdf
147
** Point contact data (percent cover, good for sessile/common spp): https://knb.ecoinformatics.org/m/#view/doi:10.6085/AA/pisco_intertidal.50.6
148
** Quadrat data (percent cover, good for mobile spp): https://knb.ecoinformatics.org/m/#view/doi:10.6085/AA/pisco_intertidal.52.7
149
** Swath data (extensive, only select rare species like seastars): https://knb.ecoinformatics.org/m/#view/doi:10.6085/AA/pisco_intertidal.51.6
150 1 Corinna Gries
151
* Konza
152
** climate data (KNZ headquarters): doi:10.6073/pasta/ac19b27f2c28a63890d59ece32f5116b
153 13 Corinna Gries
** Konza species composition (belowground experiment for N addition contrasts): doi:10.6073/pasta/b6653594d336bddf9d5f7f72c7d9200c Konza only collects cover for N addition treatments every 5 years, so we will abandon for now
154 16 Corinna Gries
155
156
*Detailed notes are on etherpad: https://epad.nceas.ucsb.edu/p/commdyn-20140105*