Project

General

Profile

Firstwrkshnotes » History » Version 14

Corinna Gries, 03/11/2014 08:10 AM

1 1 Corinna Gries
h1. Workshop Notes
2
3 3 Corinna Gries
are on etherpad: https://epad.nceas.ucsb.edu/p/commdyn-20140105
4 4 Corinna Gries
5 1 Corinna Gries
6 10 Corinna Gries
7
h1. Breakout session 1: Metrics brainstorming
8
9 7 Corinna Gries
* what are you currently using
10
* what would you like to use
11
* how widely is it used
12
* can it be applied to different biological community datasets (sampling approach)
13
* is it already coded {in R}
14 1 Corinna Gries
15 8 Corinna Gries
16 5 Corinna Gries
h2. Metrics
17 6 Corinna Gries
18 5 Corinna Gries
# *Diversity* (all of these are generally in R, mostly in vegan)
19
## Jaccard index
20
## Simpson's diversity 
21
## Shannons index
22
## Turnover - different ways to calculate
23
## Dominance 
24
## Evenness
25
## Richness
26
## Rank abundance shift
27 9 Corinna Gries
## Proportion of overall diversity
28 5 Corinna Gries
## Beta diversity
29
# *Community metrics/ordination*
30
## NMDS (vegan)
31
## PCA (vegan)
32
## Bray curtis (vegan)
33
## Variance tracking, quantify variability change
34 9 Corinna Gries
## Position in ordination-space
35 5 Corinna Gries
# *Spatial*
36
## patch scale 
37
## spatial autoregression
38 9 Corinna Gries
## Endemism
39
## Summary of species' positions within their ranges
40 5 Corinna Gries
## meta community statistics
41
# *Mechanistic models*
42
## MAR, needs driver matrix, problem auto-corelation, mostly fresh water or marine (Eli Holmes has state-space MAR in R implemented, not sure if it's on CRAN)   http://cran.r-project.org/web/packages/MARSS/index.html
43
## MANOVA (vegan? Also, permanova is in vegan)
44 9 Corinna Gries
## Ecosystem function (e.g. N deposition)
45 5 Corinna Gries
## interaction population models - inter specific competition (Ben Bolker's book and corresponding package)
46 9 Corinna Gries
## Economically/legally relevant metrics (e.g. Maximum sustainable yield)
47 5 Corinna Gries
# *Food webs*
48
## connectance
49
## network analysis
50
# *Traits/phylogentic*
51 9 Corinna Gries
## functional/phylogenetic diversity
52 1 Corinna Gries
## species aggregation (functional groups, trophic levels
53 9 Corinna Gries
## phylogenetic dispersion
54 1 Corinna Gries
## Native/exotic
55 9 Corinna Gries
## Phylogeographic history
56 1 Corinna Gries
# *Temporal indices*
57 9 Corinna Gries
## species turnover
58
## rate of return
59 4 Corinna Gries
## Variance ratio
60
## Mean-variance scaling
61 5 Corinna Gries
## Spectral analysis
62 1 Corinna Gries
## Regresssion windows (strucchange)
63 9 Corinna Gries
## time series models of abundance -- metric would be parameters of model
64 1 Corinna Gries
# *null models*
65 9 Corinna Gries
# *Comparative analysis of small noise vs large noise systems. What drives differences?*
66 5 Corinna Gries
67 8 Corinna Gries
h2. Coded in R
68
69
* Richness/diversity metrics: http://cran.r-project.org/web/packages/vegan/index.html
70
* Diversity metrics (alpha, beta, gamma): http://cran.r-project.org/web/packages/vegetarian/index.html
71
* Hubble metrics: http://cran.r-project.org/web/packages/untb/index.html
72 1 Corinna Gries
* Leading indicators, variance, autocorrelation, skew, heteroscedasticity: http://cran.at.r-project.org/web/packages/earlywarnings/index.html
73
74 8 Corinna Gries
not yet coded:
75
* state-space models and community level resilience
76 13 Corinna Gries
* variance components analysis
77 8 Corinna Gries
78 12 Corinna Gries
h1. Breakout Session 2: Identify research questions
79 10 Corinna Gries
80
# Data set transformation to allow compute of many metrics
81
# Time series analysis of community level metrics (consider higher freq data too)(earlywarnings R package)
82
# New R code for capturing climate variance at seasonal and interannual scales and residuals
83 13 Corinna Gries
# R model for analyzing more spatial variability (Eric's LTER project)
84 10 Corinna Gries
# Review of non-stationarity
85 12 Corinna Gries
## Variance partitioning 
86
## Temporal and spatial variance
87 13 Corinna Gries
88
h1. Discussion and Feedback: Collaboration Approaches
89
90
*Most Important Limitations*
91
92
* Data
93 14 Corinna Gries
** Lack of coordinated long-term measurements
94
** Time necessary to find data
95
** Determine usability of data, e.g. stations within a boundary envelope with at least 2 samples over 2 years
96 13 Corinna Gries
** Time necessary to clean data
97
** Quality control data and deal with problems
98
** Data sharing permission issues 
99
100 14 Corinna Gries
* Workflows
101
** Need incentive to document as you work; would be different if pushed to KNB as work progresses and get credit for that work done
102
103 13 Corinna Gries
* Collaboration
104
** Scattered resources: data and code in different locations, hard to move back and forth, hard to work on the code together, hard to know who's working on which parts of the code
105
** Workspace integration and accessibility
106 1 Corinna Gries
** Project management/tool integration
107 14 Corinna Gries
** Time investment in learning different tools, training needs
108
** Github is too technical
109 13 Corinna Gries
110
*Recommendations*
111
112
* Data
113
** Dataset format: long format with columns for species and count/biomass, plus columns for site (plot, subplot, etc.) and date. Separate table with species name to be able to add functional groups, taxonomic rank, etc.. Separate table for site descriptions (manipulations, land use, etc.)
114
** Gather additional data on biogeochemistry, climate etc.
115 1 Corinna Gries
** Develop standard methods for dealing with outliers, large gaps, species names and spellings
116 14 Corinna Gries
** Develop standards for classifying data points into aggregated 
117
** Create library of cleaned data sets that are massaged into one format
118 1 Corinna Gries
119 14 Corinna Gries
* Workflows
120
** Create library of workflows that provide general cleaning routines that can be applied to arbitrary data, possibly interactive with some user input
121
** Create library of workflows that make reshaping more accessible to people with little coding experience
122
** Create library of workflows specifically for dealing with taxonomic names.
123
** Link workflows to publications, e.g., via a website (repository) where scientists can publish citeable workflows (ecologicalworkflows.org, like myexperiment.org, but possibly more agnostic with respect to dependencies/tools that connect to it (package descriptions))
124
** Make this repository more accessible by keeping the 'ecology' emphasis, make workflows much more visible in existing repositories (KNB, DataONE) by linking to datasets.
125
** Create library of workflows for training purposes (e.g. Dan Bunker's R tutorial), link to datasets in a repository
126
127
128 1 Corinna Gries
* Collaboration Tool
129 14 Corinna Gries
** Pair programming: changes how you work; divide and conquer worked well
130
** Git repository, has been used successfully in this workshop when some people were familiar with it a could bootstrap the use for other people quickly
131 1 Corinna Gries
** Way to replicate or interface with services like {Google open refine, db constraints, taxize, TNRS)
132 14 Corinna Gries
** Develop a 'Redmine' that is more useful for academics; becomes the point for integration of multiple tools; also BaseCamp/Trello, Digital notebook environments
133
** Run workflows, organize outputs, communicate with collaborators
134
** Ability to couple models at multiple scales (e.g., spatial or temporal scales), scale up computing as well
135
** Incorporate writing process, version control for documents (Google docs is not sufficient)
136
** Incorporate mechanisms to maintain social connection even in absence of face to face meetings
137 13 Corinna Gries
138
*Datasets*
139
140
* small mammal (VCR, SEV)
141
* arthropod data (CAP, KNZ, FCE)
142
* datasets on kelp published in ESA journal
143
* Cedar Creek : 
144
** species compostion data Accessible at: http://doi.org/10.6073/pasta/50db8bde41c9ea8b32dfbdde8bb0fad2
145
** climate data accessible at http://doi.org/10.6073/pasta/24eb99ad3102cdcb2f8d02de93dd551e
146
	
147
* PISCO intertidal biodiversity surveys
148
** Methods: http://cbsurveys.ucsc.edu/sampling/images/dataprotocols.pdf
149
** Point contact data (percent cover, good for sessile/common spp): https://knb.ecoinformatics.org/m/#view/doi:10.6085/AA/pisco_intertidal.50.6
150
** Quadrat data (percent cover, good for mobile spp): https://knb.ecoinformatics.org/m/#view/doi:10.6085/AA/pisco_intertidal.52.7
151
** Swath data (extensive, only select rare species like seastars): https://knb.ecoinformatics.org/m/#view/doi:10.6085/AA/pisco_intertidal.51.6
152
153
* Konza
154
** climate data (KNZ headquarters): doi:10.6073/pasta/ac19b27f2c28a63890d59ece32f5116b
155
** Konza species composition (belowground experiment for N addition contrasts): doi:10.6073/pasta/b6653594d336bddf9d5f7f72c7d9200c Konza only collects cover for N addition treatments every 5 years, so we will abandon for now