-
Notifications
You must be signed in to change notification settings - Fork 2
/
coll-data-entry-protocol.Rmd
454 lines (231 loc) · 15.9 KB
/
coll-data-entry-protocol.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
---
title: "specimen-data_protocol"
author: "Emma Menchions"
date: "`r Sys.Date()`"
output:
pdf_document: default
html_document: default
---
# Data Entry
## Template
*Start new template for every field journal.*
- Copy and paste the template "HJ-occ-entry-template.xlsx"
- Rename to "HJ-[input archiveID]-occ-entry-[YY-MM-DD].xlsx"
## Setting up
1. Open the data entry excel sheet
2. Open the field journal to the first page
3. Open two internet search windows
4. In the first window --\> [https://www.inaturalist.org](https://www.inaturalist.org/home)
- click "Explore" tab
- zoom in on south western BC
- click the red rectangular select button
- click and drag to select an area that extends from around Hope BC to the east, encompasses all of Vancouver Island, and goes down to Seattle/ Tacoma in the south
- set this window aside and return to excel sheet
5. Second window will be used to search up localities that you are uncertain of
## Entering data
Begin with the **first page in a journal.**
1. Enter "**pageNum"** (page number) --\> e.g 1 . (circled in bottom corner of pages)
2. **Scan the page for collection numbers with species names** starting at the top of the page.
- To look for what location and date information is associated with each name, he often put the location and a date and then species associated with it BELOW or BESIDE
3. Enter "**numPage"** **[REQUIRED]** (number on page) as the number of the observation on the page (e.g. the first species name that has a collection number gets"1", the next one "2", etc.)
- easiest to enter all of the other data for the observation sequentially on the page and then click and drag a sequence of numbers in excel to number each observation
- enter observations by columns (e.g. if there are two columns of names on a page, enter the first column and then the second)
4. **"vName"** **[REQUIRED]** = the verbatim name written in the notes
- this may be an abbreviation (e.g. Stel med)
- or full species name (could be misspelled, ***include these spelling mistakes*****)**
- or a full species name which is outdated, ***keep this outdated name***, we want to know exactly what he wrote so we can be confident in assigning an updated species names
5. **"vSciName" [REQUIRED]** = your estimation of what the full species name is
- This is where iNaturalist is helpful. Often the taxonomy might be outdated (indicated by a bracketed name in the search results on iNat).
- start entering first 3 letters of generic name and species name
- see if it occurs in the region
- keep trying with different letters to find best match
- if there are multiple species possible and/ or you can't decipher the name, place a note in the "dataEntryRemarks" column (e.g. "could not decipher taxon name")
- Once you are reasonably certain of a full name, start typing it into search bar and finish typing by looking at the result. If the result still pops up then its proper spelling and you can copy it to spreadsheet
- If it is an outdated name, it will often be bracketed beside another name for the same species in the iNaturalist search, can use this as a spelling guide as well
- Enter your best guess of what the full verbatim name was meant to be
- there should be no spelling mistakes, but outdated taxon names should still be used here
6. **"Conf"** **[REQUIRED]** = enter your confidence level in the **taxon name** estimate (low = l, medium = m, high = h)
- the names with low confidence will be checked over once all of the occurrences have been entered
- entries with medium confidence may be returned to if there is time
7. **"sciName" [REQUIRED] =** **full, properly spelled, updated taxon names.**
- refer to iNaturalist for updates taxon names
- If subspecies or variety, just include this epithet, and do not include the "ssp." or "var." abbreviations. (e.g. Instead of Lupinus latifolius ssp. subalpinus, put "Lupinus latifolius subalpinus")
8. **"date" [REQUIRED] =** fill in YYYYMMDD
9. **"locality" [REQUIRED] =** string describing specific location. Does not contain country, province information, but includes place name (island, town, city, area, landmark, trail, road) or position relative to to these features. **Go from GENERAL to SPECIFIC and separate names with ";"**. If there are any words you can't make out or are uncertain of, use [brackets] around the estimated word. I*mportant: Use a web search to make ensure entry of proper spelling of place name!*
**USE CAPITAL LETTERS** for each word of city/ town/ municipality/ island names
**SPELL OUT ABBREVIATIONS**
- e.g. Mt --\> Mount, Rd --\> Road
**SEPARATE PLACE NAMES WITH SEMI-COLON**. Do not tack on terms like "area" .
e.g. "George Hill area" --\> "George Hill; area around George Hill"
Start with island/ town/ area name...
- e.g. "Mt. Sutil, Galiano" becomes --\> "Galiano Island; Mount Sutil"
- e.g. "Magic lake horse padoc" becomes --\> "North Pender Island; Magic Lake; Magic Lake horse padoc"
- e.g. "fen below Magic Lake" becomes --\> "North Pender Island; Magic Lake; fen below lake"
- e.g. "small island S of prevost island" --\> "Prevost Island; small island to south"
- e.g. "north side of Cowichan Lake" becomes --\> "Cowichan; Cowichan Lake; north side of lake"
- If there is a string of words like 10 km north of Mount Suitl
- e.g. Galiano Island; Mount Sutil; 10 kilometers north of Mt. Sutil
10. **"Country" [REQUIRED] =** full name of country of collection (either Canada or United States of America)
11. **"stateProvince" [REQUIRED] =** Full name of province/ state of collection (either British Columbia or Washington)
12. **"island" [REQUIRED] =** full name of island (if collected on island)
- e.g. Galiano Island or Saltspring Island
- must contain the word "Island" capitalized
13. **"idQualifier" =** identification Qualifier.
- only fill out if there was a question mark or other note of uncertainty about the taxon identification from HJ
- basically replacing question marks and brackets with more formal annotations of uncertainty in ID
- e.g. Festuca (?) rubra or Festuca (rubra) --\> enter here as Festuca cf. rubra
- e.g. Festuca sp ? --\> enter here as Festuca cf. sp
- e.g Festuca rubra ? --\> enter here as cf. Festuca rubra
14. **"county" =** only fill out if on Vancouver Island or Mainland (otherwise it will be assigned depending on the island that it is on)
15. **"habitat" =** string describing the habitat conditions
- you can use commas when listing things
- to start a new sentence use a semicolon ";" rather than a period
- descriptions about type of species growing in area, ***but do not list associate species, this will be entered automatically at a later stage***
- If there are any words you can't make out or are uncertain of, use [brackets]? around the estimated word
- e.g. "Quercus woodland",
- e.g. "vernal seepage in limestone rock outcropping"
- e.g. "edges of lake"
- e.g. "floating in lake"
- e.g. "ditch"
- e.g. "aquatic"
- e.g. "bog"
- e.g. "marsh"
- e.g. "[intertidal]? marsh
- For forest type codes like Cw, Fd, write out the full species names: <https://www2.gov.bc.ca/gov/content/industry/forestry/managing-our-forest-resources/tree-seed/tree-seed-centre/seed-testing/codes>
16. **"locationRemarks" =** additional information about location. Separate sentences with semicolons
Anything pertaining to the general location and not what has already been said in the habitat field. If there are any words you can't make out or are uncertain of, use [brackets] around the estimated word
- e.g. "poly 15" (indicating geographic polygon for surveys)
- e.g. "slope 15 percent; aspect 240 degrees" (try to spell out degrees and percents)
- e.g. "mostly open water in lake"
17. **"vElevM" = verbatim elevation in meters.** If there is any information about elevation of the site, input into this column, WITHOUT units. It seems like all of HJ's measurements were in meters. On the off chance that they're not, convert to meters before entering.
18. **"vLat" & "vLon" =** verbatim latitude and longitude.
For some locations, HJ provided coordinate infromation in lat, long coordinates. If this is the case input this info here either as:
1. decimal degrees --\> e.g vLat = "45.678"
2. degrees, minuntes, seconds --\> e.g. "45 67 89" (do not include " symbols)
Everything will be converted to decimal degrees at a later stage.
19. **"vUTM" =** verbatim UTM coordinates. If provided, enter in the format:
- "10U 45000 56890"
20. **"vCoordUncM" = verbatim coordinate uncertainty in meters.** If coordinate uncertainty was clearly stated, input this in meters, without the "m" indicating meters.
21. **numPlantsCode = number of plants code.** HJ often used a number code beside taxon names to quickly indicate rough abundance. The key is as follows:
- "+" = 1 plants
- "1" = 1-5 plants
- "2" = 5-25 plants
- "3" = 25-50 plants
- "4" = 50-75 plants
- "5" = 75+ plants
When entering these codes, enter "+" as 0 and the rest of the numbers as they are stated
22. **"orgQuantity" & "orgQuantityType" = organism quantity and type.** Only use if HJ indicated abundance in some other way other than the number of plants code.
- e.g. note about particular species being abundant/ persistant/ dominant/ very few at site --\> orgQuantity = "abundant"/ "persistant" / "dominant", "very few" and "orgQuantityType = **"qualitative"**
- e.g. note about there only being one individual --\> orgQuantity = "1", orgQuantityType = "**individuals**"
- e.g. note about percent cover --\> orQuantity = "30 percent", orgQuantityType = "percent cover"
23. **"occRemarks" =** occurrence remarks. Anything pertaining to extra information about the observation that hasn't already been entered OR note about other annotations in book for that collection such as an asterisk or if the name has been crossed out and replaced with a new one.
- e.g. "no usual travel off"
- e.g. "dominant"
- e.g. "very little, tufts"
- e.g. "asterisk next to taxon name"
- e.g. "genus name originally X but crossed out"
- e.g. "uncollected"
- e.g. "mostly Typha at edge"
- e.g. "two large beds"
- e.g. "varigated"
- e.g. "non-flowering"
24. **"phenology"** = if there were any notes about flowering phenology or life stage (either vegetative, flowering, flowering and fruiting, fruiting and flowering, budding, budding and flowering)
- e.g. "non-flowering" enter as --\> "vegetative"
- e.g. "fl" enter as -\> "flowering"
- e.g. "has fruits" or "has seeds" --\> "fruiting"
25. **"recordedBy"** = people there at time of collection (full names or initials) only enter if there are additional names that Harvey denoted of people with him collecting.
- e.g. "PJ" = enter as"Pam Janszen"
26. **"idBy" = identification by**. Only fill out with names if there was a note about someone other than Harvey identifying it.
27. **"dataEntryRemaks" =** any observation with remarks here will be checked over. Write whatever notes are important to completing data entry for that row/ what needs to be helped with.
## When finished entering all data for book:
- ***Save the final raw data book as both .xlsx and .csv files***
## Important Notes:
1. **SAVE FREQUENTLY** (by pushing to github or backing up on OSF or Google Drive)
2. If certain names can't be deciphered or a location/ time can't be attributed to it, **still include as a row in the template,** with a valid **pageNum** an **numPage** entry. It can be removed later (this will help for re-finding things on the page). To help remember what you had a difficult time deciphering, make a note of it in "dataEntryRemarks" and you can additionally attempt to fill out the information and place square brackets around the uncertain parts --\> e.g. [add tophar]?
## **Tips for efficiency**
1. When you first start a new page or new event (location and date), **count the** number of observations you think you will have. Enter the date and locality info for the first observation and then **copy for the amount of rows you think you will need.**
2. Using iNaturalist for efficient species type and spelling check
- Starting to write a name in the search bar on the explore page and then filling the rest of the name out by looking at the result that pops up
- copy and paste this to sheet
# Metadata
- Abundance codes: from page 1 of the HJ-7 field journal
- "+" = 1
- "1" = 1-5
- "2" = 5-25
- "3" = 25-50"
- "4" = 50-75
- "5" = 75+
- Codes like Cw, Fd, Md all refer to forest type codes which can be found here: <https://www2.gov.bc.ca/gov/content/industry/forestry/managing-our-forest-resources/tree-seed/tree-seed-centre/seed-testing/codes>
- List of common abbreviations:
- Aira car = Aira caryophyllea
- Alli acu = Allium acuminatum
- Alli amp = Allium amplectens
- Atri pat = Atriplex patula
- Anth odo = Anthoxanthum odoratum
- Apha arv = Aphanes arvensis
- Arct col = Arctostaphylos columbiana
- Bra arv = Sinapis arvensis (formerly Brassica)
- Brom ste = Bromus sterilis
- Brom hor = Bromus hordeaceus
- Brom sit = Bromus sitchensis
- Card oli = Cardamine oligosperma
- Care ino = Carex inops
- Cerast arv = Cerastium arvense
- Clar amo = Clarkia amoena
- Clay par = Claytonia perfoliata
- Cyno cri = Cynosurus cristatus
- Cyno ech = Cynosurus echinatus
- Dact glo = Dactylis glomerata
- Dauc pus = Daucus pusillis
- Delp men = Delphinium menziesii
- Distich spi = Distichlis spicata
- Dant cal = Danthonia californica
- Eleo pal = Eleocharis palustris
- Elym gla = Elymus glaucus
- Epil ade = Epilobium adenocaulon?
- Epil pau = Epilobium parviflorum?
- Erod cic = Erodium cicutarium
- Erio lam = Eriophyllum lanatum
- Eryt ore = Erythronium oregonum
- Gali apa = Galium aparine
- Gara can = Rosa canina
- Gera mol = Geranium molle
- holo disc = Holodiscus discolor
- Holc lan = Holcus lanatus
- Hypo rad = Hypochaeris radicata
- Koel cri = Koelaria macrantha
- Lact mur = Lactuca muralis
- Lepi vir = Lepidium virginicum
- Loni his = Lonicera hispidula
- Loma utr = Lomatium utriculatum
- Litho par = Lithophragma parviflorum
- Lych cor = Silene coronaria (formerly Lychnis)
- Meli sub = Melica subulata
- Mimu gut = Erythranthe gutatta (formerly Mimulus)
- Madi gra = Madia gracilis
- Mont fon = Montia fontana
- Mont par = Montia parvifolia
- Nemo par = Nemophila parviflora
- Osmo chi = Osmorhiza berteroi (formerly chilensis)
- Plan mar = Plantago maritima
- Plect con = Plectritis congesta
- Pter aqu = Pteridium aquilinum
- Poa pra = Poa pratensis
- Raco can = Racomitrium canescens
- Rosa gym = Rosa gymnocarpa
- Sani cra = Sanicula crassicaulis
- Sedu lan = Sedum lanceolatum
- Sedu spa = Sedum spathulifolium
- Sela wal = Selaginella wallacei
- Stel med = Stellaria media
- Symp alb = Symphoricarpos albus
- Tori jap = Torilis japonica
- Trif dub = Trifolium dubium
- Trif micro = Trifolium microdon
- Trif miceph = Trifolium microcephalum
- Trif tri = Trifolium willdenovii
- Trit hya = Triteleia hyacinthina
- Vero arv = Veronica arvensis
- Vici hir = Vicia hirsuta
- Vici sat = Vicia sativa
- Vulp bro = Festuca bromoides