1 |
808
|
berkley
|
<!--
|
2 |
|
|
* Ecological Metadata Language (EML) - Data set structural descriptors
|
3 |
|
|
*
|
4 |
|
|
* Authors: Matt Jones, Zheng Wang, and Noah Goldstein
|
5 |
|
|
* Organization: National Center for Ecological Analysis and Synthesis
|
6 |
|
|
* For Details: http://www.nceas.ucsb.edu/
|
7 |
|
|
* Created: 1997 August 19
|
8 |
|
|
* Modified: 1999 June 23
|
9 |
|
|
* Version: 1.4
|
10 |
|
|
* File Info: '$Id$'
|
11 |
|
|
*
|
12 |
|
|
* Ecological Metadata Language is a general purpose metadata content
|
13 |
|
|
* specification for documenting ecological data. The specification
|
14 |
|
|
* consists of a series of modular document type definitions (DTD) that
|
15 |
|
|
* provide metadata content descriptors. It describes the owner and
|
16 |
|
|
* contents of the dataset (eml-dataset.dtd), the research context in
|
17 |
|
|
* which it was created (eml-context.dtd), the structural
|
18 |
|
|
* characteristics of data files (eml-file.dtd), the
|
19 |
|
|
* characteristics of variables in a file (eml-variable.dtd), current
|
20 |
|
|
* status of data and metadata files (eml-status.dtd), access control
|
21 |
|
|
* rules regarding the data and metadata (eml-access.dtd), software
|
22 |
|
|
* information (eml-software) and a variety of miscellaneous
|
23 |
|
|
* supplemental descriptors (eml-supplement.dtd).
|
24 |
|
|
*
|
25 |
|
|
* Files generated under the structural constraints of eml are
|
26 |
|
|
* plain-text files and therefore are editable in ordinary
|
27 |
|
|
* text-processors. However, these DTDs are intended for use within
|
28 |
|
|
* general purpose metadata editors, and within a more specific
|
29 |
|
|
* metadata editor being developed at NCEAS for the ecological
|
30 |
|
|
* community. This metadata editor will provide facilities for
|
31 |
|
|
* version control and efficient metadata entry.
|
32 |
|
|
* The purpose of this specification was to formalize the
|
33 |
|
|
* Michener et al. work in a structured language to examine its
|
34 |
|
|
* application to ecological data in a controlled manner.
|
35 |
|
|
*
|
36 |
|
|
* This specification was based on the work of the Ecological Society
|
37 |
|
|
* of America's Committee on the Future of Long Term Data, and more
|
38 |
|
|
* specifically on a related paper, Michener et al., 1997. See:
|
39 |
|
|
* Michener, William K., et al., 1997. Ecological Appications,
|
40 |
|
|
* "Nongeospatial metadata for the ecological sciences"
|
41 |
|
|
* Vol 7(1). pp. 330-342.
|
42 |
|
|
*
|
43 |
|
|
* Where appropriate, we have used elements of the ISO/TC 211 draft
|
44 |
|
|
* standard - the ISO Geographic information/Geomatics standard,
|
45 |
|
|
* which includes xml code, as well as ISO 8601 schema. Some elements
|
46 |
|
|
* in the ISO/TC 211 were expanded to allow for greater
|
47 |
|
|
* resolution.
|
48 |
|
|
*
|
49 |
|
|
* For an explanation of the classes of metadata and elements defined
|
50 |
|
|
* below, see Michener et al. 1997. In particular, the numbered comment
|
51 |
|
|
* labels found below refer to Table 1 (pp. 336-337) of Michener
|
52 |
|
|
* et al. 1997. In addition, each of the principal elements in the
|
53 |
|
|
* specification is accompanied by a FIXED attribute called "description"
|
54 |
|
|
* that provides a brief description of the content of the element. These
|
55 |
|
|
* descriptions are derived from Michener et al. 1997.
|
56 |
|
|
*
|
57 |
|
|
-->
|
58 |
|
|
|
59 |
|
|
<!-- * * * *
|
60 |
|
|
CLASS IV - DATA STRUCTURE DESCRIPTORS
|
61 |
|
|
* * * *
|
62 |
|
|
-->
|
63 |
|
|
|
64 |
|
|
<!-- Class 4 -->
|
65 |
|
|
<!ELEMENT eml-file (meta_file_id, file)>
|
66 |
|
|
<!ATTLIST eml-file description CDATA #FIXED "Data structural descriptors">
|
67 |
|
|
<!ELEMENT meta_file_id (#PCDATA)>
|
68 |
|
|
<!ATTLIST meta_file_id description CDATA #FIXED "Unique identifier of this metadata record">
|
69 |
|
|
|
70 |
|
|
<!-- Class 4.A -->
|
71 |
|
|
<!ELEMENT file (file_name, file_size?, format, orientation?, header_lines?,
|
72 |
|
|
case_sensitive?, geographic_coverage*, temporal_coverage*,
|
73 |
|
|
authentication*, data_problem*)>
|
74 |
|
|
<!ATTLIST file description CDATA #FIXED "Data set file">
|
75 |
|
|
|
76 |
|
|
<!-- Class 4.A.1 -->
|
77 |
|
|
<!ELEMENT file_name (#PCDATA) >
|
78 |
|
|
<!ATTLIST file_name description CDATA #FIXED "File name">
|
79 |
|
|
|
80 |
|
|
<!-- Class 4.A.2 -->
|
81 |
|
|
<!ELEMENT file_size (amount, unit) >
|
82 |
|
|
<!ATTLIST file_size description CDATA #FIXED "Size of file">
|
83 |
|
|
<!ELEMENT amount (#PCDATA) >
|
84 |
|
|
<!ATTLIST amount description CDATA #FIXED "Amount">
|
85 |
|
|
<!ELEMENT unit (#PCDATA) >
|
86 |
|
|
<!ATTLIST unit description CDATA #FIXED "Unit">
|
87 |
|
|
|
88 |
|
|
|
89 |
|
|
<!-- Class 4.A.3 -->
|
90 |
|
|
<!ELEMENT format (compressed|encoded|base_file_type)>
|
91 |
|
|
<!ATTLIST format description CDATA #FIXED "File type and compression schemes employed">
|
92 |
|
|
<!ELEMENT compressed (#PCDATA)>
|
93 |
|
|
<!ATTLIST compressed description CDATA #FIXED "Method of compression">
|
94 |
|
|
<!ELEMENT encoded (#PCDATA)>
|
95 |
|
|
<!ATTLIST encoded description CDATA #FIXED "Method of encoding">
|
96 |
|
|
<!ELEMENT base_file_type (ASCII|binary)>
|
97 |
|
|
<!ATTLIST base_file_type description CDATA #FIXED "Base file type">
|
98 |
|
|
<!ELEMENT ASCII (record_delimiter)>
|
99 |
|
|
<!ATTLIST ASCII description CDATA #FIXED "ASCII">
|
100 |
|
|
<!ELEMENT record_delimiter (#PCDATA)>
|
101 |
|
|
<!ATTLIST record_delimiter description CDATA #FIXED "Record delimiter">
|
102 |
|
|
<!ELEMENT binary EMPTY>
|
103 |
|
|
<!ATTLIST binary description CDATA #FIXED "Binary">
|
104 |
|
|
|
105 |
|
|
<!-- NEW SECTION -->
|
106 |
|
|
<!ELEMENT orientation EMPTY >
|
107 |
|
|
<!ATTLIST orientation columnorrow (columnmajor|rowmajor) "columnmajor"
|
108 |
|
|
description CDATA #FIXED "Is file column major or row major?">
|
109 |
|
|
|
110 |
|
|
<!-- Class 4.A.4 -->
|
111 |
|
|
<!ELEMENT header_lines (#PCDATA) >
|
112 |
|
|
<!ATTLIST header_lines description CDATA #FIXED "Number of header lines or information that prepares data">
|
113 |
|
|
|
114 |
|
|
<!-- Class 4.A.5 -->
|
115 |
|
|
<!ELEMENT case_sensitive EMPTY >
|
116 |
|
|
<!ATTLIST case_sensitive yesorno (y|n) "n"
|
117 |
|
|
description CDATA #FIXED "Case sensitivity of field">
|
118 |
|
|
|
119 |
|
|
<!-- Class 4.A.6 -->
|
120 |
|
|
<!-- the field "specialfield" was removed as there were questions to its usefull ness -->
|
121 |
|
|
|
122 |
|
|
<!ELEMENT geographic_coverage (paragraph*, coordinates*)>
|
123 |
|
|
<!ATTLIST geographic_coverage description CDATA #FIXED "Geographic coverage">
|
124 |
|
|
|
125 |
|
|
<!ELEMENT coordinates (lattitude, longitude) >
|
126 |
|
|
<!ATTLIST coordinates description CDATA #FIXED "Cartesian Coordinates">
|
127 |
|
|
<!ELEMENT lattitude (#PCDATA)>
|
128 |
|
|
<!ATTLIST lattitude direction (north|south) #REQUIRED
|
129 |
|
|
description CDATA #FIXED "Lattitude coordinates">
|
130 |
|
|
<!ELEMENT longitude (#PCDATA)>
|
131 |
|
|
<!ATTLIST longitude direction (east|west) #REQUIRED
|
132 |
|
|
description CDATA #FIXED "Longitude coordinates">
|
133 |
|
|
|
134 |
|
|
<!ELEMENT temporal_coverage (start_date,stop_date?)>
|
135 |
|
|
<!ATTLIST temporal_coverage description CDATA #FIXED "Temporal coverage">
|
136 |
|
|
<!ELEMENT start_date (datetime)>
|
137 |
|
|
<!ATTLIST start_date description CDATA #FIXED "Date data collection commenced">
|
138 |
|
|
<!ELEMENT datetime (year?, month?, day?, hour?, minute?, second?, second_fraction?, time_zone_designator?)>
|
139 |
|
|
<!ATTLIST datetime description CDATA #FIXED "Date and time as designated in ISO 8601">
|
140 |
|
|
<!ELEMENT year (#PCDATA)>
|
141 |
|
|
<!ATTLIST year description CDATA #FIXED "Year in YYYY format">
|
142 |
|
|
<!ELEMENT month (#PCDATA)>
|
143 |
|
|
<!ATTLIST month description CDATA #FIXED "Month in MM format">
|
144 |
|
|
<!ELEMENT day (#PCDATA)>
|
145 |
|
|
<!ATTLIST day description CDATA #FIXED "Day of month in DD format">
|
146 |
|
|
<!ELEMENT hour (#PCDATA)>
|
147 |
|
|
<!ATTLIST hour description CDATA #FIXED "Hour in hh">
|
148 |
|
|
<!ELEMENT minute (#PCDATA)>
|
149 |
|
|
<!ATTLIST minute description CDATA #FIXED "Minute in mm">
|
150 |
|
|
<!ELEMENT second (#PCDATA)>
|
151 |
|
|
<!ATTLIST second description CDATA #FIXED "Seconds in ss">
|
152 |
|
|
<!ELEMENT second_fraction (#PCDATA)>
|
153 |
|
|
<!ATTLIST second_fraction description CDATA #FIXED "One or more digits representing a decimal fraction of a second">
|
154 |
|
|
<!ELEMENT time_zone_designator (#PCDATA)>
|
155 |
|
|
<!ATTLIST time_zone_designator description CDATA #FIXED "Time zone designator">
|
156 |
|
|
|
157 |
|
|
<!ELEMENT stop_date (datetime)>
|
158 |
|
|
<!ATTLIST stop_date description CDATA #FIXED "Date data collection terminated">
|
159 |
|
|
|
160 |
|
|
|
161 |
|
|
<!-- Class 4.A.7 -->
|
162 |
|
|
<!ELEMENT authentication (#PCDATA) >
|
163 |
|
|
<!ATTLIST authentication description CDATA #FIXED "Authentication procedures or techniques">
|
164 |
|
|
|
165 |
|
|
|
166 |
|
|
<!-- Class 4.B -->
|
167 |
|
|
<!-- Class 4.B (variable) was moved to its own module (eml-variable.dtd) -->
|
168 |
|
|
|
169 |
|
|
<!-- Class 4.C -->
|
170 |
|
|
<!ELEMENT data_problem (paragraph+)>
|
171 |
|
|
<!ATTLIST data_problem problemtype (missing|anomaly|error) #REQUIRED
|
172 |
|
|
description CDATA #FIXED "Data anomalies; description of missing data">
|
173 |
|
|
<!ELEMENT paragraph (#PCDATA)>
|
174 |
|
|
<!ATTLIST paragraph description CDATA #FIXED "Paragraph of text">
|
175 |
|
|
|
176 |
|
|
<!-- End of file -->
|