Project

General

Profile

Bug #4008 » EML 2 Dataset Mixed Permissions trunk.xml

Oliver Soong, 04/21/2009 04:42 PM

 
1
<?xml version="1.0" standalone="no"?>
2
<!DOCTYPE entity PUBLIC "-//UC Berkeley//DTD MoML 1//EN"
3
    "http://ptolemy.eecs.berkeley.edu/xml/dtd/MoML_1.dtd">
4
<entity name="model" class="ptolemy.actor.TypedCompositeActor">
5
    <property name="_createdBy" class="ptolemy.kernel.attributes.VersionAttribute" value="7.2.devel">
6
    </property>
7
    <property name="SDF Director" class="ptolemy.domains.sdf.kernel.SDFDirector">
8
        <property name="iterations" class="ptolemy.data.expr.Parameter" value="1">
9
        </property>
10
        <property name="vectorizationFactor" class="ptolemy.data.expr.Parameter" value="1">
11
        </property>
12
        <property name="allowDisconnectedGraphs" class="ptolemy.data.expr.Parameter" value="false">
13
        </property>
14
        <property name="allowRateChanges" class="ptolemy.data.expr.Parameter" value="false">
15
        </property>
16
        <property name="constrainBufferSizes" class="ptolemy.data.expr.Parameter" value="true">
17
        </property>
18
        <property name="period" class="ptolemy.data.expr.Parameter" value="0.0">
19
        </property>
20
        <property name="synchronizeToRealTime" class="ptolemy.data.expr.Parameter" value="false">
21
        </property>
22
        <property name="timeResolution" class="ptolemy.actor.parameters.SharedParameter" value="1E-10">
23
        </property>
24
        <property name="Scheduler" class="ptolemy.domains.sdf.kernel.SDFScheduler">
25
            <property name="constrainBufferSizes" class="ptolemy.data.expr.Parameter" value="constrainBufferSizes">
26
            </property>
27
        </property>
28
<property name="KeplerDocumentation" class="ptolemy.vergil.basic.KeplerDocumentationAttribute">
29
<property name="description" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>null</configure></property>
30
<property name="author" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Steve Neuendorffer</configure></property>
31
<property name="version" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>null</configure></property>
32
<property name="userLevelDocumentation" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>&#10;&lt;p&gt;The SDF Director is often used to oversee fairly simple, sequential workflows in which the director can determine the order of actor invocation from the workflow. Types of workflows that would run well under an SDF Director include processing and reformatting tabular data, converting one data type to another, and reading and plotting a series of data points. A workflow in which an image is read, processed (rotated, scaled, clipped, filtered, etc.), and then displayed, is also an example of a sequential workflow that requires a director simply to ensure that each actor fires in the proper order (i.e., that each actor executes only after it receives its required inputs).&lt;/p&gt;&#10;&#10;&lt;p&gt;The SDF Director is very efficient and will not tax system resources with overhead. However, this efficiency requires that certain conditions be met, namely that the data consumption and production rate of each actor in an SDF workflow be constant and declared. If an actor reads one piece of data and calculates and outputs a single result, it must always read and output a single token of data. This data rate cannot change during workflow execution and, in general, workflows that require dynamic scheduling and/or flow control cannot use this director. Additionally, the SDF Director has no understanding of passing time (at least by default), and actors that depend on a notion of time may not work as expected. For example, a TimedPlotter actor will plot all values at time zero when used in SDF. &lt;/p&gt;&#10;&#10;&lt;p&gt;By default, the SDF Director requires that all actors in its workflow be connected. Otherwise, the director cannot account for concurrency between disconnected workflow parts. Usually, a PN Director should be used for workflows that contain disconnected actors; however, the SDF Director's allowDisconnectedGraphs parameter may also be set to true. The SDF Director will then schedule each disconnected &quot;island&quot; independently. The director cannot infer the sequential relationship between disconnected actors (i.e., nothing forces the director to finish executing all actors on one island before firing actors on another). However, the order of execution within each island should be correct. Usually, disconnected graphs in an SDF model indicate an error.&lt;/p&gt;&#10; &#10;&lt;p&gt;Because SDF Directors schedule actors to fire only after they receive their inputs, workflows that require loops (feeding an actor's output back into its input port for further processing) can cause &quot;deadlock&quot; errors. The deadlock errors occur because the actor depends on its own output value as an initial input. To fix this problem, use a SampleDelay actor to generate and inject an initial input value into the workflow.&lt;/p&gt;&#10;&#10;&lt;p&gt;The SDF Director determines the order in which actors execute and how many times each actor needs to be fired to complete a single iteration of the workflow. This schedule is calculated BEFORE the director begins to iterate the workflow. Because the SDF Director calculates a schedule in advance, it is quite efficient. However, SDF workflows must be static. In other words, the same number of tokens must be consumed/produced at every iteration of the workflow. Workflows that require dynamic control structures, such as a BooleanSwitch actor that sends output on one of two ports depending on the value of a 'control', cannot be used with an SDF Director because the number of tokens on each output can change for each execution.&lt;/p&gt;&#10;&#10;&lt;p&gt;Unless otherwise specified, the SDF Director assumes that each actor consumes and produces exactly one token per channel on each firing. Actors that do not follow the one-token-per-channel firing convention (e.g., Repeat or Ramp) must declare the number of tokens they produce or consume via the appropriate parameters. &lt;/p&gt;&#10;&#10;&lt;p&gt;The number of times a workflow is iterated is controlled by the director's iterations parameter. By default, this parameter is set to &quot;0&quot;. Note that &quot;0&quot; does not mean &quot;no iterations.&quot; Rather, &quot;0&quot; means that the workflow will iterate forever. Values greater than zero specify the actual number of times the director should execute the entire workflow. A value of 1, meaning that the director will run the workflow once, is often the best setting when building an SDF workflow. &lt;/p&gt;&#10;&#10;&lt;p&gt;The amount of data processed by an SDF workflow is a function of both the number of times the workflow iterates and the value of the director's vectorizationFactor parameter. The vectorizationFactor is used to increase the efficiency of a workflow by increasing the number of times actors fire each time the workflow iterates. If the parameter is set to a positive integer (other than 1), the director will fire each actor the specified number of times more than normal. The default is 1, indicating that no vectorization should be performed. Keep in mind that changing the vectorizationFactor parameter changes the meaning of a nested SDF workflow and may cause deadlock in a workflow that uses it. &lt;/p&gt;&#10;&#10;&lt;p&gt;The SDF Director has several advanced parameters that are generally only relevant when an SDF workflow contains composite components. In most cases the period, timeResolution, synchronizeToRealTime, allowRateChanges, timeResolution, and constrainBufferSizes parameters can be left at their default values.&lt;/p&gt;&#10;&#10;&lt;p&gt;For more information about the SDF Director, see the Ptolemy documentation (http://ptolemy.eecs.berkeley.edu/papers/05/ptIIdesign3-domains/ptIIdesign3-domains.pdf).&lt;/p&gt;&#10;&#10;</configure></property>
33
<property name="prop:allowDisconnectedGraphs" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Specify whether to allow disconnected actors in the workflow (by default, all actors are required to be connected). If disconnected actors are permitted, the SDF Director will schedule each disconnected 'island' independently. Nothing &quot;forces&quot; the director to finish executing all actors on one island before firing actors on another. However, the order of execution within each island should be correct. Usually, disconnected graphs in an SDF workflow indicate an error.</configure></property>
34
<property name="prop:allowRateChanges" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Specify whether dynamic rate changes are permitted or not. By default, rate changes are not permitted, and the director will perform a check to disallow such workflows. If the parameter is selected, then workflows that require rate parameters to be modified during execution are valid, and the SDF Director will dynamically compute a new schedule at runtime. This is an advanced parameter that can usually be left at its default value.</configure></property>
35
<property name="prop:timeResolution" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>The time precision used by this director. All time values are rounded to the nearest multiple of this number. The value is a double that defaults to &quot;1E-10&quot; (which is 10-10). This is an advanced parameter that can usually be left at its default value.</configure></property>
36
<property name="prop:constrainBufferSizes" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Specify whether buffer sizes are fixed. By default, buffers are fixed, and attempts to write to the buffer that cause the buffer to exceed its scheduled size result in an error. This is an advanced parameter that can usually be left at its default value.</configure></property>
37
<property name="prop:iterations" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Specify the number of times a workflow is iterated. By default, this parameter is set to &quot;0&quot;. Note that &quot;0&quot; does not mean &quot;no iterations.&quot; Rather, &quot;0&quot; means that the workflow will iterate forever. Values greater than zero specify the actual number of times the director should execute the entire workflow. A value of 1, meaning that the director will run the workflow once, is often the best setting when building an SDF workflow. </configure></property>
38
<property name="prop:vectorizationFactor" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>The vectorizationFactor is used to increase the efficiency of a workflow by increasing the number of times actors fire each time the workflow iterates. If the parameter is set to a positive integer (other than 1), the director will fire each actor the specified number of times more than normal. The default is 1, indicating that no vectorization should be performed. Keep in mind that changing the vectorizationFactor parameter changes the meaning of a nested SDF workflow and may cause deadlock in a workflow that uses it. </configure></property>
39
<property name="prop:synchronizeToRealTime" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Specify whether the execution should synchronize to real time or not. By default, the director does not synchronize to real time. If synchronize is selected, the director will only process the workflow when elapsed real time matches the product of the period parameter and the iteration count. Note: if the period parameter has a value of 0.0 (the default), then selecting this parameter has no effect. This is an advanced parameter that can usually be left at its default value.</configure></property>
40
<property name="prop:period" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>The time period of each iteration. The value is a double that defaults to 0.0, which means that the director does not increment workflow time. If the value greater than 0.0, the actor will increment workflow time each time it fires. This is an advanced parameter that can usually be left at its default value. </configure></property>
41
</property>        <property name="entityId" class="org.kepler.moml.NamedObjId" value="urn:lsid:kepler-project.org:director:1:1">
42
        </property>
43
        <property name="class" class="ptolemy.kernel.util.StringAttribute" value="ptolemy.domains.sdf.kernel.SDFDirector">
44
            <property name="id" class="ptolemy.kernel.util.StringAttribute" value="urn:lsid:kepler-project.org:directorclass:1:1">
45
            </property>
46
        </property>
47
        <property name="semanticType00" class="org.kepler.sms.SemanticType" value="urn:lsid:localhost:onto:1:1#Director">
48
        </property>
49
        <property name="semanticType11" class="org.kepler.sms.SemanticType" value="urn:lsid:localhost:onto:2:1#Director">
50
        </property>
51
        <property name="_location" class="ptolemy.kernel.util.Location" value="[60.0, 35.0]">
52
        </property>
53
    </property>
54
    <property name="_windowProperties" class="ptolemy.actor.gui.WindowPropertiesAttribute" value="{bounds={203, 195, 873, 633}, maximized=false}">
55
    </property>
56
    <property name="_vergilSize" class="ptolemy.actor.gui.SizeAttribute" value="[600, 485]">
57
    </property>
58
    <property name="_vergilZoomFactor" class="ptolemy.data.expr.ExpertParameter" value="1.0">
59
    </property>
60
    <property name="_vergilCenter" class="ptolemy.data.expr.ExpertParameter" value="{300.0, 242.5}">
61
    </property>
62
    <entity name="Kruger National Park megaherbivore census 2001-" class="org.ecoinformatics.seek.datasource.eml.eml2.Eml200DataSource">
63
        <property name="emlFilePath" class="ptolemy.data.expr.FileParameter" value="">
64
            <display name="EML File"/>
65
        </property>
66
        <property name="dataFilePath" class="ptolemy.data.expr.FileParameter" value="">
67
            <display name="Data File"/>
68
        </property>
69
        <property name="schemaDef" class="ptolemy.kernel.util.StringAttribute" value="&lt;schema&gt;&#10;  &lt;table name=&quot;elephantbuffalo2003.txt&quot;&gt;&#10;    &lt;field name=&quot;year&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;Specie_code&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;Total&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Calves&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;X_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Y_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;  &lt;/table&gt;&#10;  &lt;table name=&quot;MEGAHERBIVORE CENSUS LUT.txt&quot;&gt;&#10;    &lt;field name=&quot;Code&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;Species&quot; dataType=&quot;STRING&quot;/&gt;&#10;  &lt;/table&gt;&#10;  &lt;table name=&quot;elephantbuffalo2001.txt&quot;&gt;&#10;    &lt;field name=&quot;Date&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;Specie_code&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;total&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Calves&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;X_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Y_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;  &lt;/table&gt;&#10;  &lt;table name=&quot;elephantbuffalo2002.txt&quot;&gt;&#10;    &lt;field name=&quot;year&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;Specie_code&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;total&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Calves_counted&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;X_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Y_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;  &lt;/table&gt;&#10;&lt;/schema&gt;&#10;">
70
            <display name="Schema Definition"/>
71
            <property name="schemaDef" class="ptolemy.actor.gui.style.TextStyle">
72
                <property name="height" class="ptolemy.data.expr.Parameter" value="10">
73
                </property>
74
                <property name="width" class="ptolemy.data.expr.Parameter" value="30">
75
                </property>
76
            </property>
77
        </property>
78
        <property name="sqlDef" class="ptolemy.kernel.util.StringAttribute">
79
            <display name="SQL Command"/>
80
            <property name="sqlDef" class="ptolemy.actor.gui.style.TextStyle">
81
                <property name="height" class="ptolemy.data.expr.Parameter" value="10">
82
                </property>
83
                <property name="width" class="ptolemy.data.expr.Parameter" value="30">
84
                </property>
85
            </property>
86
        </property>
87
        <property name="selectedEntity" class="ptolemy.data.expr.StringParameter" value="elephantbuffalo2001.txt">
88
            <display name="Selected Entity"/>
89
        </property>
90
        <property name="dataOutputFormat" class="ptolemy.data.expr.StringParameter" value="As Cache File Name">
91
            <display name="Data Output Format"/>
92
        </property>
93
        <property name="fileExtensionFilter" class="ptolemy.data.expr.StringParameter" value="">
94
            <display name="File Extension Filter"/>
95
        </property>
96
        <property name="isLenient" class="ptolemy.data.expr.Parameter" value="false">
97
            <display name="Allow lenient data parsing"/>
98
        </property>
99
        <property name="checkVersion" class="ptolemy.data.expr.Parameter" value="false">
100
            <display name="Check for latest version"/>
101
        </property>
102
        <property name="_tableauFactory" class="org.kepler.objectmanager.data.db.QBTableauFactory">
103
            <property name="sqlName" class="ptolemy.kernel.util.StringAttribute" value="sqlDef">
104
            </property>
105
            <property name="schemaName" class="ptolemy.kernel.util.StringAttribute" value="schemaDef">
106
            </property>
107
        </property>
108
        <property name="recordid" class="ptolemy.kernel.util.StringAttribute" value="judithk.306.16">
109
        </property>
110
        <property name="endpoint" class="ptolemy.kernel.util.StringAttribute" value="http://ecogrid.ecoinformatics.org/knb/services/QueryService">
111
        </property>
112
        <property name="namespace" class="ptolemy.kernel.util.StringAttribute" value="eml://ecoinformatics.org/eml-2.0.1">
113
        </property>
114
<property name="KeplerDocumentation" class="ptolemy.vergil.basic.KeplerDocumentationAttribute">
115
<property name="description" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>null</configure></property>
116
<property name="author" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>matt jones, jing tao, chad</configure></property>
117
<property name="version" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>null</configure></property>
118
<property name="userLevelDocumentation" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>&lt;p&gt;The EML2Dataset actor provides access to a wide variety of data packages that have been described using EML (Ecological Metadata Language). The actor accesses an EML dataset and handles the mechanical issues involved in parsing metadata, downloading the dataset (if applicable), and emitting data to downstream actors.&lt;/p&gt;&#10;&#10;&lt;p&gt;Each data package contains an EML metadata description and one or more data entities (e.g., data tables, spatial raster images, spatial vector images). The EML metadata allows the described data to be easily ingested into Kepler and exposed for use in downstream components. The data packages can be accessed from the local file system or through any EcoGrid server that provides access to its collection of data objects. The supported data transfer protocols include http, ftp, file, ecogrid, and srb. &lt;/p&gt;&#10;&#10;&lt;p&gt;After parsing the EML metadata, the actor automatically reconfigures its exposed ports to provide one port for each attribute described by the first entity of the EML description. For example, if the first entity is a data table with four columns, the ports might be &quot;Site&quot;, &quot;Date&quot;, &quot;Plot&quot;, and &quot;Rainfall.&quot; These details are obtained from the EML document. To preview the data, right-click the actor icon and select Preview from the drop-down menu.&lt;/p&gt;&#10;&#10;&lt;p&gt;By default, the ports created by the EML2Dataset actor represent data fields, and one tuple of data (e.g., one database row) is emitted on these ports each time the actor fires. Alternatively, the actor can be configured so that the ports represent an array of field values (&quot;AsColumnVector&quot;), or so that the ports represent an entire table of data (&quot;AsTable&quot;). Data tables are formatted in comma-separated-value (CSV) format.&lt;/p&gt;&#10;&#10;&lt;p&gt;If more than one data entity is described in the EML metadata, then the output of the actor defaults to the first entity listed in the EML. To select the other entities, specify an entity with the Selected parameter, or use the Query Builder to describe the filter and join that should be used to produce the data output. To use the Query Builder, right-click the actor and select &quot;Open Actor.&quot; Specify the fields to be output and any filtering constraints to be applied.&lt;/p&gt;</configure></property>
119
<property name="port:output" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Output ports are automatically configured to provide one port for each attribute in the first entity described in the EML description. For example, if the first entity is a data table with four columns, the ports might be &quot;Site&quot;, &quot;Date&quot;, &quot;Plot&quot;, and &quot;Rainfall.&quot; These details are obtained from the EML document.</configure></property>
120
<property name="port:trigger" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>A multiport that has no declared type (in other words, the port can accept any data type: double, int, array, etc.) If the port is connected, the actor will not fire until the trigger port receives an input token. Connecting the port is optional, but useful when scheduling the actor to perform at a certain time.</configure></property>
121
<property name="prop:isLenient" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>If this parameter is selected, &quot;extra&quot; columns of data (e.g., comments that people have entered on a line or something of that nature) that are not described in the metadata are ignored, allowing the workflow to execute. If the option is unchecked (the default), the workflow execution will halt until the discrepancy between the data and metadata is corrected.</configure></property>
122
<property name="prop:checkVersion" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Select this parameter to check the EarthGrid for updates to the data. If the actor finds a version of the data that is more recent than the cached data on your local system, the actor will prompt the user to either download the latest data and metadata or ignore the newer version. Note that different versions of the data can have vastly different structures (new columns, or even new tables of data might be included or removed). If this parameter is selected, users should be prepared to handle changes that might arise from differences in the data structure.</configure></property>
123
<property name="prop:dataOutputFormat" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Specify which ports are created for the actor and what data is emitted on those ports during each fire cycle. For example, this field can be configured to produce one port for each column in a data table, or one port that emits the entire data table at once in CSV format. Specifically, the output format choices are:&#10;&lt;table&gt;&#10;&lt;tr&gt;&lt;td&gt;As Field (the default)&lt;/td&gt;&lt;td&gt; The actor creates one output port for each field (i.e., column/attribute/variable) that is described in the EML metadata for the data package. The type of each port (e.g., string, int, double, etc.) matches the base type of the field. If a query statement has been used to subset the data, then only those fields selected in the query statement will be configured as ports. &lt;/td&gt;&lt;/tr&gt;&#10;&#10;&lt;tr&gt;&lt;td&gt;As Table&lt;/td&gt;&lt;td&gt; The selected data will be output as a string that contains the entire entity data. The actor creates three output ports: DataTable - the data itself, Delimiter - delimiter to separate fields, and NumColumns - the number of fields in the table. &lt;/td&gt;&lt;/tr&gt;&#10;&#10;&lt;tr&gt;&lt;td&gt;As Row&lt;/td&gt;&lt;td&gt;  One tuple of selected data is formatted as an array and output. The actor creates one output port (DataRow), and the data type is a record containing each of the individual data fields. &lt;/td&gt;&lt;/tr&gt;&#10;&#10;&lt;tr&gt;&lt;td&gt;As Byte Array&lt;/td&gt;&lt;td&gt; Selected data will be output as an array of bytes. The actor creates two output ports: BinaryData - contains the raw data itself, and EndOfStream - indicates whether the end of data stream has been reached.&lt;/td&gt;&lt;/tr&gt; &#10;&#10;&lt;tr&gt;&lt;td&gt;As UnCompressed File Name&lt;/td&gt;&lt;td&gt; This format is only used when the data package is a compressed file (zip, tar, et al). The compressed archive file is uncompressed after it is downloaded. The actor creates one output port, which contains an array of the filenames of all of the uncompressed archive files. If a FileExtensionFilter is specified, then the array will only contain files that match the specified extension.&lt;/td&gt;&lt;/tr&gt; &#10;&#10;&lt;tr&gt;&lt;td&gt;As Cache File Name&lt;/td&gt;&lt;td&gt;   Kepler stores data files downloaded from remote sites into its cache system. This output format will send the local cache file path for the data package so that workflow designers can directly access the cache files. The actor creates two output ports: CacheLocalFileName (the local file path) and CacheResourceName (the EML data link. e.g., ecogrid://knb/tao.2.1).&lt;/td&gt;&lt;/tr&gt; &#10;&#10;&lt;tr&gt;&lt;td&gt;As Column Vector&lt;/td&gt;&lt;td&gt; This output format is similar to &quot;As Field&quot;, except instead of sending out a single value on each port, the actor sends out an array of all of the data for each field. The type of each port is an array of the base type for the field.&lt;/td&gt;&lt;/tr&gt;&#10;&#10;&lt;tr&gt;&lt;td&gt;As ColumnBased Record&lt;/td&gt;&lt;td&gt;  This output format sends all data on one port using a record structure that encapsulates the entire data entity. The record will contain one array for each data field, and the type of each array will be determined by the type of the field it represents.&lt;/td&gt;&lt;/tr&gt;&#10;&lt;/table&gt;&#10;&#10;</configure></property>
124
<property name="prop:selectedEntity" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>If this EML data package has multiple entities, the selectedEntity parameter specifies which entity should be output. When this parameter is unset (the default), data from the first entity described in an EML package is output. This parameter is only used if no query statement is specified, or if a query statement is used and the output format is one of &quot;As Table&quot;, &quot;As Byte Array&quot;, &quot;As Uncompressed File Name&quot;, and &quot;As Cache File Name&quot;. To specify a query statement, right-click the actor and select Open Actor.</configure></property>
125
<property name="prop:emlFilePath" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>The file path of a local EML metadata file used to describe and access an EML data set.</configure></property>
126
<property name="prop:fileExtensionFilter" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>A file extension used to limit the array of file names returned by the actor when the selected output type is &quot;As UnCompressed File Name&quot;. This parameter is ignored for other output types.</configure></property>
127
<property name="prop:dataFilePath" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>The path to a local data file described by EML (must be used in conjunction with a local EML file). The actor will retrieve the data and automatically configure its ports to output it.</configure></property>
128
</property>        <property name="_location" class="ptolemy.kernel.util.Location" value="[175.0, 125.0]">
129
        </property>
130
        <port name="CacheLocalFileName" class="ptolemy.actor.TypedIOPort">
131
            <property name="output"/>
132
        </port>
133
        <port name="CacheResourceName" class="ptolemy.actor.TypedIOPort">
134
            <property name="output"/>
135
        </port>
136
    </entity>
137
    <entity name="Kruger National Park megaherbivore census 2001-2" class="org.ecoinformatics.seek.datasource.eml.eml2.Eml200DataSource">
138
        <property name="emlFilePath" class="ptolemy.data.expr.FileParameter" value="">
139
            <display name="EML File"/>
140
        </property>
141
        <property name="dataFilePath" class="ptolemy.data.expr.FileParameter" value="">
142
            <display name="Data File"/>
143
        </property>
144
        <property name="schemaDef" class="ptolemy.kernel.util.StringAttribute" value="&lt;schema&gt;&#10;  &lt;table name=&quot;elephantbuffalo2003.txt&quot;&gt;&#10;    &lt;field name=&quot;year&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;Specie_code&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;Total&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Calves&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;X_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Y_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;  &lt;/table&gt;&#10;  &lt;table name=&quot;MEGAHERBIVORE CENSUS LUT.txt&quot;&gt;&#10;    &lt;field name=&quot;Code&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;Species&quot; dataType=&quot;STRING&quot;/&gt;&#10;  &lt;/table&gt;&#10;  &lt;table name=&quot;elephantbuffalo2001.txt&quot;&gt;&#10;    &lt;field name=&quot;Date&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;Specie_code&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;total&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Calves&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;X_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Y_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;  &lt;/table&gt;&#10;  &lt;table name=&quot;elephantbuffalo2002.txt&quot;&gt;&#10;    &lt;field name=&quot;year&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;Specie_code&quot; dataType=&quot;STRING&quot;/&gt;&#10;    &lt;field name=&quot;total&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Calves_counted&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;X_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;    &lt;field name=&quot;Y_coordinate&quot; dataType=&quot;FLOAT&quot;/&gt;&#10;  &lt;/table&gt;&#10;&lt;/schema&gt;&#10;">
145
            <display name="Schema Definition"/>
146
            <property name="schemaDef" class="ptolemy.actor.gui.style.TextStyle">
147
                <property name="height" class="ptolemy.data.expr.Parameter" value="10">
148
                </property>
149
                <property name="width" class="ptolemy.data.expr.Parameter" value="30">
150
                </property>
151
            </property>
152
        </property>
153
        <property name="sqlDef" class="ptolemy.kernel.util.StringAttribute">
154
            <display name="SQL Command"/>
155
            <property name="sqlDef" class="ptolemy.actor.gui.style.TextStyle">
156
                <property name="height" class="ptolemy.data.expr.Parameter" value="10">
157
                </property>
158
                <property name="width" class="ptolemy.data.expr.Parameter" value="30">
159
                </property>
160
            </property>
161
        </property>
162
        <property name="selectedEntity" class="ptolemy.data.expr.StringParameter" value="elephantbuffalo2002.txt">
163
            <display name="Selected Entity"/>
164
        </property>
165
        <property name="dataOutputFormat" class="ptolemy.data.expr.StringParameter" value="As Cache File Name">
166
            <display name="Data Output Format"/>
167
        </property>
168
        <property name="fileExtensionFilter" class="ptolemy.data.expr.StringParameter" value="">
169
            <display name="File Extension Filter"/>
170
        </property>
171
        <property name="isLenient" class="ptolemy.data.expr.Parameter" value="false">
172
            <display name="Allow lenient data parsing"/>
173
        </property>
174
        <property name="checkVersion" class="ptolemy.data.expr.Parameter" value="false">
175
            <display name="Check for latest version"/>
176
        </property>
177
        <property name="_tableauFactory" class="org.kepler.objectmanager.data.db.QBTableauFactory">
178
            <property name="sqlName" class="ptolemy.kernel.util.StringAttribute" value="sqlDef">
179
            </property>
180
            <property name="schemaName" class="ptolemy.kernel.util.StringAttribute" value="schemaDef">
181
            </property>
182
        </property>
183
        <property name="recordid" class="ptolemy.kernel.util.StringAttribute" value="judithk.306.16">
184
        </property>
185
        <property name="endpoint" class="ptolemy.kernel.util.StringAttribute" value="http://ecogrid.ecoinformatics.org/knb/services/QueryService">
186
        </property>
187
        <property name="namespace" class="ptolemy.kernel.util.StringAttribute" value="eml://ecoinformatics.org/eml-2.0.1">
188
        </property>
189
<property name="KeplerDocumentation" class="ptolemy.vergil.basic.KeplerDocumentationAttribute">
190
<property name="description" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>null</configure></property>
191
<property name="author" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>matt jones, jing tao, chad</configure></property>
192
<property name="version" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>null</configure></property>
193
<property name="userLevelDocumentation" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>&lt;p&gt;The EML2Dataset actor provides access to a wide variety of data packages that have been described using EML (Ecological Metadata Language). The actor accesses an EML dataset and handles the mechanical issues involved in parsing metadata, downloading the dataset (if applicable), and emitting data to downstream actors.&lt;/p&gt;&#10;&#10;&lt;p&gt;Each data package contains an EML metadata description and one or more data entities (e.g., data tables, spatial raster images, spatial vector images). The EML metadata allows the described data to be easily ingested into Kepler and exposed for use in downstream components. The data packages can be accessed from the local file system or through any EcoGrid server that provides access to its collection of data objects. The supported data transfer protocols include http, ftp, file, ecogrid, and srb. &lt;/p&gt;&#10;&#10;&lt;p&gt;After parsing the EML metadata, the actor automatically reconfigures its exposed ports to provide one port for each attribute described by the first entity of the EML description. For example, if the first entity is a data table with four columns, the ports might be &quot;Site&quot;, &quot;Date&quot;, &quot;Plot&quot;, and &quot;Rainfall.&quot; These details are obtained from the EML document. To preview the data, right-click the actor icon and select Preview from the drop-down menu.&lt;/p&gt;&#10;&#10;&lt;p&gt;By default, the ports created by the EML2Dataset actor represent data fields, and one tuple of data (e.g., one database row) is emitted on these ports each time the actor fires. Alternatively, the actor can be configured so that the ports represent an array of field values (&quot;AsColumnVector&quot;), or so that the ports represent an entire table of data (&quot;AsTable&quot;). Data tables are formatted in comma-separated-value (CSV) format.&lt;/p&gt;&#10;&#10;&lt;p&gt;If more than one data entity is described in the EML metadata, then the output of the actor defaults to the first entity listed in the EML. To select the other entities, specify an entity with the Selected parameter, or use the Query Builder to describe the filter and join that should be used to produce the data output. To use the Query Builder, right-click the actor and select &quot;Open Actor.&quot; Specify the fields to be output and any filtering constraints to be applied.&lt;/p&gt;</configure></property>
194
<property name="port:output" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Output ports are automatically configured to provide one port for each attribute in the first entity described in the EML description. For example, if the first entity is a data table with four columns, the ports might be &quot;Site&quot;, &quot;Date&quot;, &quot;Plot&quot;, and &quot;Rainfall.&quot; These details are obtained from the EML document.</configure></property>
195
<property name="port:trigger" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>A multiport that has no declared type (in other words, the port can accept any data type: double, int, array, etc.) If the port is connected, the actor will not fire until the trigger port receives an input token. Connecting the port is optional, but useful when scheduling the actor to perform at a certain time.</configure></property>
196
<property name="prop:isLenient" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>If this parameter is selected, &quot;extra&quot; columns of data (e.g., comments that people have entered on a line or something of that nature) that are not described in the metadata are ignored, allowing the workflow to execute. If the option is unchecked (the default), the workflow execution will halt until the discrepancy between the data and metadata is corrected.</configure></property>
197
<property name="prop:checkVersion" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Select this parameter to check the EarthGrid for updates to the data. If the actor finds a version of the data that is more recent than the cached data on your local system, the actor will prompt the user to either download the latest data and metadata or ignore the newer version. Note that different versions of the data can have vastly different structures (new columns, or even new tables of data might be included or removed). If this parameter is selected, users should be prepared to handle changes that might arise from differences in the data structure.</configure></property>
198
<property name="prop:selectedEntity" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>If this EML data package has multiple entities, the selectedEntity parameter specifies which entity should be output. When this parameter is unset (the default), data from the first entity described in an EML package is output. This parameter is only used if no query statement is specified, or if a query statement is used and the output format is one of &quot;As Table&quot;, &quot;As Byte Array&quot;, &quot;As Uncompressed File Name&quot;, and &quot;As Cache File Name&quot;. To specify a query statement, right-click the actor and select Open Actor.</configure></property>
199
<property name="prop:dataOutputFormat" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Specify which ports are created for the actor and what data is emitted on those ports during each fire cycle. For example, this field can be configured to produce one port for each column in a data table, or one port that emits the entire data table at once in CSV format. Specifically, the output format choices are:&#10;&lt;table&gt;&#10;&lt;tr&gt;&lt;td&gt;As Field (the default)&lt;/td&gt;&lt;td&gt; The actor creates one output port for each field (i.e., column/attribute/variable) that is described in the EML metadata for the data package. The type of each port (e.g., string, int, double, etc.) matches the base type of the field. If a query statement has been used to subset the data, then only those fields selected in the query statement will be configured as ports. &lt;/td&gt;&lt;/tr&gt;&#10;&#10;&lt;tr&gt;&lt;td&gt;As Table&lt;/td&gt;&lt;td&gt; The selected data will be output as a string that contains the entire entity data. The actor creates three output ports: DataTable - the data itself, Delimiter - delimiter to separate fields, and NumColumns - the number of fields in the table. &lt;/td&gt;&lt;/tr&gt;&#10;&#10;&lt;tr&gt;&lt;td&gt;As Row&lt;/td&gt;&lt;td&gt;  One tuple of selected data is formatted as an array and output. The actor creates one output port (DataRow), and the data type is a record containing each of the individual data fields. &lt;/td&gt;&lt;/tr&gt;&#10;&#10;&lt;tr&gt;&lt;td&gt;As Byte Array&lt;/td&gt;&lt;td&gt; Selected data will be output as an array of bytes. The actor creates two output ports: BinaryData - contains the raw data itself, and EndOfStream - indicates whether the end of data stream has been reached.&lt;/td&gt;&lt;/tr&gt; &#10;&#10;&lt;tr&gt;&lt;td&gt;As UnCompressed File Name&lt;/td&gt;&lt;td&gt; This format is only used when the data package is a compressed file (zip, tar, et al). The compressed archive file is uncompressed after it is downloaded. The actor creates one output port, which contains an array of the filenames of all of the uncompressed archive files. If a FileExtensionFilter is specified, then the array will only contain files that match the specified extension.&lt;/td&gt;&lt;/tr&gt; &#10;&#10;&lt;tr&gt;&lt;td&gt;As Cache File Name&lt;/td&gt;&lt;td&gt;   Kepler stores data files downloaded from remote sites into its cache system. This output format will send the local cache file path for the data package so that workflow designers can directly access the cache files. The actor creates two output ports: CacheLocalFileName (the local file path) and CacheResourceName (the EML data link. e.g., ecogrid://knb/tao.2.1).&lt;/td&gt;&lt;/tr&gt; &#10;&#10;&lt;tr&gt;&lt;td&gt;As Column Vector&lt;/td&gt;&lt;td&gt; This output format is similar to &quot;As Field&quot;, except instead of sending out a single value on each port, the actor sends out an array of all of the data for each field. The type of each port is an array of the base type for the field.&lt;/td&gt;&lt;/tr&gt;&#10;&#10;&lt;tr&gt;&lt;td&gt;As ColumnBased Record&lt;/td&gt;&lt;td&gt;  This output format sends all data on one port using a record structure that encapsulates the entire data entity. The record will contain one array for each data field, and the type of each array will be determined by the type of the field it represents.&lt;/td&gt;&lt;/tr&gt;&#10;&lt;/table&gt;&#10;&#10;</configure></property>
200
<property name="prop:emlFilePath" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>The file path of a local EML metadata file used to describe and access an EML data set.</configure></property>
201
<property name="prop:fileExtensionFilter" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>A file extension used to limit the array of file names returned by the actor when the selected output type is &quot;As UnCompressed File Name&quot;. This parameter is ignored for other output types.</configure></property>
202
<property name="prop:dataFilePath" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>The path to a local data file described by EML (must be used in conjunction with a local EML file). The actor will retrieve the data and automatically configure its ports to output it.</configure></property>
203
</property>        <property name="_location" class="ptolemy.kernel.util.Location" value="[175.0, 205.0]">
204
        </property>
205
        <port name="CacheLocalFileName" class="ptolemy.actor.TypedIOPort">
206
            <property name="output"/>
207
        </port>
208
        <port name="CacheResourceName" class="ptolemy.actor.TypedIOPort">
209
            <property name="output"/>
210
        </port>
211
    </entity>
212
    <entity name="Display" class="ptolemy.actor.lib.gui.Display">
213
        <property name="_windowProperties" class="ptolemy.actor.gui.WindowPropertiesAttribute" value="{bounds={473, 395, 334, 234}, maximized=false}">
214
        </property>
215
        <property name="_paneSize" class="ptolemy.actor.gui.SizeAttribute" value="[326, 186]">
216
        </property>
217
        <property name="rowsDisplayed" class="ptolemy.data.expr.Parameter" value="10">
218
        </property>
219
        <property name="columnsDisplayed" class="ptolemy.data.expr.Parameter" value="40">
220
        </property>
221
        <property name="suppressBlankLines" class="ptolemy.data.expr.Parameter" value="false">
222
        </property>
223
        <property name="title" class="ptolemy.data.expr.StringParameter" value="">
224
        </property>
225
<property name="KeplerDocumentation" class="ptolemy.vergil.basic.KeplerDocumentationAttribute">
226
<property name="description" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>null</configure></property>
227
<property name="author" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Yuhong Xiong, Edward A. Lee</configure></property>
228
<property name="version" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>null</configure></property>
229
<property name="userLevelDocumentation" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>&lt;p&gt;The Display actor reads tokens of any type via its input multiport, and displays each token on a separate line in a text display window.&lt;/p&gt;&#10;&#10;&lt;p&gt;Specify the size of the text display window with the rowsDisplayed and columnsDisplayed parameters. Simply resizing the window onscreen does not persistently change the size when the workflow is saved, closed, and then re-opened. &lt;/p&gt;&#10;&#10;&lt;p&gt;If the input is a string token, then the actor strips the surrounding quotation marks before displaying the value.&lt;/p&gt; &#10;&#10;&lt;p&gt;Select the suppressBlankLines parameter to specify that the actor not add blank lines to the display. By default, the actor will add blank lines.&lt;/p&gt;&#10;&#10;&lt;p&gt;Note: this actor can consume large amounts of memory. It is not advisable to use it to display large output streams.&lt;/p&gt;</configure></property>
230
<property name="port:input" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>A multiport that accepts tokens of any type.</configure></property>
231
<property name="prop:suppressBlankLines" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>Specify whether the actor should display blank lines (the default) or suppress them.</configure></property>
232
<property name="prop:rowsDisplayed" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>The vertical size of the display, in rows. The value is an integer that defaults to 10.</configure></property>
233
<property name="prop:columnsDisplayed" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>The horizontal size of the display, in columns. The value is an integer that defaults to 40.</configure></property>
234
<property name="prop:title" class="ptolemy.kernel.util.ConfigurableAttribute"><configure>The title of the text display window. If specified, the value will appear in the title bar of the text display window.</configure></property>
235
</property>        <property name="entityId" class="org.kepler.moml.NamedObjId" value="urn:lsid:kepler-project.org:actor:7:1">
236
        </property>
237
        <property name="class" class="ptolemy.kernel.util.StringAttribute" value="ptolemy.actor.lib.gui.Display">
238
            <property name="id" class="ptolemy.kernel.util.StringAttribute" value="urn:lsid:kepler-project.org:class:883:1">
239
            </property>
240
        </property>
241
        <property name="semanticType00" class="org.kepler.sms.SemanticType" value="urn:lsid:localhost:onto:1:1#TextualOutputActor">
242
        </property>
243
        <property name="semanticType11" class="org.kepler.sms.SemanticType" value="urn:lsid:localhost:onto:2:1#TextualOutput">
244
        </property>
245
        <property name="_location" class="ptolemy.kernel.util.Location" value="{485, 135}">
246
        </property>
247
    </entity>
248
    <relation name="relation" class="ptolemy.actor.TypedIORelation">
249
        <property name="width" class="ptolemy.data.expr.Parameter" value="-1">
250
        </property>
251
    </relation>
252
    <relation name="relation2" class="ptolemy.actor.TypedIORelation">
253
        <property name="width" class="ptolemy.data.expr.Parameter" value="-1">
254
        </property>
255
    </relation>
256
    <link port="Kruger National Park megaherbivore census 2001-.CacheLocalFileName" relation="relation"/>
257
    <link port="Kruger National Park megaherbivore census 2001-2.CacheLocalFileName" relation="relation2"/>
258
    <link port="Display.input" relation="relation"/>
259
    <link port="Display.input" relation="relation2"/>
260
</entity>
(2-2/3)