1
|
<!--
|
2
|
* datafiles.html
|
3
|
*
|
4
|
* Authors: Chad Berkley
|
5
|
* Copyright: 2000 Regents of the University of California and the
|
6
|
* National Center for Ecological Analysis and Synthesis
|
7
|
* For Details: http://www.nceas.ucsb.edu/
|
8
|
* Created: 2001 January 23
|
9
|
* Version:
|
10
|
* File Info: '$ '
|
11
|
*
|
12
|
*
|
13
|
-->
|
14
|
<HTML>
|
15
|
<HEAD>
|
16
|
<TITLE>Metacat</TITLE>
|
17
|
<link rel="stylesheet" type="text/css" href="@docrooturl@default.css">
|
18
|
</HEAD>
|
19
|
<BODY>
|
20
|
<table width="100%">
|
21
|
<tr>
|
22
|
<td class="tablehead" colspan="2"><p class="label">Non-XML Data Files</p></td>
|
23
|
<td class="tablehead" colspan="2" align="right">
|
24
|
<a href="./replication.html">Back</a> | <a href="./metacattour.html">Home</a> |
|
25
|
<a href="./properties.html">Next</a>
|
26
|
</td>
|
27
|
</tr>
|
28
|
</table>
|
29
|
|
30
|
<p>Metacat has the capability of proxying a connection to a data file
|
31
|
server. This capability is provided through an abstract class called
|
32
|
DataFileUploadInterface. Currently, there is a data file server implemented
|
33
|
for this interface that can store non-XML data files on the local UNIX
|
34
|
file system. Theoretically, any file storage system could be proxied
|
35
|
through this interface.</p>
|
36
|
<P> <img alt="architecture diagram of the Metacat Data File Upload process"
|
37
|
src="dataload.gif"> </p>
|
38
|
|
39
|
<b>Reasoning</b>
|
40
|
<p>Since Metacat has been designed as a metadata storage system for
|
41
|
ecological data, there was a concern that data and metadata files were being
|
42
|
stored seperately and that users would find it impractical to have to use
|
43
|
two different systems to get their data and metadata. This system also allows
|
44
|
users to specify <a href="./acontrol.html">access control</a> restraints on their
|
45
|
data files the same way that they are specified on the xml metadata files.</p>
|
46
|
|
47
|
<b>Data Download (GET)</b>
|
48
|
<p>Metacat Server can retrieve data files that are stored on Metacat
|
49
|
file system or in any other file system on Internet. <br>
|
50
|
Metacat stores data files in a directory under the servlet context.
|
51
|
It writes information about these data files in xml_documents just like
|
52
|
about any XML file. <br>
|
53
|
An example of HTTP request for download of data file on Metacat is shown below: </p>
|
54
|
<pre><i>http://server.domain.com/metacat?action=read&docid=nceas.55</i></pre>
|
55
|
<p>An example of HTTP request for download of data file on Internet is shown below:</p>
|
56
|
<pre><i>http://server.domain.com/metacat?action=read&docid=http://otherserver.domain.com/filename</i></pre>
|
57
|
<p>Note in <i>docid=http://otherserver.domain.com/filename</i> HTTP protocol is used,
|
58
|
i.e. currently for file download HTTP protocol is only supported.</p>
|
59
|
|
60
|
<p><b>Data Upload (PUT)</b>
|
61
|
<p>Due to a Java limitation on the HTTP PUT comand, the data upload portion
|
62
|
of Metacat deviates from the standard HTTP interface. A standard bidirectional
|
63
|
TCP/IP socket is used for tranfering the data. The procedure for uploading a
|
64
|
file is as follows.</p>
|
65
|
<ol>
|
66
|
<li>The client must login to Metacat and get a session_id</li>
|
67
|
<li>The client sends a request to the servlet with an action of 'getdataport'.</li>
|
68
|
<li>The server responds with an xml message that includes a port number. The
|
69
|
message looks like: <pre><xml version="1.0"?><port>xxxx</port></pre>
|
70
|
where xxxx is an open port between 0 and 65000.</li>
|
71
|
<li>The client then can create a socket connection to the returned port.
|
72
|
Note that the client must make this connection within 30 seconds or the
|
73
|
port will close.</li>
|
74
|
<li>The data can now be sent but first, some extra information must be
|
75
|
appended onto the beginning of the data stream. The extra information looks
|
76
|
like: <pre> [filename]0[sessionID]0[filelength]0[DATA] </pre> The filename,
|
77
|
sessionID and filelength must be converted into a byte string, terminated
|
78
|
with a 0 (zero) byte and inserted into the stream in the order shown. The
|
79
|
filelength is in bytes. The DATA stream does not need to be terminated with
|
80
|
a 0 byte.</li>
|
81
|
<li>After the upload the server will return either an error message or
|
82
|
a success message which will include the docid of the new data file. Both
|
83
|
messages are encoded in xml (like the port message). The success message
|
84
|
looks like:
|
85
|
<pre><?xml version="1.0"?><docid>yyyy</docid></pre>
|
86
|
where yyyy is the new docid.</li>
|
87
|
</ul>
|
88
|
<p>The file DataStreamTest.java is a test class that shows how a client
|
89
|
must operate to successfully upload a data file to Metacat.</p>
|
90
|
|
91
|
<br>
|
92
|
<a href="./replication.html">Back</a> | <a href="./metacattour.html">Home</a> |
|
93
|
<a href="./properties.html">Next</a>
|
94
|
</BODY>
|
95
|
</HTML>
|