Bug #335
closeddecompose eml identifiers into familyid and revision
0%
Description
Current eml identifiers are a string that symbolizes a unique revision of an
object (e.g., jones.14.1). The same identifer should always be associated with
the same stream of bytes (ie, checksums would match).
Suggestion that eml identifiers should be decomposed into two parts. The first
part is a "family" id (string) that represents a group of related objects. The
second is a revision # (integer) that indicates the revision number of one of
the objects in the family. The combination of the familyid and revisionnum
would always be unique, and would be usable as an accession number. In XML,
this could look something like:
<identifier system="knb">
<familyid>jones.43</familyid>
<revision>13</revision>
</identifier>
Questions remain.
1) Would revision be required in eml, or optional?
If optional, then EML would allow description of objects that are not unique.
Is this a good thing that we want to encourage/allow as a community?
2) For citation in print publications or other non-xml environments, how would
one refer to the combination of familyid and revisionid?
Previously we were able to use the whole string -- how do we combine the parts
together now? Can we still concatenate them with a separator character?