Project

General

Profile

Actions

Bug #5497

closed

metacat accepts eml docs that fail knb parser

Added by gastil gastil over 12 years ago. Updated over 12 years ago.

Status:
Resolved
Priority:
Normal
Category:
metacat
Target version:
Start date:
09/21/2011
Due date:
% Done:

0%

Estimated time:
Bugzilla-Id:
5497

Description

This is from nis track ticket #325 https://trac.lternet.edu/trac/NIS/ticket/325
Im pasting the below in from that ticket.

docid knb-lter-sev.137.39511 is in the LTER metacat, recently harvested, and does not pass the knb parser. It has some empty elements.

Documents which do not pass the parser should not be harvested into metacat.

This is packageId knb-lter-sev.137.6933 last updated Sept 8, 2011.

I expect there are 43 eml documents in that harvest with this same problem. This one is just an example. knb-lter-sev.153.19842 is another example of the same thing.
Both are invalid with regard to the EML 2.1.0 schema, but have been successfully harvested into Metacat 1.9.3. How many other documents are invalid, but in Metacat?

In addition, a minor concern is that the HTML representation of this EML appears to use the packageId for generating the Metacat URI ( http://metacat.lternet.edu:8080/knb/metacat/knb-lter-sev.153.5657/lter) -- the revision value of the packageId (knb-lter-sev.153.5657) is not the same as the revision value of the documentId (knb-lter-sev.153.19842). The generated URI is not correct and results in a Metacat "document not found" error when used in web browser query field.

This issue is primarily the domain of the KNB and the Metacat developers, not necessarily the LTER NIS.

Actions

Also available in: Atom PDF