Activity
From 05/02/2014 to 05/31/2014
05/30/2014
- 05:06 PM Revision 8805 (metacat): handle multiple subprocessors (RDF and ORE) before the object gets indexed by making sure to merge the solr doc map before submitting to the index.
- 01:58 PM Revision 8804 (metacat): use maven for datamanager dependency now that EML project has been [partially] maven-ized.
05/29/2014
- 03:47 PM Revision 8803 (metacat): switch to use utilities from Maven repo instead of pulling from SVN and building locally. Added trunk/SNAPSHOT build to dev-testing Hudson (DataONE) for users who do not actively develop the utilities library. Motivated by the HTML->PDF utility, seemed like as good a time as any to convert Metacat's dependency on the project.
- 02:53 PM Revision 8802 (metacat): remove dependency on HttpMessage that was in the utilities project but is now removed in favor of newer (standard) http client library code.
05/28/2014
- 05:19 PM Revision 8801 (metacat): add more sections about extending the annotation model, serializations, and permissions.
- 11:49 AM Revision 8800 (metacat): Include PDF version of the metadata in the package download. https://projects.ecoinformatics.org/ecoinfo/issues/6053
- 11:48 AM Revision 8799 (metacat): use default "metacat" context name for metacat-index testing.
05/27/2014
- 04:14 PM Revision 8798 (metacat): Added more description for the model. cleaned up a few formatting issues.
05/23/2014
- 05:29 PM Revision 8797 (metacat): draft of semantic model documentation
- 03:08 PM Revision 8796 (metacat): Change geohash type to "string" (was "text") so that no Solr analysis or tokenizing happens during indexing.
05/22/2014
05/21/2014
- 06:24 PM Revision 8794 (metacat): use a member instance of ezid service that only logs in every 24 hours (or other time TBD) instead of every time there is an interaction with the service. Saves us many calls when doing batch updates to ezid but keeps us from trying to use expired sessions. Motivated by https://projects.ecoinformatics.org/ecoinfo/issues/6440
- 02:59 PM Revision 8793 (metacat): Fixed an error in the ORE model diagrams
05/20/2014
- 04:11 PM Revision 8792 (metacat): prevent js scriptlets from running when we return error messages to the client by escaping any potentially harmful xml blocks. https://projects.ecoinformatics.org/ecoinfo/issues/6224
- 04:09 PM Revision 8791 (metacat): allow updates to all doi: prefixes - realized we are already restricting to specific replica servers when updating these. worst case is we try to update a registration for which we are not the owner. https://projects.ecoinformatics.org/ecoinfo/issues/6440
- 04:08 PM Revision 8790 (metacat): show the SM and ORE generation buttons even if they have not registered/configured dataone. many potential MNs want to see their generated SM _before_ registering (and we want them to too!).
- 10:40 AM Revision 8789 (metacat): restrict DOI updates to DOIs that match our server shoulder -- may consider opening this up to any "doi:" prefix if this is too restrictive. https://projects.ecoinformatics.org/ecoinfo/issues/6440
05/19/2014
- 02:21 PM Revision 8788 (metacat): use separate surName and givenNames to lookup ORCIDs.
- 01:17 PM Revision 8787 (metacat): Changed errors in the ORE model expansion metacat documentation diagrams
05/16/2014
- 03:13 PM Revision 8786 (metacat): Describe the primary data resource in the derived resource RDF
- 10:03 AM Revision 8785 (metacat): Create a derived data and metadata in the OrePackageTest
05/15/2014
- 03:17 PM Revision 8784 (metacat): all full-text queries for ORCID, but it isn't that great because we might have a"PISCO" creator that shows us in may different orcid profiles...false matches.
- 11:55 AM Revision 8783 (metacat): correct glaring erros -- still needs to be honed, but at least it runs without NPE and Jena/foresite errors
- 11:29 AM Revision 8782 (metacat): stub for testing ORE augmentation - this generates an ORE, adds a "wasDerivedFrom" triple and saves to Metacat MN for indexing. https://projects.ecoinformatics.org/ecoinfo/issues/6548
- 10:27 AM Revision 8781 (metacat): include ORE formatId as handled by the RDF subprocessor and index prov:wasDerivedFrom field where it exists in the RDF model. https://projects.ecoinformatics.org/ecoinfo/issues/6548
- 09:34 AM Revision 8780 (metacat): expand the sparql queries to include dcterms:identifier
05/14/2014
- 02:09 PM Revision 8779 (metacat): Text changes to ORE docs
- 01:22 PM Revision 8778 (metacat): Add documentation for the ORE model expansion for derived data
- 12:04 PM Revision 8777 (metacat): use HttpClient to query orcid so I can easily set headers and such -- getting 503s from their production server when I test on dev.nceas...odd
- 11:43 AM Revision 8776 (metacat): adjust tests for production service -- more "real" information shows additional return values from the query.
- 09:18 AM Revision 8775 (metacat): switch to the production ORCID server for looking up orcid matches for our creators.
- add test to summarize how many creator matches we can actually find. https://projects.ecoinformatics.org/ecoinfo/issu...
05/13/2014
- 03:36 PM Revision 8774 (metacat): use a non-public rightsHolder for both EML and Annotation test documents now that the RDF subprocessor checks each annotation to see that it came from a user that as write permission for the object being annotated.
- 03:34 PM Revision 8773 (metacat): test for update using the updated EML file, not the original. Also add the SM to the shared map so that the indexing process can consult SM.accessPolicy when indexing annotations that assert things about those test documents.
- 03:32 PM Revision 8772 (metacat): ignore the metacat/solr comparator tests - they are one-offs.
- 03:31 PM Revision 8771 (metacat): change the hazelcast group name to be the default "metacat" instance so that the metacat-index tests pass without additional local configuration, at least when running a default metacat deployment.
05/12/2014
05/09/2014
05/08/2014
- 04:25 PM Revision 8768 (metacat): process all the returned annotation suggestions until we find one that is appropriately located in the subclass hierarchy for the given superclass.
- 04:23 PM Revision 8767 (metacat): use in-memory TDB dataset for querying annotations for indexing -- this comes with the same reasoning capabilities as the directory-based one, but has the benefit of not filling the directory with triples that will not be used again. prepping for d1 AHM
05/07/2014
- 11:16 PM Revision 8766 (metacat): pass around the object file path rather than the data stream so that multiple subprocessors can index the same object and not consume the stream before it gets to the next one. In preparation for extending the assertions stored in OREs. https://projects.ecoinformatics.org/ecoinfo/issues/6548
- 11:12 PM Revision 8765 (metacat): when indexing annotations directly, just use an in-memory triple store rather than TDB since we remove each graph as it is processed (and my TDB instance would get into the multi-GB range with a few runs, even if I removed the old models)
05/05/2014
05/02/2014
Also available in: Atom