/ - Changes - Metacat - Ecoinformatics Redmine

metacat @ 8814

svn:ignore: build
svn:mergeinfo: /branches/D1_0_6_2_BRANCH:6107-6121

#	Date	Author	Comment
8814	07/29/2014 03:53 PM	Lauren Walker	Only lock the local docid file when creating a new docid, not when inserting, for faster upload times. Remove extra debug messages from testing.
8813	07/28/2014 05:22 PM	Lauren Walker	Fix bug in the online registry where data files were not using the new docid creation process
8812	07/28/2014 03:57 PM	Lauren Walker	Lock a local file while docids are being created so multiple docs can be uploaded at once
8811	07/24/2014 10:25 AM	ben leinfelder	remove CN.systemMetadataChanged in favor of the CN.updateSystemMetadata method. Otherwise there's no good way to know where to fetch the auth copy from since the SM change might be to switch the authMN!
8810	07/23/2014 04:19 PM	ben leinfelder	add support for v2 DataONE API.
8809	06/09/2014 04:06 PM	ben leinfelder	remove old EML jar -- datamanager.jar has the EMLParser now and is pulled in with maven.
8808	06/06/2014 04:54 PM	ben leinfelder	use css changes from EML project to render a PDF that fits on a printed page during export. Note that this also changes the default skin slightly (for the better, we think). https://projects.ecoinformatics.org/ecoinfo/issues/6053
8807	06/02/2014 12:16 PM	ben leinfelder	remove configxml.jar as the ConfigXML class is now included in the utilities library.
8806	06/01/2014 08:48 AM	ben leinfelder	handle login/logout when testing using metacat client (recent refactoring to use more standard http client code)
8805	05/30/2014 05:06 PM	ben leinfelder	handle multiple subprocessors (RDF and ORE) before the object gets indexed by making sure to merge the solr doc map before submitting to the index.
8804	05/30/2014 01:58 PM	ben leinfelder	use maven for datamanager dependency now that EML project has been [partially] maven-ized.
8803	05/29/2014 03:47 PM	ben leinfelder	switch to use utilities from Maven repo instead of pulling from SVN and building locally. Added trunk/SNAPSHOT build to dev-testing Hudson (DataONE) for users who do not actively develop the utilities library. Motivated by the HTML->PDF utility, seemed like as good a time as any to convert Metacat's dependency on the project.
8802	05/29/2014 02:53 PM	ben leinfelder	remove dependency on HttpMessage that was in the utilities project but is now removed in favor of newer (standard) http client library code.
8801	05/28/2014 05:19 PM	ben leinfelder	add more sections about extending the annotation model, serializations, and permissions.
8800	05/28/2014 11:49 AM	ben leinfelder	Include PDF version of the metadata in the package download. https://projects.ecoinformatics.org/ecoinfo/issues/6053
8799	05/28/2014 11:48 AM	ben leinfelder	use default "metacat" context name for metacat-index testing.
8798	05/27/2014 04:14 PM	ben leinfelder	Added more description for the model. cleaned up a few formatting issues.
8797	05/23/2014 05:29 PM	ben leinfelder	draft of semantic model documentation
8796	05/23/2014 03:08 PM	Peter Slaughter	Change geohash type to "string" (was "text") so that no Solr analysis or tokenizing happens during indexing.
8795	05/22/2014 01:59 PM	ben leinfelder	take advantage of the ezidclient for multi-threaded/asynchronous DOI registration. This will be most useful for doing large batch updates and not so much for the one-at-a-time publish actions but works in either context. https://projects.ecoinformatics.org/ecoinfo/issues/6440
8794	05/21/2014 06:24 PM	ben leinfelder	use a member instance of ezid service that only logs in every 24 hours (or other time TBD) instead of every time there is an interaction with the service. Saves us many calls when doing batch updates to ezid but keeps us from trying to use expired sessions. Motivated by https://projects.ecoinformatics.org/ecoinfo/issues/6440
8793	05/21/2014 02:59 PM	Lauren Walker	Fixed an error in the ORE model diagrams
8792	05/20/2014 04:11 PM	ben leinfelder	prevent js scriptlets from running when we return error messages to the client by escaping any potentially harmful xml blocks. https://projects.ecoinformatics.org/ecoinfo/issues/6224
8791	05/20/2014 04:09 PM	ben leinfelder	allow updates to all doi: prefixes - realized we are already restricting to specific replica servers when updating these. worst case is we try to update a registration for which we are not the owner. https://projects.ecoinformatics.org/ecoinfo/issues/6440
8790	05/20/2014 04:08 PM	ben leinfelder	show the SM and ORE generation buttons even if they have not registered/configured dataone. many potential MNs want to see their generated SM before registering (and we want them to too!).
8789	05/20/2014 10:40 AM	ben leinfelder	restrict DOI updates to DOIs that match our server shoulder -- may consider opening this up to any "doi:" prefix if this is too restrictive. https://projects.ecoinformatics.org/ecoinfo/issues/6440
8788	05/19/2014 02:21 PM	ben leinfelder	use separate surName and givenNames to lookup ORCIDs.
8787	05/19/2014 01:17 PM	Lauren Walker	Changed errors in the ORE model expansion metacat documentation diagrams
8786	05/16/2014 03:13 PM	Lauren Walker	Describe the primary data resource in the derived resource RDF
8785	05/16/2014 10:03 AM	Lauren Walker	Create a derived data and metadata in the OrePackageTest
8784	05/15/2014 03:17 PM	ben leinfelder	all full-text queries for ORCID, but it isn't that great because we might have a"PISCO" creator that shows us in may different orcid profiles...false matches.
8783	05/15/2014 11:55 AM	ben leinfelder	correct glaring erros -- still needs to be honed, but at least it runs without NPE and Jena/foresite errors
8782	05/15/2014 11:29 AM	ben leinfelder	stub for testing ORE augmentation - this generates an ORE, adds a "wasDerivedFrom" triple and saves to Metacat MN for indexing. https://projects.ecoinformatics.org/ecoinfo/issues/6548
8781	05/15/2014 10:27 AM	ben leinfelder	include ORE formatId as handled by the RDF subprocessor and index prov:wasDerivedFrom field where it exists in the RDF model. https://projects.ecoinformatics.org/ecoinfo/issues/6548
8780	05/15/2014 09:34 AM	ben leinfelder	expand the sparql queries to include dcterms:identifier
8779	05/14/2014 02:09 PM	Lauren Walker	Text changes to ORE docs
8778	05/14/2014 01:22 PM	Lauren Walker	Add documentation for the ORE model expansion for derived data
8777	05/14/2014 12:04 PM	ben leinfelder	use HttpClient to query orcid so I can easily set headers and such -- getting 503s from their production server when I test on dev.nceas...odd
8776	05/14/2014 11:43 AM	ben leinfelder	adjust tests for production service -- more "real" information shows additional return values from the query.
8775	05/14/2014 09:18 AM	ben leinfelder	switch to the production ORCID server for looking up orcid matches for our creators. add test to summarize how many creator matches we can actually find. https://projects.ecoinformatics.org/ecoinfo/issues/6423
8774	05/13/2014 03:36 PM	ben leinfelder	use a non-public rightsHolder for both EML and Annotation test documents now that the RDF subprocessor checks each annotation to see that it came from a user that as write permission for the object being annotated.
8773	05/13/2014 03:34 PM	ben leinfelder	test for update using the updated EML file, not the original. Also add the SM to the shared map so that the indexing process can consult SM.accessPolicy when indexing annotations that assert things about those test documents.
8772	05/13/2014 03:32 PM	ben leinfelder	ignore the metacat/solr comparator tests - they are one-offs.
8771	05/13/2014 03:31 PM	ben leinfelder	change the hazelcast group name to be the default "metacat" instance so that the metacat-index tests pass without additional local configuration, at least when running a default metacat deployment.
8770	05/12/2014 11:31 AM	ben leinfelder	do not set archived=false for all CN.create calls. The CN will use create() even harvesting content that is new to it and needs to handle already-archived content. https://projects.ecoinformatics.org/ecoinfo/issues/6475
8769	05/09/2014 01:48 PM	ben leinfelder	cache the imported models to avoid timeouts from remote hosts (or being locked out for too many requests in a given time period).
8768	05/08/2014 04:25 PM	ben leinfelder	process all the returned annotation suggestions until we find one that is appropriately located in the subclass hierarchy for the given superclass.
8767	05/08/2014 04:23 PM	ben leinfelder	use in-memory TDB dataset for querying annotations for indexing -- this comes with the same reasoning capabilities as the directory-based one, but has the benefit of not filling the directory with triples that will not be used again. prepping for d1 AHM
8766	05/07/2014 11:16 PM	ben leinfelder	pass around the object file path rather than the data stream so that multiple subprocessors can index the same object and not consume the stream before it gets to the next one. In preparation for extending the assertions stored in OREs. https://projects.ecoinformatics.org/ecoinfo/issues/6548
8765	05/07/2014 11:12 PM	ben leinfelder	when indexing annotations directly, just use an in-memory triple store rather than TDB since we remove each graph as it is processed (and my TDB instance would get into the multi-GB range with a few runs, even if I removed the old models)
8764	05/05/2014 05:17 PM	ben leinfelder	redirect "short form" metacat read URIs to the the new Metacat UI using the configured UI context. This translates the docid -> pid to use the correct identifier for the correct service. https://projects.ecoinformatics.org/ecoinfo/issues/6546
8763	05/02/2014 04:39 PM	ben leinfelder	simplify lookup for classes and orcid. remove the "random" annotation code branches -- just too confusing to look at those bogus classes especially now that we have "real" generated annotations.
8762	04/30/2014 03:47 PM	ben leinfelder	add 'test' for indexing annotations without actually storing the RDF of the generated annotation.
8761	04/30/2014 03:46 PM	ben leinfelder	only allow multiple values for multi-valued fields....
8760	04/30/2014 01:02 PM	Lauren Walker	Add admin service to update DOI registrations by specifying a list of formatIds or DOIs, or update all.
8759	04/29/2014 05:21 PM	ben leinfelder	use new method to override the CN URL when constructing a CNode instance. see https://redmine.dataone.org/issues/5142
8758	04/29/2014 04:55 PM	ben leinfelder	use newer httpclient library so that Jena's dependency is met - this goes all the way back to d1_common/libclient needing to pull in the newer library.
8757	04/29/2014 04:54 PM	ben leinfelder	first pass at direct EML->semantic index method. Still produces an RDF model, but does not persist it in Metacat, only in the triplestore. Allows us to re-run without adding stale RDF to the MN store.
8756	04/29/2014 04:52 PM	ben leinfelder	allow multivalued fields to be indexed using the "fields" pass through.
8755	04/29/2014 04:47 PM	Jing Tao	Remove the attribute disable from the update button if it has been submitted.
8754	04/29/2014 03:31 PM	Jing Tao	The image which has the default values.
8753	04/29/2014 03:13 PM	Jing Tao	Add a new screen shot which contains the cn url.
8752	04/28/2014 04:54 PM	Jing Tao	Localized the file which doesn't have the bean for dataUrl.
8751	04/28/2014 03:38 PM	Jing Tao	Add the cn url.
8750	04/28/2014 03:37 PM	Jing Tao	Add the text field for the cn url.
8749	04/28/2014 03:30 PM	Jing Tao	Store the cn url in the backup.
8748	04/24/2014 01:28 PM	ben leinfelder	switch to use FIleUpload instead of O'Reilly COS library for handling chunked file uploads. https://projects.ecoinformatics.org/ecoinfo/issues/6517
8747	04/24/2014 01:26 PM	ben leinfelder	forgot to check in the actual class: first pass at allowing admins to update DOI registration. This only acts on EML objects at the moment and is meant to illustrate one mechanism for updating the DOIs. https://projects.ecoinformatics.org/ecoinfo/issues/6530
8746	04/23/2014 12:35 PM	ben leinfelder	first pass at allowing admins to update DOI registration. This only acts on EML objects at the moment and is meant to illustrate one mechanism for updating the DOIs. https://projects.ecoinformatics.org/ecoinfo/issues/6530
8745	04/23/2014 10:19 AM	ben leinfelder	correct the ORE lookup query syntax and add junit assertion to check that it continues to function as expected. https://projects.ecoinformatics.org/ecoinfo/issues/6529
8744	04/22/2014 03:38 PM	ben leinfelder	index the ORE after we submit the metadata for indexing. https://projects.ecoinformatics.org/ecoinfo/issues/6520
8743	04/22/2014 11:43 AM	ben leinfelder	include BioPortal lookup for Entity matches using the data table description. TODO: only associate measurements to the entity observation if they apply.
8742	04/21/2014 11:12 AM	ben leinfelder	recompile with java 1.6 for compatibility with our servers.
8741	04/18/2014 05:29 PM	Jing Tao	Remove the reference to the bean eml.fileID.
8740	04/17/2014 05:22 PM	ben leinfelder	use 1.5.1 tag for hudson to build metacat ui (for KNB deployment)
8739	04/17/2014 02:19 PM	Jing Tao	Index the document after it has been inserted.
8738	04/17/2014 02:18 PM	Jing Tao	Index the document after document is written to the db.
8737	04/16/2014 02:28 PM	Jing Tao	Use the ecogrid-1.2.3 branch which will be the next release.
8736	04/15/2014 09:36 AM	Jing Tao	Remove the bean named eml.fileID which used the ResolveSolrField class.
8735	04/14/2014 02:58 PM	ben leinfelder	calculate geohash_3 to three places (typo)
8734	04/14/2014 02:55 PM	ben leinfelder	use NSEW for the bounding box geohash calculation from EML - all versions
8733	04/14/2014 02:50 PM	ben leinfelder	up the field count to 111 to include the 9 geohash fields.
8732	04/14/2014 02:07 PM	Peter Slaughter	Using 1.3.0-SNAPSHOT from d1_cn_index_processor
8731	04/14/2014 01:34 PM	Peter Slaughter	Add fields for geohashes
8730	04/14/2014 01:33 PM	Peter Slaughter	Add beans to support geohashes
8729	04/14/2014 01:33 PM	Peter Slaughter	Add beans to support geohashes
8728	04/07/2014 01:25 PM	Lauren Walker	Check for undefined and null elements to avoid errors in IE 8 and earlier in the registry entry form JS
8727	04/04/2014 12:03 PM	Lauren Walker	Close a <span> HTML tag in the entry form MetacatUI template to avoid errors in older browsers
8726	04/03/2014 09:44 AM	Jing Tao	The package libdigest-sha1-perl was removed from ubuntu 12.04. We have to install it from cpan.
8725	04/02/2014 03:40 PM	ben leinfelder	add "test" for generating annotations based on the entity/attribute details of a datapackage. This iterates through all current EML revisions and either updates or creates annotations based on what it finds. It does add content to your metacat deployment (RDF files) but it can be safely re-run when each time we change our annotation algorithm.
8724	04/02/2014 03:36 PM	ben leinfelder	check for null entities and/or attributes (typically when otherEntity is being used in EML).
8723	04/02/2014 03:35 PM	ben leinfelder	remove extra space in log message
8722	04/02/2014 03:35 PM	ben leinfelder	handle null Boolean in SM.archived field
8721	03/31/2014 04:04 PM	ben leinfelder	include sample data package for generating annotations. This is the classic Datos Meteorologicos set, but with Matthew Jones as the creator so that we can look up his ORCID in their sandbox environment. https://projects.ecoinformatics.org/ecoinfo/issues/6267
8720	03/31/2014 03:25 PM	ben leinfelder	use Matthew Jones for test creator since he has an ORCID in their staging environment.
8719	03/31/2014 12:31 PM	ben leinfelder	augment annotation indexing test/sample to include orcid annotation. https://projects.ecoinformatics.org/ecoinfo/issues/6267 https://projects.ecoinformatics.org/ecoinfo/issues/6423
8718	03/31/2014 11:11 AM	ben leinfelder	attribute the datapackage to the creator (using orcid if we can find it). https://projects.ecoinformatics.org/ecoinfo/issues/6267 https://projects.ecoinformatics.org/ecoinfo/issues/6423
8717	03/31/2014 10:31 AM	ben leinfelder	add test for BioPortal annotator service.
8716	03/28/2014 03:51 PM	ben leinfelder	refactor web service calls to bioportal and orcid outside of the annotator class. test with orcid sandbox server. include orcid uri for the annotations being generated (we can index these and drive our searches on these values down the road). related to this: https://projects.ecoinformatics.org/ecoinfo/issues/6423 and also some semtools tasks.
8715	03/27/2014 04:54 PM	ben leinfelder	remove leading '?' in the query parameter for MN.query() implementation. We want it to match CN behavior/expectations and comply with the DataONE specification for the interface. https://projects.ecoinformatics.org/ecoinfo/issues/6488

Project

General

Profile

Metacat