/src/edu - Changes - Metacat - Ecoinformatics Redmine

metacat/src/edu @ 6934

#	Date	Author	Comment
6934	01/23/2012 11:08 AM	ben leinfelder	use RC-1 Dataone jars
6933	01/20/2012 10:46 PM	Matt Jones	Added DOI generation to the 2.0.0 upgrade process. To succeed, this script must be run on a fresh 2.0.0 database, or on a 1.9.5 version database, as those are the only ways to get the needed foreign keys to be marked as deferrable. The identifier conversion must be turned on by setting correct properties in metacat.properties. See the comments in GenerateGlobalIdentifiers for details. By default, conversion is set to false in the properties file. If you want to convert an instance to use DOIs, be sure to set metacat.properties up BEFORE running through the Metacat configuration and database upgrade.
6932	01/20/2012 10:38 PM	Matt Jones	Refactoring classes that throw generic Exception class to throw their more specific subclasses so that new exceptions are not hidden behind generic messages. Makes debugging easier.
6931	01/20/2012 03:45 PM	ben leinfelder	try to read the local document before making the localid->guid mapping (in cases where we fail to read the data locally like if it is referenced in an EML file but does not exist on this Metacat instance)
6927	01/20/2012 10:14 AM	Chris Jones	Ensure we have the object and sysmeta params for MN.create(). We were getting a fatal SAX parsing error encapsulated in a ServiceFailure when a science metadata object param was null. Cut it off at the pass after parsing the MMP entity.
6919	01/17/2012 04:21 PM	Chris Jones	Use the Collections class from java.util.
6918	01/17/2012 03:20 PM	Chris Jones	Remove null field tests in the IdentifierManager class. Schema-level required fields are checked on serialization/deserialization using JibX during the REST resource handler classes. Other required fields are checked in MNodeService and CNodeService, higher in the stack.
6917	01/17/2012 03:17 PM	Chris Jones	For MNs that haven't set the archived flag to false on create(), set it here. Also, ensure that the CN sync code sets the authoritative and origin member node fields.
6916	01/17/2012 03:15 PM	Chris Jones	On MN.create(), set the archived flag to be false. This field isn't required in the schema, but is needed by the DataONE indexer once objects are sync'd.
6912	01/17/2012 12:06 PM	ben leinfelder	-generate system meta for all docids, even those not originating on the server (replicas from the past) -generate ORE docs and download remote data only for those documents that originated on this server being upgraded. http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5522
6911	01/17/2012 11:43 AM	ben leinfelder	refactor generate system meta loop to the factory class -- to be reused in sysmeta and ORE generation http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5522
6910	01/17/2012 11:34 AM	Chris Jones	When managing obsoletes/obsoletedBy system metadata fields, set the archived flag to false initially, and set it to true on system metadata for objects that a revision obsoletes.
6909	01/13/2012 04:57 PM	ben leinfelder	do NOT generate ORE maps or download data when we do the initial System Metadata generation -- this is deferred until D1 registration.
6908	01/13/2012 02:25 PM	ben leinfelder	make more generic so that a custom list of IDs can be passed in.
6907	01/13/2012 02:01 PM	ben leinfelder	check that the resourceMap (based on Id only) does not currently exist in the local metacat when generating OREs
6906	01/13/2012 01:31 PM	ben leinfelder	insert OR update system metadata -- no need to do an update right after initial insert...
6905	01/13/2012 01:05 PM	ben leinfelder	call the System Metadata generator during upgrade to 2.0.0
6904	01/13/2012 11:17 AM	Chris Jones	In IdentifierManager.updateSystemMetadata(), add a check for invalid system metadata (fields that throw a NullPointerException on access) to ensure that system metadata is populated correctly. Updated calling classes to handle the exception.
6901	01/13/2012 01:14 AM	Matt Jones	Properly initialize the servlet context when starting alternate servlets, which makes sure that the configuration files have been loaded and config properties are available.
6894	01/12/2012 01:56 PM	Chris Jones	Handle SQLExceptions when trying to save system metadata locally.
6893	01/12/2012 01:56 PM	Chris Jones	Convert SQLExceptions to RuntimeExceptions for Hazelcast MapStore operations.
6892	01/12/2012 01:54 PM	Chris Jones	In IdentifierManager, throw SQLExceptions rather than just logging them, and let them be handled higher up in the stack.
6891	01/12/2012 01:32 PM	ben leinfelder	use new endpoint/method: http://mule1.dataone.org/ArchitectureDocs-current/apis/CN_APIs.html#CNReplication.deleteReplicationMetadata
6890	01/12/2012 12:18 PM	ben leinfelder	use PUT /obsoletedBy/{pid} for CNCore.setObsoletedBy per our discussion today
6889	01/12/2012 07:53 AM	Chris Jones	Keep the hzIdentifiers set in sync with the Metacat systemmetadata table. If entries are added/updated in the hzSystemMetadata map, make sure the identifier is in the set. If (for some administrative reason) the entry is removed, remove the identifier from the set. This usually doesn't happen.
6888	01/12/2012 07:47 AM	Chris Jones	When loading all keys from Metacat into the hzSystemMetadata map, also load identifiers into the hzIdentifiers set if they are not already there. Although entries may be evicted from the map, the list of identifiers will remain. The list will have a fairly small memory footprint since it's just identifiers.
6887	01/12/2012 07:44 AM	Chris Jones	Add support for the distributed Set of unique identifiers in the storage cluster called 'hzIdentifiers'. This set is a persistent total list of all identifiers (even when entries in the hzSystemMetadata map are evicted). It reflects the state of the identifiers in the postgresql systemmetadata table, but is distributed across the cluster. Add the getIdentifiers() method, which returns the ISet of identifiers.
6884	01/11/2012 04:42 PM	ben leinfelder	include new methods needed for replication (in new d1 jars) https://redmine.dataone.org/issues/2203
6883	01/11/2012 01:25 PM	ben leinfelder	add method: setObsoletedBy (https://redmine.dataone.org/issues/2185) augement new method: deleteReplicationMetadata
6882	01/11/2012 11:31 AM	ben leinfelder	remove method: assertRelation https://redmine.dataone.org/issues/2158
6881	01/11/2012 11:24 AM	ben leinfelder	add method: deleteReplicationMetadata remove method: assertRelation update the D1 jars https://redmine.dataone.org/issues/2187 https://redmine.dataone.org/issues/2158
6880	01/11/2012 10:41 AM	ben leinfelder	serialize the Identifier for the systemMetadata being registered https://redmine.dataone.org/issues/2204
6876	01/10/2012 05:04 PM	Chris Jones	Simplify setReplicationStatus() to not call updateReplicationMetadata() if a replica doesn't exist. Just create it and update the system metadata, which we already have a lock for.
6875	01/10/2012 05:03 PM	Chris Jones	Minor null checks to avoid NPEs when calling replicate()
6874	01/10/2012 05:01 PM	Chris Jones	Don't throw a NotAuthorized exception in isAdminAuthorized() - just return false.
6873	01/10/2012 12:12 PM	ben leinfelder	do not download and save remote data resources which are HTML but are not expected to be such (login or info/splash pages before data content). http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5522
6869	01/09/2012 05:08 PM	Chris Jones	Update the CN methods to throw a VersionMismatch where the API changed (where serialVersion is a required parameter). These were previously throwing an InvalidRequest exception. Change the exception handling for calls to Hazelcast to catch a RuntimeException (not Exception) so we don't catch exceptions that we purposefully throw....
6868	01/09/2012 04:59 PM	Chris Jones	Use a Logger instead of System.out for SystemMetadataMap.
6867	01/07/2012 06:01 PM	Chris Jones	Don't lock() on the map.get() in isNodeAuthorized() (this assumes that the CN has queued the task already). Add more lock/unlock debug statements, and fix setReplicationStatus() - I missed a finally statement to unlock the pid.
6866	01/07/2012 12:39 PM	Chris Jones	Modify CNReplication methods setReplicationStatus(), updateReplicationMetadata() and setReplicationPolicy() to allow administrative access from a Coordinating Node by calling isAdminAuthorized().
6865	01/07/2012 12:34 PM	Chris Jones	Add isAdminAuthorized() to D1NodeService to check if the operation is being requested from a CN. Consult the NodeList from the CN and test the NodeType of the given node and the X509 certificate Subject. Perhaps we should expand this to also check for service-level access in the future.
6864	01/06/2012 01:51 PM	ben leinfelder	store D1 configuration properties in the main backup so that they persist between upgrades.
6863	01/06/2012 12:51 PM	Chris Jones	In registerSystemMetadata(), lock the pid prior to calling map.containsKey(pid) since a put to the map could occur between the check and the subsequent put().
6862	01/06/2012 10:45 AM	ben leinfelder	update authoritative member node id when we change it (reconfiguration) and when we initially register as a MN with the CN.
6860	01/06/2012 07:40 AM	Chris Jones	Correctly deserialize the BaseException subclass in handling calls to setReplicationStatus()
6859	01/06/2012 07:23 AM	Chris Jones	Use Lock instead of ILock to be consistent across classes.
6858	01/05/2012 06:32 PM	Chris Jones	After reviewing CNodeService and D1NodeService prompted by Robert comparing the Hazelcast locking with the d1_synchronization locking, I've made a number of changes that will prevent locking problems: 1) Multiple methods contained try/catch blocks that would:...
6856	01/05/2012 02:49 PM	ben leinfelder	only delete replicated data files (server_location != 1)
6855	01/05/2012 01:37 PM	ben leinfelder	use inherited access control from EML for the data file we download from a remote source http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5522
6852	01/05/2012 12:06 PM	ben leinfelder	download remote data and save locally when it is referenced by an EML package, then include it in the ORE map. http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5522
6837	01/03/2012 07:32 PM	Chris Jones	When the requested count in a call to listObjects() is 0, return an empty object list, not a full one. Fixes https://redmine.dataone.org/issues/2122
6836	01/03/2012 07:30 PM	Chris Jones	Minor formatting for querySystemMetadata().
6830	01/03/2012 03:56 PM	ben leinfelder	exapnd permissions on the exisiting access rule not on the permission being checked. (hierarchical permissions)
6826	01/03/2012 10:56 AM	ben leinfelder	upgrade routine to purge empty replicated data files so that they can be re-replicated http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5536
6822	12/22/2011 11:51 AM	Chris Jones	Make sure the local id isn't null when we try to get the object from the local instance.
6821	12/22/2011 11:16 AM	Chris Jones	Simplify the error handling, and throw the exception once the CN is updated with the new status.
6820	12/22/2011 11:13 AM	Chris Jones	Set the replica status to failed (not invalidated) when we get exceptions trying to read the object bytes. Not much of a difference, but only the CN, in theory, is supposed to be able to set the invalidated status.
6819	12/22/2011 11:07 AM	Chris Jones	Set the replication status to invalidated when we have a localId, but getting the object bytes fails for any reason.
6818	12/22/2011 10:51 AM	Chris Jones	Only call super.create() if there's no localId found on the MN (ie a replica is there from an out of band process).
6817	12/22/2011 10:40 AM	Chris Jones	Get the object inputstream from the local metacat instance using MetacatHandler.get() rather than MN.getReplica() so we don't throw an InvalidToken exception when passing in a null Session. The D1Client object is never used for this local call.
6816	12/22/2011 09:24 AM	ben leinfelder	interpret permissions as hierarchical https://redmine.dataone.org/issues/2150
6814	12/21/2011 01:00 PM	ben leinfelder	remove flag for independent system metadata replication -- these entries are replicated along with the data/metadata objects or via hazelcast when the actual object is not on the server.
6808	12/20/2011 11:25 AM	ben leinfelder	process the current revision, not the latest! use direct object/system metadata insertion for ORE maps.
6807	12/20/2011 11:21 AM	ben leinfelder	allow other Metacat process (system metadata and ORE generation) to directly insert objects and system metadata without having to go through the MN/CN methods.
6806	12/20/2011 11:19 AM	ben leinfelder	sort the docids so that "old" revisions are processed before newer ones
6805	12/19/2011 01:07 PM	ben leinfelder	only attempt to unlock a lock if it was created (in the finally block)
6803	12/16/2011 04:24 PM	ben leinfelder	new jars with many changes -- including new CN methods: ping, describe, listChecksumAlgorithm. Removed MN.setAccessPolicy. Refactored CN.setOwner() to CN.setRightsHolder().
6802	12/16/2011 12:21 PM	ben leinfelder	refresh the SystemMetadata entry for EML and referenced data files when parsing EML access rules -- this ensures our in-memory system metadata map is up to date WRT the DB entries.
6800	12/16/2011 11:00 AM	ben leinfelder	add revision history to the generated ORE objects -- we use the revision history of the EML package as a basis because the each ORE revision mirrors the revision of the EML package. Add a placeholder for checking if an equivalent ORE map exists in the DataONE infrastructure - this will be a call to CN.search() that looks at the solr index for OREs based on the EML package ID.
6799	12/16/2011 09:56 AM	Chris Jones	Update the parameter names expected for listObjects() to reflect the MN API changes in the architecture docs.
6798	12/16/2011 09:44 AM	Chris Jones	Change the query semantics such that we implement the MN.listObjects() where the lower datetime bound is inclusive (greater than or equal to" and the upper datetime bound in exclusive (less than). This allows easier paging in client applications.
6795	12/16/2011 07:42 AM	Chris Jones	In the call to MNReplication.replicate(), call back to CNReplication.setReplicationStatus() and set the status to failed when we get local exceptions, exceptions from the source MN when calling getReplica(). Send back an exception with a description when setting the status. Add a private setReplicationStatus() method to refactor these calls out.
6794	12/16/2011 07:36 AM	Chris Jones	Modify CNresourceHandler.setReplicationStatus() to use the new API signature, including the failure BaseException that is parsed out of the MMP as a file section. Log the exception message. Since this is an asynchronous call, ReplicationManager won't see a failed status, but the MNAuditTask eventually will.
6793	12/16/2011 07:32 AM	Chris Jones	Add collectReplicationStatus() to CNResourcHandler to return the BaseException or it's subclass, if any, provided in the the call to setReplicationStatus. The exception will be reported on the CN.
6792	12/16/2011 07:29 AM	Chris Jones	Change setReplicationStatus() to drop serialVersion and report the failure exception message in the CN log.
6787	12/15/2011 01:29 PM	ben leinfelder	set SystemMetadata.archived=true on MN.delete There is ongoing discussion on what the exact behavior should be here, but this mimics Metacat's delete-as-archive action. http://redmine.dataone.org/issues/882
6786	12/14/2011 08:48 AM	Chris Jones	In MNodeService.replicate(), check to see if we have a replica (via an out of band channel) before we call sourceMN.getReplica().
6783	12/13/2011 05:06 PM	ben leinfelder	only create guid->docid mapping during metadata replication if it does not already exist http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5520
6782	12/13/2011 04:50 PM	ben leinfelder	do not treat access change as an update -- it should not attempt to retrieve the contents of the object http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5520
6780	12/13/2011 04:18 PM	ben leinfelder	only create guid->docid mapping during data replication if it does not already exist http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5520
6779	12/13/2011 03:30 PM	ben leinfelder	remove xml_acccess.docid reference (oops) http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5560
6777	12/13/2011 12:22 PM	ben leinfelder	updated D1 API -- removed Permission.REPLICATE and associated parameters
6775	12/13/2011 11:34 AM	ben leinfelder	process system metadata before access rules (access control is now driven by GUID so the mapping needs to be there)
6774	12/13/2011 11:06 AM	Jing Tao	Change the key of query result cache. The key now has the real search value.
6773	12/13/2011 10:58 AM	ben leinfelder	include SerialVersion in describe response https://redmine.dataone.org/issues/2135 NOTE: d1 jars should be replaced once all schema changes are finalized and the generate d1_common code is committed to svn
6766	12/12/2011 10:50 AM	ben leinfelder	include 'archived' system metadata element in backing DB store
6757	12/09/2011 09:05 AM	Chris Jones	If a member node cannot be found in the node list matching the targetNodeSubject given in isNodeAuthorized(), throw a ServiceFailure exception.
6756	12/09/2011 08:04 AM	Chris Jones	Minor reformatting for readability.
6754	12/08/2011 01:51 PM	ben leinfelder	update with latest d1_common/d1_lib (includes latest schema changes)
6752	12/08/2011 01:12 PM	ben leinfelder	only handle 100 (consecutive!) docId generations per millisecond, otherwise the generated docid part is bigger than Long.MAX_VALUE and Metacat cannot fully handle that.
6750	12/08/2011 11:22 AM	ben leinfelder	check previous revision when attempting to update access control with EML 2.0.x docs http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5560
6749	12/08/2011 10:17 AM	ben leinfelder	remove old access rules for a data object when they are being updated by rules contained in an EML document. Now the OnlineDataAccessTest EML 2.1.0 tests pass. http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5560
6748	12/07/2011 05:31 PM	ben leinfelder	construct the proper previousDocId when checking for update permission
6747	12/07/2011 05:05 PM	ben leinfelder	for now, look up SystemMetadata directly from the table otherwise we won't have the latest access information. Need to refresh the in-memory copy everytime we edit the access policy via Metacat (includes EML parser)
6746	12/07/2011 05:04 PM	ben leinfelder	check previous revision for permissions to update (includes data described by EML)
6745	12/07/2011 05:02 PM	ben leinfelder	use correct "rev" column in xml_revisions table
6744	12/07/2011 12:18 PM	ben leinfelder	refactor Metacat access handling to be on a per-revision basis so that it more closely aligns with the DataONE approach http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5560
6743	12/07/2011 12:16 PM	Chris Jones	To avoid id generation conflicts happening at the same millisecond, append a 5 character random string to the end of the docid.
6740	12/06/2011 03:03 PM	ben leinfelder	retry: add node name in the correct order for predicate navigation http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5561
6738	12/06/2011 02:50 PM	ben leinfelder	add node name in the correct order for predicate navigation http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5561

Project

General

Profile

Metacat