/src/edu/ucsb/nceas - Changes - Metacat - Ecoinformatics Redmine

metacat/src/edu/ucsb/nceas @ 7084

#	Date	Author	Comment
7084	03/21/2012 11:26 AM	ben leinfelder	use current datetime (at system metadata generation) as the date last modified
7083	03/19/2012 06:14 PM	Chris Jones	Don't check for populated obsoletes and obsoletedBy fields during CN.create(), only MN.create(). The CN should expect that the MN has populated this field because of existing revision information, and should trust the MN information. Addresses https://redmine.dataone.org/issues/2507.
7082	03/19/2012 06:08 PM	Chris Jones	Some minor logging changes.
7081	03/19/2012 12:40 PM	ben leinfelder	use scope prefix (knb-lter-%) for random test doc population
7079	03/19/2012 10:12 AM	ben leinfelder	use isAdminAuthorized() to check access to CN.create(). Note this method takes a pid and permission parameter and neither is used. Also removed the NotFound exception because it would never come up.
7078	03/19/2012 10:01 AM	ben leinfelder	check that caller is CN/admin for CN.delete() https://redmine.dataone.org/issues/2506
7077	03/19/2012 09:52 AM	ben leinfelder	include CN.delete() https://redmine.dataone.org/issues/2506
7076	03/16/2012 04:07 PM	Chris Jones	Notify each replica MN when critical portions of system metadata change so the MN can pull the latest copy into its store. AccessPolicy and RightsHolder changes are the most critical for the MN to keep updated on.
7075	03/16/2012 11:40 AM	Chris Jones	Only allow CNs to call MN.synchronizationFailed() by calling isAdminAuthorized(). The pid must also be valid.
7074	03/15/2012 07:50 PM	Chris Jones	Modify CNodeService.setReplicationStatus() slightly to restrict MN-based calls to only set the status to COMPLETED. The CNs should be setting failures or invalidations, or the status can remain at QUEUED or REQUESTED, and the MNAuditTask can revisit those replicas as needed.
7073	03/15/2012 07:14 PM	Chris Jones	Add a notifyReplicaNodes() method that calls MNStorage.systemMetadataChanged() on MN replica nodes for a given object identifier. This will be called when there are changes to AccessPolicy and rights holder since these are critical access metadata for an MN, but they can only be changed on the CN.
7072	03/15/2012 12:10 PM	Chris Jones	Add some debugging statements in isAuthorized().
7071	03/15/2012 12:08 PM	Chris Jones	In setReplicationStatus(), first check for a replica target MN subject match with the session subject. If this fails, look to see if CN admin access is allowed. Otherwise throw NotAuthorized. Addresses https://redmine.dataone.org/issues/2494
7070	03/15/2012 11:46 AM	ben leinfelder	do not allow "Metacat-conforming" identifiers to be used. "test.1.001" is interpreted as "test.1.1" which renders "test.1.002" unusable unless a traditional Metacat "update" is used for that id/revision which contradicts the DataONE use of Identifiers that have no lexical requirements for revisions.
7069	03/15/2012 08:08 AM	ben leinfelder	check for session when checking administrative authorization
7068	03/14/2012 12:02 PM	Chris Jones	Remove individual calls to isAdminAuthorized() in favor of the centralized isAuthorized() call that handles it now.
7067	03/14/2012 11:57 AM	Chris Jones	Incorporate isAdminAuthorized() into isAuthorized() for blanket CN access to objects.
7066	03/13/2012 04:00 PM	ben leinfelder	check for null Session before continuing with setReplicationStatus() https://redmine.dataone.org/issues/2476#note-3
7065	03/13/2012 03:09 PM	ben leinfelder	do not attempt to parse empty file for the failure (BaseException serialization). There are cases when this is not given (failure="") when there is not a failure. https://redmine.dataone.org/issues/2476
7064	03/13/2012 03:05 PM	ben leinfelder	check for null session (public) calls to MN.replicate() before passing it to the asynchronous implementation
7063	03/13/2012 02:58 PM	ben leinfelder	do not replicate if session is null
7062	03/13/2012 02:19 PM	ben leinfelder	throw not authorized when attempting to getReplica as an invalid/non-existent node
7061	03/13/2012 12:19 PM	ben leinfelder	transitive properties for mapped subjects: -group membership -verified flag https://redmine.dataone.org/issues/2430 https://redmine.dataone.org/issues/2432
7060	03/13/2012 11:24 AM	ben leinfelder	check group membership defined at group level (in addition to membership defined as part of of the Person level) https://redmine.dataone.org/issues/2429
7059	03/13/2012 10:43 AM	ben leinfelder	logging for permission checks - trying to nail down details of MN checking
7058	03/13/2012 10:08 AM	ben leinfelder	use Event.CREATE.xmlValue() when converting "insert" to "create" http://redmine.dataone.org/issues/2471
7057	03/12/2012 08:20 PM	ben leinfelder	log records should be inclusive of fromDate parameter (>=) https://redmine.dataone.org/issues/2471
7056	03/12/2012 04:38 PM	ben leinfelder	use Timestamp object from results, not a new Date object from that Timestamp object
7055	03/12/2012 12:12 PM	ben leinfelder	add an alternative method for loading system metadata identifiers but leave it commented out. We may find that using the ObjectList method is too much overhead, but it will always be consistent with what metacat reports for listObjects().
7054	03/12/2012 12:09 PM	ben leinfelder	add note about long-running load for shared system metadata map
7051	03/08/2012 04:16 PM	Chris Jones	If PID is not part of the multipart params, we end up with a NullPointerException. Throw an InvalidRequest in this case rather than ServiceFailure resulting from the NPE.
7049	03/08/2012 02:21 PM	ben leinfelder	translate "insert" events in Metacat as Event.CREATE events ("create") for DataONE https://redmine.dataone.org/issues/2461
7048	03/08/2012 11:18 AM	ben leinfelder	for good measure, use the D1 encoding util for url decoding the parameters for listObjects https://redmine.dataone.org/issues/2460
7047	03/08/2012 10:42 AM	ben leinfelder	log record paging: -use start and count parameters -if start+count exceeds the total number of records, then only return from start to the end of the list -if start exceeds total record count, start at the end of the list (will be empty list) https://redmine.dataone.org/issues/2458
7046	03/07/2012 02:49 PM	Chris Jones	Use 'fromDate' and 'toDate' as listObject param filters to comply with the API documentation. We had changed this in MNResourceHandler, but somehow missed it in CNResourceHandler.
7045	03/07/2012 02:30 PM	ben leinfelder	check whether mapping (
7044	03/06/2012 08:26 PM	ben leinfelder	catch additional NotFound exception for: "do not include log entries for documents that the caller is not allowed to read." https://redmine.dataone.org/issues/2444
7043	03/06/2012 02:03 PM	ben leinfelder	serialize exception in header for describe response when there is a BaseException https://redmine.dataone.org/issues/2440
7042	03/06/2012 01:41 PM	ben leinfelder	do not include log entries for documents that the caller is not allowed to read. https://redmine.dataone.org/issues/2444
7041	03/06/2012 01:35 PM	ben leinfelder	use revision provided in the docid when looking up guid. had been using latest revision which I think incorrectly reports on the log history. noticed this when looking at: https://redmine.dataone.org/issues/2444
7039	03/05/2012 06:30 PM	Chris Jones	A minor change to isAuthorized() - compare each Person in the SubjectInfo (not just the primary Subject) since each person could have an equivalent identity mapped to the primary Subject. Add debug logging for the comparison.
7038	03/05/2012 04:36 PM	ben leinfelder	added debug logging https://redmine.dataone.org/issues/2429
7037	03/05/2012 04:27 PM	ben leinfelder	check if verified flag is null before evaluating (NPE during MN Auth test) https://redmine.dataone.org/issues/2429
7034	03/01/2012 02:20 PM	ben leinfelder	do not include stylesheet for list of checksum algorithms -- there is no template for it and therefore looks blank in a browser
7030	02/24/2012 02:21 PM	Chris Jones	Globally change the property 'dataone.memberNodeId' to 'dataone.nodeId'. This is more useful for both MNs and CNs implemented in Metacat. Also, change D1NodeService.getLogRecords() to return log entries with the actual node id rather than the IP address (looks like a cut/paste error)....
7029	02/24/2012 09:46 AM	ben leinfelder	throw InvalidToken when an invalid Permission is passed in. THis requires that internal calls to the method also check for this exception. https://redmine.dataone.org/issues/2388
7026	02/22/2012 02:53 PM	ben leinfelder	call deleteReplica when we get that request (looks like an undetected copy and paste error)
7025	02/22/2012 02:31 PM	ben leinfelder	do not allow blank node references to be used. https://redmine.dataone.org/issues/2362
7024	02/22/2012 02:27 PM	ben leinfelder	only generate system metadata when the call comes from the legacy Metacat API, not the D1 API. https://redmine.dataone.org/issues/2362 (I think this was the culprit)
7023	02/22/2012 12:47 PM	ben leinfelder	do not "lookup" object format when retrieving system metadata -- just return what we have stored as the formatId and don't [erroneously] default it to binary when there's a problem with the lookup (cache or service or otherwise). https://redmine.dataone.org/issues/2365
7022	02/21/2012 01:11 PM	ben leinfelder	Get ReplicationPolicy correctly generated: -tweak the regular expression for getting the pref/blocked node list for default replication policy. -set blocked list (had mistakenly been two calls to set pref list)
7021	02/21/2012 10:55 AM	ben leinfelder	actually, let's set the serialVersion during the MN.create() call so that the HZ map and the backing store have the same information immediately. Also, this is how the docs specify it. http://mule1.dataone.org/ArchitectureDocs-current/design/SystemMetadata.html
7020	02/21/2012 10:08 AM	ben leinfelder	if serialVersion is null, use defualt value of 0
7019	02/20/2012 03:39 PM	ben leinfelder	handle both listing and getting checksums using the GET endpoint -- depends whether or not a pid is included in the URL https://redmine.dataone.org/issues/2089
7017	02/20/2012 12:38 PM	ben leinfelder	remove ID mapping when a create()/"insert" call fails so that subsequent calls do not return an IdentifierNotUnique error. In this case it was due to invalid XML. https://redmine.dataone.org/issues/2341
7016	02/20/2012 12:20 PM	ben leinfelder	use RC-3 DataONE jars and fix compilation error that arose. https://redmine.dataone.org/issues/2351
7015	02/17/2012 04:54 PM	ben leinfelder	overload getAllDocidsByType() method for backward semtools compatibility
7013	02/16/2012 02:47 PM	ben leinfelder	do not subset the list for MS generation testing -- at least not as the default in svn!
7012	02/15/2012 09:35 AM	Chris Jones	CNodeService.listChecksumAlgorithms() was returning null rather than the list. Fixed.
7010	02/11/2012 11:36 AM	Chris Jones	ObjectFormatCache.getFormat(String formatStr) has been deprecated, and now only takes a formatId instance to get a format from the cache. It also throws Service Failure and NotImplemented, so here just set the format to application/octet-stream in any case.
7009	02/11/2012 11:25 AM	Chris Jones	Update D1NodeService to reflect new ObjectFormatCache signature.
7007	02/09/2012 04:11 PM	ben leinfelder	only run ORE generation for EML docs -- no need to run this for all documents (yikes!)
7006	02/09/2012 03:48 PM	ben leinfelder	use IdMan method to find docids that do not already have system metadata records -- this lets us re-run without re computing system metadata for every entry (in case the process is interrupted). I haven' been using this option because I wanted to continually regenerate all SM for everything in my test DBs, but we are so close to release that I want to get this in there.
7005	02/09/2012 02:39 PM	ben leinfelder	for testing: limit and randomize the docs to generate metadata for
7004	02/09/2012 08:39 AM	ben leinfelder	FOR TESTING ONLY: limit number of records to 100 so that we can get an estimate
7003	02/08/2012 03:46 PM	ben leinfelder	update the memberNodeId in existing system metadata only after the register/update is successful with the CN -- we can avoid unneeded SM updates in cases when the register/update fails because we gave the CN bad info that it rejects. https://redmine.dataone.org/issues/2308
7002	02/08/2012 03:12 PM	ben leinfelder	include member node id text field now that the CN is not assigning random Ids. https://redmine.dataone.org/issues/2308
7001	02/08/2012 01:20 PM	ben leinfelder	1. lookup and use the guid when processing obsoletes/obsoletedBy entries -- had previously been assuming localId==guid but now that we have introduced DOIs as part of the Metacat upgrade process, we may have DOIs for the guid that map to localIds. 2. base ORE guids on the localid of the data package they are describing and not on their DOI -- otherwise we might mash up the DOI prefix (or other id scheme that we are unaware of). By using resourceMapPreix + localId we are sure to have a valid localid and guid for the ORE map we create and add to the system
7000	02/08/2012 11:23 AM	ben leinfelder	use updated authorization policies as discussed in: https://redmine.dataone.org/issues/2277 and http://epad.dataone.org/20120131-authn-authz-questions
6999	02/08/2012 10:58 AM	ben leinfelder	refactor D1-specific upgrade utilities into their own package
6998	02/08/2012 10:53 AM	ben leinfelder	remove createAndInsertSystemMetadat() method that acts on a single localId -- incorporated this into the localId-list-based method.
6997	02/08/2012 10:50 AM	ben leinfelder	refactor IdentityManager.createSystemMetadata(sm) to be insertSystemMetadata(sm) so that it is clear that this method inserts the SM object into the backing store. This differentiates it from the "generation" methods we use when we need to create SM about pre-existing objects or objects we get from non-D1 api calls.
6996	02/08/2012 10:44 AM	ben leinfelder	generate SystemMetadata during D1 registration (not 2.0.0 upgrade). This process runs in a thread and updates a metacat.properties value when it is complete.
6995	02/07/2012 09:54 PM	ben leinfelder	getMultipartParameters() outside of debug block -- thanks Mark Reyes @ CDL for catching this.
6994	02/07/2012 04:53 PM	ben leinfelder	dataone configuration and registration enhancements: -include flag to disable D1 services, currently only the MN side enforces this -do not allow multiple registration attempts if we have just submitted and are awaiting Node verification by the CN. -do not allow configuration "bypass" if D1 settings have been configured previously....
6993	02/07/2012 03:18 PM	ben leinfelder	use correct Collections import
6992	02/07/2012 11:08 AM	ben leinfelder	Show "Update" button if this MemberNodeId is already registered with DataONE, otherwise use the "Register" label
6991	02/07/2012 09:54 AM	ben leinfelder	match changes to MN service methods (return type as boolean)
6988	02/07/2012 12:02 AM	Matt Jones	Added new methods to generate a default replication policy based on properties from the metacat configuration. This is called during system metadata creation for objects that lack any system metadata.
6987	02/07/2012 12:00 AM	Matt Jones	Modify admin configuration to include default replication policy. Extensively revised the DataONE configuration page, including new wording for intro, improved tooltips throughout, new arrangement of sections, and other cosmetic changes.
6986	02/06/2012 11:56 PM	Matt Jones	Clean up warnings in class.
6984	02/06/2012 01:05 PM	Matt Jones	Remove ability to edit NodeID from D1 configuration page. Fix update of contactSubject and dataone.ore.generated property name.
6982	02/06/2012 12:38 PM	ben leinfelder	handle "BIN" objects so as to avoid repeated calls to lookup the non-existent ObjectFormat
6981	02/06/2012 11:40 AM	ben leinfelder	do not wait for SM generation to complete during the upgrade -- this way the web UI wont hang for days. the process sets a metacat property when it is complete.
6980	02/06/2012 11:38 AM	ben leinfelder
6977	02/02/2012 05:15 PM	ben leinfelder	do not shutdown hazelcast -- it needs to be running after the upgrade process so that Metacat actually works. I think the newer version of HZ makes it so the threads are all released as needed. http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5572
6976	02/02/2012 03:17 PM	Matt Jones	Commenting out the parts of the upgrade script that started to refer to EXID. At this point, the registration of EZID identifiers will be done out-of-band with respect to the upgrade.
6975	02/02/2012 02:17 PM	ben leinfelder	use plain String parameter for {pid} instead of XML serialization of it.
6974	02/02/2012 11:23 AM	ben leinfelder	remove {pid} from POST URL on CN.registerSystemMetadata() https://redmine.dataone.org/issues/2284
6973	02/02/2012 11:15 AM	ben leinfelder	remove {pid} from POST URL on CN.create() https://redmine.dataone.org/issues/2284
6972	02/02/2012 11:10 AM	ben leinfelder	remove {pid} from POST URL on MN.create() https://redmine.dataone.org/issues/2284
6971	02/01/2012 04:09 PM	ben leinfelder	catch cases where the previous/next revision of objects have not had system metadata generated yet
6970	02/01/2012 03:52 PM	ben leinfelder	create system metadata object if it wasn't found in HZ
6968	02/01/2012 09:44 AM	ben leinfelder	process systemMetadata from the docInfo string before writing to the database so that we guarantee guid-docid mapping exists before attempting to look it up.
6966	01/30/2012 02:49 PM	ben leinfelder	upgrade to hazelcast 1.9.4.6 so that threadpools are released when not needed (http://code.google.com/p/hazelcast/issues/detail?id=765). include ant target to run a specific main class (mostly for debugging)
6965	01/30/2012 02:44 PM	ben leinfelder	use File.deleteOnExit() not a half hour timer thread to do it.
6964	01/27/2012 05:15 PM	ben leinfelder	multithreaded implementation for processing docids for system metadata generation. need to investigate ant/junit running that deadlocks hazelcast (config?)
6963	01/27/2012 05:12 PM	ben leinfelder	additional logging of the config file being used - seem to have thread locking on the xmlConfig use when running under ant/junit
6962	01/27/2012 10:53 AM	ben leinfelder	calculate object size using the size on the file system rather than re-reading as an input stream. Now only EML document bytes will be read twice: once for the checksum and again for parsing out datapackage details
6961	01/26/2012 11:14 PM	ben leinfelder	system metadata generation optionally skips entries that have already been generated (data size, checksum) but allows the latest EML that describes them to have the last word on object format

Project

General

Profile

Metacat