move the hzIdentifiers initialization into the resync thread so that it does not affect start up time. cleaned up unused methods and superfluous code.
use correct children of 'publisher' element
only load local pids into hzIdentifiers if t hey do not already exist in the shared set. increase logging severity and detail of messages emitted during this process to get a better sense of what is taking so long.
utility methods to update/reserialize existing ORE maps that were generated with older foresite (and included bad dateTime strings).https://redmine.dataone.org/issues/3046
include the resourceMapId for the metadata objects, not just the data files.
updated LDAP dump and corrected missing entries that had been removed from LDAP.
On the coordinating Nodes, we often get McdbDocNotFoundExceptions for data (doctype == 'BIN') documents because they are not synchronized to the CNs. Change the logging to only print the stack trace during load() and loadAll() when log debug is enabled.
check for invalid (!) pids. thanks, M. Reyes for catching thishttps://redmine.dataone.org/issues/3047
only look up the client timeout property once, not every time we make a callhttps://redmine.dataone.org/issues/3078
improve content type handling during the get() callshttps://redmine.dataone.org/issues/3070
check for whitespace in identifiers during create() and update()https://redmine.dataone.org/issues/3047
remove semtools skin as a configured skin -- will need to add that if we ever get back to deploying a semtools instance.
configurable replication client timeouthttps://redmine.dataone.org/issues/3078
order the listObjects() results by identifier to mitigate random paged resultshttps://redmine.dataone.org/issues/3065
correct the parameter/value setting in the prepared statements for retrieving log information.
use docid, not the guid when returning the accesscontrol block
handle null givenNames from the LDAP dump.
make sure we only get the publisher text content (not attribute value)
DOI registration:-include more revision history based on the identifier table not just the generated SM metadata-include ecogrid data urls for revisions (long query in xml_nodes_revisions table)
include new libclient jar that uses encoded pids in the resolve URLshttps://redmine.dataone.org/issues/3035
update D1 jars in preparation for 2.0.2 release. NOTE: still need libclient jar that includes ORE changes for encoding PIDs in resolve URLs
prep for 2.0.2 release by updating the version numbers.
include dataone.ore.downloaddata as a configurable property in case MNs (like LTER) want to have the process download externally-stored data files described in an EML data package.
updated foresite (snapshot) to include dateTime serialization fix.https://redmine.dataone.org/issues/3035
set date SM modified when we are setting obsoletes/obsoletedBy/archived values. This way the CN can actualy pick up the changes in revision history.
update creator and publisher using LDAP dump. unfortunately LDAP has shifted over the years and not all identities are still active in LDAP...but we did get quite a few creator names updated!https://redmine.dataone.org/issues/2815
log error when looking up non-existent local SM rather than completely bombing out of the resynch thread.
include parameter for deleting system metadata records (or not). Intending to also use this for https://redmine.dataone.org/issues/3055
look up docid using mapped guid when checking permission on described data fileAddresses: http://support.nceas.ucsb.edu/rt/Ticket/Display.html?id=7490
function/procedure for removing all content related to a PID from the DB.https://redmine.dataone.org/issues/3037
use docid (not guid) when instantiating the PermissionController. Was getting an error with DOI-ified identifier and the metacat getaccesscontrol action:https://knb.ecoinformatics.org/knb/metacat?action=getaccesscontrol&docid=Collinge.3.28<error>AccessControlForSingleFile.getACL() - MCDB error when getting ACL: No guid registered for docid doi:10.5063/AA/Collinge.3.28...
save point - adding more columns for access, data packaging, revision historyhttps://redmine.dataone.org/issues/2815
script to find and update missing SystemMetadata revision history. https://redmine.dataone.org/issues/2938
update the table to indicate which DOI account we are targetinghttps://redmine.dataone.org/issues/2815
make sure we have non-null values where jibx serialization expects them for LogEntry
use secure Metacat context URL for D1 registrationhttps://redmine.dataone.org/issues/3030
first pass: DataONE-specific log retrieval to avoid java-based post-processing.
use production cn url for the resolve url
remove the non-doi identifiers before updating the LTER - should save time on the update.https://redmine.dataone.org/issues/2858
use eml stylesheet tag (1.0.3)
use 1.0.2 d1_libclient jar (built by hudson)
set archived flag (true) when we set the obsoletedBy value in the ORE system metadata
update for 2.0.1 upgrade -- scripts, docs, readme
remove saxon jar (XSLT 2.0 support) with plans to re-add when we understand how to make it more lenient WRT to invalid character content in source XML
move to 1.0.2 d1_libclient jar for ORE generation change (URI for aggregation)
use the localId for obsoletes/obsoletedBy ORE system metadata (https://redmine.dataone.org/issues/2964)
use correct esa email list
include xml-apis*.jar when building the LSID authority.war -- without this file the authority webapp has a fatal error (no class def found) on init.
correct block formatting for tomcat config changes
add tomcat config options for DataONE identifiers with slashes
excluded the HzObjectPathMapTest$1 and HzObjectPathMapTest$2 classes which are not test classes in the test taraget.
Print the stack trace when the MMP cannot be resolved.
Use a final static string to replace the hard code.Search document title rather than id in testReplicateEML_AtoB method.
use 1.0.2 tag for EML stylesheets
update MN registration screen shot and amend instructions that say a nodeId will be assigned during registration.add section about generating SM for a new Member Node that has existing Metacat data.
use CN session when testing getLogRecords() and getOperationStatistics() becuase they are now protecting "sensitive" information
use RC for EML stylesheets before going to yet another minor revision number.
report errors during XML->HTML transformhttp://bugzilla.ecoinformatics.org/show_bug.cgi?id=5618
add 'fulldist' target to combine building of src and bin distributions
include gastil's changes re: pathquery and 'delete'http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5516
Add details for configuring Apache to use client certificate authentication.
use production CN url as the default (instead of cn-dev)
Oops, previous commit suffered from a happy trigger finger. During deleteReplicationMetadata(), don't delete the replica on the replica Member Node. Call CN.delete() for that functionality. This call just updates sytem metadata (according to the API description).
remove distribution tar.gz and zip files on fullclean.
use iframe id for the login anchor since the anchor inside the iframe does not work.http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5619
get utilities source from the correct checkout location
clarify release notes for 2.0.0 (minor)
correct Javadoc link http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5516
Minor logging change.
Add debug logging to delete() to understand why we're getting InsufficientKarmaException.
apply Gastil's edits. http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5516
add section about configuring the metacat.properties with the paths to the server certificate and key
remove unused replication properties (all certificate file based now)
use 1.0.1 DataONE jars
use KNB identifier table (post DOI generation) to update LTER identifier table.https://redmine.dataone.org/issues/2857
exclude the MockObjectPathMap class from the test target since it is not a junit test.
exclude QueryRunner in test target. This is not a junit test file.
Since we already have determined access via isAuthorized() and isAdminAuthorized(), act as the Metacat administrator during calls to DocumentImpl.delete() in archive(), passing in null username and group.
restrict getLogRecrods (both MN and CN) to be called only by admin users (the CN)https://redmine.dataone.org/issues/2855
use updated tag for EML stylesheet-only changeshttp://bugzilla.ecoinformatics.org/show_bug.cgi?id=5597
In setReplicationStatus() and UpdateReplicationMetadata(), don't allow a status state change from COMPLETED to anything other than INVALIDATED. This prevents the completed status from being overwritten due to race conditions.
do not include releaseCandidate property - does not seem to be used except for debian? packaging which was never fully implemented.
use non-snapshot D1 jars in prep for tagging/release. Still RCs
script to aid in converting denyFirst rules to allowFirsthttps://redmine.dataone.org/issues/2613
use kepler account as default
use current default tomcat6 deploy directory
include test shoulder defaults for KNB, PISCO and LTER
Fixed formatting problem in a documentation file.
use metacat.properties to specify the default checksum algorithm to use -- this way it will be easy for us to switch to whatever DataONE decrees. https://redmine.dataone.org/issues/2834
put(sm) for every pid we have a SM value for so that all members receive the entry event and can save locally.
add section about importing self-signed certificates into the Java keystore (now that we use strict verification on the java client side when calling replication endpoints).
a few additional notes about Metacat replication configuration.
Throw an exception when NOT allowed, not when allowed =).
ignore partition owner -- always attempt to look up form local store if we were unable to get the SM from the shared map.
do not check if this CN has a "perfect" copy of the SM identifiers -- we need any CN coming online to contribute the records that they have locally so that in the event that all three CNs have a partial view of things they all eventually share each others' SM entries.
Also get the list size, which may throw an NPE.
Only add an AccessPolicy to SystemMetadata during generation when the AccessPolicy is not empty. We've had some scenarios where IdentifierManager.getaccessPolicy() is returning an empty policy because of an empty permission list coming from the db. This was causing InvalidSystemMetadata exceptions during MN to MN replication.
push SystemMetadata entries from the CN that has them all to the shared map where other nodes may not have all entries. The CN with the complete copy only pushes SM entries that it does not own and that return as null because those are the ones that are missing on the other, non-complete CNs....