Project

General

Profile

Statistics
| Revision:

# Date Author Comment
7328 07/23/2012 05:13 PM ben leinfelder

not every EML file has an ORE datapackage descriptor -- join only to those when setting the resourceMapId

7327 07/23/2012 04:29 PM ben leinfelder

correctly use document revision for object format and resource map joins.

7326 07/23/2012 11:55 AM ben leinfelder

use local Set processing to determine which pids (if any) should be contributed to the shared set by this node during the resync. Should save time rather than checking each and every pid against the shared set.

7325 07/20/2012 03:44 PM ben leinfelder

move the hzIdentifiers initialization into the resync thread so that it does not affect start up time. cleaned up unused methods and superfluous code.

7324 07/20/2012 02:28 PM ben leinfelder

use correct children of 'publisher' element

7323 07/20/2012 10:51 AM ben leinfelder

only load local pids into hzIdentifiers if t hey do not already exist in the shared set. increase logging severity and detail of messages emitted during this process to get a better sense of what is taking so long.

7322 07/19/2012 02:38 PM ben leinfelder

utility methods to update/reserialize existing ORE maps that were generated with older foresite (and included bad dateTime strings).
https://redmine.dataone.org/issues/3046

7321 07/18/2012 10:11 AM ben leinfelder

include the resourceMapId for the metadata objects, not just the data files.

7320 07/18/2012 08:56 AM ben leinfelder

updated LDAP dump and corrected missing entries that had been removed from LDAP.

7319 07/17/2012 03:57 PM Chris Jones

On the coordinating Nodes, we often get McdbDocNotFoundExceptions for data (doctype == 'BIN') documents because they are not synchronized to the CNs. Change the logging to only print the stack trace during load() and loadAll() when log debug is enabled.

7318 07/17/2012 01:34 PM ben leinfelder

check for invalid (!) pids. thanks, M. Reyes for catching this
https://redmine.dataone.org/issues/3047

7317 07/17/2012 12:06 PM ben leinfelder

only look up the client timeout property once, not every time we make a call
https://redmine.dataone.org/issues/3078

7316 07/17/2012 11:46 AM ben leinfelder

improve content type handling during the get() calls
https://redmine.dataone.org/issues/3070

7315 07/17/2012 11:09 AM ben leinfelder

check for whitespace in identifiers during create() and update()
https://redmine.dataone.org/issues/3047

7313 07/17/2012 10:19 AM ben leinfelder

configurable replication client timeout
https://redmine.dataone.org/issues/3078

7310 07/13/2012 07:57 AM ben leinfelder

order the listObjects() results by identifier to mitigate random paged results
https://redmine.dataone.org/issues/3065

7309 07/12/2012 04:04 PM ben leinfelder

correct the parameter/value setting in the prepared statements for retrieving log information.

7307 07/12/2012 12:21 PM ben leinfelder

use docid, not the guid when returning the accesscontrol block

7306 07/11/2012 05:05 PM ben leinfelder

handle null givenNames from the LDAP dump.

7305 07/11/2012 04:38 PM ben leinfelder

make sure we only get the publisher text content (not attribute value)

7303 07/11/2012 03:05 PM ben leinfelder

DOI registration:
-include more revision history based on the identifier table not just the generated SM metadata
-include ecogrid data urls for revisions (long query in xml_nodes_revisions table)

7300 07/10/2012 04:26 PM ben leinfelder

prep for 2.0.2 release by updating the version numbers.

7299 07/10/2012 02:12 PM ben leinfelder

include dataone.ore.downloaddata as a configurable property in case MNs (like LTER) want to have the process download externally-stored data files described in an EML data package.

7297 07/10/2012 10:20 AM ben leinfelder

set date SM modified when we are setting obsoletes/obsoletedBy/archived values. This way the CN can actualy pick up the changes in revision history.

7296 07/09/2012 04:58 PM ben leinfelder

update creator and publisher using LDAP dump. unfortunately LDAP has shifted over the years and not all identities are still active in LDAP...but we did get quite a few creator names updated!
https://redmine.dataone.org/issues/2815

7295 07/09/2012 04:23 PM ben leinfelder

log error when looking up non-existent local SM rather than completely bombing out of the resynch thread.

7294 07/09/2012 10:46 AM ben leinfelder

include parameter for deleting system metadata records (or not). Intending to also use this for https://redmine.dataone.org/issues/3055

7293 07/08/2012 10:05 AM ben leinfelder

look up docid using mapped guid when checking permission on described data file
Addresses: http://support.nceas.ucsb.edu/rt/Ticket/Display.html?id=7490

7292 07/07/2012 12:02 PM ben leinfelder

function/procedure for removing all content related to a PID from the DB.
https://redmine.dataone.org/issues/3037

7291 07/06/2012 05:29 PM ben leinfelder

use docid (not guid) when instantiating the PermissionController. Was getting an error with DOI-ified identifier and the metacat getaccesscontrol action:
https://knb.ecoinformatics.org/knb/metacat?action=getaccesscontrol&docid=Collinge.3.28
<error>
AccessControlForSingleFile.getACL() - MCDB error when getting ACL: No guid registered for docid doi:10.5063/AA/Collinge.3.28...

7290 07/05/2012 04:13 PM ben leinfelder

save point - adding more columns for access, data packaging, revision history
https://redmine.dataone.org/issues/2815

7289 07/03/2012 03:53 PM ben leinfelder

script to find and update missing SystemMetadata revision history. https://redmine.dataone.org/issues/2938

7288 07/03/2012 03:45 PM ben leinfelder

update the table to indicate which DOI account we are targeting
https://redmine.dataone.org/issues/2815

7287 07/02/2012 04:50 PM ben leinfelder

make sure we have non-null values where jibx serialization expects them for LogEntry

7286 07/02/2012 03:35 PM ben leinfelder

use secure Metacat context URL for D1 registration
https://redmine.dataone.org/issues/3030

7285 07/02/2012 12:06 PM ben leinfelder

first pass: DataONE-specific log retrieval to avoid java-based post-processing.

7284 06/22/2012 08:55 AM ben leinfelder

use production cn url for the resolve url

7283 06/20/2012 01:46 PM ben leinfelder

remove the non-doi identifiers before updating the LTER - should save time on the update.
https://redmine.dataone.org/issues/2858

7278 06/18/2012 03:43 PM ben leinfelder

set archived flag (true) when we set the obsoletedBy value in the ORE system metadata

7276 06/18/2012 02:27 PM ben leinfelder

update for 2.0.1 upgrade -- scripts, docs, readme

7273 06/18/2012 12:13 PM ben leinfelder

use the localId for obsoletes/obsoletedBy ORE system metadata (https://redmine.dataone.org/issues/2964)

7266 06/07/2012 12:03 PM Chris Jones

Print the stack trace when the MMP cannot be resolved.

7258 06/07/2012 12:25 AM ben leinfelder

report errors during XML->HTML transform
http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5618

7252 06/06/2012 03:14 PM Chris Jones

Oops, previous commit suffered from a happy trigger finger. During deleteReplicationMetadata(), don't delete the replica on the replica Member Node. Call CN.delete() for that functionality. This call just updates sytem metadata (according to the API description).

7251 06/06/2012 03:10 PM Chris Jones
7245 06/06/2012 10:23 AM Chris Jones

Minor logging change.

7244 06/06/2012 10:01 AM Chris Jones

Add debug logging to delete() to understand why we're getting InsufficientKarmaException.

7239 06/05/2012 04:44 PM ben leinfelder

use KNB identifier table (post DOI generation) to update LTER identifier table.
https://redmine.dataone.org/issues/2857

7236 06/05/2012 02:07 PM Chris Jones

Since we already have determined access via isAuthorized() and isAdminAuthorized(), act as the Metacat administrator during calls to DocumentImpl.delete() in archive(), passing in null username and group.

7234 06/04/2012 08:49 PM ben leinfelder

restrict getLogRecrods (both MN and CN) to be called only by admin users (the CN)
https://redmine.dataone.org/issues/2855

7231 06/02/2012 05:46 AM Chris Jones

In setReplicationStatus() and UpdateReplicationMetadata(), don't allow a status state change from COMPLETED to anything other than INVALIDATED. This prevents the completed status from being overwritten due to race conditions.

7227 06/01/2012 10:45 AM ben leinfelder

script to aid in converting denyFirst rules to allowFirst
https://redmine.dataone.org/issues/2613

7226 06/01/2012 10:40 AM ben leinfelder

use kepler account as default

7222 05/31/2012 09:04 PM ben leinfelder

use metacat.properties to specify the default checksum algorithm to use -- this way it will be easy for us to switch to whatever DataONE decrees.
https://redmine.dataone.org/issues/2834

7221 05/31/2012 06:16 PM ben leinfelder

put(sm) for every pid we have a SM value for so that all members receive the entry event and can save locally.

7218 05/31/2012 10:56 AM Chris Jones

Throw an exception when NOT allowed, not when allowed =).

7217 05/31/2012 10:53 AM ben leinfelder

ignore partition owner -- always attempt to look up form local store if we were unable to get the SM from the shared map.

7216 05/31/2012 10:13 AM ben leinfelder

do not check if this CN has a "perfect" copy of the SM identifiers -- we need any CN coming online to contribute the records that they have locally so that in the event that all three CNs have a partial view of things they all eventually share each others' SM entries.

7215 05/31/2012 10:10 AM Chris Jones

Also get the list size, which may throw an NPE.

7214 05/31/2012 09:53 AM Chris Jones

Only add an AccessPolicy to SystemMetadata during generation when the AccessPolicy is not empty. We've had some scenarios where IdentifierManager.getaccessPolicy() is returning an empty policy because of an empty permission list coming from the db. This was causing InvalidSystemMetadata exceptions during MN to MN replication.

7213 05/31/2012 09:19 AM ben leinfelder

push SystemMetadata entries from the CN that has them all to the shared map where other nodes may not have all entries. The CN with the complete copy only pushes SM entries that it does not own and that return as null because those are the ones that are missing on the other, non-complete CNs....

7212 05/30/2012 10:00 PM ben leinfelder

trace level log for looping over EVERY pid in the system.

7211 05/30/2012 09:47 PM ben leinfelder

meant to log the guids (source) not the pids (target)

7210 05/30/2012 08:51 PM ben leinfelder

trace level log for looping over EVERY pid in the system.

7209 05/30/2012 08:18 PM ben leinfelder

logging for each step of shared identifiers loading.

7208 05/30/2012 08:07 PM ben leinfelder

remove pause/resume - seemed to make metacat just hang on SM retrieval. Add more logging when returned SM is null -- want to make sure it is becuase the local node "owns" the pid key even though there is no value for it.

7207 05/30/2012 06:12 PM ben leinfelder

due to hudson build issue, did not actually end up testing pause/resume -- trying that again

7206 05/30/2012 05:53 PM ben leinfelder

pause/resume was not enough. trying shutdown/restart

7205 05/30/2012 05:02 PM ben leinfelder

experiment with lifecycle pause/resume. hopefully it prevents our node from taking ownership of any keys before we are sure we have them all.

7204 05/30/2012 08:29 AM ben leinfelder

increase logging and add back in the call to saveLocally() in case the SM object has already been loaded into the shared map but before this node came back online.

7203 05/29/2012 11:21 PM ben leinfelder

no need to call saveLocally explicitly since loading from the shared store triggers that behavior locally because of the configured listeners.
use an iterator over the shared identifiers in case this set is constantly changing.

7202 05/29/2012 10:10 PM ben leinfelder

make only one DB call to look up local pids - no need to do a pstmt for every single shared pid.

7201 05/29/2012 09:05 PM ben leinfelder

on init (start up) launch a synchronization thread that ensures all shared identifier entries have a corresponding local System Metadata entry.

7200 05/29/2012 04:19 PM ben leinfelder

use 'allowFirst' for access rules. We have deprecated 'denyFirst' and deny rules in Metacat as of 2.0.0

7199 05/29/2012 03:02 PM ben leinfelder

handle https-only server configuration -- must pull resources from https not http for the skins etc.

7198 05/29/2012 02:53 PM ben leinfelder

handle https-only server configuration -- must pull resources from https not http for the skins etc.

7197 05/29/2012 10:31 AM ben leinfelder

fix NPE (logMetacat object was not initialized) that was occurring during store()

7196 05/29/2012 09:33 AM ben leinfelder

stack trace the HZ put exception during CN-CN replication

7195 05/29/2012 07:37 AM ben leinfelder

additional debugging statements for CONCURRENT_MAP_PUT error during CN-CN replication.

7194 05/28/2012 01:25 PM ben leinfelder

include eml2.0.0beta4 DTD during Metacat build so that we can continue to accept (and validate) beta4 documents.
This arose when testing Metacat as DataONE Coordinating Node where legacy documents are being housed in the CN.

7193 05/27/2012 09:03 AM ben leinfelder

encode '/' and ':' in the DOI used for the resolve URL

7192 05/25/2012 06:20 PM Chris Jones

Don't set the replication status to failed for an object when it is called by a public user. Just throw the NotAuthorized exception. This prevents this node from being de-prioritized because of public calls to the method.

7191 05/25/2012 04:23 PM ben leinfelder

include revisions table in the initial temp table population.
use the "first" creator listed in the EML (either org or person).
use other reasonable default values as needed to fully populate the spreadsheet columns
https://redmine.dataone.org/issues/2815

7190 05/25/2012 02:30 PM ben leinfelder

add columns: publisher and pub_date. include default values for all columns - even data files should have title.
still a few todos but closer.
https://redmine.dataone.org/issues/2815

7189 05/25/2012 12:07 PM ben leinfelder

script to generate DOI registration spreadsheet
https://redmine.dataone.org/issues/2815

7188 05/23/2012 04:41 PM ben leinfelder

share the same dbConnection when inserting and then updating SystemMetadata objects in the backing store.
any errors encountered during the update will rollback the entire transaction and the SM record will not exist, even in part.

7187 05/23/2012 03:28 PM ben leinfelder

Do not loadAllKeys() for SystemMetadataMap when Metacat first starts up. hzIdentifiers will be populated with a simple SQL statement rather than the serial loading of every single SystemMetadata object. It will remain in synch using the usual entryXXX() methods as before....

7184 05/23/2012 09:57 AM ben leinfelder

include pidFilter handling - only matches the complete pid. Issues a warning in the Metacat logs when pidFilter cannot be applied but allows the call to getLogs() to return as though there was no pidFilter given.
https://redmine.dataone.org/issues/2798

7183 05/23/2012 09:33 AM ben leinfelder

use at least one thread on single-processor machines.
https://redmine.dataone.org/issues/2800

7181 05/22/2012 04:47 PM ben leinfelder

script for re-applying missing FK constraints on KNB production DB.
http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5608

7179 05/21/2012 02:31 PM Chris Jones

Add a few logging statemnts for round trip replication metrics.

7178 05/21/2012 02:12 PM ben leinfelder

add trace statements for measuring time to complete SM generation.

7175 05/18/2012 12:09 PM ben leinfelder

instead of generating SM and ORE maps during dataone configuration/MN registration, moved this all to the replication admin screen where we can target generation for specific nodes. That way it's more controlled as to when and where we generate DataONE required content....

7174 05/18/2012 12:00 PM ben leinfelder

include all EML versions (had been only eml 2.1 for testing)

7172 05/17/2012 05:43 PM Jing Tao

Append more information such as user name and group to the validating session response.

7171 05/17/2012 12:46 PM ben leinfelder

remove exception from method decl - was not matching the interface def and not compiling.

7170 05/14/2012 02:36 PM ben leinfelder

add "Generate System Metadata" button to the replication server list display. When clicked, we generate SM for records belonging to that source server. This is only enabled when DataONE has been configured.
https://redmine.dataone.org/issues/2762

7169 05/09/2012 03:45 PM ben leinfelder

expose serverLocation parameter to run GenerateSystemMetadata for different replication parters as needed.
https://redmine.dataone.org/issues/2740

7168 05/08/2012 04:30 PM ben leinfelder

only generate system metadata for original objects.
https://redmine.dataone.org/issues/2721

7162 05/02/2012 08:58 AM ben leinfelder

handle authorization for delete() differently for CN vs MN.
On the CN, only the CN (or tbd admin user) can call it.
On the MN, both the CN (or admin user) and the same MN can call it.