Project

General

Profile

Statistics
| Revision:

# Date Author Comment
7520 03/22/2013 12:43 PM ben leinfelder

select only distinct guids (synch may have failed more than once for any given guid)
https://redmine.dataone.org/issues/3539

7519 03/22/2013 12:29 PM ben leinfelder

include xml_revisions.
do not allow removal of server_location = 1 documents (these are not replicas).
https://redmine.dataone.org/issues/3539

7506 03/08/2013 03:49 PM ben leinfelder

class for removing failed/invalid replicas from target nodes that previously held replicated content (KNB/LTER/PISCO/etc). https://redmine.dataone.org/issues/3539

7392 09/24/2012 01:09 PM ben leinfelder

shorten the systemmetadata* table names for Oracle's 30 character limit. move version to 2.0.5. http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5717

7331 07/26/2012 04:26 PM ben leinfelder

check for null archived flag in ORE SM
https://redmine.dataone.org/issues/3046

7322 07/19/2012 02:38 PM ben leinfelder

utility methods to update/reserialize existing ORE maps that were generated with older foresite (and included bad dateTime strings).
https://redmine.dataone.org/issues/3046

7299 07/10/2012 02:12 PM ben leinfelder

include dataone.ore.downloaddata as a configurable property in case MNs (like LTER) want to have the process download externally-stored data files described in an EML data package.

7175 05/18/2012 12:09 PM ben leinfelder

instead of generating SM and ORE maps during dataone configuration/MN registration, moved this all to the replication admin screen where we can target generation for specific nodes. That way it's more controlled as to when and where we generate DataONE required content....

7174 05/18/2012 12:00 PM ben leinfelder

include all EML versions (had been only eml 2.1 for testing)

7169 05/09/2012 03:45 PM ben leinfelder

expose serverLocation parameter to run GenerateSystemMetadata for different replication parters as needed.
https://redmine.dataone.org/issues/2740

7168 05/08/2012 04:30 PM ben leinfelder

only generate system metadata for original objects.
https://redmine.dataone.org/issues/2721

7124 04/06/2012 01:53 PM ben leinfelder

do not download data at this point

7121 04/06/2012 01:35 PM ben leinfelder

only 2.1.0 EML docs for ORE generation right now...

7105 03/29/2012 11:19 AM ben leinfelder

handle last group of ids (oops)

7104 03/29/2012 10:33 AM ben leinfelder

use range of the list for test system metadata

7103 03/29/2012 08:48 AM ben leinfelder

use non-random list for generating system metadata in test mode

7081 03/19/2012 12:40 PM ben leinfelder

use scope prefix (knb-lter-%) for random test doc population

7013 02/16/2012 02:47 PM ben leinfelder

do not subset the list for MS generation testing -- at least not as the default in svn!

7007 02/09/2012 04:11 PM ben leinfelder

only run ORE generation for EML docs -- no need to run this for all documents (yikes!)

7006 02/09/2012 03:48 PM ben leinfelder

use IdMan method to find docids that do not already have system metadata records -- this lets us re-run without re computing system metadata for every entry (in case the process is interrupted). I haven' been using this option because I wanted to continually regenerate all SM for everything in my test DBs, but we are so close to release that I want to get this in there.

7005 02/09/2012 02:39 PM ben leinfelder

for testing: limit and randomize the docs to generate metadata for

7004 02/09/2012 08:39 AM ben leinfelder

FOR TESTING ONLY: limit number of records to 100 so that we can get an estimate

6999 02/08/2012 10:58 AM ben leinfelder

refactor D1-specific upgrade utilities into their own package

6996 02/08/2012 10:44 AM ben leinfelder

generate SystemMetadata during D1 registration (not 2.0.0 upgrade). This process runs in a thread and updates a metacat.properties value when it is complete.

6993 02/07/2012 03:18 PM ben leinfelder

use correct Collections import

6981 02/06/2012 11:40 AM ben leinfelder

do not wait for SM generation to complete during the upgrade -- this way the web UI wont hang for days. the process sets a metacat property when it is complete.

6980 02/06/2012 11:38 AM ben leinfelder
6977 02/02/2012 05:15 PM ben leinfelder

do not shutdown hazelcast -- it needs to be running after the upgrade process so that Metacat actually works.
I think the newer version of HZ makes it so the threads are all released as needed.
http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5572

6976 02/02/2012 03:17 PM Matt Jones

Commenting out the parts of the upgrade script that started to refer to EXID. At this point, the registration of EZID identifiers will be done out-of-band with respect to the upgrade.

6966 01/30/2012 02:49 PM ben leinfelder

upgrade to hazelcast 1.9.4.6 so that threadpools are released when not needed (http://code.google.com/p/hazelcast/issues/detail?id=765).
include ant target to run a specific main class (mostly for debugging)

6964 01/27/2012 05:15 PM ben leinfelder

multithreaded implementation for processing docids for system metadata generation.
need to investigate ant/junit running that deadlocks hazelcast (config?)

6957 01/26/2012 12:48 PM ben leinfelder

only attempt to update date-like nodedata values.

6955 01/26/2012 10:03 AM ben leinfelder

include generate system metadata upgrade in the success flag

6954 01/26/2012 10:02 AM ben leinfelder

more clean up - reuse prepared statement for data update

6953 01/26/2012 08:40 AM ben leinfelder

look up nodedata values first, then update each one - trying to avoid out of memory exception.

6933 01/20/2012 10:46 PM Matt Jones

Added DOI generation to the 2.0.0 upgrade process. To succeed, this script must be run on a fresh 2.0.0 database, or on a 1.9.5 version database, as those are the only ways to get the needed foreign keys to be marked as deferrable. The identifier conversion must be turned on by setting correct properties in metacat.properties. See the comments in GenerateGlobalIdentifiers for details. By default, conversion is set to false in the properties file. If you want to convert an instance to use DOIs, be sure to set metacat.properties up BEFORE running through the Metacat configuration and database upgrade.

6919 01/17/2012 04:21 PM Chris Jones

Use the Collections class from java.util.

6912 01/17/2012 12:06 PM ben leinfelder

-generate system meta for all docids, even those not originating on the server (replicas from the past)
-generate ORE docs and download remote data only for those documents that originated on this server being upgraded.
http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5522

6911 01/17/2012 11:43 AM ben leinfelder

refactor generate system meta loop to the factory class -- to be reused in sysmeta and ORE generation
http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5522

6909 01/13/2012 04:57 PM ben leinfelder

do NOT generate ORE maps or download data when we do the initial System Metadata generation -- this is deferred until D1 registration.

6908 01/13/2012 02:25 PM ben leinfelder

make more generic so that a custom list of IDs can be passed in.

6906 01/13/2012 01:31 PM ben leinfelder

insert OR update system metadata -- no need to do an update right after initial insert...

6905 01/13/2012 01:05 PM ben leinfelder

call the System Metadata generator during upgrade to 2.0.0

6904 01/13/2012 11:17 AM Chris Jones

In IdentifierManager.updateSystemMetadata(), add a check for invalid system metadata (fields that throw a NullPointerException on access) to ensure that system metadata is populated correctly. Updated calling classes to handle the exception.

6856 01/05/2012 02:49 PM ben leinfelder

only delete replicated data files (server_location != 1)

6852 01/05/2012 12:06 PM ben leinfelder

download remote data and save locally when it is referenced by an EML package, then include it in the ORE map.
http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5522

6826 01/03/2012 10:56 AM ben leinfelder

upgrade routine to purge empty replicated data files so that they can be re-replicated
http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5536

6806 12/20/2011 11:19 AM ben leinfelder

sort the docids so that "old" revisions are processed before newer ones

6732 12/02/2011 11:27 AM ben leinfelder

close prepared statement only if not null
http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5562

6721 11/30/2011 05:31 PM ben leinfelder

generating ORE maps and creating/updating system metadata now. There are some Permission conversion issues to be worked out yet

6712 11/29/2011 02:15 PM ben leinfelder

optionally include ORE generation/insertion into Metacat when generating SystemMetadata
https://redmine.dataone.org/issues/2056

6705 11/29/2011 12:31 PM ben leinfelder

refactor SystemMetadata creation into separate class from the MetacatHandler -- this will be shared by upgrade code and normal metacat api.

6704 11/29/2011 11:00 AM ben leinfelder

include all document revisions when generating "missing" system metadata
TODO: revision graph captured in obsoletes/obsoletedBy

6701 11/28/2011 04:09 PM ben leinfelder

simplify SystemMetadata generation -- will be done during Metacat upgrade for D1 features/support.

6551 10/27/2011 01:00 PM ben leinfelder

use logging, not system.out

6550 10/27/2011 12:40 PM ben leinfelder

change upgrade scripts/routines to use 2.0.0 version number instead of 1.10.0

6549 10/27/2011 11:35 AM ben leinfelder

make sure we close the prepared statement always

6384 08/31/2011 02:36 PM Chris Jones

Update classes to use the DataONE 0.6.4 schema and types. Major changes involve using BigInteger vs long in SystemMetadata.size, and using ObjectFormatIdentifier rather than Object format.

6366 07/27/2011 04:25 PM ben leinfelder

use new "v1" types from DataONE

6362 07/20/2011 03:00 PM ben leinfelder

remove CrudService -- replaced by MNodeService and CNodeService

6130 06/07/2011 02:56 PM ben leinfelder

organize imports so that it is clearer what dependencies exist on the D1 jars

6099 05/25/2011 11:59 AM ben leinfelder

-remove system metadata guid -> local id mapping (there is no document for system metadata now)
-include system metadata elements when replicating data objects (TODO: transfer all system metadata structures with the docinfo request).
TODO: remove docid+rev from the systemMetadata table definition

6097 05/24/2011 04:18 PM ben leinfelder

do not use XML files for storing SystemMetadata - use DB tables only.

6025 03/29/2011 11:23 AM ben leinfelder

do each table separately with it's own connection - running into memory issues on dev.nceas running this.

6020 03/24/2011 03:10 PM ben leinfelder

use the jaxb date parser for ISO 8601 formats. the numeric and date node values are now calculated after the document has been successfully inserted in the db so any sql exceptions do not prevent the raw node data from being saved.
http://bugzilla.ecoinformatics.org/show_bug.cgi?id=2084

6012 03/16/2011 10:56 PM ben leinfelder

add support for temporal element query in pathquery
http://bugzilla.ecoinformatics.org/show_bug.cgi?id=2084