Project

General

Profile

Statistics
| Revision:

# Date Author Comment
9833 07/06/2016 04:52 PM Jing Tao

Replaced the JiBXException by MarshallingException.

9490 01/14/2016 01:28 PM ben leinfelder

add warning when exception encountered loading SM into map.

9116 02/12/2015 03:32 PM Jing Tao

Add the timestamp for the printing out the start and end of hazelcast synchronization.

9078 01/27/2015 04:30 PM Jing Tao

Add the log messages to indicate the hazelcast synchronization starting and ending.

8850 09/12/2014 03:56 PM Jing Tao

Add the code to delete systemmetadata.

8810 07/23/2014 04:19 PM ben leinfelder

add support for v2 DataONE API.

8464 01/07/2014 01:56 PM ben leinfelder

Unify solr indexing with an IndexTask that is added to the queue -- allows us to send more than just the systemMetadata to the indexer. Initially this is for READ event counts for each document. https://projects.ecoinformatics.org/ecoinfo/issues/6346

7842 07/03/2013 12:28 AM ben leinfelder

only use MapStore/MapLoader for saving/loading IndexEvent objects. No need to use a listener since there is only the single node -- all entries are persisted to DB using the hazelcast.xml config we have for the map. https://projects.ecoinformatics.org/ecoinfo/issues/5944

7841 07/03/2013 12:03 AM ben leinfelder

add MapStore/Loader test for the IndexEvents -- adding and removing events in the DB table through hazelcast. https://projects.ecoinformatics.org/ecoinfo/issues/5944

7829 06/25/2013 01:23 PM ben leinfelder

stub for storing IndexEvent objects in Metacat (from metacat-index processing). https://projects.ecoinformatics.org/ecoinfo/issues/5944

7827 06/25/2013 01:17 PM ben leinfelder

do not force a get() during refresh (causing EML-defined data access rules to be lost when inserting EML docs about data files). note that this reverses a change that was meant to trigger indexing, but now we are using a new queue to share index events with metacat-index and so should not be necessary.

7812 06/20/2013 04:49 PM ben leinfelder

use an independent ISet<SystemMetadata> structure to communicate objects that should be indexed by metacat-index. https://projects.ecoinformatics.org/ecoinfo/issues/5943

7755 05/22/2013 04:44 PM ben leinfelder

load the evicted SM back into the map on a "Refresh" so that listeners hear the update. (metacat-index, for example)

7421 11/10/2012 03:34 PM Chris Jones

In migrating to Hazelcast 2.4.x, replace deprecated methods. Use Hazelcast.newHazelcastInstance() rather than Hazelcast.init(). For other deprecated static methods, use the HazelcastInstance equivalent calls.

7419 11/09/2012 08:56 AM Chris Jones

To attempt to address performance and stability WRT Hazelcast communication, we're upgrading to the 2.x series of Hazelcast. remove the 1.9.x jar files, and add the 2.4.1-SNAPSHOT jars. Modify HazelcastService to handle the minor change in the ItemListener interface (now passes ItemEvent<Identifier> as an argument)....

7411 10/26/2012 09:08 AM ben leinfelder

add count for the total processed pids (from ISet iterator)

7346 08/03/2012 02:27 PM ben leinfelder

allow SM resynch to be executed any time, not just during start up.
https://redmine.dataone.org/issues/3116

7345 08/03/2012 01:01 PM ben leinfelder

change to debug log level when processing shared/local pids)

7344 08/03/2012 10:41 AM ben leinfelder

only lock the missing pid event if we know we have it locally to contribute.
https://redmine.dataone.org/issues/3117

7343 08/03/2012 09:26 AM Chris Jones

Add locking to the itemAdded() method so ideally only one CN will respond to the request for a 'wanted' pid from the cluster. The lock is on a string, not the pid, and so won't conflict with system metadata locking. The string is based on the pid, with "missing-" as a prefix.

7342 08/03/2012 08:53 AM ben leinfelder

only publish to the missing pid "wanted list" when resynching system metadata. we were seeing redundant entry added/updated events when looking up the shared systemmetadata first.

7341 08/02/2012 10:18 PM ben leinfelder

print the missing pid count, not the total shared pid count so we know how many will be processed.

7340 08/02/2012 05:50 PM ben leinfelder

change the system metadata resynch approach: nodes will publish PIDs that they are missing after inspecting the shared identifier set. other nodes will be listening for the "wanted" pids and will put their local copy of SystemMetadata on the shared SM map. This should dramatically decrease the hazelcast chatter during a resynch and targets only the pids that are missing from any of the various nodes.

7339 08/01/2012 10:40 PM ben leinfelder

logging for processing identifier set on restart.

7326 07/23/2012 11:55 AM ben leinfelder

use local Set processing to determine which pids (if any) should be contributed to the shared set by this node during the resync. Should save time rather than checking each and every pid against the shared set.

7325 07/20/2012 03:44 PM ben leinfelder

move the hzIdentifiers initialization into the resync thread so that it does not affect start up time. cleaned up unused methods and superfluous code.

7323 07/20/2012 10:51 AM ben leinfelder

only load local pids into hzIdentifiers if t hey do not already exist in the shared set. increase logging severity and detail of messages emitted during this process to get a better sense of what is taking so long.

7319 07/17/2012 03:57 PM Chris Jones

On the coordinating Nodes, we often get McdbDocNotFoundExceptions for data (doctype == 'BIN') documents because they are not synchronized to the CNs. Change the logging to only print the stack trace during load() and loadAll() when log debug is enabled.

7295 07/09/2012 04:23 PM ben leinfelder

log error when looking up non-existent local SM rather than completely bombing out of the resynch thread.

7221 05/31/2012 06:16 PM ben leinfelder

put(sm) for every pid we have a SM value for so that all members receive the entry event and can save locally.

7217 05/31/2012 10:53 AM ben leinfelder

ignore partition owner -- always attempt to look up form local store if we were unable to get the SM from the shared map.

7216 05/31/2012 10:13 AM ben leinfelder

do not check if this CN has a "perfect" copy of the SM identifiers -- we need any CN coming online to contribute the records that they have locally so that in the event that all three CNs have a partial view of things they all eventually share each others' SM entries.

7213 05/31/2012 09:19 AM ben leinfelder

push SystemMetadata entries from the CN that has them all to the shared map where other nodes may not have all entries. The CN with the complete copy only pushes SM entries that it does not own and that return as null because those are the ones that are missing on the other, non-complete CNs....

7212 05/30/2012 10:00 PM ben leinfelder

trace level log for looping over EVERY pid in the system.

7211 05/30/2012 09:47 PM ben leinfelder

meant to log the guids (source) not the pids (target)

7210 05/30/2012 08:51 PM ben leinfelder

trace level log for looping over EVERY pid in the system.

7209 05/30/2012 08:18 PM ben leinfelder

logging for each step of shared identifiers loading.

7208 05/30/2012 08:07 PM ben leinfelder

remove pause/resume - seemed to make metacat just hang on SM retrieval. Add more logging when returned SM is null -- want to make sure it is becuase the local node "owns" the pid key even though there is no value for it.

7207 05/30/2012 06:12 PM ben leinfelder

due to hudson build issue, did not actually end up testing pause/resume -- trying that again

7206 05/30/2012 05:53 PM ben leinfelder

pause/resume was not enough. trying shutdown/restart

7205 05/30/2012 05:02 PM ben leinfelder

experiment with lifecycle pause/resume. hopefully it prevents our node from taking ownership of any keys before we are sure we have them all.

7204 05/30/2012 08:29 AM ben leinfelder

increase logging and add back in the call to saveLocally() in case the SM object has already been loaded into the shared map but before this node came back online.

7203 05/29/2012 11:21 PM ben leinfelder

no need to call saveLocally explicitly since loading from the shared store triggers that behavior locally because of the configured listeners.
use an iterator over the shared identifiers in case this set is constantly changing.

7202 05/29/2012 10:10 PM ben leinfelder

make only one DB call to look up local pids - no need to do a pstmt for every single shared pid.

7201 05/29/2012 09:05 PM ben leinfelder

on init (start up) launch a synchronization thread that ensures all shared identifier entries have a corresponding local System Metadata entry.

7197 05/29/2012 10:31 AM ben leinfelder

fix NPE (logMetacat object was not initialized) that was occurring during store()

7188 05/23/2012 04:41 PM ben leinfelder

share the same dbConnection when inserting and then updating SystemMetadata objects in the backing store.
any errors encountered during the update will rollback the entire transaction and the SM record will not exist, even in part.

7187 05/23/2012 03:28 PM ben leinfelder

Do not loadAllKeys() for SystemMetadataMap when Metacat first starts up. hzIdentifiers will be populated with a simple SQL statement rather than the serial loading of every single SystemMetadata object. It will remain in synch using the usual entryXXX() methods as before....

7168 05/08/2012 04:30 PM ben leinfelder

only generate system metadata for original objects.
https://redmine.dataone.org/issues/2721

7117 04/04/2012 04:55 PM ben leinfelder

add comment about returning early when no system metadata can be found.
removed extraneous check on the content type of the SM -- was unused.
formatted indenting

7116 04/04/2012 04:49 PM ben leinfelder

for SystemMetadata events we first check the event for the SM value. If it returns null, we look it up from the shared map. It seems as if we don't always get a value with our events.

7115 04/04/2012 03:35 PM ben leinfelder

comment out: synchronize local system metadata on cn restart

7114 04/03/2012 01:31 PM ben leinfelder

synchronize local system metadata on cn restart

7111 04/02/2012 04:11 PM ben leinfelder

log calls to store() system metadata to the backing store

7108 03/30/2012 05:24 PM ben leinfelder

Add the listener for LifecycleEvent state changes

7107 03/30/2012 05:23 PM ben leinfelder

synchronizeLocalStore() when the cluster has a LifecycleEvent state change to RESUMED.

7106 03/29/2012 02:48 PM ben leinfelder

refactor memberAdded code to separate method - synchronizeLocalStore for possible reuse

7098 03/27/2012 02:25 PM ben leinfelder

-use MembershipListener to keep new members' backing store for system metadata synchronized with the shared system metadata map.
-remove the unused InstanceListener interface

7089 03/26/2012 02:10 PM Chris Jones

Add a few more debugging statements to HazelcastService for troubleshooting hazelcast map concurrency.

7055 03/12/2012 12:12 PM ben leinfelder

add an alternative method for loading system metadata identifiers but leave it commented out. We may find that using the ObjectList method is too much overhead, but it will always be consistent with what metacat reports for listObjects().

7054 03/12/2012 12:09 PM ben leinfelder

add note about long-running load for shared system metadata map

6997 02/08/2012 10:50 AM ben leinfelder

refactor IdentityManager.createSystemMetadata(sm) to be insertSystemMetadata(sm) so that it is clear that this method inserts the SM object into the backing store. This differentiates it from the "generation" methods we use when we need to create SM about pre-existing objects or objects we get from non-D1 api calls.

6963 01/27/2012 05:12 PM ben leinfelder

additional logging of the config file being used - seem to have thread locking on the xmlConfig use when running under ant/junit

6904 01/13/2012 11:17 AM Chris Jones

In IdentifierManager.updateSystemMetadata(), add a check for invalid system metadata (fields that throw a NullPointerException on access) to ensure that system metadata is populated correctly. Updated calling classes to handle the exception.

6894 01/12/2012 01:56 PM Chris Jones

Handle SQLExceptions when trying to save system metadata locally.

6893 01/12/2012 01:56 PM Chris Jones

Convert SQLExceptions to RuntimeExceptions for Hazelcast MapStore operations.

6889 01/12/2012 07:53 AM Chris Jones

Keep the hzIdentifiers set in sync with the Metacat systemmetadata table. If entries are added/updated in the hzSystemMetadata map, make sure the identifier is in the set. If (for some administrative reason) the entry is removed, remove the identifier from the set. This usually doesn't happen.

6888 01/12/2012 07:47 AM Chris Jones

When loading all keys from Metacat into the hzSystemMetadata map, also load identifiers into the hzIdentifiers set if they are not already there. Although entries may be evicted from the map, the list of identifiers will remain. The list will have a fairly small memory footprint since it's just identifiers.

6887 01/12/2012 07:44 AM Chris Jones

Add support for the distributed Set of unique identifiers in the storage cluster called 'hzIdentifiers'. This set is a persistent total list of all identifiers (even when entries in the hzSystemMetadata map are evicted). It reflects the state of the identifiers in the postgresql systemmetadata table, but is distributed across the cluster. Add the getIdentifiers() method, which returns the ISet of identifiers.

6868 01/09/2012 04:59 PM Chris Jones

Use a Logger instead of System.out for SystemMetadataMap.

6859 01/06/2012 07:23 AM Chris Jones

Use Lock instead of ILock to be consistent across classes.

6714 11/29/2011 03:43 PM ben leinfelder

evict the HazelCast SystemMetadata entry if we update the access control rules via Metacat's legacy API, otherwise stale SystemMetadata stays in memory instead of being looked up from the backing table store.

6711 11/29/2011 01:44 PM Chris Jones

Set a default HazelcastInstance after init() is called, and use this instance in getLock() to acquire a lock in the cluster.

6708 11/29/2011 01:05 PM ben leinfelder

use shared method for looking up "docInfo" map -- both in Metacat replication and in D1 system metadata generation

6703 11/29/2011 10:10 AM Chris Jones

When using ILock.lock(), get a lock on the string value of the Identifier, not the Identifier object itself. Hazelcast locking won't work otherwise.

6702 11/29/2011 08:55 AM Chris Jones

Use the Hazelcast ILock mechanism to lock the system metadata identifier rather than using IMap.lock(pid).

6648 11/14/2011 03:34 PM ben leinfelder

delete system metadata when MN.delete() is called.

6596 11/02/2011 09:58 PM ben leinfelder

make MNodeServiceTest pass JUnit testing

6496 09/22/2011 10:12 AM ben leinfelder

return null instead of throwing an exception when pid is not found in store

6490 09/21/2011 05:45 PM ben leinfelder

comment out resynch() method until errors are resolved

6489 09/21/2011 05:25 PM ben leinfelder

use default hazelcast config when not configured to use an external one

6483 09/21/2011 04:51 PM Chris Jones

For now, remove the hzClient code connecting to the DataONE process cluster to get the hzNodes map. This will be moved into the storage cluster, but use D1Client to get the node list for now.

6481 09/21/2011 10:54 AM rnahf

going back to using IDentifier as the key for the ObjectPAthMap.

6471 09/20/2011 02:08 PM Matt Jones

Reverting previous @Overrides chanrge from r6470, as that is the desired
behavior under Java 1.6 -- previous versions of Java (e.g., 1.5) will not
comile with this usage of the @Overrides annotation, but the currently
supported version will. So reverting to the 1.6 convention.

6470 09/20/2011 01:36 PM Matt Jones

Removing incorrect @Override annotations that were preventing compilation. The methods marked did not actually override a method in the superclass, so they were not compiling. I think @Overrides was being mistaken for methods that implement an interface but aren't actually in the superclass.

6465 09/20/2011 07:18 AM Chris Jones

Remove references to CNReplicationTask.

6464 09/20/2011 07:17 AM Chris Jones

Remove the CNReplicationTask (for now). We will be using Metacat's ForceReplicationHandler to replicate science metadata across CNs, and may explore the use of a 100% evicted hzScienceMetadata map. Either way, the distributed task design won't be needed. When a dropped CN comes back online, we'll catch it up based on last modified dates for PIDs in the hzSystemMetadata map.

6462 09/20/2011 06:42 AM Chris Jones

Add getNodesMap() to return the hzNodes map from the process cluster. Remove getPendingReplicationTasks since that structure is being removed. Add minor documentation.

6459 09/19/2011 03:26 PM ben leinfelder

lookup latest system metadata update date for use in synchronizing CN-CN when an offline nodes comes back online

6458 09/19/2011 12:17 PM rnahf

changed the key type from Identifier to String for ObjectPathMap. (need a Comparable key).

6457 09/19/2011 11:48 AM ben leinfelder

rework this to be MN->MN replication. Should be fleshed out more.

6456 09/19/2011 11:43 AM ben leinfelder

throw RuntimeExceptions when store() methods throw declared exceptions -- we want callers to put() to be alerted if there are errors.

6455 09/19/2011 10:18 AM ben leinfelder

move CNReplicationTask to the hazelcast package

6451 09/16/2011 04:42 PM ben leinfelder

check if system metadata exists rather than just the id mapping (before creating the entry)

6446 09/16/2011 02:24 PM ben leinfelder

move bulk of the Hazelcast code into HazelcastService from CNodeService so that it is centrall located - easier to manage and configure

6442 09/16/2011 10:42 AM rnahf

removing unneeded class (never used)

6440 09/15/2011 01:46 PM rnahf

cleaned up mock tests on hzObjectPathMap. split out code for mocking a datastore into MockObjectPathMap.

6437 09/15/2011 12:57 PM ben leinfelder

initialize Hazelcast from the custom configuration when initializing the Metacat service.

6434 09/15/2011 10:38 AM ben leinfelder

handle entryAdded (to hzSystemMetadata) to store sysmeta to local store when it is not already present

6426 09/14/2011 03:16 PM ben leinfelder

use HashMap, HashSet instead of the Tree* classes that require Identifier objects implement Comparable