/src/edu/ucsb/nceas/metacat/DocumentImpl.java - Changes - Metacat - Ecoinformatics Redmine

metacat/src/edu/ucsb/nceas/metacat/DocumentImpl.java @ 8745

svn:eol-style: native
svn:executable: *
svn:keywords: Author Date Id Revision

#	Date	Author	Comment
8647	02/25/2014 04:14 PM	ben leinfelder	recursively submit obsoleted objects for indexing when instructed. https://projects.ecoinformatics.org/ecoinfo/issues/6424
8579	02/07/2014 12:40 PM	Peter Slaughter	sync pids of <distribution><online> data objects with CN when their access rules change in EML 2.0.* <additionalMetadata>
8560	02/04/2014 02:24 PM	Peter Slaughter	Sync access policy between MN -> CN when access rules are updated in EML 2.1+ for data object
8464	01/07/2014 01:56 PM	ben leinfelder	Unify solr indexing with an IndexTask that is added to the queue -- allows us to send more than just the systemMetadata to the indexer. Initially this is for READ event counts for each document. https://projects.ecoinformatics.org/ecoinfo/issues/6346
8454	12/20/2013 07:46 AM	Chris Jones	On changes to system metadata in CNodeService and DocumentImpl, increment the serialVersion.
8299	10/09/2013 01:47 PM	Matt Jones	Refactor to use IOUtils.closeQuietly() which handles nulls and streams that are already closed.
8298	10/09/2013 01:26 PM	Matt Jones	Added close() to finally block for readFromFileSystem() call.
8297	10/09/2013 12:44 PM	Matt Jones	Closing FileOutputStream handles so that the OS limits on filehandles are not exceeded.
8164	08/26/2013 04:30 PM	Jing Tao	If the pathquery engine is disabled, the xml path index queue will be disabled as well.
7840	07/02/2013 04:47 PM	ben leinfelder	support a "force replication delete all action" during replication. This is used when we want Metacat to remove the content from the other target replicas because the DataONE delete() action was called (more powerful than just "archive").
7824	06/24/2013 12:05 PM	ben leinfelder	do not use tmp file to return an inputstream on read() operations - just read from the file we already have. https://projects.ecoinformatics.org/ecoinfo/issues/6009
7812	06/20/2013 04:49 PM	ben leinfelder	use an independent ISet<SystemMetadata> structure to communicate objects that should be indexed by metacat-index. https://projects.ecoinformatics.org/ecoinfo/issues/5943
7477	12/18/2012 05:33 PM	ben leinfelder	remove indexing task from the queue when we are updating the document
7475	12/12/2012 02:38 PM	ben leinfelder	move DocInfo parsing into utilities project so that it can be used by Morpho as well as Metacat. http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5737
7445	11/30/2012 02:53 PM	ben leinfelder	rollback the delete() when there is an error performing part of it -- don't want to end up with partial delete.
7444	11/30/2012 02:27 PM	ben leinfelder	use Identifier object not String when retrieving SM from the HZ map to set archived during delete()
7433	11/26/2012 02:25 PM	ben leinfelder	remove document from the indexing queue when delete is called. http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5750
7429	11/23/2012 10:00 AM	ben leinfelder	mark documents as archived=true when they are deleted using the Metacat API. https://redmine.dataone.org/issues/3406
7367	08/31/2012 03:05 PM	ben leinfelder	correct the number of prepared statement parameters when inserting to xml_revisions table. Errors like the following were showing in the replication log file: knb 20120831-19:42:38: [ERROR]: DocumentImpl.writeReplication - Failed to create access rule for package: john.15950.1 because The column index is out of range: 12, number of columns: 11. [ReplicationLogging]
7350	08/06/2012 10:47 PM	ben leinfelder	when updating a document on a remote server, we still need to use the previous docid to check that the user has permissions to do so (rather than the new id that is obsoleting the old id). This was discovered by M Servilla at LTER.
7151	05/01/2012 09:18 AM	ben leinfelder	[optionally] do not archive the xml_documents and xml_nodes to *_revisions when 'deleting' a document. This will effectively guarantee that the document/data cannot be retrieved after delete. NOTE: D1 system metadata will persist (for now) so that the ID cannot be reused with the DataONE API but Metacat calls may allow the ID to be reused -- may need to reconsider this behavior....
7150	04/30/2012 04:03 PM	ben leinfelder	optionally remove the document/data file from the filesystem completely when 'deleting' it. https://redmine.dataone.org/issues/2677
7128	04/09/2012 03:18 PM	ben leinfelder	add a parameter for optionally writing EML-embedded access control rules to the Metacat DB. https://redmine.dataone.org/issues/2584 https://redmine.dataone.org/issues/2583
6746	12/07/2011 05:04 PM	ben leinfelder	check previous revision for permissions to update (includes data described by EML)
6744	12/07/2011 12:18 PM	ben leinfelder	refactor Metacat access handling to be on a per-revision basis so that it more closely aligns with the DataONE approach http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5560
6606	11/04/2011 02:45 PM	ben leinfelder	uses prepared statement instead of plain old statement. deprecated the DBConnection.createStatement() method to discourage direct parameter value use in favor of parameter binding. http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5527
6595	11/02/2011 08:40 PM	ben leinfelder	http://bugzilla.ecoinformatics.org/show_bug.cgi?id=5527
6562	10/28/2011 01:16 PM	ben leinfelder	include clearer error message when UPDATE action is requested on a replicated document and we fail to successfully get a lock from the source Metacat server http://bugzilla.ecoinformatics.org/show_bug.cgi?id=4907
6197	06/23/2011 05:16 PM	ben leinfelder	do not delete the access rules when we "archive" the document on "delete" (commented out for now)
6034	04/08/2011 10:22 AM	ben leinfelder	remove System.out statements in favor of logging
6020	03/24/2011 03:10 PM	ben leinfelder	use the jaxb date parser for ISO 8601 formats. the numeric and date node values are now calculated after the document has been successfully inserted in the db so any sql exceptions do not prevent the raw node data from being saved. http://bugzilla.ecoinformatics.org/show_bug.cgi?id=2084
6012	03/16/2011 10:56 PM	ben leinfelder	add support for temporal element query in pathquery http://bugzilla.ecoinformatics.org/show_bug.cgi?id=2084
6001	03/02/2011 02:12 PM	Chris Jones	DocumentImpl.delete() now throws finer grained exceptions (not a general exception). Consequently, the classes that call it have been updated to handle the thrown exceptions, including CrudService, ReplicationHandler, and ReplicationService.
5752	12/21/2010 02:26 PM	ben leinfelder	use detected XML encoding when reading/writing files use UTF-8 as default when performing queries in the DB (assume DB is using UTF-8) remove as many PrintWriters (uses system default character encoding only) as possible and construct OutputStreamWriters where explicit encoding can be given....
5709	12/08/2010 04:59 PM	ben leinfelder	add support for EML 2.1.1
5673	11/30/2010 04:40 PM	berkley	made delete serialize the identifier
5621	10/19/2010 09:33 AM	berkley	fixed a bunch of small errors, did some reformatting, and fixed a bug that I thought was fixed last week
5448	07/23/2010 04:35 PM	Matt Jones	Fixed spelling error.
5422	07/02/2010 01:08 PM	ben leinfelder	Correct log warning message - not "Account" but "Accession" number
5352	05/12/2010 10:55 AM	berkley	removed system.outs
5350	05/11/2010 04:49 PM	berkley	amost have update working. still need to get unit test squared away.
5340	05/10/2010 01:15 PM	berkley	refactored XMLSchemaService to not have static methods. made the CrudServiceTest more robust.
5339	05/07/2010 04:30 PM	berkley	removed system.outs
5338	05/07/2010 04:28 PM	berkley	fixed schema location bug. the dataone schemas are now correclty found
5337	05/07/2010 03:15 PM	berkley	removed CrudService dependency on servlet params. CrudService is now a singleton. I'm getting an error from metacat saying it can't find teh systemmetadata schema, even though it is, in fact, registered with metacat. need to identify why this is happening.
5319	04/19/2010 04:51 PM	Matt Jones	Modifications to support the DataONE service API version 0.1.0. For DataONE, the get() and create() services are partially complete. Several more functions and checks need to be added to create() before it is viable. This DataONE support is not complete, and the current support breaks the MetacatRestClientTest for the time being (this client will eventually be removed).
5311	04/14/2010 11:31 AM	daigle	Merge 1.9.2 changes back into the trunk
5298	04/02/2010 11:29 AM	Matt Jones	Modified readFromMetacat() to pass most exceptions up the call stack, which allows creation of new entry points for calling reads. Still need to continue factoring out the HTTPServletResponse that is passed in in order to make entrypoints that are not servlet based possible. Problem now is in...
5207	02/03/2010 01:16 PM	daigle	Beef up log messages
5198	01/21/2010 03:02 PM	daigle	do a quick retry if building path index fails with a SQL error
5197	01/20/2010 02:55 PM	daigle	Make sure buildIndex throws an exception if it has a sqlexception. That way the indexing object will be added to the indexing queue and reprocessed.
5195	01/19/2010 10:25 AM	daigle	Pass the doc xml as a string to docImpl.write and writeRepication. This is so a reader can be create for the parsing and for the write to disk. Also created a db access class for xml query result deletion.
5186	01/06/2010 04:18 PM	Jing Tao	Fix the bug of http://bugzilla.ecoinformatics.org/show_bug.cgi?id=4645. handleGetRevisionAndDocTypeAction will search both xml_documents and xml_revisions table. It also changed some constrain in AccessionNumber when user update a document, of which all previous versions are in xml_revisions table.
5162	12/09/2009 11:54 AM	daigle	Change log levels where appropriate and add class/method name to output.
5160	12/08/2009 11:40 AM	daigle	beef up error log messages
5122	11/19/2009 08:19 AM	ben leinfelder	special handling for RDF namespace documents (semtools project)
5090	10/16/2009 11:10 AM	daigle	Move access control source to it's own directory.
5030	08/24/2009 02:34 PM	daigle	Change location of PropertyService to properties directory
5025	08/14/2009 02:22 PM	daigle	Move document specific utilities to DocumentUtil from MetacatUtil. This makes it easier to define a layer between the core metacat services and the rest of the code.
5015	08/04/2009 02:32 PM	daigle	Create database and shared directories for database management code and shared code respectively.
4950	06/12/2009 04:39 PM	daigle	Add archival read funtionality (jar/kar/war files)
4856	03/24/2009 10:17 AM	daigle	Introduce replication user. Use the fileutil writer methods instead of writing directly.
4854	03/23/2009 02:56 PM	daigle	Beef up exception handling from file utilities. Move UtilException to MetacatUtilException to eliminate conflict with similar exception in utility package.
4851	03/19/2009 10:25 AM	daigle	Do not read or write zero length documents to/from disk
4812	02/18/2009 04:30 PM	daigle	Format indexPaths in metacat.properties. Remove from build.properties and build.xml. Move indexPath list getter from MetacatUtil to SystemUtil.
4743	01/13/2009 11:11 AM	walbridge	Fix typos in error messages.
4698	12/26/2008 01:07 PM	daigle	Renamed MetaCatUtil to MetacatUtil
4677	12/17/2008 02:15 PM	daigle	move readerToString function to StringUtil class in utilities module.
4661	12/09/2008 02:56 PM	daigle	Add debug statements
4589	11/19/2008 03:25 PM	daigle	Rename LDAPUtil to AuthUtil
4500	11/03/2008 11:00 AM	daigle	Move the code to write metadata to disk into documentImpl
4467	10/21/2008 03:20 PM	daigle	Add some generics typing. Separate the code that strips inline data from document files to have a different strategy for 2.0.X versus 2.1.X documents.
4426	10/09/2008 09:49 AM	daigle	Look for schemaLocations in the document while initializing parser. If full schema validation is turned on in metacat.properties, and at least one schema is not registered locally, then turn on full schema validation in the parser.
4359	09/19/2008 08:29 AM	daigle	Fix workgroup description
4346	09/09/2008 09:41 AM	daigle	add closing </distribution> tag to regex for inline data.
4341	09/03/2008 09:24 AM	daigle	Created a method in DocumentImpl to read a document file from disk. Created a support method to strip inline data for users that don't have read access.
4335	08/29/2008 10:20 AM	daigle	Move the DBAdaptor accessor into a DatabaseService class
4213	08/05/2008 05:50 PM	daigle	qualify xml and eml properties with an xml. prefix
4212	08/05/2008 05:33 PM	daigle	Continue to qualify property names
4178	07/29/2008 11:32 AM	daigle	Fully qualify spatial and replication properties
4173	07/28/2008 05:05 PM	daigle	Use qualified properties
4140	07/18/2008 09:30 AM	daigle	Add sql debug statements
4123	07/15/2008 09:58 AM	daigle	Append context url onto system id instead of server url.
4080	07/06/2008 09:25 PM	daigle	Merge 1.9 changes into Head
4011	06/19/2008 01:33 PM	berkley	fixed bug 3403
3753	02/29/2008 03:16 PM	Jing Tao	Remove an obsolete private method getPartNodeRecordList.
3735	02/25/2008 05:16 PM	Jing Tao	Use dev=1 replace dev like '1'. since postgres 8.3 doesn't support it.
3731	02/22/2008 03:52 PM	Jing Tao	Add debug info for special charater.
3507	10/08/2007 11:44 AM	Jing Tao	Modified a sql command from "like" to "=". It dramatically improves the peformance of build index.
3468	09/24/2007 09:59 PM	Jing Tao	Add debug information.
3343	08/03/2007 03:52 PM	Jing Tao	Add code to expire cached query result when metacat have a insert, update or delete action.
3338	07/31/2007 04:10 PM	Jing Tao	Fix a bug that move node from xml_nodes to xml_nodes_revisions.
3240	04/17/2007 03:25 PM	Matt Jones	Decreased the debug message priority to 'debug' level for messages when reading an XML document.
3230	04/12/2007 06:34 PM	Jing Tao	Somehow the change went to the head rather than branch. So i rollback the change in head.
3229	04/12/2007 06:13 PM	Jing Tao	This commit is for branch. In this commit the correct ip and user name will be stored in access_log table in replication event. However, it only for Forcereplication. And the test isn't completed yet. This commit is only for future use.
3198	03/07/2007 05:00 PM	Jing Tao	Change some log info.
3182	02/15/2007 02:59 PM	Chris Jones	One more patch for bug 2469: Although the correct parentid values were being indexed in xml_path_index for leaf node xpaths, they were still incorrect for relative and absolute paths. This patch modifies traverseParents() and changes the parent node id to be indexed to that of the leaf node, no matter if the path is a leaf,...
3181	02/15/2007 10:55 AM	Chris Jones	As a continued fix for http://bugzilla.ecoinformatics.org/show_bug.cgi?id=2469, I've fixed the indexing implementation in both buildIndex() and traverseParents(). Duane pointed out that the incorrect parent node ids were being indexed in xml_path_index, causing some stylesheets to render...
3152	01/25/2007 09:41 AM	Matt Jones	Fixed the implementation of the buildIndex function which was not working for new document insertions. A previous fix in updatePathIndex for ATTRIBUTE data inadvertantly caused a foreign key duplication exception for insertions of ELEMENT nodes when multiple relative paths...
3149	01/18/2007 04:33 PM	Chris Jones	I'm fixing a compile problem under jdk 1.4.2, where the get() method in HashMap needs an Object as a parameter, not a primitive data type. I changed the long to a Long as the lookup key.

Project

General

Profile

Metacat