/metacat-index - Changes - Metacat - Ecoinformatics Redmine

metacat/metacat-index @ 10160

#	Date	Author	Comment
10155	12/21/2016 06:28 PM	Jing Tao	Remove the referenced bean rdfXmlSubprocessor which was on the removed file application-context-oa.xml.
10154	12/21/2016 05:11 PM	ben leinfelder	remove unused OA indexing file and reference
10153	12/21/2016 04:15 PM	Jing Tao	Add the provenence subprocessor.
10148	12/21/2016 02:34 PM	ben leinfelder	extract "metadata" fields from result output now rather than dedicated fields in the model
10142	12/21/2016 09:15 AM	ben leinfelder	include group lookup field in mdq indexing
10126	12/15/2016 12:36 PM	ben leinfelder	use text() in xpath for multi-valued funder extraction
10119	12/13/2016 05:20 PM	ben leinfelder	correct xpath an solr settings for funder field
10116	12/13/2016 01:09 PM	ben leinfelder	make funder a multivalued field.
10115	12/13/2016 11:36 AM	ben leinfelder	extract check.echo.funder.1 output value for the "funder" index field
10114	12/12/2016 11:57 AM	Jing Tao	Add the code to handle the merge-needed fields having multiple pairs.
10112	12/09/2016 03:39 PM	Jing Tao	During index merging process, if a field is a system metadata field and current document doesn't have the field, we don't need to merge it from the existing solr document.
10094	12/06/2016 03:58 PM	ben leinfelder	re-include mdq context file after merge
10093	12/06/2016 03:58 PM	ben leinfelder	include mdq composite score
10092	12/06/2016 12:48 PM	ben leinfelder	remove root node in xpath (for some reason this wasn't working on mn-demo-8 but was locally).
10081	12/05/2016 05:27 PM	Jing Tao	Increase the version to 2.8.1
10071	12/05/2016 11:51 AM	Jing Tao	Merge the code for rdfxml subprocessor from d1_cn_index_processor to metacat.
10070	12/05/2016 11:44 AM	ben leinfelder	include v1 in the run mdq formatId
10068	12/02/2016 04:17 PM	Jing Tao	Add the prefix "prov_" the solr fields.
10066	12/02/2016 09:07 AM	ben leinfelder	adjust color-based score calculations to match UI and add up to total result count
10065	12/02/2016 08:21 AM	ben leinfelder	add indexing for scores based on successful checks by check.type
10064	12/02/2016 08:04 AM	ben leinfelder	use dateConverter for run timestamp indexing
10063	12/01/2016 06:19 PM	ben leinfelder	include initial MDQ run processing in metacat-index
10062	12/01/2016 11:20 AM	Jing Tao	Use a new class to overwrite the class RdfXmlSubprocessor in d1-processor since that one has a method to use solr http server directly.
10060	12/01/2016 10:24 AM	ben leinfelder	merge changes from d1 indexing lib
9990	09/14/2016 10:00 PM	Jing Tao	Add the method PropertyConfigurator.configureAndWatch to monitor the change on log4j.properties file.
9984	09/14/2016 11:05 AM	Jing Tao	Add more log information.
9982	09/14/2016 11:02 AM	Jing Tao	Add more log information.
9832	07/06/2016 04:36 PM	Jing Tao	Remove the import of JiBXException.
9831	07/06/2016 03:39 PM	Jing Tao	Replace the JiBXException by our own MashallingException.
9829	07/06/2016 03:11 PM	Jing Tao	Centralize the version which will be modified. Bump the d1_cn_index_processor version to 2.3.0 snapshot.
9792	05/19/2016 04:11 PM	Jing Tao	Change it to 2.8.0 version.
9698	04/29/2016 02:15 PM	Jing Tao	Process the noaa variant of isotc211.
9697	04/29/2016 11:34 AM	Jing Tao	Add the file from d1_cn_index_processor.
9696	04/29/2016 11:31 AM	Jing Tao	Add a new copy from d1_cn_index_processor.
9582	03/23/2016 03:50 PM	Jing Tao	Add a statement to help diagnose issues.
9569	03/21/2016 01:21 PM	Jing Tao	Change it to 2.7.0 snapshot.
9549	03/08/2016 04:29 PM	Jing Tao	Change the version of d1_cn_index_processor to 2.2
9527	02/19/2016 03:41 PM	Jing Tao	Add beans for the iso index.
9526	02/19/2016 03:40 PM	Jing Tao	Add a bean file for the iso index.
9525	02/19/2016 03:39 PM	Jing Tao	Add a bean file for the iso index.
9425	11/19/2015 11:44 AM	ben leinfelder	add checks on archived flag to avoid NPE.
9424	11/19/2015 08:44 AM	ben leinfelder	only consult fields to merge if there was an existing referenced doc
9420	11/18/2015 04:41 PM	ben leinfelder	subclass AnnotatorSubprocessor for use in metacat-index (uses embedded solr server and solrj for retrieving/merging existing documents).
9419	11/18/2015 04:03 PM	ben leinfelder	bump trunk to 2.6.0-SNAPSHOT and pull in d1 dependencies at 2.1.0-SNAPSHOT to continue trunk development.
9411	11/18/2015 10:33 AM	ben leinfelder	add fileName, mediaType and mediaTypeProperties to solr schema and v2 system metadata processor
9406	11/16/2015 02:24 PM	ben leinfelder	refactor v2 context bean to use the v1 pattern used in metacat
9405	11/16/2015 02:21 PM	ben leinfelder	include seriesId in solr schema and context file (v2 system metadata)
9368	10/16/2015 08:22 PM	Jing Tao	Add the code to print the exception.
9341	09/29/2015 10:14 AM	ben leinfelder	add missing quotation mark
9332	09/24/2015 01:35 PM	ben leinfelder	fix xpath from CN changes for isPublic. https://redmine.dataone.org/issues/7374
9330	09/24/2015 10:34 AM	ben leinfelder	include hierarchical permissions when evaluating isPublic during indexing. https://redmine.dataone.org/issues/7374
9214	05/15/2015 10:19 AM	Lauren Walker	Index science metadata fields for the Dublin Core Extended metadata format. - Use d1_cn_index_processor 1.4.5 in metacat-index and update beans with new dcx subprocessor and xsi namespace
9155	03/19/2015 11:20 AM	ben leinfelder	merge CN annotation context files to metacat (MN) to support semantic index fields.
9060	01/07/2015 12:02 PM	ben leinfelder	use new IDocumentDeleteSubprocessors to handle clean-up of annotation index fields when annotations are removed.
9058	01/07/2015 12:01 PM	ben leinfelder	moved RDF XML subprocessor to cn-index project.
9055	01/06/2015 02:41 PM	ben leinfelder	move RDF/XML subprocessor and example configuration with SPARQL query to the cn index project from metacat so that it can be used by prov team when indexing ProvONE models in ORE documents
9054	01/06/2015 09:45 AM	ben leinfelder	add fieldsToMerge property for annotation updates
9053	01/02/2015 11:01 AM	ben leinfelder	add stub merge method to match Skye's recent refactoring to support reindexing when deletes are performed.
9028	12/18/2014 04:34 PM	ben leinfelder	use refactored classes from cn-index-processor. still in flux, but improving to better handle non-xml files
9022	12/12/2014 02:12 PM	ben leinfelder	add accessors for bean property
9021	12/12/2014 01:53 PM	ben leinfelder	use ISolrDataField because RDF subprocessor uses the SparqlField subclass.
9020	12/12/2014 01:30 PM	ben leinfelder	use input stream instead of Document for resource map processing test
9018	12/12/2014 01:12 PM	ben leinfelder	update classes and context files that use cn-index-processor classes. allowing document subprocessors to be less tied to XML.
8973	11/18/2014 02:39 PM	ben leinfelder	use sparql field and triple store from cn-index_processor (refactor). include annotatorSubprocessor for testing in metacat-index
8972	11/17/2014 09:29 PM	ben leinfelder	let metacat-index lookup annotations for indexing rather than the metacat "reindex" action.
8965	11/14/2014 12:03 PM	ben leinfelder	remove dev-testing in favor of maven.dataone.org repo
8893	10/10/2014 03:04 PM	Jing Tao	when we remove a slor index of a resource map, we don't need to know the content of the resource map. Instead, we will search the solr index to get information.
8864	09/16/2014 05:07 PM	Jing Tao	Add the code to handle to remove the resource map index.
8862	09/16/2014 04:48 PM	Jing Tao	Add codes to handle remove a source map solr index.
8857	09/15/2014 02:32 PM	Lauren Walker	Create a valid URI by using all lowercase letters when creating a name for the triple model in the Rdf Xml Subprocessor. See bug: https://projects.ecoinformatics.org/ecoinfo/issues/6595
8847	09/10/2014 02:56 PM	Jing Tao	Change the d1_cn_index_processor version from 1.3.0 snapshot to 2.0.0 snapshot.
8840	08/19/2014 11:38 AM	Lauren Walker	When indexing annotations from RDFs, use the doc id to access the system metadata, not the model name since they are not always the same.
8839	08/18/2014 03:25 PM	Lauren Walker	Add PROV relationships to the Solr schema. Populate the fields using the RdfXmlSubprocessor
8837	08/18/2014 10:03 AM	Lauren Walker	Add wasDerivedFrom field to the Solr schema and use Sparql query to retrieve the value from the RDF
8826	08/06/2014 01:09 PM	ben leinfelder	update to use v2 types for indexing
8810	07/23/2014 04:19 PM	ben leinfelder	add support for v2 DataONE API.
8805	05/30/2014 05:06 PM	ben leinfelder	handle multiple subprocessors (RDF and ORE) before the object gets indexed by making sure to merge the solr doc map before submitting to the index.
8799	05/28/2014 11:48 AM	ben leinfelder	use default "metacat" context name for metacat-index testing.
8781	05/15/2014 10:27 AM	ben leinfelder	include ORE formatId as handled by the RDF subprocessor and index prov:wasDerivedFrom field where it exists in the RDF model. https://projects.ecoinformatics.org/ecoinfo/issues/6548
8774	05/13/2014 03:36 PM	ben leinfelder	use a non-public rightsHolder for both EML and Annotation test documents now that the RDF subprocessor checks each annotation to see that it came from a user that as write permission for the object being annotated.
8773	05/13/2014 03:34 PM	ben leinfelder	test for update using the updated EML file, not the original. Also add the SM to the shared map so that the indexing process can consult SM.accessPolicy when indexing annotations that assert things about those test documents.
8772	05/13/2014 03:32 PM	ben leinfelder	ignore the metacat/solr comparator tests - they are one-offs.
8766	05/07/2014 11:16 PM	ben leinfelder	pass around the object file path rather than the data stream so that multiple subprocessors can index the same object and not consume the stream before it gets to the next one. In preparation for extending the assertions stored in OREs. https://projects.ecoinformatics.org/ecoinfo/issues/6548
8761	04/30/2014 03:46 PM	ben leinfelder	only allow multiple values for multi-valued fields....
8758	04/29/2014 04:55 PM	ben leinfelder	use newer httpclient library so that Jena's dependency is met - this goes all the way back to d1_common/libclient needing to pull in the newer library.
8756	04/29/2014 04:52 PM	ben leinfelder	allow multivalued fields to be indexed using the "fields" pass through.
8752	04/28/2014 04:54 PM	Jing Tao	Localized the file which doesn't have the bean for dataUrl.
8741	04/18/2014 05:29 PM	Jing Tao	Remove the reference to the bean eml.fileID.
8736	04/15/2014 09:36 AM	Jing Tao	Remove the bean named eml.fileID which used the ResolveSolrField class.
8735	04/14/2014 02:58 PM	ben leinfelder	calculate geohash_3 to three places (typo)
8734	04/14/2014 02:55 PM	ben leinfelder	use NSEW for the bounding box geohash calculation from EML - all versions
8732	04/14/2014 02:07 PM	Peter Slaughter	Using 1.3.0-SNAPSHOT from d1_cn_index_processor
8730	04/14/2014 01:33 PM	Peter Slaughter	Add beans to support geohashes
8729	04/14/2014 01:33 PM	Peter Slaughter	Add beans to support geohashes
8722	04/02/2014 03:35 PM	ben leinfelder	handle null Boolean in SM.archived field
8720	03/31/2014 03:25 PM	ben leinfelder	use Matthew Jones for test creator since he has an ORCID in their staging environment.
8719	03/31/2014 12:31 PM	ben leinfelder	augment annotation indexing test/sample to include orcid annotation. https://projects.ecoinformatics.org/ecoinfo/issues/6267 https://projects.ecoinformatics.org/ecoinfo/issues/6423
8708	03/21/2014 05:07 PM	ben leinfelder	include characteristic_sm field with SPARQL query
8701	03/14/2014 10:55 AM	ben leinfelder	switch to index standard since it is more likely we will be able to determine this from our existing EML attribute information. https://projects.ecoinformatics.org/ecoinfo/issues/6253
8693	03/03/2014 04:33 PM	ben leinfelder	Do a more thorough check that the characteristic annotation was successfully indexed as expected (semtools)

Project

General

Profile

Metacat