Project

General

Profile

Statistics
| Revision:

# Date Author Comment
10063 12/01/2016 06:19 PM ben leinfelder

include initial MDQ run processing in metacat-index

10062 12/01/2016 11:20 AM Jing Tao

Use a new class to overwrite the class RdfXmlSubprocessor in d1-processor since that one has a method to use solr http server directly.

10060 12/01/2016 10:24 AM ben leinfelder

merge changes from d1 indexing lib

9990 09/14/2016 10:00 PM Jing Tao

Add the method PropertyConfigurator.configureAndWatch to monitor the change on log4j.properties file.

9984 09/14/2016 11:05 AM Jing Tao

Add more log information.

9982 09/14/2016 11:02 AM Jing Tao

Add more log information.

9832 07/06/2016 04:36 PM Jing Tao

Remove the import of JiBXException.

9831 07/06/2016 03:39 PM Jing Tao

Replace the JiBXException by our own MashallingException.

9829 07/06/2016 03:11 PM Jing Tao

Centralize the version which will be modified. Bump the d1_cn_index_processor version to 2.3.0 snapshot.

9792 05/19/2016 04:11 PM Jing Tao

Change it to 2.8.0 version.

9698 04/29/2016 02:15 PM Jing Tao

Process the noaa variant of isotc211.

9697 04/29/2016 11:34 AM Jing Tao

Add the file from d1_cn_index_processor.

9696 04/29/2016 11:31 AM Jing Tao

Add a new copy from d1_cn_index_processor.

9582 03/23/2016 03:50 PM Jing Tao

Add a statement to help diagnose issues.

9569 03/21/2016 01:21 PM Jing Tao

Change it to 2.7.0 snapshot.

9549 03/08/2016 04:29 PM Jing Tao

Change the version of d1_cn_index_processor to 2.2

9527 02/19/2016 03:41 PM Jing Tao

Add beans for the iso index.

9526 02/19/2016 03:40 PM Jing Tao

Add a bean file for the iso index.

9525 02/19/2016 03:39 PM Jing Tao

Add a bean file for the iso index.

9425 11/19/2015 11:44 AM ben leinfelder

add checks on archived flag to avoid NPE.

9424 11/19/2015 08:44 AM ben leinfelder

only consult fields to merge if there was an existing referenced doc

9420 11/18/2015 04:41 PM ben leinfelder

subclass AnnotatorSubprocessor for use in metacat-index (uses embedded solr server and solrj for retrieving/merging existing documents).

9419 11/18/2015 04:03 PM ben leinfelder

bump trunk to 2.6.0-SNAPSHOT and pull in d1 dependencies at 2.1.0-SNAPSHOT to continue trunk development.

9411 11/18/2015 10:33 AM ben leinfelder

add fileName, mediaType and mediaTypeProperties to solr schema and v2 system metadata processor

9406 11/16/2015 02:24 PM ben leinfelder

refactor v2 context bean to use the v1 pattern used in metacat

9405 11/16/2015 02:21 PM ben leinfelder

include seriesId in solr schema and context file (v2 system metadata)

9368 10/16/2015 08:22 PM Jing Tao

Add the code to print the exception.

9341 09/29/2015 10:14 AM ben leinfelder

add missing quotation mark

9332 09/24/2015 01:35 PM ben leinfelder

fix xpath from CN changes for isPublic. https://redmine.dataone.org/issues/7374

9330 09/24/2015 10:34 AM ben leinfelder

include hierarchical permissions when evaluating isPublic during indexing. https://redmine.dataone.org/issues/7374

9214 05/15/2015 10:19 AM Lauren Walker

Index science metadata fields for the Dublin Core Extended metadata format. - Use d1_cn_index_processor 1.4.5 in metacat-index and update beans with new dcx subprocessor and xsi namespace

9155 03/19/2015 11:20 AM ben leinfelder

merge CN annotation context files to metacat (MN) to support semantic index fields.

9060 01/07/2015 12:02 PM ben leinfelder

use new IDocumentDeleteSubprocessors to handle clean-up of annotation index fields when annotations are removed.

9058 01/07/2015 12:01 PM ben leinfelder

moved RDF XML subprocessor to cn-index project.

9055 01/06/2015 02:41 PM ben leinfelder

move RDF/XML subprocessor and example configuration with SPARQL query to the cn index project from metacat so that it can be used by prov team when indexing ProvONE models in ORE documents

9054 01/06/2015 09:45 AM ben leinfelder

add fieldsToMerge property for annotation updates

9053 01/02/2015 11:01 AM ben leinfelder

add stub merge method to match Skye's recent refactoring to support reindexing when deletes are performed.

9028 12/18/2014 04:34 PM ben leinfelder

use refactored classes from cn-index-processor. still in flux, but improving to better handle non-xml files

9022 12/12/2014 02:12 PM ben leinfelder

add accessors for bean property

9021 12/12/2014 01:53 PM ben leinfelder

use ISolrDataField because RDF subprocessor uses the SparqlField subclass.

9020 12/12/2014 01:30 PM ben leinfelder

use input stream instead of Document for resource map processing test

9018 12/12/2014 01:12 PM ben leinfelder

update classes and context files that use cn-index-processor classes. allowing document subprocessors to be less tied to XML.

8973 11/18/2014 02:39 PM ben leinfelder

use sparql field and triple store from cn-index_processor (refactor). include annotatorSubprocessor for testing in metacat-index

8972 11/17/2014 09:29 PM ben leinfelder

let metacat-index lookup annotations for indexing rather than the metacat "reindex" action.

8965 11/14/2014 12:03 PM ben leinfelder

remove dev-testing in favor of maven.dataone.org repo

8893 10/10/2014 03:04 PM Jing Tao

when we remove a slor index of a resource map, we don't need to know the content of the resource map. Instead, we will search the solr index to get information.

8864 09/16/2014 05:07 PM Jing Tao

Add the code to handle to remove the resource map index.

8862 09/16/2014 04:48 PM Jing Tao

Add codes to handle remove a source map solr index.

8857 09/15/2014 02:32 PM Lauren Walker

Create a valid URI by using all lowercase letters when creating a name for the triple model in the Rdf Xml Subprocessor. See bug: https://projects.ecoinformatics.org/ecoinfo/issues/6595

8847 09/10/2014 02:56 PM Jing Tao

Change the d1_cn_index_processor version from 1.3.0 snapshot to 2.0.0 snapshot.

8840 08/19/2014 11:38 AM Lauren Walker

When indexing annotations from RDFs, use the doc id to access the system metadata, not the model name since they are not always the same.

8839 08/18/2014 03:25 PM Lauren Walker

Add PROV relationships to the Solr schema. Populate the fields using the RdfXmlSubprocessor

8837 08/18/2014 10:03 AM Lauren Walker

Add wasDerivedFrom field to the Solr schema and use Sparql query to retrieve the value from the RDF

8826 08/06/2014 01:09 PM ben leinfelder

update to use v2 types for indexing

8810 07/23/2014 04:19 PM ben leinfelder

add support for v2 DataONE API.

8805 05/30/2014 05:06 PM ben leinfelder

handle multiple subprocessors (RDF and ORE) before the object gets indexed by making sure to merge the solr doc map before submitting to the index.

8799 05/28/2014 11:48 AM ben leinfelder

use default "metacat" context name for metacat-index testing.

8781 05/15/2014 10:27 AM ben leinfelder

include ORE formatId as handled by the RDF subprocessor and index prov:wasDerivedFrom field where it exists in the RDF model. https://projects.ecoinformatics.org/ecoinfo/issues/6548

8774 05/13/2014 03:36 PM ben leinfelder

use a non-public rightsHolder for both EML and Annotation test documents now that the RDF subprocessor checks each annotation to see that it came from a user that as write permission for the object being annotated.

8773 05/13/2014 03:34 PM ben leinfelder

test for update using the updated EML file, not the original. Also add the SM to the shared map so that the indexing process can consult SM.accessPolicy when indexing annotations that assert things about those test documents.

8772 05/13/2014 03:32 PM ben leinfelder

ignore the metacat/solr comparator tests - they are one-offs.

8766 05/07/2014 11:16 PM ben leinfelder

pass around the object file path rather than the data stream so that multiple subprocessors can index the same object and not consume the stream before it gets to the next one. In preparation for extending the assertions stored in OREs. https://projects.ecoinformatics.org/ecoinfo/issues/6548

8761 04/30/2014 03:46 PM ben leinfelder

only allow multiple values for multi-valued fields....

8758 04/29/2014 04:55 PM ben leinfelder

use newer httpclient library so that Jena's dependency is met - this goes all the way back to d1_common/libclient needing to pull in the newer library.

8756 04/29/2014 04:52 PM ben leinfelder

allow multivalued fields to be indexed using the "fields" pass through.

8752 04/28/2014 04:54 PM Jing Tao

Localized the file which doesn't have the bean for dataUrl.

8741 04/18/2014 05:29 PM Jing Tao

Remove the reference to the bean eml.fileID.

8736 04/15/2014 09:36 AM Jing Tao

Remove the bean named eml.fileID which used the ResolveSolrField class.

8735 04/14/2014 02:58 PM ben leinfelder

calculate geohash_3 to three places (typo)

8734 04/14/2014 02:55 PM ben leinfelder

use NSEW for the bounding box geohash calculation from EML - all versions

8732 04/14/2014 02:07 PM Peter Slaughter

Using 1.3.0-SNAPSHOT from d1_cn_index_processor

8730 04/14/2014 01:33 PM Peter Slaughter

Add beans to support geohashes

8729 04/14/2014 01:33 PM Peter Slaughter

Add beans to support geohashes

8722 04/02/2014 03:35 PM ben leinfelder

handle null Boolean in SM.archived field

8720 03/31/2014 03:25 PM ben leinfelder

use Matthew Jones for test creator since he has an ORCID in their staging environment.

8719 03/31/2014 12:31 PM ben leinfelder

augment annotation indexing test/sample to include orcid annotation. https://projects.ecoinformatics.org/ecoinfo/issues/6267
https://projects.ecoinformatics.org/ecoinfo/issues/6423

8708 03/21/2014 05:07 PM ben leinfelder

include characteristic_sm field with SPARQL query

8701 03/14/2014 10:55 AM ben leinfelder

switch to index standard since it is more likely we will be able to determine this from our existing EML attribute information. https://projects.ecoinformatics.org/ecoinfo/issues/6253

8693 03/03/2014 04:33 PM ben leinfelder

Do a more thorough check that the characteristic annotation was successfully indexed as expected (semtools)

8692 03/03/2014 03:56 PM ben leinfelder

switch to the OpenAnnotation (OA) model for annotating datapackages with measurements/characteristics (semtools)

8691 03/03/2014 03:55 PM ben leinfelder

switch to the OpenAnnotation (OA) model for annotating datapackages with measurements/characteristics (semtools)

8690 03/03/2014 03:54 PM ben leinfelder

switch to the OpenAnnotation (OA) model for annotating datapackages with measurements/characteristics (semtools)

8688 03/03/2014 09:30 AM ben leinfelder

bump the poms to 2.4.2

8648 02/25/2014 04:32 PM ben leinfelder

test that obsoleted objects remain indexed, but are marked as obsoleted. https://projects.ecoinformatics.org/ecoinfo/issues/6424

8603 02/14/2014 11:49 AM ben leinfelder

use rangeOfDates | singleDateTime to populate the beginDate and endDate index fields. https://projects.ecoinformatics.org/ecoinfo/issues/6285

8580 02/07/2014 02:20 PM ben leinfelder

include ID field as a minimum for indexing additional fields.

8554 02/03/2014 02:53 PM ben leinfelder

correctly include stacktrace for error debugging.

8552 02/03/2014 02:51 PM ben leinfelder

return null if there is no existing SolrDoc for the given pid.

8545 01/31/2014 10:41 AM ben leinfelder

index singleDateTime value into both begin and end date fields in solr. https://projects.ecoinformatics.org/ecoinfo/issues/6285

8537 01/29/2014 01:41 PM ben leinfelder

uncomment the original tests now that the "field" test is working.

8503 01/15/2014 09:55 AM ben leinfelder

check for existing index document before trying to use existing fields.

8498 01/14/2014 02:57 PM ben leinfelder

allow indexing of RDF documents - provide a sparql query that will return values for the field name. Using measurement_sm initially (a dynamic multivalued solr field). https://projects.ecoinformatics.org/ecoinfo/issues/6253

8496 01/14/2014 02:14 PM ben leinfelder

check for existing documents - don't assume it exists.

8464 01/07/2014 01:56 PM ben leinfelder

Unify solr indexing with an IndexTask that is added to the queue -- allows us to send more than just the systemMetadata to the indexer. Initially this is for READ event counts for each document. https://projects.ecoinformatics.org/ecoinfo/issues/6346

8458 12/20/2013 11:18 AM ben leinfelder

move metacat trunk to 2.4.0-SNAPSHOT

8440 12/19/2013 08:53 AM ben leinfelder

prep for 2.3.1 release

8372 11/08/2013 01:56 PM ben leinfelder

use 2.3.0 without SNAPSHOT pre-release.

8353 10/30/2013 10:37 AM Jing Tao

Renamed the test class.

8352 10/30/2013 10:19 AM Jing Tao

Rename the IndexGenerator to IndexGeneratorTimerTask.

8345 10/28/2013 08:21 PM Jing Tao

Fixed a bug that when a data file was archived, the solr index for the metadata object still kept the "documents" element.