Project

General

Profile

  • svn:eol-style: native
  • svn:keywords: Author Date Id Revision

# Date Author Comment
9463 12/18/2015 03:14 PM Jing Tao

Close some prepared sql statement in the summarize method.

8910 10/17/2014 05:02 PM ben leinfelder

use http://tools.ietf.org/rfc/rfc3023 spec for conformsTo property. use the full xpath for EML dataTable and attribute selectors

8788 05/19/2014 02:21 PM ben leinfelder

use separate surName and givenNames to lookup ORCIDs.

8784 05/15/2014 03:17 PM ben leinfelder

all full-text queries for ORCID, but it isn't that great because we might have a"PISCO" creator that shows us in may different orcid profiles...false matches.

8769 05/09/2014 01:48 PM ben leinfelder

cache the imported models to avoid timeouts from remote hosts (or being locked out for too many requests in a given time period).

8767 05/08/2014 04:23 PM ben leinfelder

use in-memory TDB dataset for querying annotations for indexing -- this comes with the same reasoning capabilities as the directory-based one, but has the benefit of not filling the directory with triples that will not be used again. prepping for d1 AHM

8765 05/07/2014 11:12 PM ben leinfelder

when indexing annotations directly, just use an in-memory triple store rather than TDB since we remove each graph as it is processed (and my TDB instance would get into the multi-GB range with a few runs, even if I removed the old models)

8763 05/02/2014 04:39 PM ben leinfelder

simplify lookup for classes and orcid. remove the "random" annotation code branches -- just too confusing to look at those bogus classes especially now that we have "real" generated annotations.

8757 04/29/2014 04:54 PM ben leinfelder

first pass at direct EML->semantic index method. Still produces an RDF model, but does not persist it in Metacat, only in the triplestore. Allows us to re-run without adding stale RDF to the MN store.

8743 04/22/2014 11:43 AM ben leinfelder

include BioPortal lookup for Entity matches using the data table description. TODO: only associate measurements to the entity observation if they apply.

8724 04/02/2014 03:36 PM ben leinfelder

check for null entities and/or attributes (typically when otherEntity is being used in EML).

8718 03/31/2014 11:11 AM ben leinfelder

attribute the datapackage to the creator (using orcid if we can find it). https://projects.ecoinformatics.org/ecoinfo/issues/6267
https://projects.ecoinformatics.org/ecoinfo/issues/6423

8717 03/31/2014 10:31 AM ben leinfelder

add test for BioPortal annotator service.

8716 03/28/2014 03:51 PM ben leinfelder

refactor web service calls to bioportal and orcid outside of the annotator class. test with orcid sandbox server. include orcid uri for the annotations being generated (we can index these and drive our searches on these values down the road). related to this: https://projects.ecoinformatics.org/ecoinfo/issues/6423 and also some semtools tasks.

8714 03/26/2014 04:05 PM ben leinfelder

Use OBOE-SBC ontology for looking up concepts (it contains subclasses of our OBOE Characteristic and Standard superclasses). Restrict annotations to only subclasses that fit the OBOE model. Correct the xpointer and individual naming conventions so they are unique, but express the exact entity/attribute being annotated.

8713 03/26/2014 03:10 PM ben leinfelder

remove my api key. oops

8712 03/26/2014 03:02 PM ben leinfelder

add comment/pointer to BioPortal annotation service.

8711 03/26/2014 03:00 PM ben leinfelder

Include method to look up annotation classes from BioPortal. We still have OBOE-SBC in there, and theyhave the SWEET ontology. The suggestions returned are not perfect, but they can be better than nothing. Ideally, we'd only query a few ontologies so we don't end up using terms from medical ontologies that aren't really appropriate for our domain. https://projects.ecoinformatics.org/ecoinfo/issues/6256

8710 03/24/2014 04:17 PM ben leinfelder

Add xpointer FragmentSelectors to each annotation.
Split attribute label into tokens to attempt matching to OBOE concepts.

8709 03/23/2014 12:11 AM ben leinfelder

include code to generate random annotations for UI testing. Effective, but can be confusing to see so many unrelated concepts on duplicate EML packages.

8702 03/14/2014 10:59 AM ben leinfelder

first pass at generating annotations from EML attribute information. uses the OpenAnnotation model that the metacat-index tests assume which allows us to populate dynamic index fields for the annotation class[es]. There is still much to be done with finding appropriate concepts for each attribute. https://projects.ecoinformatics.org/ecoinfo/issues/6256

8689 03/03/2014 03:41 PM ben leinfelder

support content from all serverLocations when summarizing entity info (semtools)

8646 02/24/2014 04:30 PM ben leinfelder

First pass at a class for summarizing attribute information for analysis. (semtools) https://projects.ecoinformatics.org/ecoinfo/issues/6256