metacat / src / doi_registration.sql @ 8427

  • svn:eol-style: native
  • svn:keywords: Author Date Id Revision
# Date Author Comment
7416 11/05/2012 09:15 AM ben leinfelder

update pub_date when the length of that field is != 4 (use date_created in this scenario). There were 2 entries that had "193" as the pub_date.

7415 11/01/2012 09:03 AM ben leinfelder

replace new lines in creator with spaces. set blank " " titles and creators to "unknown". use "Baltimore Ecosystem Study LTER" for publisher on all BES objects.

7414 10/26/2012 06:50 PM ben leinfelder

include John Kunze's latest suggestions for improved metadata -- a lot of clean-up, especially on characters in the file. Note UTF-8 encoding of the script.

7365 08/23/2012 10:41 PM ben leinfelder

use resourceMapLocation (resolve url for the ore map) as the datacite_relatedIdentifier_isPartOf property

7364 08/23/2012 10:38 PM ben leinfelder

use lowercase 'metadata' and 'data' for the resourceType

7363 08/23/2012 10:36 PM ben leinfelder

set publisher to the source system when publisher == creator (we want them to be different, even if just for appearances)

7362 08/23/2012 10:25 PM ben leinfelder

only include public (readable) DOIs in the final output

7361 08/23/2012 10:24 PM ben leinfelder

use "lastname, firstname" convention throughout

7360 08/23/2012 10:18 PM ben leinfelder

include more descriptive data file name for title of data records

7359 08/23/2012 10:04 PM ben leinfelder

include publisher given name correctly

7336 07/31/2012 07:12 AM ben leinfelder

use correct EZID account names for the three different nodes.

7335 07/30/2012 10:12 PM ben leinfelder

align the final column headers with the datacite schema, as applicable.

7333 07/30/2012 01:46 PM ben leinfelder

use DataCite isNewVersionOf/isPreviousVersionOf for revision history

7328 07/23/2012 05:13 PM ben leinfelder

not every EML file has an ORE datapackage descriptor -- join only to those when setting the resourceMapId

7327 07/23/2012 04:29 PM ben leinfelder

correctly use document revision for object format and resource map joins.

7324 07/20/2012 02:28 PM ben leinfelder

use correct children of 'publisher' element

7321 07/18/2012 10:11 AM ben leinfelder

include the resourceMapId for the metadata objects, not just the data files.

7320 07/18/2012 08:56 AM ben leinfelder

updated LDAP dump and corrected missing entries that had been removed from LDAP.

7306 07/11/2012 05:05 PM ben leinfelder

handle null givenNames from the LDAP dump.

7305 07/11/2012 04:38 PM ben leinfelder

make sure we only get the publisher text content (not attribute value)

7303 07/11/2012 03:05 PM ben leinfelder

DOI registration:
-include more revision history based on the identifier table not just the generated SM metadata
-include ecogrid data urls for revisions (long query in xml_nodes_revisions table)

7296 07/09/2012 04:58 PM ben leinfelder

update creator and publisher using LDAP dump. unfortunately LDAP has shifted over the years and not all identities are still active in LDAP...but we did get quite a few creator names updated!

7290 07/05/2012 04:13 PM ben leinfelder

save point - adding more columns for access, data packaging, revision history

7288 07/03/2012 03:45 PM ben leinfelder

update the table to indicate which DOI account we are targeting

7284 06/22/2012 08:55 AM ben leinfelder

use production cn url for the resolve url

7193 05/27/2012 09:03 AM ben leinfelder

encode '/' and ':' in the DOI used for the resolve URL

7191 05/25/2012 04:23 PM ben leinfelder

include revisions table in the initial temp table population.
use the "first" creator listed in the EML (either org or person).
use other reasonable default values as needed to fully populate the spreadsheet columns

7190 05/25/2012 02:30 PM ben leinfelder

add columns: publisher and pub_date. include default values for all columns - even data files should have title.
still a few todos but closer.

7189 05/25/2012 12:07 PM ben leinfelder

script to generate DOI registration spreadsheet