Project

General

Profile

Actions

Feature #6289

closed

Create usage statistics service

Added by Matt Jones almost 11 years ago. Updated almost 11 years ago.

Status:
Rejected
Priority:
Normal
Assignee:
-
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Bugzilla-Id:

Description

Currently the only usage stats we have in Metacat are the raw logs. This new service would provide several statistical reports in machine-readable format intended for efficient use on clients for building user interface displays that show those statistics.

The service should include the following response statistics, and be extensible to add other tracked statistics as needed:

  1. Number of views (defined as number of times the metadata has been viewed on the web)
  2. Number of package downloads (needs definition)
  3. Size in bytes of package downloads
  4. Number of citations (implement in a second phase)

For each of these statistics, calling apps should be able to constrain the results to only include certain records pertaining to:

  1. a PID or list of PIDs
  2. a user or list of users (DN, or ORCID, or some amalgam -- to be discussed)
  3. a time range
  4. a spatial range

For each of these statistics, calling apps should be able to request the statistic aggregated by several specific facets, including the following (in order of importance):

  1. PID
  2. User (DN, or ORCID, or some amalgam -- to be discussed)
  3. Time range, aggregated to requested unit (day, week, month, year)
  4. Spatial range, aggregated to requested unit (to be discussed)

Intersections of these aggregated facets should also be possible, but are a lower priority than the facets alone. For example, when finished, one should be able to request the following reports, among others:

  1. Number of downloads by pid
  2. Number of downloads by user (aggregates across pids)
  3. Number of downloads by month (aggregates across pids)
  4. Number of downloads by spatial location (aggregates across pids)
  5. Number of downloads by pid by month
  6. Number of downloads by user by month
  7. etc.

The download format (JSON?, XML?) should allow for an extended set of response variables, and an extendable set of aggregating facets. Need to discuss.

The REST API for this service should be developed in the DataONE space, with intention of it being implementable by both other MNs and CNs in DataONE.


Related issues

Is duplicate of Metacat - Feature #5989: Track data download, view and citation statisticsIn ProgressPeter Slaughter

Actions
Actions

Also available in: Atom PDF