Project

General

Profile

Bug #2044

R actor for means and error by group

Added by Matt Jones over 14 years ago. Updated over 14 years ago.

Status:
Resolved
Priority:
Immediate
Assignee:
Category:
actors
Target version:
Start date:
03/11/2005
Due date:
% Done:

0%

Estimated time:
Bugzilla-Id:
2044

Description

This bug is to create an actor that wraps an R script (using the generic R actor
described in bug #1342) for calculating the mean, standard deviation, and
standard error of a numeric variable that is passed into the script. Optionally
the stats should be calculated using a grouping variable (by group), with one
mean/std/se for each group.

There are two inputs:
1) dataValue (type double)
2) group (type string)

The actor will accululate all data passed in as a series of {dataValue, group}
tuples and pass this along to R to calculate the mean, stdev, and stderr by
group. If group is not present then the tuples will be {dataValue}. When the R
script is finished, a matrix of {group, mean, stdev, stderr} will be passed back
to the R actor and then emitted on the output port. So for N tokens that come
in on the input port the actor will output on 1 token on the output port after
all input data has been received (this is a grouping operation).


Related issues

Blocked by Kepler - Bug #2043: create suite of common statistical actors using RResolved03/11/2005

History

#1 Updated by Dan Higgins over 14 years ago

An RExpression actor with this functionality has been created and is present in
the Kepler CVS distribution as an example workflow.

#2 Updated by Redmine Admin over 6 years ago

Original Bugzilla ID was 2044

Also available in: Atom PDF