Requirement	StatCalc
Section	3.2.6
JIRA Task	EIR-56 - Getting issue details... STATUS

Reviewed For	Date
Conventional spacing between sections	2016-08-31

Introduction

The StatCalc component of Epi Info™ 7 enables the user to evaluate the performance of different study designs and statistical tests by supplying high-level information on the properties of hypothetical data sets and the criteria used for evaluation. StatCalc tools can be divided into three broad categories: 1). sample size and power calculations for unmatched case-control studies, population surveys, cohort or cross-sectional studies, chi-square for trend by the Mantel extension of the Mantel-Haenszel summary odds ratio, and chi square (tests for the presence of a trend in dose-response or other case-control studies where a series of increasing or decreasing exposures is being studied); 2). analysis of 2×2 tables to produce odds ratios and risk ratios (relative risks) with confidence limits, Fisher exact tests, and 1- and 2-tailed p-values, with Mantel-Haenszel summary odds ratios, chi square tests and associated p-values for stratified data; 3). distribution-based event probabilities, 2-tailed p-values and confidence intervals for deviations from binomial (proportions) and Poisson (rare events) distributions given the number of observed and expected events. The StatCalc tools can be accessed as an independent module from the main menu or as part of the Visual Dashboard.

Accessing StatCalc

Overview

StatCalc appears on the Epi Info™ 7 main menu, middle row, right column, and is captioned "Statistical calculators for sample size, power, and more." Clicking on StatCalc in the main menu opens the StatCalc menu, which is similar to the main menu in appearance and contains buttons for the eight (8) StatCalc calculators listed in StatCalc Calculator Properties, below. The individual calculators are also part of Visual Dashboard (VD). In VD, the calculators can be accessed from the Options menu, under the "Add StatCalc calculator" submenu, and added to the dashboard Canvas in the same manner as other gadgets. As StatCalc calculators do not access data records and process only data entered manually by the user, calculators can be added to a canvas without having to first attach a data source. However, calculators placed on the dashboard cannot be saved with the canvas, nor can their output be exported or printed with the other analysis gadgets.

Functional Requirements

Epi Info™ 7 shall enable the user to open the StatCalc menu from the Epi Info™ main menu.
The StatCalc menu shall enable the user to start the eight (8) StatCalc calculators from an array of buttons bearing the names of the calculators.
When the user selects a calculator from the StatCalc menu, each calculator shall open a window containing the components specified in StatCalc Calculator Properties, below.
The VD shall enable the user to select StatCalc calculators from the Options menu.
The VD shall enable the user to place the selected calculators on the Canvas.
Calculators placed on the dashboard Canvas shall have all the relevant properties of other gadgets including the ability to:
1. be repositioned on the Canvas,
2. anchor to other gadgets,
3. stack vertically with other gadgets in response to the user pressing the Vertical Arrange button on the VD status bar,
4. hide borders in response to the user pressing the Hide Gadget Borders button on the VD status bar^[1],
5. collapse and expand the calculator window in response to the user pressing the Arrow toggle button on the gadget's title bar^[2], and
6. close and be removed from the Canvas in response to the user pressing the Close button (red box with a white × inside).

Notes:

The behavior of this function in StatCalc calculators differs from that of other gadgets. In most cases, the Hide Gadget Borders button not only removes the outer rectangle, but the title bar and associated buttons as well. As a consequence, the gadgets can no longer be repositioned, reconfigured, collapsed, expanded, or closed. The only function the Hide Gadget Borders button provides to a StatCalc calculator is to remove its border.
These functions have not been implemented as of Epi Info™ version 7.2.0.1. See also Future Development.

StatCalc Calculators

Overview

The StatCalc calculators can be divided into three general categories: 1) sample size and power calculations, 2) two-way table calculations, and 3) distribution-based event probabilities. Each category has distinct input parameters and analyses, but they all provide the ability to enter hypothetical data and evaluate their statistical significance. In doing so, the user may determine important study parameters, such as the number of subjects required to test a hypothesis given specific assumptions about variables such as rates of exposure or the ratios of cases to controls.

Functional Requirements

The requirements for StatCalc are best described in a tabular format, with a row for each calculator and a column for each feature. The features are: calculator name, title (used when the calculator is open as a window or gadget), description, input parameters, and derived values.

StatCalc Calculator Properties

Calculator	Title	Description	Input Parameters	Derived Values
Sample Size & Power
Population Survey	Sample Size and Power	Population survey or descriptive study (For simple random sampling, leave design effect and clusters equal to 1.)	Population size Expected frequency Acceptable margin of error Design effect Clusters	For confidence levels of: 80% 90% 95% 97% 99% 99.9% 99.99% Cluster size Total Sample
Cohort or Cross- Sectional	Sample Size and Power	Unmatched cohort and cross-sectional studies (exposed and unexposed)	Two-sided confidence level: 80% 90% 95% 97% 99% 99.9% 99.99% Power (%) Ratio (Unexposed: Exposed) Outcome in unexposed group (%) Risk ratio Odds ratio Outcome in exposed group (%)	A table consisting of: Rows: Cases controls totals Columns: Kelsey Fleiss Fleiss w/CC Models
Unmatched Case-Control	Sample Size and Power	Unmatched case-control study (Comparison of ILL and NOT ILL)	Two-sided confidence level: 80% 90% 95% 97% 99% 99.9% 99.99% Power (%) Ratio of controls to cases Controls exposed (%) Odds ratio Cases with exposure (%)^[1]	A table consisting of: Rows: Cases controls totals Columns: Kelsey Fleiss Fleiss w/CC Models
Chi-Square for Trend	Chi-Square for Trend	Analysis for Linear Trends in Proportions	Series of records containing: exposure score N cases N controls	odds ratio (for each record) Chi square for linear trend`^[2]` p-value
2×2 Table Calculations
Calculator	Title	Input Parameters	Derived Values (By Stratum)	Derived Values (Summary Results)
Tables (2×2×N) (Stratified Two-way Tables)	2×2 Tables	By stratum, 1 - 9: Subject count: Exposure , Outcome Subject count: Exposure , Outcome Subject count: Exposure , Outcome Subject count: Exposure , Outcome	Odds-based Parameters Odds Ratio: Estimate Lower Upper Maximum Likelihood Estimate Odds Radio (Mid-P): Estimate Lower Upper Fisher's Exact Test: Lower Upper Risk-based Parameters Risk ratio: Estimate Lower Upper Risk difference: Estimate Lower Upper Statistical Tests Uncorrected: chi-square 2-tailed p-value Mantel-Haenszel: chi-square 2-tailed p-value Mid-P exact test: 1-tailed p-value Fisher's exact test: 1-tailed p-value 2-tailed p-value	Odds ratio Crude (Cross Product): Estimate Lower Upper Crude (MLE): Estimate Lower Upper Fisher's Exact test: Lower Upper Adjusted (MH): Estimate Lower Upper Adjusted (MLE): Estimate Lower Upper Risk ratio Crude: Estimate Lower Upper Adjusted: Estimate Lower Upper Chi-square Uncorrected (MH): chi-square 1-tailed p-value 2-tailed p-value Corrected (MH): chi-square 1-tailed p-value 2-tailed p-value
Matched-Pair Case Control	Pair-Matched Case-Control Study	Cases: Exposure Cases: Exposure Controls: Exposure Controls: Exposure		Odds-based parameters Odds Ratio: Estimate Lower Upper Exact: Lower Upper Statistical Tests McNemar: chi-square 2-tailed p-value Corrected: chi-square 2-tailed p-value Fisher's exact test: 1-tailed p-value 2-tailed p-value
Distribution-Based Event Probabilities
Calculator	Title	Description	Input Parameters	Derived Values
Binomial	Binomial	Binomial - Proportion vs. Standard	Numerator (cases) Total observations Expected percentage	Probability that the number of cases is <, <=, =, >=, > numerator Two-tailed p-value 95% Confidence interval
Poisson	Poisson	Rare Event vs. Standard	Observed number of events Expected number of events	Probability that the number of cases is <, <=, =, >=, > to the observed number 2-tailed p-value 95% confidence interval

Notes:

Not all input parameters are independent.
Extended Mantel-Haenszel

Future Development

Overview

The StatCalc tools are well-implemented from the mathematical and performance points of view. The primary problem involves the labeling of fields and the limited documentation of specific tools. Generally speaking, the calculators are designed with a very specific application in mind, while the underlying calculations may be applied to a much broader set of problems. This can be addressed in a number of ways. Making the labels more general may be helpful for some users. However, a better approach might be to add a pull-down menu to tools such as "Cohort or Cross-Sectional" that offers a number of possible scenarios. In this case, the options may include "Unmatched Cohort Study" and "Cross-Sectional Study". Choosing an option would change the labeling of the input (and some output) fields to use terminology traditionally associated with that study type. The options may also reflect the methodological context of the study. Applied to infectious disease, one examines the ratio of exposed to unexposed individuals and the corresponding numbers who are ill or unaffected. In contrast, a genetic study examines individuals that do or do not possess a particular allele of a gene or marker; the outcome is expressed in terms of phenotypes. The goal is to express the statistical variables in a manner that is familiar to the researcher using a particular study design in a particular discipline. Doing so will help ensure that the user enters the proper information in the correct fields and can accurately interpret the results.

The remaining issues concern consistency in how the tools are named and referred to in different parts of Epi Info™. The label on the button in the StatCalc menu or VD Options menu does not always match the labeling of the resulting window or gadget (when run in stand-alone mode or as part of Visual Dashboard, respectively). There is also inconsistent hyphenation of compound adjectives (e.g., "case-control"). Finally, the StatCalc gadgets in VD do not follow the pattern for the control widgets (located in the upper-right corner) used in other VD gadgets (they cannot be collapsed, for example).

Epi Info 7 Requirements

StatCalc

Introduction

Accessing StatCalc

Overview

Functional Requirements

StatCalc Calculators

Overview

Functional Requirements

StatCalc Calculator Properties

Future Development

Overview

Related content