Rel 4.1 Real Statistics software | Real Statistics Using Excel

I am pleased to announce Release 4.1 of the Real Statistics Resource Pack. The new release is now available for free download (Download Resource Pack) for Excel 2007, 2010 and 2013 environments.

The Real Statistics Examples Workbook has been updated for compatibility with the new release and can be downloaded for free.

The Real Statistics website is in the process of being updated to reflect the new features. These changes will be made over the next few days.

Thanks to all of you who have made suggestions for new capabilities and have identified bugs.

Release 4.1 contains the following new features:

Fligner Killeen test for homogeneity of variances

The FKTEST function outputs the p-value for the Fligner Killeen test for homogeneity of variances.

Two-sample Runs Test

The RUNS2TEST array function is now available. This function performs the two-sample runs test. A similar capability has been added to the Non-parametric Tests data analysis tool. A special option is included to handle the case where there are many tied values.

Enhancement to the One-sample Runs Test

A new capability has been added to the one sample runs test. When range R1 contains only two distinct values and these values are characters then RUNSTEST(R1, lab, tails) is equivalent to in RUNSTEST(s, lab, tails) where s = the string consisting of the values contained in R1.

ICC Power and Sample Size Requirements

The ICC_POWER and ICC_SIZE functions are now available. These functions calculate the statistical power and sample size required for the ICC(1,1) version of the intraclass correlation. A similar capability has been added to the Power and Sample Size data analysis tool.

Jarque-Barre test for normality

The JBTEST and JARQUE functions have been added. JBTEST outputs the p-value for the Jarque-Barre test for normality and JARQUE calculates the test statistic.

Identification of potential outliers

Two commonly used criteria for identifying potential outliers are to flag data elements x such that (1) x > x̄ + 2.5*s or x < x̄ − 2.5*s or (2) x > Q3 + 1.5*IQR or x < Q1 − 1.5*IQR.

Using the new STANDARD function, a potential outlier occurs when (1) STANDARD(x,R1) > 2.5 or STANDARD(x,R1) < -2.5 or (2) STANDARD(x, R1,1,exc) > 1.5 or STANDARD(x, R1,1,exc) < -1.5, where STANDARD is defined as follows.

if type = 0 (default) then

STANDARD(x, R1, type, exc) = STANDARDIZE(x, x̄, s)

while if type = 1 then

STANDARD(x, R1, type, exc) = (x − Q3)/IQR if x > Q3, (x − Q1)/IQR if x < Q1 or 0 otherwise (i.e. when Q1 ≤ x ≤ Q3)

where

x̄ = AVERAGE(R1)
s = STDEV(R1)
Q1 = QUARTILE.EXC(R1,1) if exc = TRUE and Q1 = QUARTILE(R1,1) otherwise
Q3 = QUARTILE.EXC(R1,3) if exc = TRUE and Q3 = QUARTILE(R1,3) otherwise
IQR = IQR(R1, exc)

New Logistic Regression function

The resource pack adds a new array function LogitCoeff2 for the calculation of the logistic regression coefficients and related parameters. The function takes the form

LogitCoeff2(R1, R2, lab, head, alpha, iter)

This function is similar to the existing LogitCoeff function. In particular, the lab, head, alpha and iter arguments are the same, as is the output of the new function. The differences are as follows:

(1) In LogitCoeff(R1, lab, raw, head, alpha, iter), R1 contains all the data, where the data for the independent variables precede the data for the dependent variable. In LogitCoeff2, R1 contains the data for the independent variables and R2 contains the data for the dependent variable. If R2 contains one column then this indicates that the data is in raw format, while if R2 contains two columns then the data is in summary format

(2) The data in R1 and R2 for LogitCoeff2 can contain non-numeric values. These are treated as missing data and are not included in the analysis

Delete error values data analysis tool

A Remove error cells option has been added to the Reformatting a Data Range data analysis tool. When this option is selected any cell in the input data range which contains an error value (#N/A, #DIV/0!, etc.) is replaced by an empty cell. This is the same functionality provided by the DELErr function.

Changes to the noncentral distribution functions

All the functions that calculate or use the noncentral t, F and chi-square distributions (NT_DIST, NF_INV, ANOVA1_POWER, etc.) have been revised so that if you specify a value for the parameter m (which limits the number of terms in the infinite sum used to estimate the distribution value) that is higher than the limit for this parameter, then the software will automatically reduce the value to the highest usable value.

The default value for m has also been revised from 40 to 80.

Bug fix for the Mann-Whitney and Signed Ranks tests with ties

A bug has been fixed in the SRANK_TEST and MANN_TEST functions as well as in the equivalent capabilities in the data analysis tools. The bug resulted in a negative variance when the ties option is chosen.