Comparing the slopes for two independent samples

Basic Concepts

On this webpage, we show how to test whether the slopes for two independent populations are equal, i.e. we test the following null and alternative hypotheses:

H₀: β₁ = β₂ i.e. β₁ – β₂ = 0

H₁: β₁ ≠ β₂ i.e. β₁ – β₂ ≠ 0

The test statistic is

If the null hypothesis is true then

where

If the two error variances are equal, then as for the test for the differences in the means, we can pool the estimates of the error variances, weighing each by their degrees of freedom, and so

Now

Since we can replace the numerators of each by the pooled value $s_{Res}^2$ , we have

Note that while the null hypothesis that β = 0 is equivalent to ρ = 0, the null hypothesis that β₁ = β₂ is not equivalent to ρ₁ = ρ₂.

Example

Example 1: We have two samples, each comparing life expectancy vs. smoking. The first sample is for males and the second for females. We want to determine whether there is any significant difference in the slopes for these two populations. We assume that the two samples have the values in Figure 1 (for men the data is the same as that in Example 1 of Regression Analysis):

Figure 1 – Data for Example 1

As can be seen from the scatter diagrams in Figure 1, it appears that the slope for women is less steep than for that for men. In fact, as can be seen from Figure 2, the slope of the regression line for men is -0.6282 and the slope for women is -0.4679, but is this difference significant?

As can be seen from the calculations in Figure 2, using both pooled and unpooled values for s_Res, the null hypothesis, H₀: the slopes are equal, cannot be rejected. And so we cannot conclude that there is any significant difference between the life expectancy of males and females for any incremental amount of smoking.

Figure 2 – t-test to compare slopes of regression lines

Worksheet Function

Real Statistics Function: The following array function is provided by the Real Statistics Resource Pack. Here Rx1, Ry1 are ranges containing the X and Y values for one sample and Rx2, Ry2 are the ranges containing the X and Y values for a second sample.

SlopesTest(Rx1, Ry1, Rx2, Ry2, b, lab): outputs the standard error of the difference in slopes s_b1–b2, t, df and p-value for the test described above for comparing the slopes of the regression lines for the two samples.

If b = True (the default) then the pooled standard error s_b1–b2 is used (as in cell T10 of Figure 2); otherwise the non-pooled standard error is used (as in cell N10 of Figure 2).

If lab = True then the output is a 4 × 2 range where the first column contains labels and the second column contains the values described above and if lab = False (the default) only the data is outputted (in the form of a 4 × 1 range).

The SlopesTest function only produces the correct results if there are no missing data elements in Rx1, Ry1, Rx2, Ry2.

Observation

For Example 1, the formula

=SlopesTest(A5:A19,B5:B19,D5:D20,E5:E20,FALSE,TRUE)

generates the output in range M30:N33 of Figure 3, while the formula

=SlopesTest(A5:A19,B5:B19,D5:D20,E5:E20)

generates the output in range O30:O33.

Comparing slopes Excel formulas

Figure 3 – Comparing slopes using SlopesTest function

192 thoughts on “Comparing the slopes for two independent samples”

Dexter D Roberts

April 27, 2018 at 9:23 pm

The t-test compares the sample slope (or regression coefficient) to what value indicating the null hypothesis?

The Y-intercept

0

1

p-value
Reply
- Charles
  
  April 28, 2018 at 7:46 am
  
  Dexter,
  See https://real-statistics.com/correlation/dichotomous-variables-t-test/
  Charles
  Reply
Olagunju Solomon

February 25, 2018 at 3:17 pm

Hi Charles, thank you for this tutorial. I would like to ask if there is way one can test difference in slope of a regression from 1 as done in reduced major axis regression using excel.
Thanks.
Reply
- Charles
  
  February 26, 2018 at 2:01 pm
  
  Hi Olagunju,
  Sorry, but I don’t understand your question. What do you mean by “1 as done in reduced major axis regression”?
  Charles
  Reply
Amanda

January 25, 2018 at 4:20 pm

I was wondering about that as well.
Reply
Beth

December 13, 2017 at 10:59 pm

Thanks for this post, it is a great help – especially with the step-by-step walkthrough. One question though: What should be done when there are more than 2 samples to be compared?
Reply
- Charles
  
  December 14, 2017 at 8:18 am
  
  Beth,
  In this case you use ANOVA. See
  One-way ANOVA
  Charles
  Reply
Clement

October 18, 2017 at 6:02 pm

Hi Charles, many thanks for this useful tutorial.
I’m not sure about how much the maths explained here rely on the assumption that the correlation between x and y is actually linear… How (in)accurate would it be to use this test for comparing the slopes of linear regression lines fitted on an exponential distribution? Intuitively, I would think that it is not proper usage of this test.
The reason I’m asking is that I’m reviewing someone’s work who used this test for comparing the slopes of 2 linear regression lines, but with 2 data series that are poorly modeled by a linear regression in the first place (R2 is ~0.2 for both). In fact, their dataset is best modeled as an exponential decay (R2 is ~0.8 for both). So I would suggest they first transform the y values with x->ln(x) and repeat the test. Is that sensible ?
PS: sorry for poor synthax of original post
Reply
- Charles
  
  October 19, 2017 at 7:39 am
  
  Clement,
  The test result should be valid regrading the two straight lines that best fit the two samples. If an exponential curve is a better fit for the data, then this test won’t say anything about the two exponential curves.
  You can transform data so that an exponential regression model is transformed into a linear regression model. In this case the slopes test should be useful. Slopes in linear regression become exponents in exponential regression. See the following webpage regrading this type of transformation:
  Exponential regression.
  Charles
  Reply
Carol

October 13, 2017 at 12:00 am

Dear Charles,

Thank you for the post, your website is so helpful!
I’d like to know if you can help me in a problem I have. I want to know if women are faster than men in a specific setting. The plots I have are time x distance, so in this case I have several points for each person (woman or man). I plotted them all together (all points I had for all women and all points for all men). I then ran an ANCOVA analysis to see if the slopes were different. But I was told that I should instead calculate a slope for each person (man or woman) and then compare the slopes them with a t-test. I am very confused as in my case distance x time are always related to velocity, that I want to measure, so is that one of the reasons I should not use ANCOVA for my comparisons? Do you have an opinion about what analysis should I do? Thank you very much!
Reply
- Charles
  
  October 14, 2017 at 5:38 pm
  
  Carol,
  It seems like you can use a t test to test whether women are faster than men. I would need more information to understand why this wouldn’t be the correct approach.
  Charles
  Reply
  - Carol
    
    October 19, 2017 at 12:02 am
    
    Thank you, Charles!!!
    I think I understand I can use the t-test to compare the slopes. But I don’t understand why I cannot use ANCOVA, since I am comparing the slopes?
    Also, I hope it’s ok I ask you another question. To run the t-test, I will have several slopes with their respective SD. Should I combine the SD and then run the t-test? Because if I don’t combine them, I will be only comparing the slopes and ignoring the variation of each slope, correct? Does that make sense? Thank you!!!
    Reply
    - Charles
      
      October 19, 2017 at 7:51 am
      
      Carol,
      1. In ANCOVA you compare the slopes to determine whether ANCOVA is a suitable test. You should be able to use this test as well to compare the slopes.
      2. I don’t know which t test you are referring to. In general, a t test is only used with a pair of samples: you have two slopes and not several. If you have multiple samples, then you could use ANOVA instead of the t test.
      Charles
      Reply
      - Carol
        
        October 19, 2017 at 11:56 pm
        
        Hi, Charles!
        Thank you! I will look more into the ANCOVA test to understand if it applies to my problem.
        In regards to the t-test, I didn’t explain myself well. I have only two groups, but inside each group I have a number of slopes that belong to each specific group. Each slope has a SE, so my question was – when I do the t-test, should I combine these SE into one (one for each group)? Because if I don’t do that, each SE for each slope will be ignored. I hope I explained myself well! Thank you!!!
      - Charles
        
        October 20, 2017 at 7:51 am
        
        Carol,
        I don’t know what this means “inside each group I have a number of slopes that belong to each specific group.”
        Charles
Henning

September 22, 2017 at 9:16 am

Hi Charles,
I would realy like to use this tool, it would help a lot!
I also think my installation worked fine (=VER() returns “5.2 Excel 2007/2010 and I am using Excel 2010). But the function SlopesTest(R1, R2, R3, R4) returns only a single numeric value in a single cell. Do you have any idea what could be the problem?
Reply
- Charles
  
  September 22, 2017 at 9:40 am
  
  Henning,
  This is an array function and so you can’t simply press the Enter key. See the following webpage:
  Array Formulas and Functions
  Charles
  Reply
Heather

July 26, 2017 at 9:15 am

Thank you for your information!
I am wondering what is the difference between using this method or using a group coefficient in the fitting slope. for example, using y = a*group*x+group+b
The second question is, what if I want to compare two quadratic curve or other like power-law curve fitting result? Can I still use the same way to compare between groups?
Reply
- Charles
  
  July 26, 2017 at 1:47 pm
  
  Heather,
  1. I don’t quite understand the method using a group coefficient in the fitting slope (e.g. using y = a*group*x+group+b), that you are referring to. You would have two such equations; how would you compare them?
  2. I don’t know whether this method would work for quadratic or power-law curves.
  Charles
  Reply
Vickie

May 11, 2017 at 2:04 pm

Hi Charles,

thank you for this excellent explanation. I have checked the calculations in Howell’s book that you refer to (although, in the 8th edition) and I was wondering why you in your calculations for Sb1-b2, exchange s^2y*x with 1?
Also, when deciding wether to use the pooled or unpooled estimate, is it the values for Sy*x that should be compared?

Thank you!
Reply
- Charles
  
  May 12, 2017 at 10:41 am
  
  Vickie,
  In Howell’s book he puts the pooled variance (s_y.x)^2 in the numerator of both fractions inside the square root symbol. By simple algebra these numerators can be replaced by 1 provided you place s_y.x in front of the square root symbol, as I have done.
  Charles
  Reply
Michael Szczepaniak

October 27, 2016 at 4:09 am

Hi Charles – Appreciate the derivations as I’ve implemented this test in python and used your data and results to test my code.

One question/suggestion: Would it be a good idea to explicitly state what the alternative hypothesis is? I can tell from your doubling of the p-value that H1: β1 ≠ β2, but this may not be obvious to everyone visiting this site.
Reply
- Charles
  
  October 27, 2016 at 8:00 am
  
  Michael,
  Ok, I have made your suggested change.
  Charles
  Reply
Graham Bird

October 22, 2016 at 9:37 pm

Hi Charles,

=VER() gives 4.11 Excel 2010/2013/2016

Graham
Reply
- Charles
  
  October 23, 2016 at 8:27 am
  
  Graham,
  This means that the Real Statistics software was installed properly and so you should be able to use the SlopesTest function. When you type the function name into a cell you won’t see it reflected in a tooltip (as for standard Excel functions), but it should work.
  If you are still having problems, you can send me your data and I will tell you exactly what you should do. You can find my email address at Contact Us.
  Charles
  Reply
Graham Bird

October 21, 2016 at 10:56 pm

Hello,
I have finally managed to find this site (and EXCEL Add-in) because comparing regression lines is a common need for my work in taxonomy. However, I cannot see in the Real Statistics package any reference/procedure for doing the comparison or the Slopetest. Can anyone provide a simple step-by-step procedure (for someone like me who is not a statistician/mathematician)? Many thanks.
GrahamB
Reply
- Charles
  
  October 22, 2016 at 7:56 am
  
  Graham,
  This capability has been implemented as a function (not as one of the data analysis tools), namely the SlopesTest function. Just use it as you would any other Excel function. The arguments are as described on the referenced webpage.
  Charles
  Reply
  - Graham Bird
    
    October 22, 2016 at 9:51 am
    
    Thank you Charles, I’ll try and find may way through this. It will be a very useful tool for me when I can get it to work.
    Graham
    Reply
    - Graham Bird
      
      October 22, 2016 at 11:03 am
      
      Hi Charles,
      sorry to be a pain, but I cannot find the Real Statistics functions anywhere. Should it be obvious or hasn’t the resource pack downloaded properly?
      
      Many thanks
      GrahamB
      Reply
      - Charles
        
        October 22, 2016 at 12:39 pm
        
        Graham,
        What do you see when you enter the following formula? =VER()
        Charles
Vasilis Andriopoulos

October 4, 2016 at 11:22 am

Dear Dr. Zaiontz,

To perform the same test for the intercept of the slopes does someone simply have to replace b1 and b2 with intercept 1 and intercept 2?

Thank you in advance,
Vasilis
Reply
- Charles
  
  October 4, 2016 at 11:45 am
  
  No.
  Reply
Joshua

September 12, 2016 at 3:48 pm

Hi,

Thanks forthe helpgful guide! Exactly what I was looking for! However, when I run the Real Statisitics function (Slopestest), I only get the Sb1+b2 for the data sets, no t-stat, df or p-value. This is regardless of whether lab = TRUE or not.

I tried to find this test under the ‘Data Analysis Tools’ menu, but can’t find which heading it is under. Any ideas where I can find it, or how to fix this problem?
Reply
- Charles
  
  September 13, 2016 at 11:40 am
  
  Joshua,
  The function is an array function and so you need to follow Excel’s rules for array functions, as described on the following webpage
  Excel array functions and formulas
  Charles
  Reply
- Charles
  
  September 23, 2016 at 10:29 am
  
  Joshua,
  SlopesTest is what Excel refers to as an array function and so you can’t simply press the Enter key to use it. See the following webpage for how to use array functions:
  Array formulas and functions.
  Charles
  Reply
Reza

August 27, 2016 at 2:42 am

Charles,

Suppose we have a monovariate regression analysis with X being discrete independent variable and Y the dependent variable. If the parent population is normally distributed how many samples I need to use to make sure my Slope and Intercept resemble the population slope and intercept with %5 error rate? What else I need to know from the parent population to answer this?
Thanks,
Reza
Reply
- Charles
  
  August 27, 2016 at 9:41 am
  
  Reza,
  The website tells how to calculate the sample size required for regression. Probably this is the value that you need. See
  https://real-statistics.com/multiple-regression/multiple-regression-analysis/sample-size-multiple-regression/
  Charles
  Reply
Keehan Teixeira

August 26, 2016 at 4:38 pm

Hi Charles,

Thank you very much for this solution. It is a simple way to check the significance on a difference in slope. I have compared it to another method, outlined here: http://socserv.socsci.mcmaster.ca/jfox/Courses/SPIDA/dummy-regression-notes.pdf

In summary, the method pools the data and creates a dummy factor that identifies the original grouping of the data (1 for group A and 0 for group B). It then uses that to derive an interaction factor, and the significance of that factor is examined. If the interaction coefficient is significant, the null hypothesis can be rejected.

I’ve used both methods on my data and have found that the two conclusions are different. Since the calculated standard error on the coefficient is different, and since the coefficient itself is the same, the t-statistic is different. The null hypothesis can be rejected in only one of the two methods (not yours).

Can you attempt to rationalize why the two methods might be giving different results? Am I perhaps interpreting the results incorrectly?

Thank you,
Keehan
Reply
- Charles
  
  August 26, 2016 at 5:51 pm
  
  Keehan,
  I provide an option for the calculation of the standard error (the second to last argument of the function). Perhaps the other option yields the same result as in the other website you referenced.
  Charles
  Reply
  - Keehan Teixeira
    
    August 29, 2016 at 4:12 pm
    
    Thank you Charles,
    
    The pooled value for standard error yields the same conclusion as the non-pooled error, that is the null hypothesis cannot be rejected.
    
    I’ve actually just discovered that my data is very slightly non-linear.
    Can I still use the method you’ve outlined here for the second order coefficient?
    
    Thanks!
    Keehan
    Reply
    - Charles
      
      September 6, 2016 at 9:43 pm
      
      Keehan,
      You are very unlikely to find data that is exactly linear, and so it really depends on how “slightly” your data is “slightly non-linear”. The test that I describe in the referenced webpage assumes linearity.
      Charles
      Reply
Mac Reynolds

August 20, 2016 at 1:54 am

Charles,
I have paired samples in a regression model. The independent samples t test of the slopes, I believe, is inappropriate where the paired samples would actually be a dependent samples test. Do you have a test to compare slopes of paired samples. The results are measuring the same sample of subjects under different conditions. I need to know if the two different conditions create a different relationship between variables for the same subjects. Is this a change in how the slopes test is calculated or is a different test needed?
Reply
Mac Reynolds

August 18, 2016 at 5:05 pm

I have downloaded Real Statistics and have installed it in Excel 2016. It was not easy given your instructions, but I can start your addin with ctl m. However, I want to run a slopes test and “SlopesTest” is not an option. The dialog brings up “Choose a selection from the following” and there is no SlopesTest. Do I use some other command and how do I get to a manual for help.
Reply
- Charles
  
  August 18, 2016 at 6:23 pm
  
  Mac,
  SlopesTest is a function and so can’t be accessed via Ctrl-m. This function is described on the referenced webpage.
  To get information about any Real Statistics function or analysis tool, see the Tools menu on the website.
  Charles
  Reply
  - Mac Reynolds
    
    August 19, 2016 at 2:04 am
    
    Charles,
    Thanks. I downloaded the Real-Statistics Examples-Part 2 and found the template for SlopesTest. Excellent value for me–I want the test results, not the mechanics of the test. I will eventually create a macro for your template. Right now, I just need to test for difference in slopes. I greatly appreciate all the work you have put into this add in for Excel. Real-Statistics makes Excel an extremely useful tool. Great Work!
    Mac
    Reply
Steve Shaha

July 21, 2016 at 6:15 pm

Citations? References
Reply
- Charles
  
  July 21, 2016 at 7:31 pm
  
  Steve,
  See Howell’s book as shown in the Bibliography
  Charles
  Reply
  - Alice
    
    August 29, 2016 at 7:46 pm
    
    Dear Charles,’=
    
    Thank you for your most interesting post.
    
    Its the following reference the one you are used?:
    
    Howell, D. C. (2010). Statistical methods for psychology (7th ed.).
    Wadsworth, Cengage Learning.
    
    I checked the book’s table of content, and do not see anything related with slope comparison. I just want to make sure I am citing the right reference in my work.
    
    Thank you!
    Alice
    Reply
    - Charles
      
      August 30, 2016 at 12:31 am
      
      Alice,
      Yes. It is in section 9.11 on page 273 in my book.
      Charles
      Reply
David

June 26, 2016 at 10:55 pm

Dear Charles,
what is the name of your method you described here?
How can I cite this in my paper?
David
Reply
- Charles
  
  June 27, 2016 at 9:22 am
  
  Dear David,
  I don’t know of any other name besides Comparing the Slopes of Two Independent Samples. The test is described in Howell’s textbook (see Bibliography).
  Charles
  Reply
Kirsten

June 16, 2016 at 3:53 pm

Is it possible to compare the rate of growth to two exponential regression lines? I’ve tried to make the lines linear by taking ln and using the slopes in that way but I’m not sure you are allowed to do that.

If it helps, the two equations are y=24.356e^(.0437x) and y=30.101e^(.0235x)
Reply
- Charles
  
  June 16, 2016 at 6:52 pm
  
  Kirtsen,
  Taking the log of both sides and using the two slope comparison test seems like a reasonable approach.
  Charles
  Reply
Maria

May 30, 2016 at 11:56 am

Dear Charles Zaiontz,

Thank you for you useful post. I wonder if the hypothessis testing is 95% or 90%, due to the alpha value=o.05. I got a bot confused. Hope you can help me.

Thanks in advance
Reply
- Charles
  
  May 30, 2016 at 1:37 pm
  
  Maria,
  Yes, a 95% confidence interval corresponds to alpha = .05.
  Charles
  Reply
  - Stig
    
    June 1, 2016 at 12:31 pm
    
    Dear Charles.
    
    Is alpha=.05 also used if we want to use a two-tailed test with 95 % confidence?
    Reply
    - Charles
      
      June 1, 2016 at 1:25 pm
      
      Yes, although effectively you are computing the distribution value when alpha = .05/2
      Charles
      Reply
Florian Schaefer

May 16, 2016 at 3:01 pm

I created five scenarios of how inbreeding in a population increases over time. The linear regressions of these five scenarios show a similiar slope.

I want to test wether the similarity between all five scenarios is significant. How can I do this?
Reply
- Charles
  
  May 16, 2016 at 10:28 pm
  
  Florian,
  Do you want to compare the slopes or do you want to compare the means of the five scenarios (or something else)?
  Charles
  Reply
Sid

May 15, 2016 at 4:07 am

Dear Charles, Your pack appears to be very useful for my application. When I installed for Vista/Office 2010 combination, I got an error and request for password which did not show up again. When I run SlopesTest, I get ‘std err’ or a single value but not a matrix. The value I get matches neither value in your example – I am getting a value of 0.528. Any guidance will be greatly appreciated.
Reply
- Charles
  
  May 15, 2016 at 10:53 am
  
  Sid,
  1. If you install the Real Statistics Resource Pack as described on the webpage from where you downloaded the software you should never receive a request for the password. You don-t need a password to use the software.
  2. SlopesTest is an array function and so you need to highlight a range and then press Crtl-Shft-Enter instead of Enter to get the full results. See the following webpage for details
  Array Functions and Formulas
  Charles
  Reply
Glenn Ramsey

May 11, 2016 at 2:08 pm

Step 1: How do I access this function SlopesTest?
Reply
- Charles
  
  May 11, 2016 at 8:22 pm
  
  Glenn,
  It is provided by the Real Statistics Resource Pack, which you can download for free at
  https://real-statistics.com/free-download/real-statistics-resource-pack/
  Charles
  Reply
Brenda L.

April 19, 2016 at 12:04 am

I need to compare two slopes of scores on an annual English acquisition test for English language learners. I’ll be comparing the results of one program over three years with results either from a sequence of three years previous to the new program for other students (probable) or previous scores for the same students (since many of the students began the program in kindergarten or the early elementary grades and won’t have three years previous, this will really reduce my sample size). The students will be tracked over three years, so there will be three points to construct the slopes. I know that the scores will improve over time, so all the slopes will be upward. I realize this is a time series, but all the time series analyses I’ve seen look like they are for something much more high-powered than this. I am not a statistics whiz, and I’m looking for something I can get my head around. Will the technique above be appropriate for this analysis? Thanks.
Reply
- Charles
  
  April 19, 2016 at 2:43 pm
  
  Brenda,
  First of all, why do you need to compare slopes at all? Do you really need to compare slopes or are you interested in some other statistic (e.g. mean)?
  The referenced test will compare two slopes. If you have three or more you could use the technique described for ANCOVA. If the data sets are not indpendent, but are dependent then you need to use the dependent version of the test (which is also decribed on the website).
  Charles
  Reply
  - Brenda L.
    
    April 20, 2016 at 5:48 am
    
    Thanks for your response. I’m interested in seeing whether the program has an effect on how quickly the children acquire English. I know I could just compare the three time points and see if they are different, but I wonder if I wouldn’t gain more understanding by also looking at the slopes. This seems especially important if I’m comparing scores for the same child from years 1, 2, 3 (old program) with scores for years 4, 5, 6 (new program) – the scores will presumably improve from year to year, but I’m interested if they are improving more quickly in years 4-6. From what I’ve read of it, it doesn’t seem like ANCOVA would do this for me. Am I missing something? (Sorry if this is a little dense, I am not a great statistician but always willing to learn more.) Thanks.
    Reply
    - Charles
      
      April 22, 2016 at 10:50 am
      
      Brenda,
      You don’t use ANCOVA, only the test to see whether the slopes are equal. This test can be found at ANCOVA Assumptions
      Charles
      Reply
Edward Leader

March 30, 2016 at 4:38 pm

Hi Dr. Zaiontz,

Can this test be used to compare the time constants of the fit of an exponential function to two sets of normalized data? I want to test whether or not the time constant of the fit to one set of data is significantly different from the fit to another set (actually, before drug vs. after drug).

Thanks in advance,

Ed
Reply
- Charles
  
  March 30, 2016 at 6:23 pm
  
  Edward,
  No, you can’t use this test to compare constants. Are you assuming that the slopes are equal? If not, what is the purpose of the comparison?
  Charles
  Reply
  - Edward Leader
    
    March 31, 2016 at 3:48 pm
    
    Charles,
    
    Thanks very much for your reply.
    
    I have fit an exponential function (y=exp(-t/tau)) to two sets of normalized data points, first in control and then in the presence of a drug. I observe that the tau in the presence of drug is 10% smaller. I am asking how to tell if the two taus representing each condition are significantly different. What is the appropriate test for this case?
    
    Thanks,
    
    Ed
    Reply
Camila

February 18, 2016 at 4:00 am

Hi Charlie,

I am measuring the bacterial biodegradation of a chemical overtime using different treatments to enhance its degradation (First order reaction). Once I obtained the biodegradation rate constants (slopes) and the half life of the chemical, I want to test weather or not there is a statistical significance between the degradation of the chemical and the degradation of the chemical under the different treatments (6 treatments). Which statistical test would you recommend?
Reply
MrEction

February 12, 2016 at 10:40 pm

Wow, this is phenomenally helpful. Most of the other examples for this I have seen are focused on ANOVA which I can’t use since my points are all over the place (on the x axis). Gonna use this shortly for checking some models between different systems.
Reply
Toni

February 11, 2016 at 10:40 am

Dear Professor Charles Zaiontz,

I have one question. I try to compare four different slopes. I use your tool for pairwise comparisons. Do I have to use also Bonfferoni correction for p-value?
Best regards,
Toni
Reply
- Charles
  
  February 22, 2016 at 8:05 pm
  
  Dear Toni,
  If you do multiple pairwise tests, then you would need to handle experimentwise/familywise error. Bonferroni is one method for doing this.
  There may be some tests for homogeneity of slopes that handle more than two lines (e.g. to test this assumption for ANCOVA), but I don’t have time now to identify such a test.
  Charles
  Reply
  - Toni
    
    February 22, 2016 at 10:42 pm
    
    Thanks,
    
    Toni
    Reply
    - david
      
      March 29, 2016 at 11:01 am
      
      an F test can handle this. with it you can compare an infinite array of slopes or lines.
      Reply
Ali

January 31, 2016 at 3:15 pm

Dear Professor Charles Zaiontz,

Thank you indeed for very helpful website.
I would be pleased if you could introduced ant books or articles that I can address this procedure for comparing two regression slope.

Best regards
Ali
Reply
- Charles
  
  February 1, 2016 at 8:02 am
  
  Ali,
  One reference is the following book:
  Howell, D. C. (2010). Statistical methods for psychology (7th ed.). Wadsworth, Cengage Learning
  Charles
  Reply
Grant Simpson

December 18, 2015 at 1:58 pm

First, as a recently graduated undergrad, let me say, this array function is so awesome!

My question is, do you have a similar function to compare the b-intercepts of two sets of data?

Using your your =slopestest on my data, I get a p-value=.9205, which makes sense as my plotted data looks like two parallel lines. However, the plot shows two lines that have vastly different intercepts, and I want to know if the lines have *statistically* different intercepts.

To give you a bit of context regarding the data, I am looking an intervention study, and I am comparing the frequency of hospital visits before and after intervention. Before and after intervention, the average visits are flat lines (slightly negative slopes, m =~ -0.1), but the after intervention the flat line drops down considerably.

Any help would be appreciated. I’m glad to have such a cool add-in on my excel.
Reply
- Charles
  
  December 22, 2015 at 11:37 am
  
  Hi Grant,
  
  Here is my first thought as to how to address this problem. If the two lines have significantly equal/different intercepts, then if you subtract the equation of one line from the other you should be able to test whether the resulting line has zero intercept.
  
  To test whether a line has zero intercept, please look at Example 2 on the following webpage:
  https://real-statistics.com/regression/confidence-and-prediction-intervals/
  
  Charles
  Reply
Marieke

December 10, 2015 at 3:17 pm

Hi all,

I used this test and was wondering what the exact name of that test is, or how to call that type of test. Concretely, in a text normally you say “…were significant (P < 0.05; T-test, ANOVA, etc…)" so what is it here ? "P < 0.05; xxx"? On the internet I found it called a MANCOVA, or just a T-test but then how to indicate concretely that the slopes were tested?

Thanks for your help!
Marieke
Reply
- Charles
  
  December 22, 2015 at 10:32 pm
  
  Hi Marieke,
  I don’t know how to call this test except comparison of the slopes of regression lines from two independent samples.
  I wouldn’t call it MANCOVA or T-test.
  Charles
  Reply
  - Katharine
    
    April 3, 2016 at 11:48 pm
    
    Hello!
    Was there any additional thought on a name for this test? My PI doesn’t accept the notion that this analysis has no name associated with it, and I don’t know what to tell him. Is there a name given, perhaps in the Howell (2010) book referenced above?
    
    Thanks,
    Katharine
    Reply
    - Charles
      
      April 4, 2016 at 7:23 am
      
      The test is called “Testing the difference between two independent slopes”
      Charles
      Reply
Javier

September 22, 2015 at 9:21 am

I was looking for the reference in which the unpooled comparing of slopes is based.
The pooled is based on the Kleinbaum book?
But, I was searching in Armitage book, and I can not find the unpooled formulaes when the residual variances of the two groups are quite different.
Can you help me?
Best regards,
Javier
Reply
- Charles
  
  September 23, 2015 at 8:16 am
  
  David Howell, Statistical Methods for Psychology, Wadsworth.
  Charles
  Reply
  - Javier
    
    September 23, 2015 at 9:13 am
    
    Good day,
    Thank-you for your fast answer! I will look in.
    Best regards,
    Javier
    Reply
Dirk

August 20, 2015 at 9:33 am

Good day,

i was trying the formula for slopestest, but first it wants rather semicolons to separate the different ranges and then it outputs me only either “stderr” (if lab = true) and or number if i do not precise b and lab. I use Excel 2010. Any help would be appreciated!

Thanks
Dirk
Reply
- Charles
  
  August 20, 2015 at 10:06 am
  
  Hi Dirk,
  You are probably using the German version of Excel, which uses semi-colons instead of commas to separate arguments in a function. The SlopesTest function is what Excel calls an array function. In this case you must highlight the range where the output will go (not just a single cell) and then press Ctrl-Shift-Enter. See the webpage Array Formulas and Functions for more details about how to use array functions.
  Chrales
  Reply
  - Dirk
    
    August 20, 2015 at 12:43 pm
    
    Thanks for the superfast response! I struggled a bit, but now managed. Thanks for your help!
    Dirk
    Reply
Chris

July 6, 2015 at 8:46 pm

Hello,

I just made it through all of this to compare the slopes for two different sites. The difference between the slopes is so small, I know this probably sounds like a silly question. I got 0.023049 for my unpooled slope, and 0.023765 for my pooled slope. They round to 0.023 and 0.024. Does this violate the null hypothesis because it equals 0.001 and thus make the differences between slopes significantly different?

Thanks
Reply
- Charles
  
  July 7, 2015 at 8:07 am
  
  Chris,
  If the p-value = .001 then I would conclude there is a significant difference, even if the difference between the slopes seems rather small. Remember that significantly different doesn’t mean large. You use the effect size to characterize whether an effect is small or large.
  Charles
  Reply
JJ

July 2, 2015 at 11:13 am

Hihi,
I have a question, also for a non-expert.
I have 10 values for x and y plotted as a scatter with a trend line (which gave me an intercept and slope).
I want to test if the slope is sig. different from slope x=y.
can the test =SlopesTest be used in this example?
( i have the dutch excel and for some reason cannot find the corresponding dutch command, to try for myself)
thanks, JJ
Reply
- Charles
  
  July 3, 2015 at 8:19 am
  
  JJ,
  
  SlopesTest is not a standard Excel function. It is part of the Real Statistics Resource Pack. You need to download and install the resource pack to use this function. This is free.
  
  The Slopes Test requires two samples of xy points. You only have one such sample. Here are a few possible options for how to conduct the test you want. I prefer choice 3. It is the easiest to implement.
  
  1. You could try to create a second sample consisting of points whose x and y values are equal and then use the SlopesTest function. I’m not sure this approach is completely sound, though.
  
  2. You can use the Testing significance of slope approach. This method tests whether the slope is equal zero based on testing the correlation coefficient. You want to test whether the slope equals 1 (the slope of y = x). You would need to modify this test probably using the Fisher transformation in some way (see testing correlation coefficient).
  
  3. You take all the xy points in your sample and create a new sample consisting of the points xy’ where y’ = y-x. If the points xy in your original sample have slope which is not significantly different from 1 (the slope of y=x) then the points xy’ in the new sample should have slope which is not significantly different from zero. The converse is also true. Fortunately there is a test to see whether the slope of a regression line is significantly different from zero, namely the test described on the webpage Testing significance of slope.
  
  In any case, since your sample is so small, the power of any of these tests will likely be low.
  
  Charles
  Reply
Greg

May 2, 2015 at 4:19 am

Hi Charles,

Can you give a little more information on the citation for this work? Meaning I see you said Statistical Methods for Psychology”, Wadsworth CENGAGE Learning, 2010 however what Chapter and/or pages where you using?

Thanks,

Greg
Reply
- Charles
  
  May 2, 2015 at 7:22 am
  
  Greg,
  This is a good suggestion. I’ll try to do this in the future, since it would quite difficult for me to do this for all the previous references.
  Charles
  Reply
Jordan Chill

April 28, 2015 at 8:44 am

Hi Dr. Zaiontz,

Thanks for the illumniating example.
What happens if the two functions are NOT linear and CANNOT be transformed easily into a linear function?
In my case, I am fitting two sets of time-dependent data to the function f(t), and f(t) is a1*cos(pi*a2*t)cos(pi*a3*t)*exp(-a4*t), where a1…a4 are the fitted parameters. I am interested in whether the a3 parameter obtained for two different fits is indeed significantly different. We have 18 measurements in each fit (so 14 degrees of freedom).
Happy to hear your opinion or get a link to a helpful textbook!
Jordan
Reply
- Charles
  
  April 28, 2015 at 5:17 pm
  
  Jordan,
  You need to perform non-linear regression. Probably the easiest way to do this is with Excel’s Solver. I give some examples on the following webpages:
  Exponential Regression
  Logistic Regression
  Charles
  Reply
Rebecca

April 27, 2015 at 8:30 pm

Hi,
I was just wondering why the degrees of freedom are n-4?
Thanks,
Rebecca
Reply
- Charles
  
  April 28, 2015 at 7:13 am
  
  Rebecca,
  As you can see from the webpage https://real-statistics.com/regression/hypothesis-testing-significance-regression-line-slope/ df = n-2 for a one slope test. With two slopes the n-2 becomes n-4 (minus 2 for each slope).
  Charles
  Reply
  - Rebecca
    
    April 28, 2015 at 11:22 am
    
    Thank you very much.
    Also I was wondering what are the assumptions that are made about the data to conduct the t-test? That it was normally distributed for both groups?
    I was a bit confused about whether to use the pooled or non-pooled standard error, could you explain what you meant by “if b = true”?
    And finally, would you report these results in a lab report the same as you would report the results of any t-test, except reporting the values of b and standard deviations of b instead of means?
    Sorry about all the questions, I’m very new to stats and using excel!
    Best wishes,
    Rebecca
    Reply
    - Charles
      
      April 28, 2015 at 5:39 pm
      
      Rebecca,
      The assumptions are similar to those for the usual t test, including both samples are normally distributed (although such tests are usually pretty robust to violations of this assumption)
      Use the pooled standard error if the variances are approximately equal; otherwise use the non-pooled standard errors.
      b = True means that the fifth argument in the function takes the value True.
      Yes, you should report the results in a manner similar to the usual t test.
      Charles
      Reply
MJ

April 10, 2015 at 9:03 am

Any chance of providing the t-test to compare two intercepts? Thanks.
Reply
- Charles
  
  April 15, 2015 at 4:31 pm
  
  MJ,
  For this test, see Example 2 of the webpage https://real-statistics.com/regression/confidence-and-prediction-intervals/.
  Alternatively you can use the Linear Regression data analysis tool.
  Charles
  Reply
Mattt

March 10, 2015 at 10:59 pm

Sir,
in Figure 2, cell V10 the code cites: “= Sres…” does that refer to the sqrt of cell V9? Because cell S9 refers to the parameter “Sres^2” and elsewhere small details such these are called out quite explicitly, I don’t know for certain which way to go [i.e., use the “Sres^2” or sqrt(Sres^2)].

Thank you in advance for your assistance.
Matt
Reply
- Charles
  
  March 11, 2015 at 10:59 am
  
  Matt,
  The formula in cell T10 (which corresponds to V10) is =SQRT(T9*(1/(N7^2*(N4-1))+1/(O7^2*(O4-1))))
  The formula in cell T9 is =((N4-2)*N6^2+(O4-2)*O6^2)/(N4+O4-4)
  Charles
  Reply
  - Matt
    
    August 15, 2015 at 10:13 pm
    
    thank you so much.
    Reply
Gina W

November 30, 2014 at 3:20 pm

Hey,

I’ve got a question: Does this comparison also work if I have 3 samples and not only 2?
Reply
- Charles
  
  November 30, 2014 at 11:01 pm
  
  Gina,
  No, you need to run ANCOVA. See Chapter 18 of J Zar, Biostatistical Analysis, 2nd edition, Prentice-Hall, 1984 for more details.
  Charles
  Reply
  - Gina W
    
    December 3, 2014 at 11:01 am
    
    Thanks!
    
    I used a Mediationmodel.
    
    Is it right, that in ANCOVA I use “Group” as fixed, Outcome as dependent and Mediator as Covariate?
    
    Greetings
    Reply
    - Charles
      
      January 13, 2015 at 12:14 pm
      
      Gina,
      Sorry that it has taken me so long to respond to your question. I seemed to have missed your response. I am not very familiar with Mediation models and so I am reluctant to answer your question. I plan to look into these sorts of models later this year.
      Charles
      Reply
Patricia Olson

October 27, 2014 at 10:07 pm

Charles
Never mind… I figured it out finally. I was entering the array data incorrectly.

Thanks
Patricia
Reply
Patricia Olson

October 27, 2014 at 9:49 pm

Hi Charles
Your RealStatistics Resource Pack for Excel is great. Thank you for providing it. I have been using R but am still learning the language. Your tool is much more time saving for some statistical analyses than R. However, I am having some problems accessing some of the functions such as SlopesTest. I’m using Excel 7. I tried to access it through the example worksheet and still just get the #VALUE! message.
Thank you for your help on this.
Cheers
Patricia
Reply
David

September 8, 2014 at 10:49 am

Hi Charles,
Thank you very much for this great post!
I have a small question. What if each one of my data (y) is actually a mean over a lager data set, how can I account for it? should I expect a different result?
Thanks.
Reply
- Charles
  
  September 8, 2014 at 4:32 pm
  
  David,
  I’m not sure how you would account for this (or if you could account for this). I would think that this would change things considerably.
  I suggest that you try a few examples where you create some data (i.e. the larger data sets) and have the y values be the means over the larger data set that you have created. Then run the test using the means and run it again using the larger data set and see what sort of differences there are.
  Charles
  Reply
  - David
    
    September 8, 2014 at 6:46 pm
    
    Thanks Charles for your quick reply.
    I’ll try your suggestion.
    David
    Reply
Hammad

June 24, 2014 at 5:08 am

Hi.

How can I do this on excel 2010? I was trying TDIST formula but this function is available only with Excel 2007 or earlier versions and I am unable to understand the 2010 version. Or post a picture using Excel 2010 please.

Cheers
Reply
- Charles
  
  June 26, 2014 at 10:35 am
  
  Hi Hammad,
  
  I am using Excel 2010 and have no problem using the Excel 2007 functions such as TDIST. In any case here are substitutions for Excel 2010:
  
  Replace TDIST(x,df,2) by T.DIST.2T(x,df)
  Replace TINV(p,df) by T.INV.2T(p,df)
  
  Charles
  Reply
Ramiro

June 20, 2014 at 12:13 am

Hi Charles,

I noticed that if there are holes in the data the result of SlopesTest is different. Belos are the numbers I tried, they are the same but some points are missing one or the other piece of data. Since only data with x and y should count, I thought the SlopesTest would give me the same result. Should I always remove missing data before doing the SlopesTest?
Thanks you,
Ramiro

This gave me p=0.131431
1 3 1 1
2 3 2 2
3 3 3 5
4 4 6
5 4 8
4 5 9
6 5 6
7 6 7 13

and this gave me p=0.14889

1 3 1 1
2 3 2 2
3 3 3 5
5 4 4 6
6 5 5 9
7 6 7 13
Reply
- Charles
  
  June 29, 2014 at 10:36 am
  
  Hi Ramiro,
  In the current implementation of the SlopesTest function the correct values are generated only if there is no missing data. You need to remove any missing data before using the function.
  Charles
  Reply
Carl

May 15, 2014 at 10:43 am

Hi, I tried testing your SlopesTest using your Example1 data. When I input it I only get the following “result”: std err. This appears to be only the label as in your Figure 3. When I repeat the formula by excluding the “false,true” part I get the result 0.23271.
Any ideas?
Reply
- Charles
  
  May 20, 2014 at 9:26 am
  
  Carl,
  SlopesTest is an array formula. Try entering the formula and then pressing Ctrl-Shift-Enter. The full results should be displayed. If you press Enter instead, then only the first cell in the output will appear.
  Charles
  Reply
  - Laura
    
    June 14, 2014 at 1:40 am
    
    I have the same problem as Carl, except I have tried ‘Ctrl-Shift-Enter’ and it makes no difference to the result. It’s either ‘std err’ or a number (in my case 28.16). Please let me know if you are aware of any other factors that might be stopping this formula from working.
    Reply
    - Charles
      
      June 14, 2014 at 6:55 pm
      
      Laura,
      
      Since this is an array function, you need to first highlight a 4 x 1 column range, then enter a formula of form SlopesTest(R1, R2, R3, R4, b) where R1, R2. R3 and R4 are ranges and b is either TRUE or FALSE, and finally press Ctl-Shft-Enter. This will fill the highlighted range with the following values: std err, t, df, p-value.
      
      Alternatively you can first highlight a 4 x 2 range, then enter a formula of form SlopesTest(R1, R2, R3, R4, b, TRUE) and finally press Ctrl-Shift-Enter. This will fill the second highlighted column with the same values as described above and fill the first column with the appropriate labels.
      
      The key is that you must first highlight an output range of sufficient size to contain all the output. It can even be larger than necessary (the extra cells will be filled with #N/A.
      
      Charles
      Reply
      - Laura
        
        June 15, 2014 at 7:17 am
        
        Thank you for the advice Charles. The problem is now fixed thanks to your suggestion!
        
        Cheers
Johnathan Clayborn

April 13, 2014 at 11:18 pm

Hi Dr. Zaiontz,

This is exactly the type of information that I was looking for to complete a study that I was working on. I was wondering two questions;

1st) can you explain more about about I would go about finding the X/Y values of the lines in order to perform these calculations? I’m using a trendline in time-series line graph and I can see that there is definitely statistical significance, but I need to express it mathematically.

2nd) Do you know if this method is possible using SPSS?
Reply
- Charles
  
  April 15, 2014 at 8:32 pm
  
  Hi Johnathan,
  
  I am very pleased that the information is relevant for your needs. Regarding your questions:
  
  1) I am not sure what you mean. The X/Y values are the data that you are testing.
  
  2) I don’t use SPSS, but I believe that the answer is yes. E.g. the following webpage references doing this in SPSS: http://core.ecu.edu/psyc/wuenschk/MV/multReg/Potthoff.pdf
  
  Charles
  Reply

Basic Concepts

Example

Worksheet Function

Observation

192 thoughts on “Comparing the slopes for two independent samples”

Leave a Comment Cancel reply