<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">
<head>
<meta content="text/html; charset=UTF-8" http-equiv="content-type" />
<meta http-equiv="Content-Language" content="en" />
<meta name="generator" content="Pressbooks 5.20.1" />
<meta name="pb-authors" content="Jeremy Boettinger" />
<meta name="pb-editors" content="Nett Smith, Kristin Conlin, and Harvey Sky" />
<meta name="pb-translators" content="" />
<meta name="pb-reviewers" content="" />
<meta name="pb-illustrators" content="" />
<meta name="pb-contributors" content="" />
<meta name="pb-title" content="Math and Statistics Guides from UB's Math &amp; Statistics Center" />
<meta name="pb-language" content="en" />
<meta name="pb-cover-image" content="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/MSC_cover-scaled.jpg" />
<meta name="pb-primary-subject" content="PB" />
<meta name="pb-short-title" content="Math and Statistics Guides" />
<meta name="pb-book-license" content="cc-by-nc-sa" />
<meta name="pb-copyright-holder" content="Jeremy Boettinger" />
<meta name="pb-additional-subjects" content="PBC, PBCD, PBD, PBK, PBW, PBT, UFM" />
<title>Math and Statistics Guides from UB's Math & Statistics Center</title>
</head>
<body lang='en' >
<div id="half-title-page"><h1 class="title">Math and Statistics Guides from UB's Math & Statistics Center</h1></div>
<div id="title-page"><h1 class="title">Math and Statistics Guides from UB's Math & Statistics Center</h1><h2 class="subtitle"></h2><h3 class="author">Jeremy Boettinger</h3><h3 class="author"></h3><h4 class="publisher"></h4><h5 class="publisher-city"></h5></div>
<div id="copyright-page"><div class="ugc">
<div class="license-attribution"><p><img src="https://ubalt.pressbooks.pub/app/themes/pressbooks-book/packages/buckram/assets/images/cc-by-nc-sa.svg" alt="Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License" /></p><p>Math and Statistics Guides from UB's Math &amp; Statistics Center by Jeremy Boettinger is licensed under a <a rel="license" href="https://creativecommons.org/licenses/by-nc-sa/4.0/">Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License</a>, except where otherwise noted.</p></div>
</div></div>
<div id="toc"><h1>Contents</h1><ul><li class="front-matter introduction"><a href="#front-matter-introduction"><span class="toc-chapter-title">Introduction</span> <span class="chapter-author">Jeremy Boettinger</span></a></li><li class="part"><a href="#part-main-body">Using SPSS</a></li><li class="chapter standard"><a href="#chapter-using-data-and-variable-view"><span class="toc-chapter-title">Using Data and Variable View</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-descriptive-statistics"><span class="toc-chapter-title">Descriptive Statistics</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-comparing-means-single-sample-t-test"><span class="toc-chapter-title">Comparing Means: Single Sample t-test</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-comparing-means-independent-samples-t-test"><span class="toc-chapter-title">Comparing Means: Independent Samples t-test</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-comparing-means-repeated-measures-t-test"><span class="toc-chapter-title">Comparing Means: Repeated Measures t-test</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-comparing-means-repeated-measures-one-way-anova"><span class="toc-chapter-title">Comparing Means: Repeated Measures One-Way ANOVA</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-independent-measures-one-way-anova"><span class="toc-chapter-title">Comparing Means: Independent Measures One-Way ANOVA</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-spss-simple-linear-regression"><span class="toc-chapter-title">Simple Linear Regression</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-spss-correlations"><span class="toc-chapter-title">Correlations</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="part"><a href="#part-opre">Statistics</a></li><li class="chapter standard"><a href="#chapter-introduction-to-statistics-basics"><span class="toc-chapter-title">Introduction to Statistics Basics</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-frequency-distributions"><span class="toc-chapter-title">Frequency Distributions</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-measures-of-central-tendency"><span class="toc-chapter-title">Measures of Central Tendency</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-basics-of-variability"><span class="toc-chapter-title">Basics of Variability</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-z-score-basics"><span class="toc-chapter-title">Z-score Basics</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-probability-and-statistics"><span class="toc-chapter-title">Probability and Sampling</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-distribution-of-sample-means"><span class="toc-chapter-title">Distribution of Sample Means</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-introduction-to-hypothesis-testing"><span class="toc-chapter-title">Introduction to Hypothesis Testing</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-introduction-to-the-t-statistic"><span class="toc-chapter-title">Introduction to the t-statistic</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-independent-t-tests"><span class="toc-chapter-title">Independent t-tests</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-repeated-measures-t-test"><span class="toc-chapter-title">Repeated Measures t-test</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-independent-one-way-anova"><span class="toc-chapter-title">Independent One-Way ANOVA</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-repeated-measures-anova"><span class="toc-chapter-title">Repeated Measures ANOVA</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-two-factor-anovas"><span class="toc-chapter-title">Two-Factor ANOVAs</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-introduction-to-correlation-and-regression"><span class="toc-chapter-title">Introduction to Correlation and Regression</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-introduction-to-linear-regression"><span class="toc-chapter-title">Introduction to Linear Regression</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-discrete-probability-distributions"><span class="toc-chapter-title">Discrete Probability Distributions</span> <span class="chapter-author">Jason Green</span></a></li><li class="chapter standard"><a href="#chapter-statistics-and-excel-evaluating-normality"><span class="toc-chapter-title">Statistics and Excel: Evaluating Normality</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-statistics-formula-glossary"><span class="toc-chapter-title">Statistics Formula Glossary</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="part"><a href="#part-math-82-86-153-163">Algebra and Concepts</a></li><li class="chapter standard"><a href="#chapter-introduction-to-exponents-and-polynomials"><span class="toc-chapter-title">Introduction to Exponents and Polynomials</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-factoring-polynomials"><span class="toc-chapter-title">Factoring Polynomials</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-rational-expressions-and-equations"><span class="toc-chapter-title">Rational Expressions and Equations</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-mathematical-ideas-problem-solving-techniques"><span class="toc-chapter-title">Mathematical Ideas: Problem-Solving Techniques</span> <span class="chapter-author">Jenna Lehmann</span></a></li><li class="chapter standard"><a href="#chapter-mathematical-ideas-basic-concepts-of-set-theory"><span class="toc-chapter-title">Mathematical Ideas: Basic Concepts of Set Theory</span> <span class="chapter-author">Jenna Lehmann</span></a></li></ul></div>
<div class="front-matter introduction" id="front-matter-introduction" title="Introduction"><div class="front-matter-title-wrap"><h3 class="front-matter-number">1</h3><h1 class="front-matter-title">Introduction</h1><h2 class="chapter-author">Jeremy Boettinger</h2></div><div class="ugc front-matter-ugc"> <p>This book contains content originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/" data-url="https://blogs.ubalt.edu/mathsupportcenter/">Math Support Center Resources page</a>, a blog run by student tutors and staff at the University of Baltimore. The chapters are mostly organized according to the tagging system of the source blog and may include references to specific math and statistics courses offered by the university.</p> </div></div>
<div class="part " id="part-main-body"><div class="part-title-wrap"><h3 class="part-number">I</h3><h1 class="part-title">Using SPSS</h1></div><p>This section contains chapters about how to use SPSS Statistics, a software package used in statistical analysis. A link to the original blog post is included at the bottom of each chapter.</p><div class="ugc part-ugc"></div></div>
<div class="chapter standard" id="chapter-using-data-and-variable-view" title="Using Data and Variable View"><div class="chapter-title-wrap"><h3 class="chapter-number">1</h3><h2 class="chapter-title">Using Data and Variable View</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>When getting started with SPSS, you may initially be confused about how to input your data in such a way that it’s easy to read and will allow you do to do the analyses that you would like to do.</p> <p>When first opening a new dataset on SPSS, you will be greeted with this blank screen on the Data View tab.</p> <p><img class="wp-image-72 size-full alignnone" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-a.png" alt="Screenshot of blank SPSS document" width="624" height="525" data-wp-editing="1" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-a.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-a-300x252.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-a-65x55.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-a-225x189.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-a-350x294.png 350w" /></p> <p>The Data View tab is where you will be inputting the actual data. The way the data should be organized is that all data corresponding to a variable is lined up by column. Data for more than one variable which corresponds to an individual should be organized by row. The picture shows data for three variables organized by column.</p> <p><img class="alignnone wp-image-76 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-b.png" alt="Screenshot of SPSS document" width="624" height="525" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-b.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-b-300x252.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-b-65x55.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-b-225x189.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-b-350x294.png 350w" /></p> <p>But I want to know what it is I’m looking at; which column corresponds to which variable? This is when you would click on the Variable view tab to name your variables. All you have to do is click on the variable you want to rename and type in the name like an Excel spreadsheet.</p> <p><img class="alignnone wp-image-77 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-c.png" alt="Screenshot of SPSS document" width="624" height="452" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-c.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-c-300x217.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-c-65x47.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-c-225x163.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-c-350x254.png 350w" /></p> <p>If you’re working with interval or ratio variables, skip this next step, but because I want to try working with nominal variables, I’m going to reenter my data on the Data View so that my condition 1 is full of only 0’s, 1’s, and 2’s. I want to make my condition 2 full of only 1’s, 2’s, 3’s, 4’s, and 5’s. Finally, I want my condition 3 to be full of 0’s and 1’s. This is because I want each number to relate back to a certain response. I want my condition 1 to be the answer to the question “Do you study for tests?” with the possible answers being “No,” “Sometimes,” and “Yes.” I want my condition 2 to be participant letter grades: A, B, C, D, and E. Finally, I want my condition 3 data to the question “Do you, on average, get 8 hours of sleep a night?” with the answers being “Yes” and “No.” Again, if you’re working with data which is not nominal or categorical, don’t bother with labeling.</p> <p><img class="alignnone size-full wp-image-78" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-d.png" alt="Screenshot of SPSS document" width="624" height="509" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-d.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-d-300x245.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-d-65x53.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-d-225x184.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-d-350x285.png 350w" /></p> <p>In order to make this easier to understand when I’m actually analyzing the data, I’m going to name each of these numbers using the Values section which is highlighted in the Variable View picture two pictures up. Simply double click it to make the blue button appear and click the blue button for a pop-up to appear.</p> <p><img class="alignnone size-full wp-image-79" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-e.png" alt="Screenshot of SPSS document" width="768" height="595" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-e.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-e-300x232.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-e-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-e-225x174.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SPSS1-e-350x271.png 350w" /></p> <p>To name a number, simply type the value into the Input box, type the name you would like to give it into the Label box, and click the Add button. When you’re done naming all of your numbers, you can hit the OK button. We won’t be going over what good this does in this post, but it will be important for reading your output data later on, which will be discussed in other posts.</p> <p><img class="alignnone size-full wp-image-73" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro6-624x556-1.png" alt="Screenshot of SPSS document" width="624" height="556" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro6-624x556-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro6-624x556-1-300x267.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro6-624x556-1-65x58.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro6-624x556-1-225x200.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro6-624x556-1-350x312.png 350w" /></p> <p>Going back to the Data View, as you can see, changing the names of your data digits does not affect the Data View. I also went ahead and changed the names of the variables in Variable View so that the data have some more context. There are some other things in Variable View which may be important to consider moving forward. Like I said, we’re working with nominal values and SPSS gives us the option of defining them as such for analysis purposes. All you have to do is go back to Variable View, click the button under Measure which corresponds with the variable you would like to change the scale for, and select what you would like from the drop-down menu.</p> <p><img class="alignnone size-full wp-image-74" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro7-624x495-1.png" alt="Screenshot of SPSS document" width="624" height="495" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro7-624x495-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro7-624x495-1-300x238.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro7-624x495-1-65x52.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro7-624x495-1-225x178.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro7-624x495-1-350x278.png 350w" /></p> <p>Finally, when I’m working with whole numbers, I find the two decimals at the end of each number to be rather annoying. To get rid of those, just click the box you want to change under Decimals and change the number of decimals you would like to see next to each number in that variable column.</p> <p><img class="alignnone size-full wp-image-75" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro8-624x479-1.png" alt="Screenshot of SPSS document" width="624" height="479" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro8-624x479-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro8-624x479-1-300x230.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro8-624x479-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro8-624x479-1-225x173.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/intro8-624x479-1-350x269.png 350w" /></p> <p>These are all the basics of inputting data. It is possible to copy and paste Excel data into SPSS if you already have a data set ready. Please just keep in mind that unlike working in Excel, variable names cannot simply be put in the first rows of the sheet, they must be logged in Variable View.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/30/using-spss-using-data-and-variable-view/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/30/using-spss-using-data-and-variable-view/">Math Support Center blog</a> at the University of Baltimore on June 30, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-descriptive-statistics" title="Descriptive Statistics"><div class="chapter-title-wrap"><h3 class="chapter-number">2</h3><h2 class="chapter-title">Descriptive Statistics</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>Sometimes you’ll want to get some basic information on the data you have. Running these descriptive statistics is pretty straight forward. First, click the Analyze button, hover over the Descriptive Statistics tab, and then you’ll be able to choose a few different options. I prefer just clicking the frequencies button because it gives you the option to look at frequencies as well as other kinds of descriptives.</p> <p><img class="alignnone wp-image-83 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies1-624x513-1.png" alt="Screenshot of SPSS document" width="624" height="513" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies1-624x513-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies1-624x513-1-300x247.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies1-624x513-1-65x53.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies1-624x513-1-225x185.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies1-624x513-1-350x288.png 350w" /></p> <p>After clicking that, a pop-up will appear. Highlight the groups you would like descriptives and frequencies on and move them over to the right.</p> <p><img class="alignnone size-full wp-image-84" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies2-768x454-1.png" alt="Screenshot of SPSS document" width="768" height="454" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies2-768x454-1.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies2-768x454-1-300x177.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies2-768x454-1-65x38.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies2-768x454-1-225x133.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies2-768x454-1-350x207.png 350w" /></p> <p>To get the descriptive data we want, click the Statistics button. You can choose from a variety of different descriptives. I like getting all the measures of central tendency and everything related to variability, but as you can see there are other options as well.</p> <p><img class="alignnone size-full wp-image-85" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies3-768x434-1.png" alt="Screenshot of SPSS document" width="768" height="434" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies3-768x434-1.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies3-768x434-1-300x170.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies3-768x434-1-65x37.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies3-768x434-1-225x127.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies3-768x434-1-350x198.png 350w" /></p> <p>After clicking out of that, you can then have SPSS make you a chart or a graph by clicking the Charts button. I decided not to go ahead with that, but I wanted to point out that option. There are some other buttons to click, but I personally have never needed to mess with those.</p> <p><img class="alignnone size-full wp-image-86" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies4-768x441-1.png" alt="Screenshot of SPSS document" width="768" height="441" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies4-768x441-1.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies4-768x441-1-300x172.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies4-768x441-1-65x37.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies4-768x441-1-225x129.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies4-768x441-1-350x201.png 350w" /></p> <p>Once you finish with the pop-up, the output should appear on a separate window. These are screenshots of what my data looked like. This output is pretty easy to read because it’ll just tell you what you asked to know. Other outputs may be more difficult to read so in future posts I’ll go into detail about what it is you’ll be looking at.</p> <p><img class="alignnone size-full wp-image-87" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies5-768x576-1.png" alt="Screenshot of SPSS document" width="768" height="576" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies5-768x576-1.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies5-768x576-1-300x225.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies5-768x576-1-65x49.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies5-768x576-1-225x169.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies5-768x576-1-350x263.png 350w" /></p> <p><img class="alignnone size-full wp-image-88" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies6.png" alt="Screenshot of SPSS document" width="995" height="1010" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies6.png 995w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies6-296x300.png 296w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies6-768x780.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies6-65x66.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies6-225x228.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/frequencies6-350x355.png 350w" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/07/01/using-spss-descriptive-statistics/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/07/01/using-spss-descriptive-statistics/">Math Support Center blog</a> at the University of Baltimore on July 1, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-comparing-means-single-sample-t-test" title="Comparing Means: Single Sample t-test"><div class="chapter-title-wrap"><h3 class="chapter-number">3</h3><h2 class="chapter-title">Comparing Means: Single Sample t-test</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>With a one-sample t-test, we only need to worry about working with one sample. When starting, you should already know the population mean you’ll be comparing the sample to. So in this first picture, we have one column of data lined up and ready to go.</p> <p><img class="alignnone size-full wp-image-91" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample1-624x528-1.png" alt="Screenshot of SPSS document" width="624" height="528" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample1-624x528-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample1-624x528-1-300x254.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample1-624x528-1-65x55.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample1-624x528-1-225x190.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample1-624x528-1-350x296.png 350w" /></p> <p>The next step is to click the Analyze button, hover over Compare Means, and click One-Sample T-Test.</p> <p><img class="alignnone size-full wp-image-93" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample2-624x516-1.png" alt="Screenshot of SPSS document" width="624" height="516" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample2-624x516-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample2-624x516-1-300x248.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample2-624x516-1-65x54.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample2-624x516-1-225x186.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample2-624x516-1-350x289.png 350w" /></p> <p>A pop-up should appear. Simply move the name of your test variable to the right. In the Test Variable, type in the population mean you would like to compare the sample to.</p> <p><img class="alignnone size-full wp-image-94" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample3-624x500-1.png" alt="Screenshot of SPSS document" width="624" height="500" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample3-624x500-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample3-624x500-1-300x240.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample3-624x500-1-65x52.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample3-624x500-1-225x180.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample3-624x500-1-350x280.png 350w" /></p> <p>This is what my output ended up looking like. In the first row of boxes, N is the number of data points you have. In the second row of boxes, t is the t-statistic, df is the degrees of freedom, and Sig. is the p-value. The p-value will tell you if the difference is significant. Usually, we look for a p-value less than or equal to .05 before we state that the difference between the means is significant. In this case, the difference is significant.</p> <p><img class="alignnone size-full wp-image-95" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample4-1024x536-1.png" alt="Screenshot of SPSS document" width="1024" height="536" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample4-1024x536-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample4-1024x536-1-300x157.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample4-1024x536-1-768x402.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample4-1024x536-1-65x34.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample4-1024x536-1-225x118.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/onesample4-1024x536-1-350x183.png 350w" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/07/01/using-spss-comparing-means-single-sample-t-test/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/07/01/using-spss-comparing-means-single-sample-t-test/">Math Support Center blog</a> at the University of Baltimore on July 1, 2019.&nbsp;</em></p> </div></div>
<div class="chapter standard" id="chapter-comparing-means-independent-samples-t-test" title="Comparing Means: Independent Samples t-test"><div class="chapter-title-wrap"><h3 class="chapter-number">4</h3><h2 class="chapter-title">Comparing Means: Independent Samples t-test</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>So far we’ve talked about creating independent variables, but what about levels? This may seem strange at first, but levels of a condition need to be spelled out by numbers. Usually, I just assign condition 1 a 1 and condition 2 a 2. You can see in the picture below how this looks. You can’t see this, but there are 30 individuals in total, half in condition 1, and half in condition 2.</p> <p><img class="alignnone size-full wp-image-98" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples1-624x530-1.png" alt="Screenshot of SPSS document" width="624" height="530" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples1-624x530-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples1-624x530-1-300x255.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples1-624x530-1-65x55.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples1-624x530-1-225x191.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples1-624x530-1-350x297.png 350w" /></p> <p>For this tutorial, we’ll pretend that this data has been collected by a struggling baker who wants to try out a new cookie recipe to see if customers would like it better. She asks 15 people to sample her old recipe and 15 people to sample her new recipe and rate how they liked the cookie on a scale of 1-5. To make this easier to read in the output later, I’m going to label the conditions in Variable View.</p> <p><img class="alignnone size-full wp-image-102" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples7-624x465-1.png" alt="Screenshot of SPSS document" width="624" height="465" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples7-624x465-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples7-624x465-1-300x224.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples7-624x465-1-65x48.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples7-624x465-1-225x168.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples7-624x465-1-350x261.png 350w" /></p> <p>Now it’s time for the actual analysis. Click the Analyze button, hover over Compare Means, and click Independent Samples T-Test.</p> <p><img class="alignnone size-full wp-image-99" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples3-624x515-1.png" alt="Screenshot of SPSS document" width="624" height="515" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples3-624x515-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples3-624x515-1-300x248.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples3-624x515-1-65x54.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples3-624x515-1-225x186.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples3-624x515-1-350x289.png 350w" /></p> <p>A pop-up will appear. Put your dependent variable in the Testing Variable box and your conditions in the Grouping Variable box. Then, click the Define Groups button.</p> <p><img class="alignnone size-full wp-image-100" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples4-624x500-1.png" alt="Screenshot of SPSS document" width="624" height="500" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples4-624x500-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples4-624x500-1-300x240.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples4-624x500-1-65x52.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples4-624x500-1-225x180.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples4-624x500-1-350x280.png 350w" /></p> <p>Another pop-up will appear. Simply fill in each box with the number that corresponds to your groups (again, in my case it’s just 1 and 2).</p> <p><img class="alignnone size-full wp-image-101" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples5-624x496-1.png" alt="Screenshot of SPSS document" width="624" height="496" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples5-624x496-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples5-624x496-1-300x238.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples5-624x496-1-65x52.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples5-624x496-1-225x179.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples5-624x496-1-350x278.png 350w" /></p> <p>Your output will look something like this. The first row of boxes will simply give you descriptives about your data. The second box get’s a little tricker. The boxes under Levene’s Test for Equality of Variances simply allows the user to see if the two samples have equal variances. In this case, we want the p-value to be less than .05 because we don’t want any differences in the variances. It looks like we’re in the clear for this data set. The next few boxes give us the t-statistic, the degrees of freedom, and the p-value. It looks like our conditions are significantly different. Remember to report a .000 p-value as p&lt;0.01, because there is no such thing as a p-value of 0. So we know that there’s a difference, but which cookie got the higher score overall? Simply look at the means. The new recipe has a higher mean rating than the old recipe.</p> <p><img class="alignnone size-full wp-image-103" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples8-768x274-1.png" alt="Screenshot of SPSS document" width="768" height="274" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples8-768x274-1.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples8-768x274-1-300x107.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples8-768x274-1-65x23.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples8-768x274-1-225x80.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/indsamples8-768x274-1-350x125.png 350w" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/07/01/using-spss-comparing-means-independent-samples-t-test/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/07/01/using-spss-comparing-means-independent-samples-t-test/">Math Support Center blog</a>&nbsp; at the University of Baltimore on July 1, 2019.&nbsp;</em></p> </div></div>
<div class="chapter standard" id="chapter-comparing-means-repeated-measures-t-test" title="Comparing Means: Repeated Measures t-test"><div class="chapter-title-wrap"><h3 class="chapter-number">5</h3><h2 class="chapter-title">Comparing Means: Repeated Measures t-test</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>In this section, we’ll be talking about how to properly conduct a repeated measures t-test on SPSS. Before, when we were working on independent t-tests, we needed to create a list of numbers which represented group categories so that the corresponding continuous data was grouped properly. In this kind of t-test though, each “Variable” actually becomes a level. In this case of this example, we’re looking at the data from a before and after. The “Before” consists of the number of alcoholic drinks 30 college students are consuming a week. The “After” consists of the number of alcoholic drinks the same college students were drinking after having taken a Wellness class which focused on the effects of drug and alcohol on the mind and body. If you’re confused as to how this differs from an independent samples t-test, I suggest looking at the <a href="#chapter-comparing-means-independent-samples-t-test" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/comparing-means-independent-samples-t-test/">Independent Samples t-test</a> and <a href="#chapter-repeated-measures-t-test" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/repeated-measures-t-test/">Repeated Measures t-test</a> chapters.</p> <p><img class="alignnone size-full wp-image-106" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest1-624x614-1.png" alt="Screenshot of SPSS document" width="624" height="614" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest1-624x614-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest1-624x614-1-300x295.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest1-624x614-1-65x64.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest1-624x614-1-225x221.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest1-624x614-1-350x344.png 350w" /></p> <p>Just for reference, I have already labeled my columns Before and After in the Variable View section.</p> <p><img class="alignnone size-full wp-image-107" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest2-1024x791-1.png" alt="Screenshot of SPSS document" width="1024" height="791" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest2-1024x791-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest2-1024x791-1-300x232.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest2-1024x791-1-768x593.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest2-1024x791-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest2-1024x791-1-225x174.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest2-1024x791-1-350x270.png 350w" /></p> <p>To conduct the test, click the Analyze button, hover over Compare Means, and click Paired Samples t-test.</p> <p><img class="alignnone size-full wp-image-108" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest3-768x590-1.png" alt="Screenshot of SPSS document" width="768" height="590" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest3-768x590-1.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest3-768x590-1-300x230.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest3-768x590-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest3-768x590-1-225x173.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest3-768x590-1-350x269.png 350w" /></p> <p>Simply drag your before and after into the correct slots. These are usually done in chronological order from left to right.</p> <p><img class="alignnone size-full wp-image-109" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest4-768x597-1.png" alt="Screenshot of SPSS document" width="768" height="597" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest4-768x597-1.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest4-768x597-1-300x233.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest4-768x597-1-65x51.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest4-768x597-1-225x175.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest4-768x597-1-350x272.png 350w" /></p> <p>Finally, you’ll get your output. Based on this particular test, we can see that we got a t score of 11.4, which is already a pretty good indicator that the results will be significant. Typically, anything above a 3 or 4 will be significant. Just to be sure, let’s look at our p value. It’s less than 0.05, which is our typical alpha level, which means that there was a significant difference between the before and after. To see in which direction there is a difference, we go up to the means. Which one is smaller or bigger than the other? We can see that the mean drinks before the intervention was higher on average than after the intervention. In this case, we would say participants drank significantly fewer drinks per week after the intervention than before the intervention.</p> <p><img class="alignnone size-full wp-image-110" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest5-1024x515-1.png" alt="Screenshot of SPSS document" width="1024" height="515" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest5-1024x515-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest5-1024x515-1-300x151.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest5-1024x515-1-768x386.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest5-1024x515-1-65x33.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest5-1024x515-1-225x113.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasttest5-1024x515-1-350x176.png 350w" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/07/16/using-spss-comparing-means-repeated-measures-t-test/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/07/16/using-spss-comparing-means-repeated-measures-t-test/">Math Support Center blog</a> at the University of Baltimore on July 16, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-comparing-means-repeated-measures-one-way-anova" title="Comparing Means: Repeated Measures One-Way ANOVA"><div class="chapter-title-wrap"><h3 class="chapter-number">6</h3><h2 class="chapter-title">Comparing Means: Repeated Measures One-Way ANOVA</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>This post will be about finding a difference in means when it comes to repeated measures in research designs with a factor with more than 2 levels. Just like with the <a href="#chapter-comparing-means-repeated-measures-t-test" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/comparing-means-repeated-measures-t-test/">Repeated Measures t-test</a>, we’ll be lining our levels up in columns. For this example, we’ll pretend that we’ve collected data on self-reported depression. Participants were asked to rate on a scale from 1-9 how severe they felt their depression is. They were then given medication to take which is known to reduce depressive symptoms. Participants were asked again after 6 months how high they rated their depression. They were asked one last time at the end of 12 months.</p> <p><img class="alignnone size-full wp-image-115" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova1-768x602-1.png" alt="Screenshot of SPSS document" width="768" height="602" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova1-768x602-1.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova1-768x602-1-300x235.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova1-768x602-1-65x51.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova1-768x602-1-225x176.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova1-768x602-1-350x274.png 350w" /></p> <p>I went ahead and named the levels in the Variable view.</p> <p><img class="alignnone size-full wp-image-116" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova2-1024x794-1.png" alt="Screenshot of SPSS document" width="1024" height="794" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova2-1024x794-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova2-1024x794-1-300x233.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova2-1024x794-1-768x596.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova2-1024x794-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova2-1024x794-1-225x174.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova2-1024x794-1-350x271.png 350w" /></p> <p>To run the actual test, simply go up to Analyze, scroll over General Linear Model, and click Repeated Measures.</p> <p><img class="alignnone size-full wp-image-117" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova3.png" alt="Screenshot of SPSS document" width="1303" height="1002" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova3.png 1303w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova3-300x231.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova3-1024x787.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova3-768x591.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova3-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova3-225x173.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova3-350x269.png 350w" /></p> <p>A pop-up will appear. In the first box, create the name of your factor. In this case, I’ve named it time, because we’re doing comparisons across time. In the second box, I typed in 3 because we have 3 levels and then I pressed Add. In the third box, I named our dependent variable and clicked Add. Next, we need to Define our factors.</p> <p><img class="alignnone size-full wp-image-118" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova4-1024x789-1.png" alt="Screenshot of SPSS document" width="1024" height="789" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova4-1024x789-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova4-1024x789-1-300x231.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova4-1024x789-1-768x592.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova4-1024x789-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova4-1024x789-1-225x173.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova4-1024x789-1-350x270.png 350w" /></p> <p>Another pop up will appear. Move the levels over into the top, right box. I prefer doing this in chronological order from top to bottom. Then, click Options.</p> <p><img class="alignnone size-full wp-image-119" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova5-1024x789-1.png" alt="Screenshot of SPSS document" width="1024" height="789" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova5-1024x789-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova5-1024x789-1-300x231.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova5-1024x789-1-768x592.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova5-1024x789-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova5-1024x789-1-225x173.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova5-1024x789-1-350x270.png 350w" /></p> <p>I would recommend getting means for everything, so move OVERALL and time over to the box on the right. I also recommend clicking the Descriptive Statistics and Estimate of Effect Size boxes. Finally, click the Compare Means checkbox; it’s located under the big, white box on the right. Click all the Continues and OK’s that follow.</p> <p><img class="alignnone size-full wp-image-120" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova6-1024x795-1.png" alt="Screenshot of SPSS document" width="1024" height="795" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova6-1024x795-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova6-1024x795-1-300x233.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova6-1024x795-1-768x596.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova6-1024x795-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova6-1024x795-1-225x175.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova6-1024x795-1-350x272.png 350w" /></p> <p>We can see from the means that the average for Start is greater than at 6 months is greater than at 12 months. This is important to know, but this does not prove significance.</p> <p><img class="alignnone size-full wp-image-121" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova7-1024x760-1.png" alt="Screenshot of SPSS document" width="1024" height="760" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova7-1024x760-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova7-1024x760-1-300x223.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova7-1024x760-1-768x570.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova7-1024x760-1-65x48.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova7-1024x760-1-225x167.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova7-1024x760-1-350x260.png 350w" /></p> <p>Go to the Tests of Within-Subjects Effects box and find “time” on the left, scroll over to Greenhouse-Geisser, and then scroll all the way to F and significance. We have a huge F score of 68.5 and a significance which is less than 0.05 and so we can say that somewhere there is a significant difference in the groups. If you need to report the effect size, you can find it under Partial Eta Squared.</p> <p><img class="alignnone size-full wp-image-122" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova8-1024x768-1.png" alt="Screenshot of SPSS document" width="1024" height="768" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova8-1024x768-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova8-1024x768-1-300x225.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova8-1024x768-1-768x576.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova8-1024x768-1-65x49.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova8-1024x768-1-225x169.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova8-1024x768-1-350x263.png 350w" /></p> <p><img class="alignnone size-full wp-image-123" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova9-1024x789-1.png" alt="Screenshot of SPSS document" width="1024" height="789" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova9-1024x789-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova9-1024x789-1-300x231.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova9-1024x789-1-768x592.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova9-1024x789-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova9-1024x789-1-225x173.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova9-1024x789-1-350x270.png 350w" /></p> <p>This last box shows us the post-hoc under Pairwise Comparison. As you can see, all the comparisons are significantly different with a significance less than 0.05. This means that we can say that there was a significant difference in times since treatment began with participants expressing the most depression before the treatment started, less depression 6 months after the treatment started, and the least depression after 12 months of treatment.</p> <p><img class="alignnone size-full wp-image-124" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova10.png" alt="Screenshot of SPSS document" width="1052" height="671" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova10.png 1052w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova10-300x191.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova10-1024x653.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova10-768x490.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova10-65x41.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova10-225x144.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/repmeasanova10-350x223.png 350w" /></p> <p>&nbsp;</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/07/16/using-spss-comparing-means-repeated-measures-one-way-anova/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/07/16/using-spss-comparing-means-repeated-measures-one-way-anova/">Math Support Center blog</a> at the University of Baltimore on July 16, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-independent-measures-one-way-anova" title="Comparing Means: Independent Measures One-Way ANOVA"><div class="chapter-title-wrap"><h3 class="chapter-number">7</h3><h2 class="chapter-title">Comparing Means: Independent Measures One-Way ANOVA</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>Just like an independent samples t-test, an Independent measures one-way ANOVA uses independent subjects for each level/condition within an independent variable. In this example, we’re growing plants. In Variable View, I’ve made the independent variable Condition (in this case the amount of water I’ll be giving to the plants) and the dependent variable Height.</p> <p><img class="alignnone size-full wp-image-127" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA1-1024x796-1.png" alt="Screenshot of SPSS document" width="1024" height="796" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA1-1024x796-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA1-1024x796-1-300x233.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA1-1024x796-1-768x597.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA1-1024x796-1-65x51.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA1-1024x796-1-225x175.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA1-1024x796-1-350x272.png 350w" /></p> <p>Next, it’s important to label the levels of your independent variable, so I clicked the cell under Values and assigned the numbers 1, 2, and 3 a condition: no water, some water, and a lot of water.</p> <p><img class="alignnone size-full wp-image-128" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA2-1024x795-1.png" alt="Screenshot of SPSS document" width="1024" height="795" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA2-1024x795-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA2-1024x795-1-300x233.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA2-1024x795-1-768x596.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA2-1024x795-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA2-1024x795-1-225x175.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA2-1024x795-1-350x272.png 350w" /></p> <p>Then, I just put my data in Data View, with the Condition column full of the numbers representing the different conditions and the Height column full of the measured heights of each plant.</p> <p><img class="alignnone size-full wp-image-129" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA3-300x285-1.png" alt="Screenshot of SPSS document" width="300" height="285" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA3-300x285-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA3-300x285-1-65x62.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA3-300x285-1-225x214.png 225w" /></p> <p>To conduct the test, click Analyze at the top, hover over Compare means, and then click One-Way ANOVA.</p> <p><img class="alignnone size-full wp-image-130" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA4-768x720-1.png" alt="Screenshot of SPSS document" width="768" height="720" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA4-768x720-1.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA4-768x720-1-300x281.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA4-768x720-1-65x61.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA4-768x720-1-225x211.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA4-768x720-1-350x328.png 350w" /></p> <p>You’ll be greeted with a pop-up asking you to arrange the variables you would like to test. The dependent variable goes in the top box and the independent variable goes in the bottom box. Then, click the Post Hoc box.</p> <p><img class="alignnone size-full wp-image-131" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA5-1024x798-1.png" alt="Screenshot of SPSS document" width="1024" height="798" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA5-1024x798-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA5-1024x798-1-300x234.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA5-1024x798-1-768x599.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA5-1024x798-1-65x51.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA5-1024x798-1-225x175.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA5-1024x798-1-350x273.png 350w" /></p> <p>You will be greeted by another pop-up. Here you can click the kind of post hoc test you would like to run. Tukey seems to be pretty popular, so that’s the one I chose. Then, click Continue.</p> <p><img class="alignnone size-full wp-image-132" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA6-300x231-1.png" alt="Screenshot of SPSS document" width="300" height="231" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA6-300x231-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA6-300x231-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA6-300x231-1-225x173.png 225w" /></p> <p>Once back to the main pop-up, then click Options. Here you’ll be able to add in descriptive statistics (like mean, SD, etc.) and a Homogeneity of Variance test, which may be important to report depending on what your professor is asking for. Then, click continue and finally OK.</p> <p><img class="alignnone size-full wp-image-133" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA7-300x231-1.png" alt="Screenshot of SPSS document" width="300" height="231" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA7-300x231-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA7-300x231-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA7-300x231-1-225x173.png 225w" /></p> <p>Your output will look something like this. First is always the descriptives. The next box is the results of the test of homogeneity of variances. Remember, this is the one that we don’t want to be significant; we want there to be no difference between the groups. Looking under Sig, we can see that our p-value is greater than 0.05 so we’re in the clear! The third box shows us the result of our analysis overall. Here, our F-value is 21.4 and we have a p-value of less than 0.01, which means that there is definitely a difference somewhere between these three conditions.</p> <p><img class="alignnone size-full wp-image-134" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA8.png" alt="Screenshot of SPSS document" width="943" height="679" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA8.png 943w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA8-300x216.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA8-768x553.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA8-65x47.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA8-225x162.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA8-350x252.png 350w" /></p> <p>Moving on the Post Hoc, each box compares one group with the other group. So as well can see, no water and some water are significantly different, and no water and lots of water are significantly different (remember that stars indicate significance). We’ve compared conditions 1 and 2 and conditions 1 and 3, but we still need to compare 2 and 3, so we move down to the next box and see that some water and lots of water are also significantly different. Make sure to report all of these differences. To know which groups are significantly less than or greater than others, refer to the descriptive statistics at the top (specifically the means).</p> <p><img class="alignnone size-full wp-image-135" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA9.png" alt="Screenshot of SPSS document" width="794" height="743" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA9.png 794w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA9-300x281.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA9-768x719.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA9-65x61.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA9-225x211.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/IndependentANOVA9-350x328.png 350w" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/09/10/using-spss-comparing-means-independent-measures-one-way-anova/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/09/10/using-spss-comparing-means-independent-measures-one-way-anova/">Math Support Center blog</a> at the University of Baltimore on September 10, 2019.&nbsp;</em></p> </div></div>
<div class="chapter standard" id="chapter-spss-simple-linear-regression" title="Simple Linear Regression"><div class="chapter-title-wrap"><h3 class="chapter-number">8</h3><h2 class="chapter-title">Simple Linear Regression</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>A regression can be seen as a kind of extension of a correlation. When doing a regression, you find a lot of the same outputs, like Pearson’s r and r-squared. The difference is that the point of a regression is to also construct a model (usually linear) that will help us predict values using a line of best fit. In the case of this example, we will be looking at average hours of sleep students get and comparing it to their GPA. A regression will also give us a model (y=mx+b) that would allow us to predict the GPA of a hypothetical student if we knew the average amount of sleep they get a night.</p> <p>First, we need to create our variables in Variable View.</p> <p><img class="alignnone size-full wp-image-138" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression1-1024x105-1.png" alt="Screenshot of SPSS document" width="1024" height="105" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression1-1024x105-1.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression1-1024x105-1-300x31.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression1-1024x105-1-768x79.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression1-1024x105-1-65x7.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression1-1024x105-1-225x23.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression1-1024x105-1-350x36.png 350w" /></p> <p>Then, we need to input our data into Data View. You can’t see it in this photo, but I have 25 participants total.</p> <p><img class="alignnone size-full wp-image-139" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression2.png" alt="Screenshot of SPSS document" width="274" height="405" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression2.png 274w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression2-203x300.png 203w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression2-65x96.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression2-225x333.png 225w" /></p> <p>To start our regression, we need to go to Analyze &gt; Regression &gt; Linear.</p> <p><img class="alignnone size-full wp-image-140" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression3-624x455-1.png" alt="Screenshot of SPSS document" width="624" height="455" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression3-624x455-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression3-624x455-1-300x219.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression3-624x455-1-65x47.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression3-624x455-1-225x164.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression3-624x455-1-350x255.png 350w" /></p> <p>Once we click that, this pop-up will appear. Make sure to make your <em>predictor </em>the independent variable and the <em>predicted</em> variable the dependent variable.</p> <p><img class="alignnone size-full wp-image-141" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression4.png" alt="Screenshot of SPSS document" width="563" height="487" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression4.png 563w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression4-300x260.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression4-65x56.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression4-225x195.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression4-350x303.png 350w" /></p> <p>It may be important for you to then click the Statistics button and make sure to check what you need to include in your report. Descriptives and Confidence Intervals never hurt.</p> <p><img class="alignnone size-full wp-image-142" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression5.png" alt="Screenshot of SPSS document" width="550" height="466" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression5.png 550w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression5-300x254.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression5-65x55.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression5-225x191.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression5-350x297.png 350w" /></p> <p>Your output will look something like the following two pictures. Descriptives are at the top, so this will help you report your means and standard deviations if you need to. Next are your correlation matrices. In my case, it looks like Sleep and GPA have a somewhat strong positive correlation and it appears to be significant with a p- value of less than .001. Ignoring the variables entered/removed section, the model summary shows us once again our r value and it also gives us an r-squared value. The ANOVA table gives us an F value and significance if we choose to report that. Finally (and the part we’ve been waiting for) is the model. This part is like in Algebra when you needed to learn about linear functions. We’re constructing a line using y=mx+b where the m is the slope and the b is the y-intercept. For this example, the model we’re working with is y=0.11x+.418 and I found these numbers from the coefficients table.</p> <p><img class="alignnone size-full wp-image-143" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression6.png" alt="Screenshot of SPSS document" width="371" height="542" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression6.png 371w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression6-205x300.png 205w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression6-65x95.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression6-225x329.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression6-350x511.png 350w" /></p> <p><img class="alignnone size-full wp-image-144" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression7.png" alt="Screenshot of SPSS document" width="758" height="530" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression7.png 758w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression7-300x210.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression7-65x45.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression7-225x157.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression7-350x245.png 350w" /></p> <p>Now let’s say you need a graph of this. Go to Graphs &gt; Legacy Dialogues &gt; Scatter Plot.</p> <p><img class="alignnone size-full wp-image-145" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression8-624x473-1.png" alt="Screenshot of SPSS document" width="624" height="473" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression8-624x473-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression8-624x473-1-300x227.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression8-624x473-1-65x49.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression8-624x473-1-225x171.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression8-624x473-1-350x265.png 350w" /></p> <p>This pop-up should appear. Just click Simple Scatter.</p> <p><img class="alignnone size-full wp-image-146" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression9.png" alt="Screenshot of SPSS document" width="385" height="247" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression9.png 385w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression9-300x192.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression9-65x42.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression9-225x144.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression9-350x225.png 350w" /></p> <p>Another pop-up should appear. Make sure your <em>predictor</em> is on the x-axis and the <em>predicted</em> variable is in the y-axis.</p> <p><img class="alignnone size-full wp-image-147" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression10.png" alt="Screenshot of SPSS document" width="530" height="579" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression10.png 530w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression10-275x300.png 275w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression10-65x71.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression10-225x246.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression10-350x382.png 350w" /></p> <p>You should end up with this basic scatterplot of your points.</p> <p><img class="alignnone size-full wp-image-148" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression11-624x384-1.png" alt="Screenshot of SPSS document" width="624" height="384" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression11-624x384-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression11-624x384-1-300x185.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression11-624x384-1-65x40.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression11-624x384-1-225x138.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression11-624x384-1-350x215.png 350w" /></p> <p>To get the line of best fit, right-click the graph &gt; Edit Content &gt; In Separate Window.</p> <p><img class="alignnone size-full wp-image-149" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression12-624x408-1.png" alt="Screenshot of SPSS document" width="624" height="408" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression12-624x408-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression12-624x408-1-300x196.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression12-624x408-1-65x43.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression12-624x408-1-225x147.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression12-624x408-1-350x229.png 350w" /></p> <p>A new editing pop-up will appear. Click the linear equation button at the bottom of the bar. It will say Add Fit Line at Total when you hover over it.</p> <p><img class="alignnone size-full wp-image-150" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression13-624x440-1.png" alt="Screenshot of SPSS document" width="624" height="440" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression13-624x440-1.png 624w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression13-624x440-1-300x212.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression13-624x440-1-65x46.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression13-624x440-1-225x159.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression13-624x440-1-350x247.png 350w" /></p> <p>Once that’s done, your graph should look something like this. If you exit out of the pop-up to go back to the output, the output graph should represent the changes you made in the editor pop-up.</p> <p><img class="alignnone size-full wp-image-151" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression14-768x343-1.png" alt="Screenshot of SPSS document" width="768" height="343" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression14-768x343-1.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression14-768x343-1-300x134.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression14-768x343-1-65x29.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression14-768x343-1-225x100.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/linear-regression14-768x343-1-350x156.png 350w" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2020/02/11/using-spss-simple-linear-regression/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2020/02/11/using-spss-simple-linear-regression/">Math Support Center blog</a> at the University of Baltimore on February 11, 2020.</em></p> </div></div>
<div class="chapter standard" id="chapter-spss-correlations" title="Correlations"><div class="chapter-title-wrap"><h3 class="chapter-number">9</h3><h2 class="chapter-title">Correlations</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>A correlation requires at least 2 continuous variables. We need to first define our variables in Variable View. In this case, we’re looking at how number of absences relates to grade point average.</p> <p><img class="alignnone size-full wp-image-153" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation1.png" alt="Screenshot of SPSS document" width="1020" height="75" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation1.png 1020w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation1-300x22.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation1-768x56.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation1-65x5.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation1-225x17.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation1-350x26.png 350w" /></p> <p>Next, we type in our data points in Data View.</p> <p><img class="alignnone size-full wp-image-154" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation2.png" alt="Screenshot of SPSS document" width="257" height="470" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation2.png 257w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation2-164x300.png 164w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation2-65x119.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation2-225x411.png 225w" /></p> <p>To run the analysis, go to Analyze &gt; Correlate &gt; Bivariate.</p> <p><img class="alignnone size-full wp-image-155" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation3.png" alt="Screenshot of SPSS document" width="696" height="584" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation3.png 696w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation3-300x252.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation3-65x55.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation3-225x189.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation3-350x294.png 350w" /></p> <p>A pop-up should appear. Put both of your variables in the Variables column.</p> <p><img class="alignnone size-full wp-image-156" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation4.png" alt="Screenshot of SPSS document" width="443" height="399" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation4.png 443w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation4-300x270.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation4-65x59.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation4-225x203.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation4-350x315.png 350w" /></p> <p>Then, go to options and click means and standard deviations if that is something you need to report.</p> <p><img class="alignnone size-full wp-image-157" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation5.png" alt="Screenshot of SPSS document" width="283" height="228" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation5.png 283w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation5-65x52.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation5-225x181.png 225w" /></p> <p>Your output should look something like this. The correlations matrix looks a little redundant, but what’s important here is the Pearson Correlation value and the significance. You should have everything you need here to report a correlation. For information about how to do a scatter plot, please visit the <a href="#chapter-spss-simple-linear-regression" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/spss-simple-linear-regression/">SPSS Regression chapter</a>.</p> <p><img class="alignnone size-full wp-image-158" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation6.png" alt="Screenshot of SPSS document" width="394" height="410" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation6.png 394w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation6-288x300.png 288w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation6-65x68.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation6-225x234.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/correlation6-350x364.png 350w" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2020/02/13/using-spss-correlations/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2020/02/13/using-spss-correlations/">Math Support Center blog</a> at the University of Baltimore on February 13, 2020.</em></p> </div></div>
<div class="part " id="part-opre"><div class="part-title-wrap"><h3 class="part-number">II</h3><h1 class="part-title">Statistics</h1></div><p>This section contains chapters about statistics-related topics. A link to the original blog post is included at the bottom of each chapter.</p><div class="ugc part-ugc"></div></div>
<div class="chapter standard" id="chapter-introduction-to-statistics-basics" title="Introduction to Statistics Basics"><div class="chapter-title-wrap"><h3 class="chapter-number">10</h3><h2 class="chapter-title">Introduction to Statistics Basics</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>Whether this is your first statistics class or whether you’re just in need of a refresher, there are a few basic statistical principles which are necessary for one to understand before moving forward.</p> <p>&nbsp;</p> <h5><strong>Understanding Populations and Samples</strong></h5> <p><strong>Populations</strong> are the groups of people that we are interested in studying. This can be the entirety of people with depression, an entire town, or dog-owners. Populations can vary in size but are typically very large. They are almost always impossible to study in their entirety. Therefore, we select <strong>samples</strong> from a population. Although they’re never as diverse as the population, they are generally representative. However, they provide limited information and introduce sampling error.</p> <p><strong>Samples</strong> are a subset of the population which as been selected by various means. A sample is representative when it accounts for the variability and diversity of the population. For example, a representative sample of “individuals who attend the University of Baltimore” would include a diversity of age groups, race, educational background, students from different programs, faculty from multiple departments, staff, etc., in their appropriate percentages in the population. A non-representative sample in that case would not account for the various differences that exist among the individuals in a population, or would over-represent/under-represent a specific group. The figure below illustrates a hypothetical population, two examples of non-representative samples, and one representative sample of that population.</p> <div class="wp-caption aligncenter" id="attachment_22" aria-describedby="caption-attachment-22" style="width: 618px"><img class="wp-image-22 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Simple_random_sampling.png" alt="Diagram of simple random sampling in a populaton" width="618" height="476" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Simple_random_sampling.png 618w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Simple_random_sampling-300x231.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Simple_random_sampling-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Simple_random_sampling-225x173.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Simple_random_sampling-350x270.png 350w" /><div class="wp-caption-text" id="caption-attachment-22">Created by Dan Kernler and shared under a CC BY-SA 4.0 License</div></div> <p>Why do we care about these distinctions? What we really care about is getting an answer that most closely represents a population. A non-representative sample introduces bias and error, and precludes researchers from making sound interpretations. But since we can’t study entire populations, we want to take samples and study them as best as we can to generalize the results to the population. Samples are not going to be exactly representative of a population so it’s best to know the distinction.</p> <p>Why can’t researchers study entire populations? As we mentioned before, populations are usually extremely large, and it would require a lot of time and resources (e.g. financial resources) to study it in its entirety. Furthermore, in a hypothetical world where a researcher possesses the resources necessary to do so, they will not be able to include every single person from the population in their research study.</p> <p>&nbsp;</p> <h5><strong>Parameters vs Statistics and Sampling Error</strong></h5> <p>A <strong>parameter</strong> is a value that describes a population, while a <strong>statistic</strong> is a value that describes a sample. A good way to remember it is: Parameter – Population, Statistic – Sample. If the sample was a good enough sample (completely random and preferably large), then these values should be very similar. <strong>Sampling error</strong> is the discrepancy that exists between a sample statistic and the corresponding population parameter. Every sample will have sampling error simply because a sample cannot possibly be as diverse as a whole population, but there are measures of preventing a larger one.</p> <p>How do these things relate to one another? These things all relate to each other because we select participants from a population which become a sample, on which we run tests and analysis, and then we can determine if the results are then generalizable to the general population we’re studying.</p> <p>&nbsp;</p> <h5><strong>Sampling Methods</strong></h5> <p>There are a number of ways to collect sample data. There are pros and cons to each, but simple random sampling reduces sampling error the most.</p> <ul><li><strong>Simple Random Sampling</strong>: <ul><li>Everyone has an equal chance of being selected.</li> <li>The selection method is completely random.</li> <li>One individual’s selection does not impact the likelihood that someone else is subsequently selected.</li> </ul> </li> <li><strong>Systematic Sampling</strong>: <ul><li>In a lost of all the individual, every nth individual is picked.</li> </ul> </li> <li><strong>Convenience Sample</strong>: <ul><li>Using the first individuals a surveyor comes across for the sample.</li> <li>Least likely to get a representative sample.</li> </ul> </li> <li><strong>Cluster Sampling</strong>: <ul><li>Dividing the population into groups (usually geographically).</li> <li>The clusters themselves are randomly selected while the people in them are not.</li> </ul> </li> <li><strong>Stratified Sampling</strong>: <ul><li>Divides the population into groups based on characteristics.</li> <li>A sample is taken from each of these groups so that characteristics that are important are accounted for.</li> </ul> </li> </ul> <p>&nbsp;</p> <h5><strong>Variable Types and Individuals</strong></h5> <p>A <strong>variable</strong> is a characteristic or condition that changes or has different values for individuals. In other words, it’s something that can be manipulated, categorized, or measured. An <strong>independent variable</strong> is a variable that is manipulated or decided on by the researcher. A <strong>dependent variable</strong> is a variable which is not to be manipulated, but instead observed. For example, if one is trying to see whether plants grow faster depending on the type of fertilizer is used, then the independent variable is the type of fertilizer and the dependent variable is the growth of the plant.</p> <p>A <strong>categorical variable</strong> is a variable which is measured by its name or category. This could be color (red, green, blue, etc.), gender (man, woman, nonbinary, etc.), or in the case of our coffee example, whether the coffee is meant to be served hot or cold. Although we might assign each of these categories a number in SPSS or excel, these numbers have no quantitative value and are just replacements for the names. Here is a Khan Academy video which may be helpful to you in understanding this concept:</p> <div class="textbox interactive-content interactive-content--oembed"><img src="https://i.ytimg.com/vi/EqeVXI4WNHM/hqdefault.jpg" alt="Thumbnail for the embedded element &quot;Identifying individuals, variables and categorical variables in a data set | Khan Academy&quot;" title="Identifying individuals, variables and categorical variables in a data set | Khan Academy" /> <p>A YouTube element has been excluded from this version of the text. You can view it online here: <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=56#pb-interactive-content" title="Identifying individuals, variables and categorical variables in a data set | Khan Academy" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=56#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=56</a> </p> </div> <p>An <strong>individual</strong> is an object or person that is described by a set of data. So if we were measuring the height and weight of 15 participants, each of those participants would be an individual in the study. If we were looking at the different coffees on a menu and we gathered data on whether each drink is hot or cold, how many calories is in each drink, how much sugar is in each drink, and how much caffeine is in each drink, then each of the different kinds of coffee would be considered individuals in this study.</p> <p>&nbsp;</p> <h5><strong>Types of Statistics &amp; Types of Studies</strong></h5> <p><strong>Descriptive statistics</strong> are used to summarize, organize, and simplify data (basically it lets us turn data sets into something legible). <strong>Inferential statistics</strong> are techniques that allow us to make generalizations about the population from a sample (so this is actually comparing groups to see if there are statistical differences between them or comparing variables to see if there are relationships between them). There are two types of data structures that make use of these kinds of statistics.</p> <p><span style="text-decoration: underline">Data Structure I: Measuring two variables for each individual</span></p> <p><strong>Correlational method</strong>: Measuring two variables for each individual in order to determine if there is a significant relationship between the two. A limitation of this method is that it can show a relationship, but not an explanation for the relationship. A correlation does not necessarily mean a causation and is never enough to draw such an inference.</p> <p><span style="text-decoration: underline">Data Structure II: Comparing two or more groups of scores</span></p> <p><strong>Experimental Method:</strong> The goal is to demonstrate a cause and effect relationship between two variables. The experiment attempts to show that changing the value of one variable causes changes to occur in the second variable. This requires:</p> <ul><li><strong>Manipulation</strong>: The researcher manipulates one variable by changing its value from one level to another. A second variable is observed to determine whether the manipulation causes changes to occur</li> <li><strong>Control</strong>: The researcher must exercise control over the research situation to ensure that other, extraneous variables do not influence the relationship being examined. These variables that need to be controlled can be participant variables (characteristics such as age, gender, and intelligence that vary from one individual to the other) or environmental variables (lighting, time of day, weather, etc.). Researchers should control for as many variables as they can, and so spend a lot of time designing an experiment about what variables are important to control for and how to go about doing that.</li> </ul> <p><strong>Non-experimental Method</strong> (Nonequivalent groups and pre-post studies): This is when the experimenter is unable to fully manipulate the independent variable. For example, when gender is studied, one can’t assign participants to be a random gender. Researchers also have no control over time, and so pre-post tests are also not true experiments. What is meant by this is that a variable is measured twice (pre and post), and researchers can’t control which they measure first – it must be the pre.</p> <p>&nbsp;</p> <h5><strong>Constructs</strong></h5> <p><strong>Constructs</strong> are internal attributes or characteristics that can’t be directly observed but are useful for describing and explaining behavior. The construct is a proposed attribute of a person that often cannot be measured directly, but can be assessed using a number of indicators or manifest variables (for example, depression). We tend to use an operational definition for constructs, which describe a set of operations for measuring the construct and defines a construct in terms of the resulting measurement. Here is a helpful YouTube video for explaining this concept:</p> <div class="textbox interactive-content interactive-content--oembed"><img src="https://i.ytimg.com/vi/Qhz_z4bFqEs/hqdefault.jpg" alt="Thumbnail for the embedded element &quot;Constructs and Operational Definitions (Module 1 1c 2,2)&quot;" title="Constructs and Operational Definitions (Module 1 1c 2,2)" /> <p>A YouTube element has been excluded from this version of the text. You can view it online here: <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=56#pb-interactive-content" title="Constructs and Operational Definitions (Module 1 1c 2,2)" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=56#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=56</a> </p> </div> <p>&nbsp;</p> <h5><strong>Scale Types</strong></h5> <p>A <strong>scale</strong> is a way in which to categorize and/or quantify variables. Each type of scale may have a combination of magnitude, equal intervals, absolute 0, or none. Magnitude means that the scale specifies if each marker has relative value to the other markers. Equal intervals means that a one point difference carries the same weight throughout the scale and that there is a linear relationship among the variables. Absolute 0 just means that the 0 on the scale means the complete absence of that thing. The different types of scales are as follows:</p> <ul><li>Nominal: Set of categories; no quantitative distinction (Ravens, Steelers, etc.)</li> <li>Ordinal: Categories in an ordered sequence ( 1st place vs. 4th place. We don’t know the differences between each race time, only that this is the order that they came in.)</li> <li>Interval: Ordered categories with equal intervals. Arbitrary zero point (ex. Celsius, 0 could have been placed anywhere but we decided to place it at the freezing point of water)</li> <li>Ratio: Ordered categories with equal intervals. Absolute zero point. (ex. Height or weight)</li> </ul> <p><img class="wp-image-434 size-full aligncenter" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch10.png" alt="A table that explains different types of scale" width="547" height="272" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch10.png 547w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch10-300x149.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch10-65x32.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch10-225x112.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch10-350x174.png 350w" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/04/introduction-to-statistics-basics/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/04/introduction-to-statistics-basics/">Math Support Center blog</a> at the University of Baltimore on June 4, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-frequency-distributions" title="Frequency Distributions"><div class="chapter-title-wrap"><h3 class="chapter-number">11</h3><h2 class="chapter-title">Frequency Distributions</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>In statistics, a lot of tests are run using many different points of data and it’s important to understand how those data are spread out and what their individual values are in comparison with other data points. A <strong>frequency distribution</strong> is just that–an outline of what the data look like as a unit. A <strong>frequency table</strong> is one way to go about this. It’s an organized tabulation showing the number of individuals located in each category on the scale of measurement. When used in a table, you are given each score from highest to lowest (X) and next to it the number of times that score appears in the data (f). A table in which one is able to read the scores that appear in a data set and how often those particular scores appear in the data set. Here’s a Khan Academy video we found to be helpful in explaining this concept:</p> <div class="textbox interactive-content interactive-content--oembed"><img src="https://i.ytimg.com/vi/gdE46YSedvE/hqdefault.jpg" alt="Thumbnail for the embedded element &quot;Frequency tables and dot plots | Data and statistics | 6th grade | Khan Academy&quot;" title="Frequency tables and dot plots | Data and statistics | 6th grade | Khan Academy" /> <p>A YouTube element has been excluded from this version of the text. You can view it online here: <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=66#pb-interactive-content" title="Frequency tables and dot plots | Data and statistics | 6th grade | Khan Academy" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=66#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=66</a> </p> </div> <p>&nbsp;</p> <p><span style="text-decoration: underline">Organizing Data into a Frequency Distribution</span></p> <ol><li>Find the range</li> <li>Order the table from highest score to lowest score, not skipping scores that might not have shown up in the data set</li> <li>In the next column, document how many times this score shows up in the data set</li> </ol> <p>&nbsp;</p> <p><span style="text-decoration: underline">Organizing data into a group frequency table</span></p> <ol><li>The grouped frequency table should have about 10 intervals. A good strategy is to come up with some widths according to Guideline 2 and divide the total range of numbers by that width to see if there are close to 10 intervals.</li> <li>The width of the interval should be a relatively simple number (like 2, 5, or 10)</li> <li>The bottom score in each class interval should be a multiple of the width (0-9, 10-19, 20-19, etc.)</li> <li>All intervals should be the same width.</li> </ol> <p>&nbsp;</p> <h5><strong>Proportions and Percentages</strong></h5> <p><strong>Proportions</strong> measure the fraction of the total group that is associated with each score (they’re called relative frequencies because they describe the frequency in relation to the total number of scores). For example, if I have 10 pieces of fruit and 3 of them are oranges, 3/10 is the proportion of oranges. On the other hand, <strong>percentages</strong> express relative frequency out of 100, but essentially report the same values. Keeping in line with our fruit example, 30% of my fruit is oranges. Here’s a YouTube video which might be helpful:</p> <div class="textbox interactive-content interactive-content--oembed"><img src="https://i.ytimg.com/vi/zPAdUkh7Vm8/hqdefault.jpg" alt="Thumbnail for the embedded element &quot;Calculating Relative Frequency&quot;" title="Calculating Relative Frequency" /> <p>A YouTube element has been excluded from this version of the text. You can view it online here: <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=66#pb-interactive-content" title="Calculating Relative Frequency" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=66#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=66</a> </p> </div> <h5><strong>Real Limits</strong></h5> <p><strong>Real limits</strong> are continuous variables require a calculation of a real limit. They can be calculated by taking the apparent limit and subtracting and then separately adding half the value of the smallest digit available or presented. For example, I have a value of 50 and I want the real limits. To make it easier to see, I make the number 50.0. The smallest digit shown is the 1 digit, so I subtract half of one (49.5) and add half of one (50.5). Sometimes one isn’t the smallest digit. If I have a value of 34.5, I add another digit to the end to make 34.50, and the smallest value is the 0.5, so we divide by 2 to get 0.25. So the limits are 34.75 and 34.25. Finally, sometimes the smallest value of measurement is given. If the smallest unit a scale can measure is 0.2 pounds, and you have a value of 80 pounds, you add and subtract half of 0.2 pounds and get 80.1 and 79.9. This can be a difficult concept two grasp, so here are two YouTube videos we found helpful.</p> <p><code></code></p> <div class="textbox interactive-content"><span class="interactive-content__icon"></span> <p>An interactive or media element has been excluded from this version of the text. You can view it online here:<br /> <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=66#pb-interactive-content" title="Real limits" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=66#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=66</a> </p> </div> <p></p> <h5><strong>Frequency Distribution Graphs</strong></h5> <p>A frequency distribution is often best grasped conceptually though the use of graphs. These graphs are like the tables in that they represent the same data, but graphs show it in a different way. This can be done with bar graphs (discrete), histograms (continuous), or polygons (continuous). Here are two Khan Academy videos we found helpful.</p> <p><code></code></p> <div class="textbox interactive-content"><span class="interactive-content__icon"></span> <p>An interactive or media element has been excluded from this version of the text. You can view it online here:<br /> <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=66#pb-interactive-content" title="Frequency distribution graphs" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=66#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=66</a> </p> </div> <p></p> <p>These graphs can come in a multitude of shapes, but here are just a few important descriptive words generally used in statistics:</p> <ul><li><strong>Symmetrical</strong>: When the shape of the distribution is, at least for the most part, mirrored on both sides if you were to view the mean as the flipping point.</li> <li><strong>Asymmetrical</strong>: When the shape of the distribution is not mirrored on both sides for whatever reason (usually because of skew).</li> <li><strong>Positively Skewed</strong>: This is when there is what looks like a tail of data trailing off to the right. I like to remember this is as the P in Positive having fallen on its back.</li> <li><strong>Negatively Skewed</strong>: This is when there is what looks like a tail of data trailing off to the left.</li> <li><strong>Unimodal</strong>: This literally means having a buildup of data around what looks to be one number, so one mode. Your typical bell curve is unimodal.</li> <li><strong>Bimodal</strong>: This is when there is data clustering around two different numbers or spots on the distribution, so having two modes. This can often look like camel humps.</li> <li><strong>Multimodal</strong>: When a distribution has two or more “humps” in the graph.</li> </ul> <p>Here’s a video which may be helpful in teaching you how to interpret data presented in a table and organizing data into a frequency distribution graph.</p> <div class="textbox interactive-content interactive-content--oembed"><img src="https://i.ytimg.com/vi/gdE46YSedvE/hqdefault.jpg" alt="Thumbnail for the embedded element &quot;Frequency tables and dot plots | Data and statistics | 6th grade | Khan Academy&quot;" title="Frequency tables and dot plots | Data and statistics | 6th grade | Khan Academy" /> <p>A YouTube element has been excluded from this version of the text. You can view it online here: <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=66#pb-interactive-content" title="Frequency tables and dot plots | Data and statistics | 6th grade | Khan Academy" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=66#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=66</a> </p> </div> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/04/statistics-frequency-distributions/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/04/statistics-frequency-distributions/">Math Support Center blog</a> at the University of Baltimore on on June 4, 2019.&nbsp;</em></p> </div></div>
<div class="chapter standard" id="chapter-measures-of-central-tendency" title="Measures of Central Tendency"><div class="chapter-title-wrap"><h3 class="chapter-number">12</h3><h2 class="chapter-title">Measures of Central Tendency</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p><strong>Central tendency</strong> is a statistical measure; a single score to define the center of a distribution. It is also used to find the single score that is most typical or best represents the entire group. No single measure is always best for both purposes. There are three main types:</p> <p>&nbsp;</p> <ul><li><strong>Mean</strong>: sum of all scores divided by the number of scores in the data, also referred to as the average.</li> <li><strong>Median</strong>: the midpoint of the scores in a distribution when they are listen in order from smallest to largest. It divides the scores into two groups of equal size. With an even number of scores, you compute the average of the two middle scores.</li> <li><strong>Mode</strong>: the most frequently occurring number(s) in a data set.</li> </ul> <p>&nbsp;</p> <p>Here is a variety of videos to help you understand the concepts of these measures, finding the median using a histogram, and finding a missing value given the mean.</p> <p><code></code></p> <div class="textbox interactive-content"><span class="interactive-content__icon"></span> <p>An interactive or media element has been excluded from this version of the text. You can view it online here:<br /> <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=180#pb-interactive-content" title="Central tendency measures" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=180#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=180</a> </p> </div> <p></p> <p>There are properties that will change in the mean depending on how scores are modified. When every score has a number added to it, the mean also gets the same number added to it (ex. if the mean is 8 and every score within the distribution as a 3 added to is, the new mean will be 11). When all the numbers are multiplied by a something, the mean is also multiplied by that something (ex. if the mean is 2 and all the numbers in the distribution were multiplied by 3, the new mean would be 6). When only a few scores are greater or lower, the mean value follows with it but it needs to be recalculated.</p> <p>The following videos detail what happens to the mean and median when increasing the highest value, the impact that removing the lowest value has on the mean and median, and estimating means and medians when given a graph.</p> <p><code></code></p> <div class="textbox interactive-content"><span class="interactive-content__icon"></span> <p>An interactive or media element has been excluded from this version of the text. You can view it online here:<br /> <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=180#pb-interactive-content" title="Estimating data" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=180#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=180</a> </p> </div> <p></p> <p>&nbsp;</p> <h5><strong>Computing Central Tendency Measures</strong></h5> <p><span style="text-decoration: underline">Computing the mean:</span> The mean is pretty straightforward. One should add up all the values and divide that sum by the number of values. For example, if I have a data set of 5 (2, 6, 3, 2, 2), I would add all the numbers up (15) and divide that by 5 to get a mean of 3.</p> <p><span style="text-decoration: underline">Computing the median:</span> Calculating the median involves lining up all the scores from smallest to biggest. The middle one is the median. If there are an even amount of numbers, the average of the 2 middle numbers is considered the median. Remember that the purpose of a median is to divide the data in half. When working with a discrete frequency distribution, please refer to the first video below. When working with a grouped or continuous frequency distribution, there are extra steps. Please refer to the second video included below.</p> <p><code></code></p> <div class="textbox interactive-content"><span class="interactive-content__icon"></span> <p>An interactive or media element has been excluded from this version of the text. You can view it online here:<br /> <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=180#pb-interactive-content" title="Computing the median" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=180#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=180</a> </p> </div> <p></p> <p><span style="text-decoration: underline">Computing the mode:</span> Mode is the most frequent number which comes up. Whatever shows up the most in your frequency table, that’s the mode. There may be more than one mode, so keep this in mind.</p> <p><span style="text-decoration: underline">Computing weighted means:</span> Overall mean is the sum of all the scores of group one plus the sum of all the scores in group two. All of this is then divided by n1+n2. In some cases you’ll get something like “group 1 consists of 5 people with an average score of 10 and group 2 consists of 8 people with an average score of 7.” In this case you would multiply 5 and 10 and add that to 8 times 7. You would then divide that number by the total number of people to get the weighted mean. Here is a helpful video:</p> <div class="textbox interactive-content interactive-content--oembed"><img src="https://i.ytimg.com/vi/kA4Jvl3_G7U/hqdefault.jpg" alt="Thumbnail for the embedded element &quot;How to calculate Weighted Mean and Weighted Average&quot;" title="How to calculate Weighted Mean and Weighted Average" /> <p>A YouTube element has been excluded from this version of the text. You can view it online here: <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=180#pb-interactive-content" title="How to calculate Weighted Mean and Weighted Average" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=180#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=180</a> </p> </div> <p>&nbsp;</p> <h5><strong>Central Tendency and How they Relate to Distribution Shape</strong></h5> <p>The shape of a distribution can help you determine which measure of central tendency is greatest.</p> <p>&nbsp;</p> <ul><li><strong>Normal</strong>: The mean, median, and mode are all in the same spot</li> <li><strong>Bimodal</strong>: The mean and median are together in the middle, while the two modes are on either side, represented by the two humps</li> <li><strong>Skewed</strong>: The mean is going to be closest to the tail, median is between mean and mode (closer to the tail than in a normal distribution, but not as close as the mean), and the mode is found by the hump. This means that a positively skewed distribution will have a mean larger than its median and a median larger than its mode, while a negatively skewed distribution will have a mode larger than its median and a median lager than its mean.</li> </ul> <p>&nbsp;</p> <h5><strong>When to Use Each Measure</strong></h5> <p>In regards to the mean, no situation precludes it, but it shouldn’t be used when there are extreme scores, skewed distributions, undetermined values, open-ended distributions, ordinal scales, or nominal scales. With the median, it’s appropriate to use when there are extreme scores, skewed distributions, undetermined values, open-ended distributions, or ordinal scales. It is not to be used when there is a nominal scale. The mode is good to use with nominal scales, discrete variables, and in describing shape, but it shouldn’t be used with interval or ratio data, except to accompany the mean or median.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/04/measures-of-central-tendency/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/04/measures-of-central-tendency/">Math Support Center blog</a> at the University of Baltimore on June 4, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-basics-of-variability" title="Basics of Variability"><div class="chapter-title-wrap"><h3 class="chapter-number">13</h3><h2 class="chapter-title">Basics of Variability</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p><strong>Variability</strong> is often a difficult topic for newcomers to statistics to grasp. Essentially it is the spread of the scores in a frequency distribution. If you have a bell curve which is pretty flat, you would say that it has high variability. If you have a bell curve which is pointy, you would say that it has low variability. Variability is really a quantitative measure of the differences between scores and describes the degree to which the score are spread out or clustered together. The purpose of measuring variability is to be able to describe the distribution and measure how well an individual score represents the distribution.</p> <p>There are three main types of variability:</p> <ul><li><strong>Range</strong>: The distance between the lowest and the highest score in a distribution. Can be described as one number or represented by writing out the lowest and highest number together (ex. values 4-10). Calculated by subtracting the highest score from the lowest score. If you’re working with continuous variables, it’s the upper real limit for Xmax minus the lower real limit for Xmin.</li> <li><strong>Standard deviation</strong>: The average distance between the scores in a data set and the mean. This value is also the square root of the variance. Here’s a video to help you conceptualize this.</li> </ul> <div class="textbox interactive-content interactive-content--oembed"><img src="https://i.ytimg.com/vi/-iCMQoCJFBY/hqdefault.jpg" alt="Thumbnail for the embedded element &quot;Visually assessing standard deviation | AP Statistics | Khan Academy&quot;" title="Visually assessing standard deviation | AP Statistics | Khan Academy" /> <p>A YouTube element has been excluded from this version of the text. You can view it online here: <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=188#pb-interactive-content" title="Visually assessing standard deviation | AP Statistics | Khan Academy" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=188#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=188</a> </p> </div> <ul><li><strong>Variance</strong>: Measures the average squared distance from the mean. This number is good for some calculations, but generally we want the standard deviation to determine how spread out a distribution is. Calculated with this equation:</li> </ul> <p class="ql-center-displayed-equation" style="line-height: 47px;"><span class="ql-right-eqno">&nbsp; </span> <span class="ql-left-eqno">&nbsp; </span><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-2ba7bd692ca0e359be1345f84dcf9457_l3.svg" height="47" width="151" class="ql-img-displayed-equation quicklatex-auto-format" alt="\[ {\sigma^2}=\Sigma \frac{(X - \mu)^2}{N} \]" title="Rendered by QuickLaTeX.com" /></p> <p>&nbsp;</p> <h5><strong>Sample Variance and Degrees of Freedom</strong></h5> <p><strong>Sample variance</strong> is just the variance that needs to be calculated as a substitute sometimes when the population variance is unavailable (this will be talked about more later). See the first slide below for a video explaining this more. The degrees of freedom determine the number of scores in the sample that are independent and free to vary. This is important because in a sample, all the data points are allowed to be whatever score, but the last score needs to be such that the mean we calculated stays that mean. So if we have 3 scores in a set, and we know the mean is 5, the first two scores can be any numbers, in this case it’s 9 and 2. Because we calculated that the mean is 5, the last number has to be 4 to add up to 15 and divide by 3 to get 5. The last score is dependent on the other scores. Al this means practically is that the equation of sample variance differs from population variance in that the denominator is n-1. So n-1 literally means that all the scores except the “last” one are allowed to be whatever they want. See the second slide below for a video with some more explanation.</p> <p><code></code></p> <div class="textbox interactive-content"><span class="interactive-content__icon"></span> <p>An interactive or media element has been excluded from this version of the text. You can view it online here:<br /> <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=188#pb-interactive-content" title="Sample variance" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=188#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=188</a> </p> </div> <p></p> <h5><strong>Biased vs. Unbiased</strong></h5> <p>An <strong>unbiased estimate</strong> of a population parameter is when the average value of a statistic is equal to the parameter and the average value uses all possible samples of a particular size n. A <strong>biased estimate</strong> of a population parameter systematically overestimates or underestimates the population parameter. In this case, we know that sample variability tends to underestimate the variability of the corresponding population. We correct this by using degrees of freedom and we account for this when we use standard error.</p> <h5><strong>Inferring Patterns in Data</strong></h5> <p>Variability in the data influences how easy it is to see patterns. High variability obscures patterns in comparing two sets of data that would be visible in low variability samples. It can’t tell you if there’s a significant difference between groups, though. You have to run an analysis of variance or t-test to determine that.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/04/statistics-variability/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/04/statistics-variability/">Math Support Center blog</a> at the University of Baltimore on June 4, 2019.&nbsp;</em></p> </div></div>
<div class="chapter standard" id="chapter-z-score-basics" title="Z-score Basics"><div class="chapter-title-wrap"><h3 class="chapter-number">14</h3><h2 class="chapter-title">Z-score Basics</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <h5><strong>Standardized Distributions</strong></h5> <p>Sometimes when working with data sets, we want to have the scores on the distribution <strong>standardized</strong>. Essentially, this means that we convert scores from a distribution so that they fit into a model that can be used to compare and contrast distributions from different works. For example, if you have a distribution of scores that show the temperature each day over the summer in Boston, it may be recorded in Fahrenheit. Someone else in Paris may have recorded their summer temperatures as well but in Celcius. If we wanted to compare these distributions of scores based on their descriptive statistics, we may want to convert them to the same standardized unit of measurement.</p> <p>Standardized distributions have one single unit of measurement. Raw scores are transformed into this standardized unit of measurement to be compared to one another. Ultimately, they should look just like the original distribution, the only difference is that the scores have been placed on a different unit of measurement.</p> <h5><strong>Z-Scores</strong></h5> <p>Z-scores are the most common standardized score. They are used to describe score location in a distribution (descriptive statistics) and because we can compare scores across distributions, we can look at the relative standing of a score in a sample or a sample in a population (inferential statistics). The equation is</p> <p class="ql-center-displayed-equation" style="line-height: 45px;"><span class="ql-right-eqno">&nbsp; </span> <span class="ql-left-eqno">&nbsp; </span><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-92f2619360b40d1f4d5bb312cbee9981_l3.svg" height="45" width="119" class="ql-img-displayed-equation quicklatex-auto-format" alt="\[ Z=\frac{(X - \mu)}{\sigma} \]" title="Rendered by QuickLaTeX.com" /></p> <p>In this equation, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b2bc1264b576ae2e5edd1c7d2950eec4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="Z" title="Rendered by QuickLaTeX.com" height="15" width="15" style="vertical-align: 0px;" /> is the z-score, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-597f8460a306ca51ecdbf4ccd022b48c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="X" title="Rendered by QuickLaTeX.com" height="15" width="18" style="vertical-align: 0px;" /> is the variable you want to convert, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-404da1b818d307897c96a781d5ee7d1e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\mu" title="Rendered by QuickLaTeX.com" height="14" width="13" style="vertical-align: -5px;" /> is the mean of the original distribution, and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-855d3e797a310a631af06f7b8257a07f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\sigma" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> is the standard deviation of the original distribution.</p> <p>So, what are the characteristics of a z-score/distributions? In a z-score the mean is placed at 0 and each number below or above is a representation of how many standard deviations away a score is. A 1 represents one standard deviation above the mean and -1 represents one standard deviation below the mean. For example, if I know that my original mean is 10 and my original standard deviation is 2, I know that a z-score of 1 would mean 12 and a z-score of -1 would mean 8. For the purposes of your class, all z-score distributions are normal distributions. Z-scores aren’t used on other kinds of distributions because the charts and proportions are designed to describe normal distributions.</p> <p><img class="alignnone wp-image-359 size-large" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SND-1024x445.png" alt="Example of a standard normal distribution (z-score)" width="1024" height="445" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SND-1024x445.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SND-300x130.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SND-768x334.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SND-1536x668.png 1536w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SND-2048x890.png 2048w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SND-65x28.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SND-225x98.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/SND-350x152.png 350w" /></p> <p>What’s nice about z-scores is that they can also be used to find proportions, which will be talked about even more in the next post. This requires the <strong>Unit Normal Table</strong> which is a table designed to help one translate z-scores into proportions of the population on either side of the score or compared to the mean score. There are 4 columns: one with the z-scores, one with the proportion of the population in the body of the distribution with the z-score as the starting point, one with the proportion of the population in the tail of the distribution with the z-score as the starting point, and one with the proportion of the population between the z-score and the mean. It can usually be found in the back of any statistics textbook. If I have a z-score of -1.5 and I wanted to know the proportion of the scores which are lower than -1.5, I could go to the back of my textbook, find -1.50 in the margins, and get the proportion .06681, meaning that 6.6881% of the data is less than a z-score of -1.5. The numbers in this table show the reader the proportion of everything to the left of the z-score in question. If I wanted to know everything to the right, the proportion would be 1 – 0.06681, which is .93319 or 93.319% of the data.</p> <p><img class="alignnone wp-image-196 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/zscore-example.png" alt="Example of Unit Normal Table" width="472" height="478" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/zscore-example.png 472w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/zscore-example-296x300.png 296w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/zscore-example-65x66.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/zscore-example-225x228.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/zscore-example-350x354.png 350w" /></p> <p>Z-scores can be used in inferential statistics. Interpretation of research results depends on determining if the (treated) sample is noticeably different from the population. The distribution of the general population would describe the average untreated person, so this allows researchers to compare that distribution to their treated sample. Z-scores are one technique for defining “noticeably different”, but it more like borders on inferential statistics, because we can’t actually tell if there’s a statistical difference without running the right test. Z-tests and their purpose in inferential statistics will be discussed in other posts.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/06/statistics-z-score-basics/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/06/statistics-z-score-basics/">Math Support Center blog</a> at the University of Baltimore on June 6, 2019.&nbsp;</em></p> </div></div>
<div class="chapter standard" id="chapter-probability-and-statistics" title="Probability and Sampling"><div class="chapter-title-wrap"><h3 class="chapter-number">15</h3><h2 class="chapter-title">Probability and Sampling</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <h5><strong>Probabilities</strong></h5> <p>A <strong>probability</strong> is a fraction or a <strong>proportion</strong> of all the possible outcomes. So it’s the number of classified outcomes classified as X divided by the total number of possible outcomes (N). It’s generally reported as a decimal, but it can also be reported as a fraction or a percentage.</p> <p>What is the role of probability in populations, samples, and inferential statistics? As we discussed before, because it’s usually impossible for researchers to draw data from the entirety of a population, they draw samples. The size of the sample affects how comparable the sample population is to the general population. Probability is used to predict what kind of samples are likely to be obtained from a population. Thus, probability establishes a connection between samples and populations; we know from looking at the population how likely it is for a specific sample to be drawn. We also use proportions that exist within samples to infer the probabilities that exist within a population. Inferential statistics rely on this connection when they use sample data as the basis for making conclusions about populations.</p> <p>&nbsp;</p> <h5><strong>Random Sampling</strong></h5> <p><strong>Random sampling</strong> is a process by which researchers pool together a sample in such a way that it is most likely to be representative of the population as a whole. While this will never be entirely the case – since (1) there is always a chance that a sample will be entirely different from the population and (2) samples inherently always have less variability than the population – it’s good practice to follow certain random sampling requirements:</p> <p>&nbsp;</p> <ul><li><strong>Independent random sampling</strong>: Probabilities must stay constant from one selection to the next if more than one individual is selected. In other words, selecting one individual shouldn’t affect the probability of another person being selected; their chances are independent of one another.</li> <li><strong>Random sampling with replacement</strong>: Each individual in the population has an equal chance of being selected, meaning that to keep the denominator of the probability equation (X/N) the same for each draw, the first draw needs to be returned to the population pool.</li> </ul> <p>&nbsp;</p> <h5><strong>Proportions in Frequency Distributions</strong></h5> <p>Proportions can be represented in frequency distributions, and this was briefly touched on in another blog post about z-scores. A selected section of a frequency distribution represents a proportion of the population; the selected area under the curve represents a proportion of the population. Because normal distributions are symmetrical and the same shape, just stretched out differently, we can use z-scores to standardize the scores and use a unit normal table to determine what proportion of the population is on either side of that score. The area under the curve literally becomes a proportion. We also know that in a normal distribution, more extreme scores are less likely to occur, since most scores will build up near the mean. The proportions of ranges of scores closer to the mean are greater than the proportions of scores in the ranges near the tails of the distribution.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/06/statistics-probability-and-sampling/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/06/statistics-probability-and-sampling/">Math Support Center blog</a> at the University of Baltimore on on June 6, 2019.&nbsp;</em></p> </div></div>
<div class="chapter standard" id="chapter-distribution-of-sample-means" title="Distribution of Sample Means"><div class="chapter-title-wrap"><h3 class="chapter-number">16</h3><h2 class="chapter-title">Distribution of Sample Means</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>Up until this point, as far as distributions go, it’s been about being able to find individual scores on a distribution. Moving into hypothesis testing, we’re going to switch from working with very concrete distributions with scores to hypothetical distributions of sample means. In other words, we’re still working with normal distributions, but the points that make up the distribution will no longer be individual scores, but all possible sample means which can be drawn from a population with a given <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a13b200485b8e78d6d1986a266856286_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="N" title="Rendered by QuickLaTeX.com" height="15" width="19" style="vertical-align: 0px;" /> or number of scores in them.</p> <p>We use these kinds of distributions because with inferential statistics we’re going to want to find the probability of acquiring a certain sample mean to see if it’s common or very rare and therefore perhaps significantly different from another mean.</p> <p>There are some concepts you will have to keep in mind for this shift including sampling error, the central limit theorem, and standard error.</p> <p>&nbsp;</p> <h5><strong>Sampling Error</strong></h5> <p><strong>Sampling error</strong> is the natural discrepancy, or amount of error, between a sample statistic and its corresponding population parameter. So each sample is different because you’re likely drawing separate samples from the same population; you hope to get a diverse group, but you don’t really get to pick what you get, and you’re not likely to get the exact same group twice. Samples can’t be entirely representative of a population and always have less variability than the population. So even though we take a sample in order to run statistics that can be generalized back to the population, there is always going to be some error.</p> <p>&nbsp;</p> <h5><strong>Central Limit Theorem</strong></h5> <p>The <strong>central limit theorem</strong> is a set of rules that dictate how a distribution of sample means will look given certain criteria. For any population with mean <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-404da1b818d307897c96a781d5ee7d1e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\mu" title="Rendered by QuickLaTeX.com" height="14" width="13" style="vertical-align: -5px;" /> and standard deviation <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-855d3e797a310a631af06f7b8257a07f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\sigma" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" />, the distribution of sample means for sample size <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> will have a mean of <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-404da1b818d307897c96a781d5ee7d1e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\mu" title="Rendered by QuickLaTeX.com" height="14" width="13" style="vertical-align: -5px;" /> and a standard error of <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-4fe6f94921ba3213da781dd58298c0e4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\sigma\sqrt{n}" title="Rendered by QuickLaTeX.com" height="21" width="43" style="vertical-align: -5px;" /> (which we will talk about more in a minute) and will approach a normal distribution as n approaches infinity. So this practically means that the distribution of sample means is almost perfectly normal in either of two conditions: the population from which the samples are selected is a normal distribution or the number of scores in each sample (also known as sample size) is relatively large (around 30 or more). The central limit theorem also mentions that as n increases, variability decreases. In other words, the greater the sample n, the pointer your distribution.</p> <p>These videos will help your understanding:</p> <p><code></code></p> <div class="textbox interactive-content"><span class="interactive-content__icon"></span> <p>An interactive or media element has been excluded from this version of the text. You can view it online here:<br /> <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=202#pb-interactive-content" title="Central limit theorem" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=202#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=202</a> </p> </div> <p></p> <p>&nbsp;</p> <h5><strong>Standard Error</strong></h5> <p>The <strong>standard error</strong> provides a measure of how much distance is expected on average between a sample mean <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-1e7499c3ce9b5856bf3730e3e654eec8_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(M)" title="Rendered by QuickLaTeX.com" height="22" width="36" style="vertical-align: -6px;" /> and the population mean <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-0698b2f275d5257ab83023c604dbfafd_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(\mu)" title="Rendered by QuickLaTeX.com" height="22" width="26" style="vertical-align: -6px;" />. Essentially, it’s the standard deviation of sample means from the mean of sample means. It specifies precisely how well a sample mean estimates its population mean. The magnitude of the standard error is determined by two factors: the size of the sample and the standard deviation of the population from which the sample is selected. We can see by the equation <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-4762ba1ed6032ce77209b136e59fe9e6_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="M=n" title="Rendered by QuickLaTeX.com" height="15" width="63" style="vertical-align: 0px;" /> that the greater <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> is, the greater its square root and the more that the standard deviation will have to be divided by, making the standard error smaller. But if the population standard deviation is already small, that will make the standard error small too.</p> <p>&nbsp;</p> <h5><strong>Inferential Statistics</strong></h5> <p><strong>Inferential statistics</strong> are methods that use sample data as a basis for drawing general conclusions about populations, and as mentioned before, are the reason why we’re learning about distributions of sample means. It’s important to know how much a sample differs from the population because we can’t draw many conclusions about the population from a sample that is very different. The error is important to keep in mind too when creating a control group. If a study with treated and untreated patients is to be generalized to the general population, you don’t just want to know if there was a significant difference between the two groups, but you want to make sure that the untreated group represents the general population.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/06/statistics-distribution-of-sample-means/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/06/statistics-distribution-of-sample-means/">Math Support Center blog</a> at the University of Baltimore on June 6, 2019.&nbsp;</em></p> </div></div>
<div class="chapter standard" id="chapter-introduction-to-hypothesis-testing" title="Introduction to Hypothesis Testing"><div class="chapter-title-wrap"><h3 class="chapter-number">17</h3><h2 class="chapter-title">Introduction to Hypothesis Testing</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <h5><strong>What is Hypothesis Testing?</strong></h5> <p><strong>Hypothesis testing</strong> is a big part of what we would actually consider testing for inferential statistics. It’s a procedure and set of rules that allow us to move from descriptive statistics to make inferences about a population based on sample data. It is a statistical method that uses sample data to evaluate a hypothesis about a population.</p> <p>This type of test is usually used within the context of research. If we expect to see a difference between a treated and untreated group (in some cases the untreated group is the parameters we know about the population), we expect there to be a difference in the means between the two groups, but that the standard deviation remains the same, as if each individual score has had a value added or subtracted from it.</p> <p>&nbsp;</p> <h5><strong>Steps of Hypothesis Testing</strong></h5> <p>The following steps will be tailored to fit the first kind of hypothesis testing we will learn first: single-sample z-tests. There are many other kinds of tests, so keep this in mind.</p> <ul><li>Step 1: State the Hypothesis <ul><li><strong>Null Hypothesis (H0):</strong> states that in the general population there is no change, no difference, or no relationship, or in the context of an experiment, it predicts that the independent variable has no effect on the dependent variable.</li> <li><strong>Alternative Hypothesis (H1):</strong> states that there is a change, a difference, or a relationship for the general population, or in the context of an experiment, it predicts that the independent variable has an effect on the dependent variable.</li> </ul> </li> <li>Step 2: Set the Criteria for a Decision <ul><li><strong>Alpha Level:</strong> Also known as <strong>Level of Significance</strong>, is a probability value that is used to define the concept of “very unlikely” in a hypothesis test. We chose an alpha level in order to separate the most unlikely sample means from the most likely sample means. Ex. <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-c28bf28a51f354ad0a2acef86f0dddf6_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\alpha = 0.05," title="Rendered by QuickLaTeX.com" height="19" width="85" style="vertical-align: -4px;" /> that means that we’re separating the most unlikely 5% from the most likely 95%. The largest permissible value of alpha is 0.05, although some researchers like to use more conservative alpha levels to reduce the risk that a false report is published. But you don’t want the value to be too conservative because otherwise, you might run the risk of a Type II error, in which case the hypothesis test demands more evidence from the research results, in which case you might be throwing out evidence that a treatment will work.</li> <li><strong>Critical Region:</strong> Composed of the extreme sample values that are very unlikely to be obtained if the null hypothesis is true. Determined by alpha level. If sample data fall in the critical region, the null hypothesis is rejected, because it’s very unlikely they’ve fallen there by chance.</li> </ul> </li> <li>Step 3: Collect Data and Compute Sample Statistics <ul><li>After collecting the data, we find the sample mean. Now we can compare the sample mean with the null hypothesis by computing a z-score that describes where the sample mean is located relative to the hypothesized population mean. We use the z-score formula.</li> </ul> </li> <li>Step 4: Make a Decision <ul><li>We decided previously what the two z-score boundaries are for a critical score. If the z-score we get after plugging the numbers in the aforementioned equation is outside of that critical region, we reject the null hypothesis. Otherwise, we would say that we failed to reject the null hypothesis.</li> </ul> </li> </ul> <p>&nbsp;</p> <h5><strong>Regions of the Distribution</strong></h5> <p>Because we’re making judgments based on probability and proportion, our normal distributions and certain regions within them come into play.</p> <p>As mentioned before, <strong>Alpha Level</strong>, also known as <strong>Level of Significance</strong>, is a probability value that is used to define the concept of “very unlikely” in a hypothesis test. We chose an alpha level in order to separate the most unlikely sample means from the most likely sample means. Ex. <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-c28bf28a51f354ad0a2acef86f0dddf6_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\alpha = 0.05," title="Rendered by QuickLaTeX.com" height="19" width="85" style="vertical-align: -4px;" /> that means that we’re separating the most unlikely 5% from the most likely 95%</p> <p>The <strong>Critical Region</strong> is composed of the extreme sample values that are very unlikely to be obtained if the null hypothesis is true. Determined by alpha level. If sample data fall in the critical region, the null hypothesis is rejected, because it’s very unlikely they’ve fallen there by chance.</p> <p>These regions come into play when talking about different errors.</p> <p>A <strong>Type I Error</strong> occurs when a researcher rejects a null hypothesis that is actually true; the researcher concludes that a treatment has an effect when it actually doesn’t. This happens when a researcher unknowingly obtains an extreme, non-representative sample. This goes back to alpha level: it’s the probability that the test will lead to a Type I error if the null hypothesis is true.</p> <p>A <strong>Type II Error</strong> occurs when a researcher fails to reject the null hypothesis that is really false; this means that the hypothesis test has failed to detect a real treatment effect. This happens when the sample mean is not in the critical region even though the treatment has had an effect on the sample. Usually, this means that the effect of the treatment was small, but it’s still there. The probability of a Type II error is represented by beta <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-7c4679ddaa85d1b9df1cc4e1c777d16e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(\beta)" title="Rendered by QuickLaTeX.com" height="22" width="26" style="vertical-align: -6px;" /></p> <p><img class="alignnone size-full wp-image-436" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch17a.png" alt="An example of hypothesis testing" width="420" height="370" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch17a.png 420w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch17a-300x264.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch17a-65x57.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch17a-225x198.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch17a-350x308.png 350w" /></p> <p><img class="alignnone size-full wp-image-437" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/hypochart.png" alt="A chart of hypothesis testing results" width="570" height="473" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/hypochart.png 570w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/hypochart-300x249.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/hypochart-65x54.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/hypochart-225x187.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/hypochart-350x290.png 350w" /></p> <p>A result is said to be <strong>significant</strong> or <strong>statistically significant</strong> if it is very unlikely to occur when the null hypothesis is true. That is, the result is sufficient to reject the null hypothesis. For instance, two means can be significantly different from one another.</p> <p>&nbsp;</p> <h5><strong>Factors that Influence and Assumptions of Hypothesis Testing</strong></h5> <p>Assumptions of Hypothesis Testing:</p> <ul><li><strong>Random sampling:</strong> it is assumed that the participants used in the study were selected randomly so that we can confidently generalize our findings from the sample to the population.</li> <li><strong>Independent observation:</strong> two observations are independent if there is no consistent, predictable relationship between the first observation and the second.<br /> The value of σ is unchanged by the treatment; if the population standard deviation is unknown, we assume that the standard deviation for the unknown population (after treatment) is the same as it was for the population before treatment. There are ways of checking to see if this is true in SPSS or Excel.</li> <li><strong>Normal sampling distribution:</strong> in order to use the unit normal table to identify the critical region, we need the distribution of sample means to be normal (which means we need the population to be distributed normally and/or each sample size needs to be 30 or greater based on what we know about the central limit theorem).</li> </ul> <p>Factors that influence hypothesis testing:</p> <ul><li>The variability of the scores, which is measured by either the standard deviation or the variance. The variability influences the size of the standard error in the denominator of the z-score.</li> <li>The number of scores in the sample. This value also influences the size of the standard error in the denominator.</li> </ul> <p><strong>Test statistic:</strong> indicates that the sample data are converted into a single, specific statistic that is used to test the hypothesis (in this case, the z-score statistic).</p> <p>&nbsp;</p> <h5><strong>Directional Hypotheses and Tailed Tests</strong></h5> <p>In a <strong>directional hypothesis test</strong>, also known as a one-tailed test, the statistical hypotheses specify with an increase or decrease in the population mean. That is, they make a statement about the direction of the effect.</p> <p>The Hypotheses for a Directional Test:</p> <ul><li>H0: The test scores are not increased/decreased (the treatment doesn’t work)</li> <li>H1: The test scores are increased/decreased (the treatment works as predicted)</li> </ul> <p>Because we’re only worried about scores that are either greater or less than the scores predicted by the null hypothesis, we only worry about what’s going on in one tail meaning that the critical region only exists within one tail. This means that all of the alpha is contained in one tail rather than split up into both (so the whole 5% is located in the tail we care about, rather than 2.5% in each tail). So before, we cared about what’s going on at the 0.025 mark of the unit normal table to look at both tails, but now we care about 0.05 because we’re only looking at one tail.</p> <p>A one-tailed test allows you to reject the null hypothesis when the difference between the sample and the population is relatively small, as long as that difference is in the direction that you predicted. A two-tailed test, on the other hand, requires a relatively large difference independent of direction. In practice, researchers hypothesize using a one-tailed method but base their findings off of whether the results fall into the critical region of a two-tailed method. For the purposes of this class, make sure to calculate your results using the test that is specified in the problem.</p> <p>&nbsp;</p> <h5><strong>Effect Size</strong></h5> <p>A measure of <strong>effect size</strong> is intended to provide a measurement of the absolute magnitude of a treatment effect, independent of the size of the sample(s) being used. Usually done with Cohen’s d. If you imagine the two distributions, they’re layered over one another. The more they overlap, the smaller the effect size (the means of the two distributions are close). The more they are spread apart, the greater the effect size (the means of the two distributions are farther apart).</p> <p>&nbsp;</p> <h5><strong>Statistical Power</strong></h5> <p>The <strong>power</strong> of a statistical test is the probability that the test will correctly reject a false null hypothesis. It’s usually what we’re hoping to get when we run an experiment. It’s displayed in the table posted above. Power and effect size are connected. So, we know that the greater the distance between the means, the greater the effect size. If the two distributions overlapped very little, there would be a greater chance of selecting a sample that leads to rejecting the null hypothesis.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/11/statistics-introduction-to-hypothesis-testing/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/11/statistics-introduction-to-hypothesis-testing/">Math Support Center blog</a> at the University of Baltimore on June 11, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-introduction-to-the-t-statistic" title="Introduction to the t-statistic"><div class="chapter-title-wrap"><h3 class="chapter-number">18</h3><h2 class="chapter-title">Introduction to the t-statistic</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <h5><strong>Z-tests vs. t-tests</strong></h5> <p><strong>Z-tests</strong> compare the means between a population and a sample and require information that is usually unavailable about populations, namely the variance/standard deviation. <strong>Single sample t-tests</strong> compare the population mean to a sample mean, but only require one variance/standard deviation, and that’s from the sample. This is where <strong>estimated standard error</strong> comes in. It’s used as an estimate of the real standard error, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-8c304e35d37377f01ffe5ca96a098317_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\sigmaM," title="Rendered by QuickLaTeX.com" height="7" width="4" style="vertical-align: -4px;" /> when the value of <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-855d3e797a310a631af06f7b8257a07f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\sigma" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> is unknown. It is computed using the sample variance or sample standard deviation and provides an estimate of the standard distance between a sample mean, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-0d7710e737d506e11fefb5325c7bd4c7_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="M" title="Rendered by QuickLaTeX.com" height="15" width="22" style="vertical-align: 0px;" />, and the population mean, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-404da1b818d307897c96a781d5ee7d1e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\mu" title="Rendered by QuickLaTeX.com" height="14" width="13" style="vertical-align: -5px;" />, (or rather, the mean of sample means). It’s an “error” because it’s the distance between what the sample mean is and what it would ideally be since we would rather have the population standard deviation. The formula for estimated standard error is <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fc59f6ef6d9477cb8e4d1f4f4e6e49ed_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{s}{\sqrt{n}}" title="Rendered by QuickLaTeX.com" height="27" width="24" style="vertical-align: -12px;" />.</p> <p>The formula for the t-test itself is:</p> <p class="ql-center-displayed-equation" style="line-height: 59px;"><span class="ql-right-eqno">&nbsp; </span> <span class="ql-left-eqno">&nbsp; </span><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f3cec57eaf49d5465e93d24c28b50357_l3.svg" height="59" width="96" class="ql-img-displayed-equation quicklatex-auto-format" alt="\[ t = \frac{{\overline{X}} - {\mu}}{{\frac{s}{\sqrt{n}}}} \]" title="Rendered by QuickLaTeX.com" /></p> <p>with the bottom portion referring to the estimated standard error. You may see this written as <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d35b24fd8e33119991f68a04094aea4e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="sM" title="Rendered by QuickLaTeX.com" height="15" width="31" style="vertical-align: 0px;" /> instead.</p> <p>&nbsp;</p> <h5><strong>Degrees of Freedom</strong></h5> <p><strong>Degrees of freedom</strong> describe the number of scores in a sample that are independent and free to vary. Because the sample mean places a restriction on the value of one score in the sample, there are degrees of freedom for a sample with n scores. For a single sample t-test, the degrees of freedom are calculated using the following formula: <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-762f8bf704fd3e3e1ffeacacbea957da_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="df = n-1" title="Rendered by QuickLaTeX.com" height="19" width="95" style="vertical-align: -4px;" />.</p> <p>&nbsp;</p> <h5><strong>Shape of a t-distribution</strong></h5> <p>One general rule of t-distribution is that it’s always slightly flatter than it’s corresponding normal distribution. This is because t-statistics are always working from a sample size, which is relatively small, rather than a population, which are generally large.</p> <p>There are some factors which also influence the shape of each individual t-distribution:</p> <ul><li><span style="text-decoration: underline">The degrees of freedom:</span> The greater n is, and subsequently df, the more like a normal distribution the t-distribution will begin to look (this also follows the 30 rule)</li> <li><span style="text-decoration: underline">The sample variance:</span> The bottom half of the equation deals with the estimated standard error, which changes when the standard deviation changes. Because the ESE is dependent on the sample variance, each sample can create a different ESE</li> </ul> <p>&nbsp;</p> <h5><strong>Hypothesis Testing with a Single Sample t-test</strong></h5> <p>The written null and alternative hypotheses for a single sample t-test are as follows:</p> <p>H0 : <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-404da1b818d307897c96a781d5ee7d1e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\mu" title="Rendered by QuickLaTeX.com" height="14" width="13" style="vertical-align: -5px;" /> <em>= population mean</em></p> <p>H1 : <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-404da1b818d307897c96a781d5ee7d1e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\mu" title="Rendered by QuickLaTeX.com" height="14" width="13" style="vertical-align: -5px;" /> \neq population mean</p> <p>The stops for a single sample t-test are as follows:</p> <ol><li>State the hypotheses and select an alpha level</li> <li>Locate the critical region</li> <li>Calculate the test statistic</li> <li>Make a decision regarding the null hypothesis</li> </ol> <p>The following are some assumptions one makes when doing a single sample t-test:</p> <ul><li>The values in the sample must consist of independent observations.</li> <li>The population that is sampled must be normal.</li> </ul> <p>&nbsp;</p> <h5><strong>Effect Size for Single Sample t-tests</strong></h5> <p>Effect size for a single sample t-test is calculated using Cohen’s d. The formula for this is the mean difference over the standard deviation, or <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9a87e952e55751ca5d492209496303a3_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{(M - \mu)}{s}" title="Rendered by QuickLaTeX.com" height="29" width="53" style="vertical-align: -7px;" />. Effect size is important because it’s a way of quantifying the difference between two groups, rather than just saying that there is a significant difference. Essentially, it’s also important to know how much of a difference there is, not just the likelihood that the group differences you’re seeing are a fluke because the null hypothesis is actually true. This can also be important for practical significance. If your treatment is statistically significant, but has a small effect size, is it worth using this treatment on clients?</p> <p>For Cohen’s d, 0.2 would be considered a small effect size, 0.5 is medium, and 0.8 is large. Some people will mix words together like “small to medium effect size” but some professors will want you to just pick a side.</p> <p>&nbsp;</p> <h5><strong>Confidence Intervals</strong></h5> <p><strong>Confidence intervals</strong> are a range of values which is likely to encompass the true value you’re looking for. More specifically, it’s a range we create using a sample that we can say with X% confidence that the population mean falls within that range. Confidence intervals are constructed at a confidence level, such as 95%, selected by the user. It means that if the same population is sampled on numerous occasions and interval estimates are made on each occasion, the resulting intervals would bracket the true population parameter in approximately 95% of the cases. Confidence intervals and any kind of interval estimation are used in the same situations that you would use hypothesis testing. There is an estimation procedure for every kind of hypothesis test.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/11/statistics-introduction-to-the-t-statistic/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/11/statistics-introduction-to-the-t-statistic/">Math Support Center blog</a> at the University of Baltimore on June 11, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-independent-t-tests" title="Independent t-tests"><div class="chapter-title-wrap"><h3 class="chapter-number">19</h3><h2 class="chapter-title">Independent t-tests</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>We have talked about single sample t-tests, which is a way of comparing the mean of a population with the mean of a sample to look for a difference. With two-sample t-tests, we are now trying to find a difference between two different sample means. More specifically, <strong>independent t-tests</strong> involve comparing the means of two samples which are distinctly different from one another in regards to the individuals within each sample. For example, a group of pet owners vs. a group of folks who don’t own pets. These two groups are completely independent of one another. This distinction will be important in a later post.</p> <p>A more technical explanation of the difference between a single sample and two-sample is that a single sample t-test revolves around drawing conclusions about a treated population based on a sample mean and an untreated population mean (no standard deviation). An independent sample t-tests are all about comparing the means of two samples (usually a control group/untreated group and a treated group) to draw inferences about how there might be differences between those two groups in the broader population</p> <p>There are some distinct advantages and disadvantages to this approach when compared to other approaches. To avoid confusion, we won’t describe the other approaches here but will just mark the advantages and disadvantages of this one here for your consideration:</p> <p>Advantages:</p> <ul><li>Gives the opportunity to conduct an experiment with very little contamination by extraneous factors.</li> <li>Lowers the chance of participants suffering from boredom after a long series of tests as well as skewing the results by becoming more accomplished through practice experience.</li> </ul> <p>Disadvantages:</p> <ul><li>Can be complex.</li> <li>Requires a large number of participants.</li> <li>Needs a new group for every treatment and manipulation.</li> <li>Confounding variables brought in by the individuals in the study can weaken results.</li> </ul> <p>&nbsp;</p> <h5><strong>Hypothesis Testing with Independent t-tests</strong></h5> <p>The null and alternative hypotheses for this kind of test are as follows:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-77acf617d6113a98f0236c2eec9df2e2_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H0: \mu1 - \mu2 = 0" title="Rendered by QuickLaTeX.com" height="20" width="158" style="vertical-align: -5px;" /> (no difference in the population means)</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-e9fdb35114452d883324b858023a0142_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H1: \mu1 - \mu2 \neq 0" title="Rendered by QuickLaTeX.com" height="20" width="158" style="vertical-align: -5px;" /> (there is a mean difference)</p> <p>Steps of calculating an independent samples t-test (from this point forward, if there is a larger formula you’re looking for, see our <a href="#chapter-statistics-formula-glossary" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/statistics-formula-glossary/">formula guide glossary</a>):</p> <ol><li>Calculate the estimated standard error by calculating pooled variance and figuring out the degrees of freedom for each group.</li> <li>Subtract the two means from one another (we assume that the difference between the population means will be 0 given the null hypothesis) and then divide by the standard error.</li> <li>Determine the critical region based on your alpha level and whether you’re running a one or two-tailed test. Then decide whether your calculated t-test falls within the critical region or not.</li> <li>Make a decision about the null hypothesis based on this comparison.</li> </ol> <p>Assumptions of independent sample t-tests:</p> <ul><li>The observations within each sample must be independent.</li> <li>The two populations from which the samples are selected must be normal.</li> <li>To justify using the pooled variance, the two populations from which the samples are selected must have equal variances (homogeneity of variance); essentially the standard deviation of after treatment should be very similar to the standard deviation presented before treatment. This can be confirmed using SPSS or Excel. This can also be done using Hartley’s F-max test, which is described later on in this chapter.</li> </ul> <p>&nbsp;</p> <h5><strong>Estimated Standard Error and Pooled Variance</strong></h5> <p>To calculate the estimated standard error, you need to first calculate pooled variance, especially because not all treatment or non-treatment groups will have the same number of scores, and so you need to weight in both groups before coming to terms with the overall estimated standard error. Remember that the estimated standard error is how we calculate the standard error when there’s no population mean to go off of.</p> <p>In essence, the steps for calculating a t-test by hand are:</p> <ol><li>Find the sum of squares of each sample.</li> <li>Calculate the pooled variance given the sums of squares you just found and the degrees of freedom (<img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fb332ed392ff5806add7efa26a07d18c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n-1" title="Rendered by QuickLaTeX.com" height="14" width="47" style="vertical-align: 0px;" /> for each sample).</li> <li>Calculate the estimated standard error using that pooled variance.</li> <li>Plug the estimated standard error into the t-test formula and solve for <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ecf078806dd38e5b014c0894cfaf74a7_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="t" title="Rendered by QuickLaTeX.com" height="13" width="7" style="vertical-align: 0px;" />.</li> </ol> <p>&nbsp;</p> <h5><strong>Effect Size of Independent Samples t-test</strong></h5> <p>We use Cohen’s d to get effect size. For this particular test, it’s mean 1 minus mean 2 all divided by the square root of the pool variance calculated earlier. In this case, instead of comparing the effects of a sample to the population (asking, is this practically significant rather than just statistically significant?), we’re comparing the effects of two different samples.</p> <p>&nbsp;</p> <h5><strong>Hartley’s F-Max Test</strong></h5> <p><strong>Hartley’s F-max test</strong> is a statistical test to evaluate the homogeneity assumption. To compute, you need to compute the sample variance of each sample individually. Then, you need to make a fraction with the biggest variance on top and the smallest one on the bottom. Finally, compute. The F-max value computed for the sample data is compared with the critical value found in an F-max table. If the sample value is larger than the table value, then you can conclude that the homogeneity assumption is not valid.</p> <p>&nbsp;</p> <p>If you’re looking for more help on learning the concept of the independent samples t-test or how to calculate it, check out this series of videos (each one about five minutes long):</p> <p><code></code></p> <div class="textbox interactive-content"><span class="interactive-content__icon"></span> <p>An interactive or media element has been excluded from this version of the text. You can view it online here:<br /> <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=216#pb-interactive-content" title="Independent samples t-test" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=216#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=216</a> </p> </div> <p></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/11/statistics-independent-t-tests/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/11/statistics-independent-t-tests/">Math Support Center blog</a> at the University of Baltimore on June 11, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-repeated-measures-t-test" title="Repeated Measures t-test"><div class="chapter-title-wrap"><h3 class="chapter-number">20</h3><h2 class="chapter-title">Repeated Measures t-test</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>A <strong>repeated measures</strong> or <strong>paired samples</strong> design is all about minimizing confounding variables like participant characteristics by either using the same person in multiple levels of a factor or pairing participants up in each group based on similar characteristics or relationship and then having them take part in different treatments. <strong>Matched subjects</strong> is another word used to describe this kind of test and it is used specifically to refer to designs in which different people are matched up by their characteristics. Participants are often matched by age, gender, race, socioeconomic status, or other demographic features, but can also be matched up on other characteristics the researchers might consider possible confounds. Twin studies are a good example of this kind of design; one twin has to be matched up with the other – they can’t be matched to someone else’s twin.</p> <p>To reiterate the differences between a repeated measures t-test and the other kinds of tests you may have learned up to this point, a single sample t-test revolves around drawing conclusions about a treated population based on a sample mean and an untreated population mean (no standard deviation). An independent sample t-tests are all about comparing the means of two samples (usually a control group/untreated group and a treated group) to draw inferences about how there might be differences between those two groups in the broader population. Different, randomly assigned participants are used in each group. Related samples t-tests are like independent sample t-tests except they use the same person for multiple test groups or they match people based on their characteristics or relationships to cut down on extraneous variables which may interfere with the data.</p> <p>&nbsp;</p> <h5><strong>Mean Difference and Estimated Standard Error of the Mean Difference</strong></h5> <p>The <strong>mean difference</strong> is calculated by subtracting the two scores collected from each person (because there are two testing groups), adding all of those differences up, and then dividing that number by the number of scores. This is done because rather than just compare means between the two samples, like in an independent samples t-test, we have the opportunity to first calculate the difference between each individual to see how the treatment affected them.</p> <p>The <strong>estimated standard error of the mean difference</strong> is a measure of how much the mean difference might vary from one occasion to the next. This is different from independent measures because instead of pooling variance between two samples, you base your sum of squares on the difference between the two scores and then calculate the estimated standard error like you would a single sample t test.</p> <p>&nbsp;</p> <h5><strong>Hypothesis Testing with Repeated Measures t-tests</strong></h5> <p>The null and alternative hypothesis are written as follows:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-bd84c132652f72d731d606ba16818d4b_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H0 : \muD = 0" title="Rendered by QuickLaTeX.com" height="15" width="74" style="vertical-align: 0px;" /> or that there is no difference between the two conditions</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-3ed086808a579584410161505a2e4ea0_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H1 : \mu1 \neq 0" title="Rendered by QuickLaTeX.com" height="20" width="109" style="vertical-align: -5px;" /> or that there is a significant difference between the two conditions</p> <p>Steps for calculating a repeated measures t-test (all formulas needed can be found in the <a href="#chapter-statistics-formula-glossary" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/statistics-formula-glossary/">statistics formula glossary</a>):</p> <ol><li>State the null and alternative hypothesis</li> <li>Locate the critical region (remember that the <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ea2ad6d7e9d827d315c0fc9d0f3a9d24_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="df" title="Rendered by QuickLaTeX.com" height="19" width="19" style="vertical-align: -4px;" /> is <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fb332ed392ff5806add7efa26a07d18c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n-1" title="Rendered by QuickLaTeX.com" height="14" width="47" style="vertical-align: 0px;" />)</li> <li>Calculate the t statistic using the t formula after calculating the estimated standard error of the mean difference.</li> <li>Make a decision.</li> </ol> <p>Once again, there are some advantages and disadvantages to using this approach.</p> <p>Advantages:</p> <ul><li>Fewer subjects needed</li> <li>Is well-suited for studying changes over time (developmental, learning, studying)</li> <li>Reduces or eliminates caused by individual differences within the participants by either linking participants up based on characteristics or by using the same person twice.</li> </ul> <p>Disadvantages:</p> <ul><li>Increases the likelihood that outside factors that change over time may be responsible for changes in the participants’ scores.</li> <li>Participation in the first treatment could affect scores in the second treatment (practice, fatigue, etc.).</li> </ul> <p>&nbsp;</p> <h5><strong>Effect Size for Repeated Measures t-tests</strong></h5> <p>Once again, Cohen’s d is the effect size measurement of choice. In this case, it’s the sample mean difference over the sample mean deviation (so whatever you found as the variance, square root that to get the sample mean deviation).</p> <p>&nbsp;</p> <h5><strong>Variability as a Measure of Consistency</strong></h5> <p>If a treatment consistently adds a few points to each individual’s score, then the set of difference scores are clustered together on a normal distribution curve with relatively small variability. In this situation, with small variability, it is easy to see the treatment effect and it is likely to be significant. High variability means that there’s no consistency with a treatment effect, meaning that it’s harder to see that there’s any difference between groups and it’s unlikely that a significant difference will be found.</p> <p>&nbsp;</p> <h5><strong>Degrees of Freedom</strong></h5> <p>Before, when we were working with independent t-tests, the degrees of freedom was <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fb332ed392ff5806add7efa26a07d18c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n-1" title="Rendered by QuickLaTeX.com" height="14" width="47" style="vertical-align: 0px;" /> for each sample, so in the end, it was <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b5485bca46835d6cbb29af033c1f1789_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n-2" title="Rendered by QuickLaTeX.com" height="14" width="48" style="vertical-align: 0px;" />. However, for a repeated measures t-test, we’re only needing degrees of freedom for the mean difference. Therefore, the total degrees of freedom is simply <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fb332ed392ff5806add7efa26a07d18c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n-1" title="Rendered by QuickLaTeX.com" height="14" width="47" style="vertical-align: 0px;" />.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/11/statistics-repeated-measures-t-test/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/11/statistics-repeated-measures-t-test/">Math Support Center blog</a> at the University of Baltimore on June 11, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-independent-one-way-anova" title="Independent One-Way ANOVA"><div class="chapter-title-wrap"><h3 class="chapter-number">21</h3><h2 class="chapter-title">Independent One-Way ANOVA</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>An <strong>ANOVA (ANalysis Of VAriance)</strong> is a test that is run either to compare multiple independent variables with two or more levels each, or one independent variable with more than 2 levels. You can technically also run an ANOVA in the same cases you would run a t-test and come up with the same results, but this isn’t common practice, as t-tests are easier to compute by hand.</p> <p>For the purposes of this post, a One-way ANOVA is a test which compares the means of multiple samples (more than 2) which are connected by the same independent variable. An example of this might be comparing the growth of plans who receive no water (Group 1) a little water (Group 2), a moderate amount of water (Group 3), and a lot of water (Group 4).</p> <p>A <strong>factor</strong> is another name for an independent variable. As mentioned earlier, ANOVAs can sometimes have more than one factor, but for now we’re only working with one, just like we have before. A <strong>level</strong> is a group within that independent variable. Using the example from before, the groups in which the plants are put in are the levels (no water, little water, some water, a lot of water) and the independent variable itself is just water amount.</p> <p>&nbsp;</p> <h5><strong>Experiment-wise Alpha</strong></h5> <p>A question you might be asking yourself is, why bother doing an ANOVA when I can just do multiple t-tests? This is because the risk of a Type I error that accumulates as you do more and more separate tests. Doing multiple t-tests would result in greater experiment-wise alpha and therefore experiment-wise error. In a lot of ways its better to just do one big ANOVA to look for differences and then decipher those differences later using a post-hoc, which will be discussed later.</p> <p>&nbsp;</p> <h5><strong>Hypothesis Testing with One-way ANOVAs</strong></h5> <p>The null and alternative hypotheses for a one-way ANOVA are as follows (please keep in mind that 3 is not the maximum number of means that can be compared so write your hypotheses accordingly):</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-493f750951dc5e73d15430da80f0190e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H0 : \mu1 = \mu2 = \mu3" title="Rendered by QuickLaTeX.com" height="20" width="173" style="vertical-align: -5px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a761d820e3c91b31ed0710046f1a4899_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H1: \mu1 \neq \mu2 \neq \mu3" title="Rendered by QuickLaTeX.com" height="20" width="173" style="vertical-align: -5px;" /></p> <p>Essentially, the point is whether there will or won’t be a significant difference between the groups, or at least two of them.</p> <p>Steps for calculating a one-way ANOVA (please refer to the <a href="#chapter-statistics-formula-glossary" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/statistics-formula-glossary/">statistics formula glossary</a> for actual formulas):</p> <p><img class="alignnone size-full wp-image-446" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-steps.png" alt="A chart detailing the steps of ANOVA" width="927" height="468" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-steps.png 927w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-steps-300x151.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-steps-768x388.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-steps-65x33.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-steps-225x114.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-steps-350x177.png 350w" /></p> <h5><strong>Variability in One-Way ANOVAs</strong></h5> <p>There are multiple kinds of variability found within the calculation of an ANOVA. There is <strong>between-treatment variance</strong> and <strong>within-treatment variance</strong>. The between-treatment variance can be further broken down into <strong>systematic treatment effects</strong> and <strong>random, unsystematic factors</strong>. The within-treatment variance only accounts for random, unsystematic factors in this case.</p> <p><img class="alignnone size-full wp-image-447" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-variability.png" alt="A diagram of the different variance in ANOVA" width="845" height="480" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-variability.png 845w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-variability-300x170.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-variability-768x436.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-variability-65x37.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-variability-225x128.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ANOVA-variability-350x199.png 350w" /></p> <p>The purpose of calculating within-treatments variance is to determine how much of the between-treatment variance was due to random, unsystematic factors and how much was due to treatment effects.</p> <p>There is a conceptual meaning underlying the ANOVA formula. The numerator is meant to represent the differences between sample means and the denominator is meant to represent the differences between samples expected with no treatment effect. This is basically between-treatments variance (the general differences between treatment conditions) and within-treatment variance (the variability within each sample).</p> <p>Assumptions of a one-way ANOVA</p> <ul><li>The observations within each sample must be independent.</li> <li>The population from which the samples are selected must be normal.</li> <li>The populations from which the samples are selected must have equal variance (homogeneity of variance).</li> </ul> <p>&nbsp;</p> <h5><strong>Effect Size in One-Way ANOVAs</strong></h5> <p>Effect size is now calculated with something called partial eta squared. The formula for this is: <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-4561b94da1458adc07d3def1df46ee39_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\eta2 = \frac{SSbetweentreatments}{SStotal}" title="Rendered by QuickLaTeX.com" height="26" width="213" style="vertical-align: -7px;" />, or the sum of squares of the between treatments over the sum of squares total.</p> <p>&nbsp;</p> <h5><strong>Post-Hoc Test</strong></h5> <p>A <strong>post-hoc test</strong> allows one to figure out which groups are significantly different from one another once a significant F-ratio has been established. This is better than just running individual t-tests because post hoc still reduce experiment-wise error. There are several options for conducting a post-hoc, but two more popular options are Tukey’s and Scheffe’s tests. Tukey’s test calculates a single value that determines the minimum difference between treatment means that is necessary for significance. Scheffe’s test uses an F-ratio to evaluate the significance of the difference between the two treatment conditions. Formulas for both of these tests are in the <a href="#chapter-statistics-formula-glossary" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/statistics-formula-glossary/">statistics formula glossary</a>.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/11/statistics-independent-one-way-anova/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/11/statistics-independent-one-way-anova/">Math Support Center blog</a> at the University of Baltimore on June 11, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-repeated-measures-anova" title="Repeated Measures ANOVA"><div class="chapter-title-wrap"><h3 class="chapter-number">22</h3><h2 class="chapter-title">Repeated Measures ANOVA</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>Just like when we talked about independent samples t-tests and repeated measures t-tests, ANOVAs can have the same distinction. Independent one-way ANOVAs use samples which are in no way related to each other; each sample is completely random, uses different individuals, and those individuals are not paired in any meaningful way. In a <strong>repeated measures one-way ANOVA</strong>, individuals can be in multiple treatment conditions, be paired with other individuals based on important characteristics, or simply matched based on a relationship to one another (twins, siblings, couples, etc.). What’s important to remember that in a repeated measures one-way ANOVA, we are still given the opportunity to work with multiple levels, not just two like with a t-test.</p> <p>Advantages:</p> <ul><li>Individual differences among participants do not influence outcomes or influence them very little because everyone is either paired up on important participant characteristics or they are the same person in multiple conditions.</li> <li>A smaller number of subjects needed to test all the treatments.</li> <li>Ability to assess an effect over time.</li> </ul> <p>Disadvantages:</p> <ul><li>Increases the likelihood that outside factors that change over time may be responsible for changes in the participants’ scores.</li> <li>Participation in the first treatment could affect scores in the second treatment (practice, fatigue, etc.).</li> </ul> <p>&nbsp;</p> <h5><strong>Hypothesis Testing with Repeated Measures One-Way ANOVA</strong></h5> <p>The null and alternative hypotheses for a repeated measures ANOVA are as follows:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-493f750951dc5e73d15430da80f0190e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H0 : \mu1 = \mu2 = \mu3" title="Rendered by QuickLaTeX.com" height="20" width="173" style="vertical-align: -5px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a761d820e3c91b31ed0710046f1a4899_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H1: \mu1 \neq \mu2 \neq \mu3" title="Rendered by QuickLaTeX.com" height="20" width="173" style="vertical-align: -5px;" /></p> <p>Assumptions of repeated measures one-way ANOVAs are as follows:</p> <ul><li>The observations within each treatment condition must be independent.</li> <li>The population distribution within each treatment must be normal</li> <li>The variances of the population distribution for each treatment should be equivalent</li> </ul> <p>The steps to calculating a repeated measures one-way ANOVA are explained in this chart.</p> <p><img class="alignnone wp-image-454 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22a-fix.png" alt="Chart about repeated measures ANOVA" width="470" height="370" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22a-fix.png 470w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22a-fix-300x236.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22a-fix-65x51.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22a-fix-225x177.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22a-fix-350x276.png 350w" /></p> <p>See below for a useful video. Please remember that different disciplines use different versions of the same equations; don’t let this intimidate you. Just use what you have been given by your book or professor.</p> <div class="textbox interactive-content interactive-content--oembed"><img src="https://i.ytimg.com/vi/VPB3xrsFl4o/hqdefault.jpg" alt="Thumbnail for the embedded element &quot;Repeated-Measures ANOVA&quot;" title="Repeated-Measures ANOVA" /> <p>A YouTube element has been excluded from this version of the text. You can view it online here: <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=231#pb-interactive-content" title="Repeated-Measures ANOVA" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=231#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=231</a> </p> </div> <p>There is a conceptual meaning underlying the process of calculating this. There are more sums of squares to consider because we’re doing our best to separate within differences from between differences but also distinguishing which within differences are due to individual differences between the subjects and what error can’t be accounted for by individual differences. So instead of basing an F-ratio on the balance of between treatment differences and any error that could ever take place, we reduce the error being used to calculate whether there’s a significant difference by getting rid of the kind of error we can measure with a repeated measures design: participant differences. See the graphic below for a visual representation of this concept.</p> <p><img class="alignnone wp-image-455 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22b-redone.jpg" alt="ANOVA concept map" width="506" height="402" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22b-redone.jpg 506w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22b-redone-300x238.jpg 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22b-redone-65x52.jpg 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22b-redone-225x179.jpg 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch22b-redone-350x278.jpg 350w" /></p> <h5><strong>Effect Size</strong></h5> <p>Effect size, in this case, is calculated once again using partial eta squared:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ff0de34e512c2fe725159a47ec156ecf_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\eta2 = \frac{SSBetweenTreatments}{SStotal} - SSBetweenSubjects" title="Rendered by QuickLaTeX.com" height="26" width="444" style="vertical-align: -7px;" /></p> <p>Be careful not to accidentally plug in the wrong value, as these names all sound similar to one another. The numerator should be the between treatments sum of squares gotten in the first step of the calculation. The denominator is the total sum of squares minus the between-subjects sum of squares found in the second round of calculations.</p> <p>&nbsp;</p> <h5><strong>Post Hoc Tests</strong></h5> <p>Like we mentioned in the previous post, post hoc tests are tests run to determine which groups are significantly different from one another after determining through the ANOVA that there’s a significant difference somewhere. Is the significant difference between A and B, B and C, C and A, or all three? For repeated measures one-way ANOVAs, Tukey’s HSD and Scheffe can be used, just substitute SSerror and dferror in the formulas. These formulas can be found on the <a href="#chapter-statistics-formula-glossary" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/statistics-formula-glossary/">statistics formula glossary post</a>.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/18/statistics-repeated-measures-anova/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/18/statistics-repeated-measures-anova/">Math Support Center blog</a> at the University of Baltimore on June 18, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-two-factor-anovas" title="Two-Factor ANOVAs"><div class="chapter-title-wrap"><h3 class="chapter-number">23</h3><h2 class="chapter-title">Two-Factor ANOVAs</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>So far we’ve talked about tests which are used if there is one independent variable, either with two levels or more. This is not the limit of how much we can include in a single analysis. In a <strong>two-factor ANOVA</strong>, there is more than one independent variable and each of those variables can have two or more levels. Take this example into consideration:</p> <p style="padding-left: 40px;">A farmer wants to know the best combination of products to use to maximize her crop yield. She decides to test out three different fertilizer brands (A, B, and C) and two different kinds of seeds (Y and Z). Each product is paired once with another for a total of 6 conditions: AY, BY, CY, AZ, BZ, CZ.</p> <p>A two-factor ANOVA considers more than one factor and considers the joint impact of factors. This means that instead of running a new study every time you want to see how an independent variable affects a specific dependent variable, you can run an experiment with two different independent variables and seeing how they each impact the dependent variable and you get to see if the two independent variables do anything together to affect the dependent variable. These are called <strong>main effects</strong> and <strong>interactions</strong>. Keeping the example going, if we find that no matter what the seed type is that fertilizers A, B, and C resulted in different crop yields from one another, we would say there is a main effect for fertilizer type. If no matter what the fertilizer type is there is a difference between the crop yields of seeds Y and Z, we would say that there is a main effect for seed type. If there are times that the two factors influence each other (for example, let’s say that fertilizer worked much better specifically when paired with Y seeds), we would say there’s an interaction. The defining characteristic of an <strong>interaction</strong> is when the effect of one factor depends on the different levels of a second factor or the impact of another factor, either amplifying or reducing the effect based on the level.</p> <p>&nbsp;</p> <h5><strong>Meaning of the Equation Before the ANOVA</strong></h5> <p>Often times when reading a paper which uses a multiple factor ANOVA, there is a little equation before it like 2×2 or 4x5x2 or something along those lines. These equations may look confusing and intimidating, but there is a simple way to read these. The number of numbers in the equation tells you how many factors there are. For instance, a 2×3 ANOVA simply has two factors because there are only two numbers presented. A 4x5x7x2x3x4 ANOVA, although this equation looks ridiculous, simply has six factors in it because there are six numbers present. The actual values of each number tell you how many levels are in each factor. A lot of papers make sure to define which factors they’re considering first, but simply put, a 2×3 ANOVA has two factors and the first factor has two levels while the second has three. A 4x5x7x2x3x4 ANOVA has six factors, the first has four levels, the second has five, the third has seven, the fourth has two, the fifth has three, and the sixth has four. This author has personally never come across an ANOVA so convoluted in a paper, but this example was just meant to show that although it’s easy to get caught up in the sheer volume of numbers, interpreting them is not so complicated. Also remember that it’s the author’s job in an article to interpret the results for you, so that should help.</p> <p>&nbsp;</p> <h5><strong>Hypothesis Testing with Two-Factor ANOVAs</strong></h5> <p>For an ANOVA with only two factors (which is all you’ll likely need to master), there are three different null and alternative hypotheses to consider. One is for the first main effect, one is for the second main effect, and one is for an interaction. Don’t forget to include as many means as there are levels.</p> <p>Null:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b64c5940abe326893b6cb2c08e7efbdd_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H0: \mu1 = \mu2 = \cdots \mu A" title="Rendered by QuickLaTeX.com" height="20" width="207" style="vertical-align: -5px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-47483de7aa0a1aa4f6f5045b297153c3_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H0: \mu1 = \mu2 = \cdots \mu B" title="Rendered by QuickLaTeX.com" height="20" width="207" style="vertical-align: -5px;" /></p> <p>H0: There is not an A X B interaction</p> <p>Alternative:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9d0b3b8054dea2aa0445f5b896e7fdb8_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H1: \mu1 \neq \mu2 \neq \cdots \mu A" title="Rendered by QuickLaTeX.com" height="20" width="207" style="vertical-align: -5px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a5f1064b4ec1c1716aa862b54fda6543_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="H1: \mu1 \neq \mu2 \neq \cdots \mu B" title="Rendered by QuickLaTeX.com" height="20" width="207" style="vertical-align: -5px;" /></p> <p>H1: There is an A X B interaction</p> <p>The following are the steps and stages needed to calculate a two-factor ANOVA. Please keep in mind that the formulas needed for these calculations exist in the <a href="#chapter-statistics-formula-glossary" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/statistics-formula-glossary/">statistics formula glossary post</a>.</p> <p>&nbsp;</p> <ul><li>First Stage: <ul><li>Is identical to independent samples ANOVA</li> <li>Compute the <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a87a39df86cc8f85b9f3d3f46e6acd2a_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="SS_{total}" title="Rendered by QuickLaTeX.com" height="18" width="60" style="vertical-align: -3px;" />, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-79dde6483d4f2f16b367a2524459a154_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="SS_{between treatments}" title="Rendered by QuickLaTeX.com" height="18" width="165" style="vertical-align: -3px;" />, and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-65a09232439685dbe2f556281e27b59b_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="SS_{within treatments}" title="Rendered by QuickLaTeX.com" height="18" width="156" style="vertical-align: -3px;" /></li> </ul> </li> <li>Second Stage: <ul><li>Calculate <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-5353526f39fcda04592b9d403da40c31_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="df_{total}" title="Rendered by QuickLaTeX.com" height="19" width="51" style="vertical-align: -4px;" />, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-4cf6c243e879558b5df1a5994da06046_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="df_{within treatments}" title="Rendered by QuickLaTeX.com" height="19" width="148" style="vertical-align: -4px;" />, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b93a1d157b3aee5e4c4f7417c919cd05_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="df_{between treatments}" title="Rendered by QuickLaTeX.com" height="19" width="157" style="vertical-align: -4px;" />, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a747bad83330dae3c2d62adace3513cd_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="df_A" title="Rendered by QuickLaTeX.com" height="19" width="29" style="vertical-align: -4px;" />, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-0a770f9e98e54702f0e7398054b8a5df_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="df_B" title="Rendered by QuickLaTeX.com" height="19" width="30" style="vertical-align: -4px;" />, and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a95cae919f06a13005be01051ea50577_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="df_{error}" title="Rendered by QuickLaTeX.com" height="19" width="57" style="vertical-align: -4px;" />.</li> </ul> </li> <li>Third Stage: <ul><li>The goal is to partition the <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-79dde6483d4f2f16b367a2524459a154_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="SS_{between treatments}" title="Rendered by QuickLaTeX.com" height="18" width="165" style="vertical-align: -3px;" /> into three separate components</li> <li>Calculate <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ecc97aa5e4cb927dc323c3c8bff32092_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="SS_A" title="Rendered by QuickLaTeX.com" height="18" width="38" style="vertical-align: -3px;" />, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-cf4b6f151f631e4e00473960b2f31baf_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="SS_B" title="Rendered by QuickLaTeX.com" height="18" width="39" style="vertical-align: -3px;" />, and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-8f3e6b836cbceae36b1d1427e2fd09af_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="SS_{AxB}" title="Rendered by QuickLaTeX.com" height="18" width="60" style="vertical-align: -3px;" />. In other words, calculate the parts of <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-79dde6483d4f2f16b367a2524459a154_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="SS_{between treatments}" title="Rendered by QuickLaTeX.com" height="18" width="165" style="vertical-align: -3px;" /> which can be attributed to main effect one, main effect two, and an interaction.</li> </ul> </li> <li>Fourth Stage: <ul><li>Calculate <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-2d0272a9fd8a32277ac95482962fb3ae_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="MS_A" title="Rendered by QuickLaTeX.com" height="18" width="48" style="vertical-align: -3px;" />, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-0b068ee3c33b20c9d642cf2985f6b5f7_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="MS_B" title="Rendered by QuickLaTeX.com" height="18" width="49" style="vertical-align: -3px;" />, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-59458120cb067b6761e8df9f263c26fb_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="MS_{AxB}" title="Rendered by QuickLaTeX.com" height="18" width="70" style="vertical-align: -3px;" />, and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-92c5811e575120b0fae71f26cb61d5e0_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="MS_{within treatments}" title="Rendered by QuickLaTeX.com" height="18" width="166" style="vertical-align: -3px;" />.</li> </ul> </li> <li>Fifth Stage: <ul><li>Calculate <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-e0248d38760976294d009ad2ca40c3cb_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="F_A" title="Rendered by QuickLaTeX.com" height="18" width="26" style="vertical-align: -3px;" />, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-cea343ff226a6fc89f520898d87eb095_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="F_B" title="Rendered by QuickLaTeX.com" height="18" width="27" style="vertical-align: -3px;" />, and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a7adb7f147301267fd21dfe4b71ceb6f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="F_{AxB}" title="Rendered by QuickLaTeX.com" height="18" width="48" style="vertical-align: -3px;" /></li> <li>These are the numbers which you use to determine if there is a main effect one <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-56a69b9a19cf11db8d6ce42810b3d514_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(F_A)" title="Rendered by QuickLaTeX.com" height="22" width="40" style="vertical-align: -6px;" />, main effect two <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-efa973c5b9d27e4db3248d51ae12e8ac_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(F_B)" title="Rendered by QuickLaTeX.com" height="22" width="41" style="vertical-align: -6px;" />, and/or an interaction <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-982555de3441c055867cc14803957a51_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(F_{AxB})" title="Rendered by QuickLaTeX.com" height="22" width="62" style="vertical-align: -6px;" />.</li> </ul> </li> <li>Sixth Stage: <ul><li>If there is a significant main effect that has three or more levels, this is when you would conduct a post-hoc analysis for that factor alone. This would be no different than one done for a one-way ANOVA.</li> </ul> </li> </ul> <p>The conceptual meaning behind these calculations is that we’re dividing up the variance between the treatments so that we know the differences between the levels of factor A, the differences between the levels of factor B, and then the differences due to an interaction between the factors.</p> <p>The assumptions of a two-factor ANOVA are as follows:</p> <ul><li>The observations within each sample must be independent of each other.</li> <li>The populations from which the samples are selected must be normally distributed.</li> <li>The populations from which the samples are selected must have equal variances.</li> </ul> <p>&nbsp;</p> <h5><strong>Effect Size</strong></h5> <p>Once again we’re using partial eta squared, but this time we’re calculating it thrice – once for main effect one, once for main effect two, and once for an interaction. The formulas for these are relatively simple and can be found in the <a href="#chapter-statistics-formula-glossary" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/statistics-formula-glossary/">statistics formula glossary post</a>.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/18/statistics-two-factor-anova/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/18/statistics-two-factor-anova/">Math Support Center blog</a> at the University of Baltimore on June 18, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-introduction-to-correlation-and-regression" title="Introduction to Correlation and Regression"><div class="chapter-title-wrap"><h3 class="chapter-number">24</h3><h2 class="chapter-title">Introduction to Correlation and Regression</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>So far we’ve been talking about analyses which involve variables which are split up into categorical or discrete variables (ex. treatment A, B, C) compared to a dependent variable which is continuous (ex. plant height). However, there is a way to look at two variables which have continuous data: <strong>correlation</strong>. A correlation will tell you the characteristics of a relationship such as <strong>direction</strong> (either positive or negative), <strong>form</strong> (we often work with linear relationships), and <strong>strength</strong> of the relationship. Strength and direction can be understood with the number which is given at the end of an analysis <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-bc28395d2212807d339bd3a661195b21_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(r)" title="Rendered by QuickLaTeX.com" height="22" width="23" style="vertical-align: -6px;" />.</p> <p>A <strong>positive correlation</strong> is one in which the increased value of one variable results in the increased value of another. For example, height and weight – as height increased, weight also tends to increase. A <strong>negative correlation</strong> is one in which the increased value of one variable results in the decrease of another. For example, as the temperature outside increases, hot chocolate sales will decrease. This is what is meant by the direction of a correlation. An r-value with a negative sign in front of it means a negative correlation and one without a negative sign means a positive correlation.</p> <p>R-values exist on a plane between -1 and 1. The closer a number is to 1, the stronger its positive relationship. The closer a number is to -1, the stronger its negative relationship. The closer a number is to 0, the weaker its relationship, no matter if its negative or positive.</p> <div class="wp-caption alignnone" id="attachment_449" aria-describedby="caption-attachment-449" style="width: 862px"><img class="wp-image-449 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Strong-weak-no-correlation.png" alt="Three graphs showing strong, weak, and correlation" width="862" height="274" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Strong-weak-no-correlation.png 862w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Strong-weak-no-correlation-300x95.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Strong-weak-no-correlation-768x244.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Strong-weak-no-correlation-65x21.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Strong-weak-no-correlation-225x72.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Strong-weak-no-correlation-350x111.png 350w" /><div class="wp-caption-text" id="caption-attachment-449">Created by Spiritia and shared under a CC BY-SA 3.0 License</div></div> <p>&nbsp;</p> <h5><strong>Pearson Correlation</strong></h5> <p>The most common type of correlation used is the <strong>Pearson Correlation</strong>. It measures the degree and direction of the linear relationship between two variables. It will measure a perfect linear relationship. Every change in variable <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-597f8460a306ca51ecdbf4ccd022b48c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="X" title="Rendered by QuickLaTeX.com" height="15" width="18" style="vertical-align: 0px;" /> has a corresponding change in variable <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-289636edca4c123175b5d84dbf63cd0c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="Y" title="Rendered by QuickLaTeX.com" height="15" width="16" style="vertical-align: 0px;" />. The possible range of an r-value is between -1 and 1. <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9a82f992e7c853c0b75156342ed1edbf_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="R" title="Rendered by QuickLaTeX.com" height="15" width="16" style="vertical-align: 0px;" /> is calculated in the following way: <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ed18b841732d371cf14ab9562762b984_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="r" title="Rendered by QuickLaTeX.com" height="9" width="10" style="vertical-align: 0px;" /> = covariability / variability of <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-597f8460a306ca51ecdbf4ccd022b48c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="X" title="Rendered by QuickLaTeX.com" height="15" width="18" style="vertical-align: 0px;" /> and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-289636edca4c123175b5d84dbf63cd0c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="Y" title="Rendered by QuickLaTeX.com" height="15" width="16" style="vertical-align: 0px;" /> separately.</p> <p>There are some important factors to take into consideration when using and interpreting the Pearson Correlation.</p> <p>&nbsp;</p> <ol><li><span style="text-decoration: underline"><em>Correlation does not demonstrate causation.</em></span> This is something very important to remember; just because two variables have a correlation doesn’t mean that one is causing the other. There may be another factor <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d32eab5e280d447f37936fc67389acfa_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(Z)" title="Rendered by QuickLaTeX.com" height="22" width="29" style="vertical-align: -6px;" /> that we haven’t measured which may be the real reason. Take into account that when ice cream sales go up, so do the number of drownings in an area. Does that mean that ice creams are causing people to drown? Consider that maybe increased temperatures result in more ice cream consumption as well as an increase in the number of people who are going out swimming. You can never know if there’s a third lingering factor with just a correlation.</li> <li>The value of the correlation is affected by the range of scores in the data. For example, if you’re looking at how height and age correlate, if your sample is just made up of people who are 20 or older, you probably will get a weak correlation, as most adults no longer grow. However, if your sample is 17 or younger, you’re likely to find a decent positive correlation.</li> <li>Extreme points (outliers) have an impact. Data points which vary greatly from the others may sometimes need to be removed as their presence affects the correlation.</li> <li>Correlation cannot be interpreted as a proportion.</li> </ol> <p>&nbsp;</p> <h5><strong>Coefficient of Determination</strong></h5> <p>The <strong>coefficient of determination</strong> is a measurement of the proportion of variability in one variable that can be determined from the relationship with the other variable (r squared). In other words, it’s used to analyze how differences in one variable can be explained by a difference in a second variable. The example given by Statistics How To is that when you get pregnant has a direct relation to when they give birth. Link to the <a href="https://www.statisticshowto.datasciencecentral.com/probability-and-statistics/coefficient-of-determination-r-squared/" data-url="https://www.statisticshowto.datasciencecentral.com/probability-and-statistics/coefficient-of-determination-r-squared/">whole article here</a>. This measure is usually reported along the lines of this: “75% of the variation in <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-289636edca4c123175b5d84dbf63cd0c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="Y" title="Rendered by QuickLaTeX.com" height="15" width="16" style="vertical-align: 0px;" /> can be explained by the variation in <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-597f8460a306ca51ecdbf4ccd022b48c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="X" title="Rendered by QuickLaTeX.com" height="15" width="18" style="vertical-align: 0px;" />.”</p> <p>&nbsp;</p> <h5><strong>Other Types of Correlation</strong></h5> <p>While Pearson Correlation is the most commonly used, there are times when the data one collects warrants the use of a different kind of correlation. Some are listed below:</p> <ul><li><strong>Partial correlation:</strong> A partial correlation measures the relationship between two variables while controlling the influence of a third variable by holding it constant.</li> <li><strong>Spearman correlation:</strong> Used when both variables are measured on an ordinal scale; Used when the relationship is consistently directional but may not be linear.</li> <li><strong>Point-biserial correlation:</strong> Measures relationship between two variables when one variable has only two values (dichotomous value)</li> <li><strong>Phi-coefficient:</strong> Both variables are dichotomous. Both variables are re-coded to values 0 and 1</li> </ul> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/18/statistics-correlation/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/18/statistics-correlation/">Math Support Center blog</a> at the University of Baltimore on June 18, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-introduction-to-linear-regression" title="Introduction to Linear Regression"><div class="chapter-title-wrap"><h3 class="chapter-number">25</h3><h2 class="chapter-title">Introduction to Linear Regression</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p><strong>Linear regression</strong> is a method for determining the best-fitting line through a set of data. In a lot of ways, it’s similar to a correlation since things like <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ed18b841732d371cf14ab9562762b984_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="r" title="Rendered by QuickLaTeX.com" height="9" width="10" style="vertical-align: 0px;" /> and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-76ddf28f7bfd9fac72aa9a5edb23418d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="r^2" title="Rendered by QuickLaTeX.com" height="19" width="18" style="vertical-align: 0px;" /> are still used. The one difference is that the purpose of regression is prediction. The best-fitting line is calculated through the minimization of total squared error between the data points and the line.</p> <p>The equation used for regression is <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-52444de9329c9c2b8e3cee3683f77947_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="Y = a + bx" title="Rendered by QuickLaTeX.com" height="17" width="103" style="vertical-align: -2px;" /> or some variation of that. If you remember from algebra class, this formula is like <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-2654f22b59150cf979bfea3e69c5a0d7_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="Y=mx + b" title="Rendered by QuickLaTeX.com" height="17" width="110" style="vertical-align: -2px;" />. This is because they are both the linear equation. Although you may be asked to report <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ed18b841732d371cf14ab9562762b984_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="r" title="Rendered by QuickLaTeX.com" height="9" width="10" style="vertical-align: 0px;" /> and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-76ddf28f7bfd9fac72aa9a5edb23418d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="r^2" title="Rendered by QuickLaTeX.com" height="19" width="18" style="vertical-align: 0px;" />, the purpose of regression is to be able to find values for the slope <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-363420616fb4f9ce87f5c5da17ce0a61_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(b)" title="Rendered by QuickLaTeX.com" height="22" width="22" style="vertical-align: -6px;" /> and the y-intercept <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-70c4e01a4c344526d9cee69f07183d58_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(a)" title="Rendered by QuickLaTeX.com" height="22" width="24" style="vertical-align: -6px;" /> that creates a line that best fits through the data.</p> <p>&nbsp;</p> <h5><strong>Standard Error of the Estimate</strong></h5> <p>Regression equations make a prediction, and the precision of the estimate is measured by the <strong>standard error of the estimate</strong>. The standard error of the estimate is a measure of the accuracy of predictions made with a regression line and has to do with how wide the data points are scattered (strength of the correlation). In other words, it tells you how far away the points tend to be from the prediction line.</p> <p>Here is a playlist of videos that may be helpful<span class="footnote"><span class="footnote-indirect" data-fnref="244-1"></span></span><span class="footnote"><span class="footnote-indirect" data-fnref="244-2"></span></span>:</p> <div class="textbox interactive-content interactive-content--oembed"><img src="https://i.ytimg.com/vi/zPG4NjIkCjc/hqdefault.jpg" alt="Thumbnail for the embedded element &quot;Linear Regression.&quot;" title="Linear Regression." /> <p>A YouTube element has been excluded from this version of the text. You can view it online here: <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=244#pb-interactive-content" title="Linear Regression." data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=244#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=244</a> </p> </div> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/18/statistics-regression/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/18/statistics-regression/">Math Support Center blog</a> at the University of Baltimore on June 18, 2019.</em></p> <div class="footnotes"><div id='244-1'>Longstreet, D. [statisticsfun]. (2012, February 5). An Introduction to Linear Regression Analysis<em> </em>[Video]. YouTube. https://www.youtube.com/watch?v=zPG4NjIkCjc&amp;list=PLF596A4043DBEAE9C&amp;index=1</div><div id='244-2'>Longstreet's resources available through his "statistics fun" channel is extensive. While the title may be off-putting, "My Book Sucks" is an incredibly useful CC-BY licensed resource: https://www.youtube.com/user/statisticsfun/about</div></div></div></div>
<div class="chapter standard" id="chapter-discrete-probability-distributions" title="Discrete Probability Distributions"><div class="chapter-title-wrap"><h3 class="chapter-number">26</h3><h2 class="chapter-title">Discrete Probability Distributions</h2><h2 class="chapter-author">Jason Green</h2></div><div class="ugc chapter-ugc"> <h5><strong>Discrete versus Continuous Variables</strong></h5> <p>A <strong>discrete variable</strong> typically originates from a counting process while a <strong>continuous variable</strong> usually comes from a measuring process. An easy way to make the distinction between a discrete and a continuous variable is that discrete variables are usually whole numbers with no decimals. Continuous variables on the other hand frequently take the form of decimals. For instance, the number of people which exist within a group is a discrete variable because it’s always a whole number, while a person’s weight would be continuous since it can typically be measured to multiple decimal places.</p> <p>&nbsp;</p> <h5><strong>The Probability Distribution for a Discrete Variable</strong></h5> <p>A <strong>probability distribution</strong> for a discrete variable is simply a compilation of all the range of possible outcomes and the probability associated with each possible outcome. Since, probability in general, by definition, must sum to 1, the summation of all the possible outcomes must sum to 1. For example, if you’re flipping a coin once, there’s a 1 in 2 chance it will land on heads, and a 1 in 2 chance it will land on tails; 1/2 + 1/2 = 1. In this way, measuring probability is similar to the use of percentages. Percentages are always measured out of 100 while probability is always measured out of 1. This is true for all probability measurements.</p> <p>&nbsp;</p> <h5><strong>Expected Value for a Discrete Variable</strong></h5> <p>The expected value for a discrete variable is essentially the same as the population mean. In this way, the expected value is calculated simply by finding the product of each possible outcome and its associated probability and doing a summation at the end.</p> <p>&nbsp;</p> <h5><strong>Standard Deviation and Variance of a Discrete Variable</strong></h5> <p><strong>Standard deviation</strong> is basically a measure of how much each data point varies away from the mean; it’s also often described as the spread of the distribution. Quantitatively, the standard deviation is simply the square root of variance. This quantitative definition confirms the fact that variance must always be a positive number since numerically the evaluation of standard deviation would be impossible otherwise. Conversely, the standard deviation can be both positive and negative as each data point can be both above and below the mean value. Standard deviation and variance as concepts are also discussed in an earlier post called Basics of Variability.</p> <p>&nbsp;</p> <h5><strong>Binomial Distribution</strong></h5> <p>The <strong>binomial distribution</strong> is a type of mathematical model. Mathematical models allow us to easily calculate the probability of occurrence of any specific value of the variable of interest. The binomial distribution is used in situations where the discrete variable is the number of occurrences in a sample of n observations.</p> <p>There are 4 properties of the Binomial Distribution:</p> <ol><li>The sample must consist of a fixed number of observations, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /></li> <li>Each and every observation can be categorized into one of two mutually exclusive and collectively exhaustive categories</li> <li>The probability of an event of interest, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-de7f58c02f2ca852481500881db58dd4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="p" title="Rendered by QuickLaTeX.com" height="13" width="12" style="vertical-align: -4px;" />, is constant across all observations. Therefore the probability of a non-event of interest, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-503c933d70142f98e15135b6e8482853_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="1 - p" title="Rendered by QuickLaTeX.com" height="18" width="46" style="vertical-align: -4px;" /> (sometimes called <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-8512264da9dbe7a3fb6978588d542fbf_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="q" title="Rendered by QuickLaTeX.com" height="13" width="10" style="vertical-align: -4px;" />) is constant for all observations.</li> <li>Observations are all independent. This simply means the probability of occurrence of any observation is not dependent on any other observation.</li> </ol> <p>The Binomial distribution formula:</p> <p class="ql-center-displayed-equation" style="line-height: 50px;"><span class="ql-right-eqno">&nbsp; </span> <span class="ql-left-eqno">&nbsp; </span><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-538b26fb6f051a160d34dc02eb4ed0c3_l3.svg" height="50" width="369" class="ql-img-displayed-equation quicklatex-auto-format" alt="\[ P (X = x | n, p) = \frac{n!}{x! (n - x)!} p^x {(1-p)}^{n-x} \]" title="Rendered by QuickLaTeX.com" /></p> <p>And</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b1bb26e9edb3fe938cef636ee1377473_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="P(X = x | n, p)" title="Rendered by QuickLaTeX.com" height="22" width="130" style="vertical-align: -6px;" /> = probability that <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-1b491b036fbd776b45da9aeefc1b384b_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="X = x" title="Rendered by QuickLaTeX.com" height="15" width="59" style="vertical-align: 0px;" /> events of interest, where <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-de7f58c02f2ca852481500881db58dd4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="p" title="Rendered by QuickLaTeX.com" height="13" width="12" style="vertical-align: -4px;" /> are as follows:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> = number of observations</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-de7f58c02f2ca852481500881db58dd4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="p" title="Rendered by QuickLaTeX.com" height="13" width="12" style="vertical-align: -4px;" /> = probability of an event of interest (prob.of success)</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-952a52d39458bce9067dd83209654a42_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="I - p = q" title="Rendered by QuickLaTeX.com" height="19" width="85" style="vertical-align: -4px;" /> = probability of not having an event of interest (prob. of failure)</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-18e47b3ed445355ba232dfccfd985d0d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> = number of events of interest (no. of successes) in the sample (<img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-597f8460a306ca51ecdbf4ccd022b48c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="X" title="Rendered by QuickLaTeX.com" height="15" width="18" style="vertical-align: 0px;" /> = 0,1,2, …, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" />)</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fa1e44c58f2d0c88aa0d531afb116435_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{n!}{x! (n - x)!}" title="Rendered by QuickLaTeX.com" height="31" width="63" style="vertical-align: -12px;" /> = The number of combinations of <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-18e47b3ed445355ba232dfccfd985d0d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> events of interest out of <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> observations. This calculation does not take into account the order in which the events actually occur. If the order was important, that would involve calculating a permutation, not a combination. Here is a video depicting the calculation of combinations:</p> <div class="textbox interactive-content interactive-content--oembed"><img src="https://i.ytimg.com/vi/WWv0RUxDfbs/hqdefault.jpg" alt="Thumbnail for the embedded element &quot;Binomial distribution | Probability and Statistics | Khan Academy&quot;" title="Binomial distribution | Probability and Statistics | Khan Academy" /> <p>A YouTube element has been excluded from this version of the text. You can view it online here: <a href="https://ubalt.pressbooks.pub/mathstatsguides/?p=264#pb-interactive-content" title="Binomial distribution | Probability and Statistics | Khan Academy" data-url="https://ubalt.pressbooks.pub/mathstatsguides/?p=264#pb-interactive-content">https://ubalt.pressbooks.pub/mathstatsguides/?p=264</a> </p> </div> <p>Another version of this formula which may be easier to read can be found in the <a href="#chapter-statistics-formula-glossary" data-url="https://ubalt.pressbooks.pub/mathstatsguides/chapter/statistics-formula-glossary/">Statistics Formula Glossary</a>.</p> <p>When conducting calculations for binomial distributions, there are three distinct possibilities that may be encountered.</p> <p style="padding-left: 40px"><em>Example: There are 10 golf balls in a bag, consisting of 6 orange balls and 4 yellow balls. If we define success as the likelihood of picking an orange ball and therefore failure as not picking an orange ball (and therefore picking a yellow ball), we can illustrate the three distinct possibilities that may be encountered in calculations.</em></p> <p style="padding-left: 40px"><em>If 6 golf balls are to be selected at random (without replacement):</em></p> <p>&nbsp;</p> <ul><li><span style="text-decoration: underline"><em>What is the probability of picking exactly 4 orange balls?</em></span></li> </ul> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-429c25a544bbf37de6737f2807bdb3f0_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="P (X = x | n, p) = \frac{n!}{x! (n - x)!} p^x {(1-p)}^{n-x}" title="Rendered by QuickLaTeX.com" height="31" width="342" style="vertical-align: -12px;" /></p> <p style="padding-left: 40px">And</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b1bb26e9edb3fe938cef636ee1377473_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="P(X = x | n, p)" title="Rendered by QuickLaTeX.com" height="22" width="130" style="vertical-align: -6px;" /> = probability that <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-1b491b036fbd776b45da9aeefc1b384b_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="X = x" title="Rendered by QuickLaTeX.com" height="15" width="59" style="vertical-align: 0px;" /> events of interest, when <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-de7f58c02f2ca852481500881db58dd4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="p" title="Rendered by QuickLaTeX.com" height="13" width="12" style="vertical-align: -4px;" /></p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a13b200485b8e78d6d1986a266856286_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="N" title="Rendered by QuickLaTeX.com" height="15" width="19" style="vertical-align: 0px;" /> = number of observations = 6</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-bcc4b1f28438dadaa8af496853458cfb_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="P" title="Rendered by QuickLaTeX.com" height="15" width="16" style="vertical-align: 0px;" /> = probability of an event of interest = 6/10 = 0.6</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f50b0b57ec2d0d16217222221ce8f8dc_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="I - p(q)" title="Rendered by QuickLaTeX.com" height="22" width="73" style="vertical-align: -6px;" /> = prob. of not having an event of interest = 0.4</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-597f8460a306ca51ecdbf4ccd022b48c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="X" title="Rendered by QuickLaTeX.com" height="15" width="18" style="vertical-align: 0px;" /> = number of events of interest (no. of successes) in the sample (<img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-597f8460a306ca51ecdbf4ccd022b48c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="X" title="Rendered by QuickLaTeX.com" height="15" width="18" style="vertical-align: 0px;" /> = 0,1,2, …, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" />) = 4</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-0bc0b057bbad496231cb8a5f23e12f70_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="P (X = x | n, p) = \frac{6!}{4! (6 - 4)!} 0.6^4 \ast (1-0.6)^{(6-4)} = 0.3110" title="Rendered by QuickLaTeX.com" height="32" width="487" style="vertical-align: -12px;" /></p> <p>&nbsp;</p> <ul><li><em><span style="text-decoration: underline">What is the probability of picking at least 4 orange balls?</span></em></li> </ul> <p style="padding-left: 40px">This equates to: prob. of 4 orange + prob. of 5 orange + prob. of 6 orange</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-07442a74d8f323d673ef88f262922760_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="P (X = x | n, p) = \frac{6!}{4! (6 - 4)!} 0.6^4 \ast (1-0.6)^{(6-4)}" title="Rendered by QuickLaTeX.com" height="32" width="399" style="vertical-align: -12px;" /></p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-6468c5a35dcad005563aae47e96c3c58_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="+ \frac{6!}{5! (6 - 5)!} 0.6^5 \ast (1-0.6)^{(6-5)}" title="Rendered by QuickLaTeX.com" height="32" width="255" style="vertical-align: -12px;" /></p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f3b49bb2f6a420a313eef362978a1061_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="+ \frac{6!}{6! (6 - 6)!} 0.6^6 \ast (1-0.6)^{(6-6)}" title="Rendered by QuickLaTeX.com" height="32" width="255" style="vertical-align: -12px;" /></p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ade841e712d4b86588b4e80e1d0a8aba_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="= 0.31104 + 0.186624 + 0.046656" title="Rendered by QuickLaTeX.com" height="17" width="301" style="vertical-align: -2px;" /></p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-262e5a8812d42f6e32f0f51647b6bfa4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="= 0.5443" title="Rendered by QuickLaTeX.com" height="15" width="80" style="vertical-align: 0px;" /></p> <p>&nbsp;</p> <ul><li><em><span style="text-decoration: underline">What is the probability of picking less than 4 orange balls?</span></em></li> </ul> <p style="padding-left: 40px">This equates to: prob. of 0 orange + prob. of 1 orange + prob. of 2 orange + prob. of 3 orange</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-dd46740aa540037440e6f970ecc02af2_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="= 1 -" title="Rendered by QuickLaTeX.com" height="14" width="47" style="vertical-align: 0px;" /> Prob. of at least 4 orange balls <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-865f4d2fa3d47e0c0f97e317721a3fd8_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="= 1 - 0.5443 = 0.4557" title="Rendered by QuickLaTeX.com" height="15" width="204" style="vertical-align: 0px;" /></p> <p>&nbsp;</p> <h5><strong>Mean of the Binomial Distribution</strong></h5> <p>The mean, ℳ, of the binomial distribution is the product of the sample size, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" />, and the probability of an event of interest (success), <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-de7f58c02f2ca852481500881db58dd4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="p" title="Rendered by QuickLaTeX.com" height="13" width="12" style="vertical-align: -4px;" />.</p> <p>ℳ<img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-e59a841b94d1416bd89cf41ee4ae88c7_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="= E (X) = np" title="Rendered by QuickLaTeX.com" height="22" width="126" style="vertical-align: -6px;" /></p> <p>This is the value that is statistically most likely to occur. For instance, consider the example of tossing two unbiased dice, the range of values that may result extends from 2 to 12. The mean value is actually 7. This is because there are six distinct ways to get a value of 7. They are 1&amp; 6, 6 &amp; 1, 2 &amp; 5, 5 &amp; 2, 3 &amp; 4, and 4 &amp; 3. This represents 6 distinct possibilities out of a total of 36 possibilities, which is the most likely result to occur from all the distinct possibilities.</p> <p>&nbsp;</p> <h5><strong>Standard Deviation of the Binomial Distribution</strong></h5> <p>The standard deviation of the binomial distribution, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-dfd5b8899b6bf52736e755584b39f360_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="?" title="Rendered by QuickLaTeX.com" height="15" width="8" style="vertical-align: 0px;" />, is the square root of the variance.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-54ea05ce622dca2348a114282c51ca3e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="? = \sqrt{Var (X)} = \sqrt{np (1-p)}" title="Rendered by QuickLaTeX.com" height="25" width="268" style="vertical-align: -6px;" /></p> <p>&nbsp;</p> <h5><strong>Poisson Distribution</strong></h5> <p>The <strong>Poisson distribution</strong> is another type of mathematical model. The Poisson distribution applies when we want to determine the number of occurrences of a particular event in some fixed interval of time and space. This fixed interval of time and space is often called an area of opportunity. Within the area of opportunity, there can be multiple occurrences of an event.</p> <p>There are 4 properties of the Poisson Distribution:</p> <p>&nbsp;</p> <ol><li>The area of opportunity must be defined by time, length, surface area etc. Per the Poisson distribution, we can determine the number of times a particular event occurs in a given area of opportunity.</li> <li>The probability that an event occurs in a given area of opportunity must be the same for all the areas of opportunity.</li> <li>The number of events that occur in each and every area of opportunity is independent of the number of events that occur in any area of opportunity</li> <li>The probability that two or more events will occur in any area of opportunity approximates to zero as the area of opportunity becomes smaller.</li> </ol> <p>&nbsp;</p> <p>The Poisson distribution formula:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-393f1dc6eb623a2294ca92f5aa86538a_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="P (X = x | \lambda) = \frac{e^{-\lambda}\lambda^{x}!}{x!}" title="Rendered by QuickLaTeX.com" height="29" width="193" style="vertical-align: -7px;" /></p> <p>where</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-404507ab0a4148ecd0c9e2c4004c2453_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="P (X = x | \lambda)" title="Rendered by QuickLaTeX.com" height="22" width="109" style="vertical-align: -6px;" /> = probability that <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-1b491b036fbd776b45da9aeefc1b384b_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="X = x" title="Rendered by QuickLaTeX.com" height="15" width="59" style="vertical-align: 0px;" /> events in an area of opportunity given <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-875e63e39950adf9338deae5f4902424_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\lambda" title="Rendered by QuickLaTeX.com" height="15" width="11" style="vertical-align: 0px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-875e63e39950adf9338deae5f4902424_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\lambda" title="Rendered by QuickLaTeX.com" height="15" width="11" style="vertical-align: 0px;" /> = expected number of events per unit</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-3cdd4c6005b6c22008aa257e640b2f95_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="e" title="Rendered by QuickLaTeX.com" height="9" width="10" style="vertical-align: 0px;" /> = mathematical constant approximated by 2.71828</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-18e47b3ed445355ba232dfccfd985d0d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> = number of events (<img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9e2977c7298afb4055ac8209bab80419_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = 0,1,2, \cdots, n" title="Rendered by QuickLaTeX.com" height="18" width="153" style="vertical-align: -4px;" />)</p> <p>&nbsp;</p> <p style="padding-left: 40px"><em>Example: Imagine that the mean number of cars that pass an intersection in a 1-minute interval is 5.0.</em></p> <p>&nbsp;</p> <ul><li><em><span style="text-decoration: underline">What is the probability that in a given minute, exactly four cars will arrive?</span></em></li> </ul> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-901fa33be2a4c861e5c5083eab790a33_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="P (X = 4 | \lambda = 5) = \frac{e^{-5} \ast 5.0^{4}!}{4!} = 0.1755" title="Rendered by QuickLaTeX.com" height="29" width="335" style="vertical-align: -7px;" /></p> <p>&nbsp;</p> <ul><li><em><span style="text-decoration: underline">What is the probability that more than four cars will arrive in a given minute?</span></em></li> </ul> <p style="padding-left: 40px">The probability that more than four cars will arrive:</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-995df12d623d9b49eb59e837dd8a4f25_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="image" /> 4) = P (X = 4) + P (X = 5) + P (X = 6) + \cdots +" title="Rendered by QuickLaTeX.com" height="22" width="512" style="vertical-align: -6px;"&gt;</p> <p style="padding-left: 40px">Since all probabilities in a distribution sum to 1:</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d6b2e6a61df84412a54f52a40cc1b92e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="image" /> 4) = 1 - P(X &lt;= 4)" title="Rendered by QuickLaTeX.com" height="22" width="261" style="vertical-align: -6px;"&gt;</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-7434aed4d1431961f0b44e0130f6372c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="= 1 - [P(X=0) + P(X=1) + P(X=2) + P(X=3) + P(X=4)]" title="Rendered by QuickLaTeX.com" height="22" width="623" style="vertical-align: -6px;" /></p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-674d89bf4163620ac371eee3fbe5fc64_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="= 1 - (0.0067 + 0.0337 + 0.0842 + 0.1404 + 0.1755)" title="Rendered by QuickLaTeX.com" height="22" width="469" style="vertical-align: -6px;" /></p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-842aa3c274b1820010ac07573cbd648f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="= 1 - 0.4405" title="Rendered by QuickLaTeX.com" height="15" width="116" style="vertical-align: 0px;" /></p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-64b3ea3aff3c5cc4f78faaf54cb60add_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="= 0.5595" title="Rendered by QuickLaTeX.com" height="15" width="80" style="vertical-align: 0px;" /></p> <p>&nbsp;</p> <p><span style="background-color: #ffff00">Some of the material in this post was obtained from Statistics for Managers: Using Microsoft Excel, Eighth Edition.</span></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/07/01/statistics-discrete-probability-distributions/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/07/01/statistics-discrete-probability-distributions/">Math Support Center blog</a> at the University of Baltimore on July 1, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-statistics-and-excel-evaluating-normality" title="Statistics and Excel: Evaluating Normality"><div class="chapter-title-wrap"><h3 class="chapter-number">27</h3><h2 class="chapter-title">Statistics and Excel: Evaluating Normality</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <h5><strong>Evaluating Normalcy</strong></h5> <p>Many statistical tests run on the assumption that the data with which you are working is normally distributed, so it’s important to check. There are several different ways to go about this. This post will explain a few different methods for testing normalcy as well as provide some instructions about how to run these tests in Excel.</p> <p>&nbsp;</p> <h5><strong>Mean vs. Median</strong></h5> <p>An important rule to note about distribution is that in a normal distribution, the mean, median, and mode are approximately equal. What it looks like visually is that the mean, median, and mode are all sitting at the top of the hump of the bell curve. When a distribution is skewed, these values become different. The mode will always sit around the hump of a distribution (because this is where most of the values have accumulated). The mean is the measure of central tendency most affected by extreme variables and outliers, so it will follow the longest tail. The median, in this case, will always fall somewhere between the median and the mode. Put another way, if the distribution is positively skewed, the mean will be the greatest value, the median will be the second greatest value, and the mode will be the smallest value. If the distribution is negatively skewed, the mean will be the smallest value, the median will be the second smallest value, and the mode will be the greatest value. So when you’re looking at a data set, you may be able to get an idea of the skew of the distribution by comparing the mean and the median.</p> <p><img class="alignnone wp-image-465 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29a-redone.png" alt="Diagram showing different distributions" width="606" height="306" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29a-redone.png 606w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29a-redone-300x151.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29a-redone-65x33.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29a-redone-225x114.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29a-redone-350x177.png 350w" /></p> <p>The easiest way to get all of the descriptive statistics you need in Excel is to download Analysis ToolPak. To do this go to: <strong>File&gt;Options&gt;Add-ins&gt;Analysis ToolPak</strong>. Make sure to then hit GO in the bottom right, and then click the check-mark next to Analysis ToolPak before hitting OK. These directions are good for PC, but Mac users may need to find a different route for including Add-ins.</p> <p>Once there, go to Data&gt;Data Analysis. From there, you should see this pop-up.</p> <p><img class="alignnone wp-image-284 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/2-data-analysis-screen.png" alt="Screenshot of Excel pop-up window" width="582" height="253" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/2-data-analysis-screen.png 582w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/2-data-analysis-screen-300x130.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/2-data-analysis-screen-65x28.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/2-data-analysis-screen-225x98.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/2-data-analysis-screen-350x152.png 350w" /></p> <p>From there, click Descriptive Statistics, select your input range, select an output range, click Summary Statistics, and then click OK.</p> <p><img class="alignnone size-full wp-image-285" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/3-descriptives-box-300x229-1.png" alt="Pop-up of Excel spreadsheet" width="300" height="229" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/3-descriptives-box-300x229-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/3-descriptives-box-300x229-1-65x50.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/3-descriptives-box-300x229-1-225x172.png 225w" /></p> <p>Your output should look something like this. You should be able to see the mean, standard deviation, median, mode, range, minimum, maximum, etc. These will all be helpful in future normalcy tests.</p> <p><img class="alignnone size-full wp-image-286" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/4-descriptives-190x300-1.png" alt="Screenshot of Excel data" width="190" height="300" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/4-descriptives-190x300-1.png 190w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/4-descriptives-190x300-1-65x103.png 65w" /></p> <p>&nbsp;</p> <h5><strong>Range and Interquartile Range vs Standard Deviation</strong></h5> <p>Another two questions which should be asked when comparing mean and median are, “Is the range approximately six times the standard deviation?” and “Is the interquartile range approximately 1.33 times the standard deviation?” Given the descriptive statistics we just ran, the first question should be easy to answer. Simply multiply the standard deviation by 6 and check to see if it is close to the range. Interquartile range is the difference between the first quartile and the third quartile, which are not given to you when you run the descriptive statistics. One way to find the first and third quartiles is by using the =QUARTILE function. When using this function, highlight the data, punch in a coma, and then put in the number quartile that you are interested in (1 or 3). It should look like this: =QUARTILE(A2:A:20,1). Find the difference between the two and compare this to the standard deviation multiplied by 1.33. If either of these results show wildly different results than expected, you should consider that the data may be skewed.</p> <p>&nbsp;</p> <h5><strong>Boxplot</strong></h5> <p>A boxplot is a quick way to see if the data you’re working with are symmetrical. To create a boxplot in Excel, highlight your data and go to Insert &gt;Recommended Charts &gt; All Charts &gt; Box &amp; Whisker. If the plot looks to be symmetrical, your data are likely normal. If one of your box sides or whiskers stretch out farther than the rest, your data may be skewed. More specifically, if your whisker extends out in the direction of the larger numbers, your data are positively skewed. If your whisker extends out to the smaller numbers, your data are negatively skewed. Below is an example of each.</p> <p><img class="alignnone wp-image-464 size-full" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29b-redone.png" alt="Diagram of skewed bloxplots" width="969" height="716" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29b-redone.png 969w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29b-redone-300x222.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29b-redone-768x567.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29b-redone-65x48.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29b-redone-225x166.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/ch29b-redone-350x259.png 350w" /></p> <p>&nbsp;</p> <h5><strong>Histogram</strong></h5> <p>Histograms are a good way to visually check the normalcy of large amounts of data. When evaluating it, the shape of the bars should represent a bell-curve. If not, then the data may be skewed or multimodal. To create a histogram, highlight your data and go to Insert &gt; Recommended Charts &gt; All Charts &gt;Histogram. When you first create the histogram, it may look a little wonky. To fix this, click the new chart, and a green plus sign should appear. Click this, hover over Axes, and move your cursor to the right until a black arrow appears. Click this, then click More Axis Options, and a bar should appear on the right side of your screen which will allow you to adjust your bin width and bin number.</p> <p><img class="alignnone wp-image-288 size-large" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/6-histogram-1024x274.png" alt="Diagram of a histogram in Excel" width="1024" height="274" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/6-histogram-1024x274.png 1024w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/6-histogram-300x80.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/6-histogram-768x205.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/6-histogram-65x17.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/6-histogram-225x60.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/6-histogram-350x94.png 350w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/6-histogram.png 1366w" /></p> <p>&nbsp;</p> <h5><strong>Empirical Rule</strong></h5> <p>Although there is no simple way to use the empirical rule to test to see if your data are normally distributed, if you did want to take that route, the rules are as follows:</p> <p>&nbsp;</p> <ul><li>Two-thirds of the data lie within ±1 standard deviations of the mean</li> <li>Four-fifths of the data lie within ±1.28 standard deviations of the mean</li> <li>19 of every 20 data points lie within ± 2 standard deviations of the mean</li> </ul> <p>If one were to do this in Excel, one might create a frequency distribution with these standard deviations as the bins and compare those frequencies with these rules. This can be done using the Histogram feature of Data Analysis.</p> <p>&nbsp;</p> <h5><strong>Normal Probability Plot (Q-Q Plot)</strong></h5> <p>A normal probability plot (otherwise known as a quantile-quantile or Q-Q plot) is a way to visualize data normalcy. Steps on how to create one in Excel are as follows:</p> <p>&nbsp;</p> <ul><li>Step 1: Order the data from least to greatest</li> <li>Step 2: Find the expected quantile z-scores. This is done using the following equation: <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-5f4ab19b562da722e42855f130250f43_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{1}{n+1} \ast q" title="Rendered by QuickLaTeX.com" height="28" width="63" style="vertical-align: -9px;" /> where <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fbdaefcfe46d44fc6862c7ef0b98f40f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="n" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> is the number of data points and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-8512264da9dbe7a3fb6978588d542fbf_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="q" title="Rendered by QuickLaTeX.com" height="13" width="10" style="vertical-align: -4px;" /> is the ordered value. The smallest number would have an ordered value of 1, the second smallest would have an ordered value of 2, and so on and so forth.</li> <li>Step 3: Finally, you can highlight the data and the z-scores (make sure the two columns are next to each other), go to Insert, and create a scatterplot with the data. If the line is straight, then the data is normally distributed. If the line curves upward, the data is positively skewed. If the line curves downward, the data is negatively skewed.</li> </ul> <p><img class="alignnone size-full wp-image-289" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/7-qqplo.png" alt="Screenshot of Excel data and graph" width="909" height="430" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/7-qqplo.png 909w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/7-qqplo-300x142.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/7-qqplo-768x363.png 768w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/7-qqplo-65x31.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/7-qqplo-225x106.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/7-qqplo-350x166.png 350w" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2020/02/04/statistics-and-excel-evaluating-normality/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2020/02/04/statistics-and-excel-evaluating-normality/">Math Support Center blog</a> at the University of Baltimore on February 4, 2020.&nbsp;</em></p> </div></div>
<div class="chapter standard" id="chapter-statistics-formula-glossary" title="Statistics Formula Glossary"><div class="chapter-title-wrap"><h3 class="chapter-number">28</h3><h2 class="chapter-title">Statistics Formula Glossary</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <p>This chapter includes a glossary of formulas that may be helpful to keep around when practicing statistical problems for homework or studying for an upcoming test.</p> <p>PDF version: <a href="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Stats-Formula-Glossary-7_16_2019.pdf" data-url="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Stats-Formula-Glossary-7_16_2019.pdf">Stats-Formula-Glossary-7_16_2019</a></p> <p>Word (.docx) version: <a href="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Stats-Formula-Glossary-7_16_2019.docx" data-url="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Stats-Formula-Glossary-7_16_2019.docx">Stats-Formula-Glossary-7_16_2019</a></p> <p>Please keep in mind that although these formulas work, they may not be the versions that your professors have taught you to use. It may also be that this formula sheet has formulas for problems you don’t need to know how to solve for the purposes of your class. If this is the case, we encourage you to download the Word version so that you may add to, subtract from, or edit the glossary to fit your own individual needs.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/06/30/statistics-formula-glossary/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/06/30/statistics-formula-glossary/">Math Support Center blog</a> at the University of Baltimore on June 30, 2019.</em></p> </div></div>
<div class="part " id="part-math-82-86-153-163"><div class="part-title-wrap"><h3 class="part-number">III</h3><h1 class="part-title">Algebra and Concepts</h1></div><p>This section contains chapters about algebra and basic mathematical concepts. A link to the original blog post is included at the bottom of each chapter.</p><div class="ugc part-ugc"></div></div>
<div class="chapter standard" id="chapter-introduction-to-exponents-and-polynomials" title="Introduction to Exponents and Polynomials"><div class="chapter-title-wrap"><h3 class="chapter-number">29</h3><h2 class="chapter-title">Introduction to Exponents and Polynomials</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <h5><strong>Evaluating Exponential Expressions</strong></h5> <p>When working with exponents, it might be more helpful to think of them as multiple instances of multiplication. Some exponents are going to be more straight-forward, but be careful of the writing of some exponents.</p> <p>Let’s take a look at some examples of evaluating exponential expressions:</p> <p>In the expression below, this is an illustration of what we mean when we say that an exponent is like multiple multiplications. The exponents signify the number of times that the number 2 should be multiplied by itself.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ccfdb5ff108227fd1738e7cde7662d76_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2^5 = 2 \ast 2 \ast 2 \ast 2 \ast 2" title="Rendered by QuickLaTeX.com" height="19" width="178" style="vertical-align: 0px;" /></p> <p>In the next expression, the -3 is in parentheses. This means that the exponent outside of the parentheses needs to be applied to the number as a whole, including its being negative.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-c8555fa04886ab03ce0a0057f720a474_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(-3)^2 = (-3)(-3) = 9" title="Rendered by QuickLaTeX.com" height="25" width="203" style="vertical-align: -6px;" /></p> <p>In this last expression, the exponent is sitting right next to the 3 without any parentheses holding the negative and the 3 together. In this case, think of it as though a -1 needs to be multiplied by 3 which is then multiplied by 3. No matter what the exponent is in this case, the answer will always be negative because a -1 is being multiplied to whatever number comes out of evaluating the exponent.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b99ac23f94d03b0a4da2a7118c829d6f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="-3^2 = -(3)(3) = -9" title="Rendered by QuickLaTeX.com" height="25" width="187" style="vertical-align: -6px;" /></p> <p>&nbsp;</p> <h5><strong>Using the Product Rule</strong></h5> <p>When multiplying two or more of the same variable with exponents (meaning they have the same base number or letter), it’s as if you’re adding the exponents together. Below is a visualization of why that is. If we stretch out the expression so that each x is being multiplied by itself the proper number of times, it’s as if we added the 2 and 3 exponents to one another.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-5f1faf70c7de9722a83345ba1770abc0_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^2 \ast x^2 = (x \ast x)(x \ast x \ast x) = x^5" title="Rendered by QuickLaTeX.com" height="25" width="291" style="vertical-align: -6px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a2ffad6c5f6b96251c786584d7ab81a4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^2 \ast x^3 = x^{2+3} = x^5" title="Rendered by QuickLaTeX.com" height="19" width="180" style="vertical-align: 0px;" /></p> <p>It’s important to remember, however, that you can only simplify expressions this way if the base number or variable is the same. In the example below, although it looks very complex, if we separate all numbers without exponents and all the different variables by type, we can easily achieve a simplified version of the expression.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fc678ac97933c9c5e4a083e9c30baf58_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(-a^7b^4)(3ab^9)" title="Rendered by QuickLaTeX.com" height="25" width="124" style="vertical-align: -6px;" /></p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-027df331d1ecd5368ffa0f61263fba24_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(-1 \ast 3)(a^7 \ast a^1)(b^4 \ast b^9)" title="Rendered by QuickLaTeX.com" height="25" width="220" style="vertical-align: -6px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-99812c35728ff0eeb2888c4bbacdc8c1_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(-3)(a^8)(b^{13})" title="Rendered by QuickLaTeX.com" height="25" width="119" style="vertical-align: -6px;" /></li> </ul> <p>So the answer is <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-66b233268537a8fbfad8837fe3d1c17f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="3a^8b^{13}" title="Rendered by QuickLaTeX.com" height="19" width="56" style="vertical-align: 0px;" /></p> <p>&nbsp;</p> <h5><strong>Power of a Quotient Rule</strong></h5> <p>The power of a quotient rule is that when a fraction exists within parentheses and is met with an exponent, everything within the parentheses is affected by the exponent. This includes any numbers which are attached to variables as well.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-69e7bfc433c6b9ea6a0caf4b12bf95d6_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(\frac{x}{y})^n = (\frac{x^n}{y^n})" title="Rendered by QuickLaTeX.com" height="29" width="106" style="vertical-align: -10px;" /></p> <p>&nbsp;</p> <h5><strong>Quotient Rule for Exponents</strong></h5> <p>If multiplying numbers with exponents is like adding the exponents together, then dividing is like subtracting the exponents.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-aa4b0a71763a37c277ab2f9b86d2fc0b_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{x^6}{x^2} = x^{6-2} = x^4" title="Rendered by QuickLaTeX.com" height="30" width="138" style="vertical-align: -8px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b2c523c176c7223e5766fb96483088a4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{14^7}{14^5} = 14^{7-5} = 14^2" title="Rendered by QuickLaTeX.com" height="30" width="164" style="vertical-align: -8px;" /></p> <p>&nbsp;</p> <h5><strong>Zero Exponent</strong></h5> <p>For any number that has an exponent of 0, that number is always translated to 1. Try to keep this in mind as you start to deal with more complex equations involving exponents, as an equation can be better cleaned up by immediately translating a number or variable with an exponent of 0 to just be the number 1.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-c0473001fb72398c6a1e822756c4d5bb_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^0 = 1" title="Rendered by QuickLaTeX.com" height="19" width="58" style="vertical-align: 0px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ab7c2ba722881802762ac0ecb6c89262_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="5^0 = 1" title="Rendered by QuickLaTeX.com" height="19" width="56" style="vertical-align: 0px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-1154d51e111a210f5b08a788c9e4258d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="251^0 = 1" title="Rendered by QuickLaTeX.com" height="19" width="77" style="vertical-align: 0px;" /></p> <p>&nbsp;</p> <h5><strong>Negative Exponents</strong></h5> <p>Negative exponents ask that the variable be flipped into (or sometimes out of) a fraction when translated. In the first example below, the x has to be flipped over into a denominator in order to get rid of the negative sign on its exponent. In the other, the x has to be flipped over to the numerator to get rid of the negative. In this case, however, the one in the denominator can be removed and the x no longer has to be part of a fraction.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-10b950b798aa10fe79b41eb6f97cd3bf_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^{-3} = \frac{1}{x^3}" title="Rendered by QuickLaTeX.com" height="27" width="82" style="vertical-align: -8px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-7d5fb74da796708d6d7f81c56b5f9229_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{1}{x^{-3}} = x^3" title="Rendered by QuickLaTeX.com" height="27" width="79" style="vertical-align: -8px;" /></p> <p>&nbsp;</p> <h5><strong>Scientific Notation</strong></h5> <p>Scientific notation is a process used to make either very big or small numbers easier to read. To translate a number into scientific notation, count the number of spaces it would take to get the first non-zero digit to become the one’s digit and then multiply that number by 10 to the number of digits moved. Here we can see an example. We have the number 0.000003. Simply start your pencil at the decimal and then create a bump around each 0 until you get to the other side of the number 3. Then, count the number of bumps. Remember that if you’re translating a big number to be smaller, the exponent next to the 10 should be positive. If you’re translating a small number to be bigger, the exponent next to the 10 should be negative.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f91fac45590177db53ef9159687fd9d5_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="0.000003 = 3 \ast 10^{-6}" title="Rendered by QuickLaTeX.com" height="19" width="181" style="vertical-align: 0px;" /></p> <p><img class="alignnone size-full wp-image-303" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D12-300x83-1.png" alt="Example of scientific notation" width="300" height="83" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D12-300x83-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D12-300x83-1-65x18.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D12-300x83-1-225x62.png 225w" /></p> <p>&nbsp;</p> <h5><strong>What is a Polynomial?</strong></h5> <p>A polynomial is an equation which is created through the use of two or more algebraic terms. In the example below, each color represents a different term.</p> <p>Polynomials can be made up of some or all of the following:</p> <ul><li><strong>Variables:</strong> the letters in the equation</li> <li><strong>Constants:</strong> In the example above, the constant would be the 11. It’s a number that does not contain any modifiable variables.</li> <li><strong>Exponents:</strong> These are the numbers that you’ll typically find attached to variables</li> <li><strong>Addition, subtraction, multiplication, and division</strong></li> </ul> <p><img class="alignnone size-full wp-image-304" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D13.png" alt="Color-coded example of a polynomial" width="255" height="71" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D13.png 255w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D13-65x18.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D13-225x63.png 225w" /></p> <p>&nbsp;</p> <h5><strong>What isn’t Considered a Polynomial?</strong></h5> <p>While a polynomial can appear in many different ways, there are some rules about what is not considered a polynomial. A polynomial is NOT:</p> <p>An equation which contains division by a variable.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f0bc45cc68ac5ac57d39b91002f2aa67_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{2x^2 + 6x + 3}{x}" title="Rendered by QuickLaTeX.com" height="29" width="77" style="vertical-align: -7px;" /></p> <p>An equation that contains negative exponents.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b0f5a69af2b89320febb67a660c4e76e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2x^{-2} + 6x + 3" title="Rendered by QuickLaTeX.com" height="21" width="127" style="vertical-align: -2px;" /></p> <p>An equation that contains fractional exponents.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-86e05c0bfb008c69091c9a655c6762b9_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2x^{1/2} + 6x + 3" title="Rendered by QuickLaTeX.com" height="22" width="131" style="vertical-align: -2px;" /></p> <p>An equation that contains radicals.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-0e61469988ebfc24ce1292d0a40e4005_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2x^2 + \sqrt{6x} + 3" title="Rendered by QuickLaTeX.com" height="22" width="132" style="vertical-align: -3px;" /></p> <p>&nbsp;</p> <h5><strong>Evaluating Polynomials</strong></h5> <p>Evaluating polynomials is just like solving any other math problem; make sure to use the order of operations. The order of operations is abbreviated to PEMDAS, which stands for Parentheses, Exponents, Multiplication and Division, and Addition and Subtraction. To start, just plug the value for x or whatever letter you’re working with and then use the order of operations until you get your most simplified answer. Check out the example below:</p> <p>&nbsp;</p> <ul><li>Evaluate <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b5a6d2b55c01c7ebbecbaa31eda4067f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="8x^2 + 4x + 7" title="Rendered by QuickLaTeX.com" height="21" width="117" style="vertical-align: -2px;" /> when <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-4f433e0d0947e76f73a3249867e748b9_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = 2" title="Rendered by QuickLaTeX.com" height="14" width="50" style="vertical-align: 0px;" /></li> <li>Step 1: Substitute <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-00e3540ce4707e255c65bda35609f22d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="8 \ast 2^2 + 4 \ast 2 + 7" title="Rendered by QuickLaTeX.com" height="21" width="154" style="vertical-align: -2px;" /></li> </ul> </li> <li>Step 2: P<span style="background-color: #ffcc00">E</span>MDAS <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-6da45851d869cbc07cafdf1aa88487d5_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="8 \ast 4 + 4 \ast 2 + 7" title="Rendered by QuickLaTeX.com" height="17" width="144" style="vertical-align: -2px;" /></li> </ul> </li> <li>Step 3: PE<span style="background-color: #ffcc00">M</span>DAS <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ea68981d579b80e6cb1c687370db995d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="32 + 8 + 7" title="Rendered by QuickLaTeX.com" height="17" width="94" style="vertical-align: -2px;" /></li> </ul> </li> <li>Step 4: PEMD<span style="background-color: #ffcc00">A</span>S <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f17d5dc31861fc4b992335e08c189797_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="47" title="Rendered by QuickLaTeX.com" height="15" width="22" style="vertical-align: 0px;" /></li> </ul> </li> </ul> <p>&nbsp;</p> <h5><strong>Adding Polynomials</strong></h5> <p>When adding polynomials, keep in mind that you can only add together like terms. The like terms are highlighted in different colors in the example below. Don’t be intimidated by the parentheses when adding – just add like you normally would.</p> <p><img class="alignnone size-full wp-image-306" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D19-300x167-1.png" alt="Color-coded example of adding polynomials" width="300" height="167" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D19-300x167-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D19-300x167-1-65x36.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D19-300x167-1-225x125.png 225w" /></p> <p>&nbsp;</p> <h5><strong>Subtracting Polynomials</strong></h5> <p>Just like when adding polynomials, only like terms can be subtracted from one another. However, in this case, you do need to keep the parentheses in mind because of the minus side to the left of the second polynomial. Treat the minus sign like a -1, as if you were about to multiply everything in the parentheses by -1. This means that everything that was once positive will be negative and vice versa.</p> <p><img class="alignnone size-full wp-image-307" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D20-300x171-1.png" alt="Color-coded example of subtracting polynomials" width="300" height="171" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D20-300x171-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D20-300x171-1-65x37.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D20-300x171-1-225x128.png 225w" /></p> <p>&nbsp;</p> <h5><strong>Polynomials with Two Variables</strong></h5> <p>Don’t get distracted by the new variable! The rules from before still apply. Just be sure to separate each type of number by the base and then simplify.</p> <p><img class="alignnone size-full wp-image-308" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D21-300x224-1.png" alt="Color-coded example of two-variable polynomials" width="300" height="224" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D21-300x224-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D21-300x224-1-65x49.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D21-300x224-1-225x168.png 225w" /></p> <p>&nbsp;</p> <h5><strong>Multiplying Monomials</strong></h5> <p>When working in equations that involve variables, multiplying two of the same variable results in an “addition” of exponents. For equations that simply have an “x” or “y,” imagine there’s a 1 exponent above it.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-2b5eff67c71a4204cdb4f3fd3d24afed_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(3x)(5x) = 15x^2" title="Rendered by QuickLaTeX.com" height="25" width="146" style="vertical-align: -6px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-8c3341f20eb86d50ba093edb4f97dd47_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(4x)(6x^2) = 24x^3" title="Rendered by QuickLaTeX.com" height="25" width="155" style="vertical-align: -6px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-585d6c800df5859b53ea74d07fe6aa40_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(2xy)(3x) = 6x^2y" title="Rendered by QuickLaTeX.com" height="25" width="158" style="vertical-align: -6px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-6520e2c739581d10b7fe65c71e3de893_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(3xy)(4xy) = 12x^2y^2" title="Rendered by QuickLaTeX.com" height="25" width="187" style="vertical-align: -6px;" /></p> <p>&nbsp;</p> <h5><strong>Multiplying Monomials by Polynomials</strong></h5> <p>When multiplying monomials by polynomials, it’s important to multiply the monomial term by every term within the polynomial.</p> <p><img class="alignnone size-full wp-image-309" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D23-300x270-1.png" alt="Example of multiplying monomials by polynomials" width="300" height="270" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D23-300x270-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D23-300x270-1-65x59.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D23-300x270-1-225x203.png 225w" /></p> <p>&nbsp;</p> <h5><strong>Using FOIL</strong></h5> <p>When multiplying two different polynomials, remember to FOIL. First, multiply the two <strong>F</strong>irst variables in each polynomial. Then, multiply the <strong>O</strong>utside variables together. Next, multiply the <strong>I</strong>nside variables. Finally, multiply the <strong>L</strong>ast variable of each polynomial.</p> <p><img class="alignnone size-full wp-image-310" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D24-300x271-1.png" alt="Example of how to use FOIL protocol" width="300" height="271" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D24-300x271-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D24-300x271-1-65x59.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D24-300x271-1-225x203.png 225w" /></p> <p>&nbsp;</p> <h5><strong>Multiplying Binomials</strong></h5> <p>When multiplying binomials, it’s important to remember that each term of the first binomial should be multiplied with each term of the second.</p> <p><img class="alignnone size-full wp-image-311" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D25-300x280-1.png" alt="Example of how to multiply binomials" width="300" height="280" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D25-300x280-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D25-300x280-1-65x61.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D25-300x280-1-225x210.png 225w" /></p> <p>&nbsp;</p> <h5><strong>Multiplying Larger Polynomials</strong></h5> <p>Don’t be intimidated by the added numbers. Just keep in mind that each variable needs to be multiplied by the other variables at some point. With larger polynomials like this one <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-cf0333834417fc54b9becc0bd4e8eeeb_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="((a+b+c)(x+y+z))" title="Rendered by QuickLaTeX.com" height="22" width="212" style="vertical-align: -6px;" />, I typically just go from left to right: <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-632ee5757b4b4d0a1fa45c9069e6a3ad_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="a \ast x, a \ast y, a \ast z, b \ast x, b \ast y, b \ast z, c \ast x, c \ast y, c \ast z" title="Rendered by QuickLaTeX.com" height="19" width="443" style="vertical-align: -4px;" />.</p> <p>Check out the example below.</p> <p><img class="alignnone size-full wp-image-312" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D26.png" alt="Color-coded example of multiplying large polynomials" width="426" height="479" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D26.png 426w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D26-267x300.png 267w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D26-65x73.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D26-225x253.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D26-350x394.png 350w" /></p> <p><img class="alignnone size-full wp-image-313" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D27-300x264-1.png" alt="Continuation of color-coded large-polynomial multiplication" width="300" height="264" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D27-300x264-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D27-300x264-1-65x57.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-D27-300x264-1-225x198.png 225w" /></p> <p>&nbsp;</p> <h5><strong>Dividing Polynomials by Monomials</strong></h5> <p>Dividing a Polynomial by a Monomial of often easier when one breaks it up into smaller pieces.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9904ff7974b9d70b89a38e3dc211d82a_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{6m^2 + 2m}{2m} = (\frac{6m^2}{2m}) + (\frac{2m}{2m}) = 3m + 1" title="Rendered by QuickLaTeX.com" height="29" width="310" style="vertical-align: -7px;" /></p> <p>&nbsp;</p> <h5><strong>Dividing a Polynomial by Another Polynomial</strong></h5> <p>Solve <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-8f360ec9cd5ca3855decfece00feb0f5_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{4x^2 + 7 +8x^3}{2x + 3}" title="Rendered by QuickLaTeX.com" height="31" width="85" style="vertical-align: -9px;" /></p> <p>Step 1: Rewrite in descending powers and include missing variables (<img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-1679c5ffe52b223b4bc8eb3ba3e9c600_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="0x" title="Rendered by QuickLaTeX.com" height="14" width="23" style="vertical-align: 0px;" />)</p> <p style="padding-left: 40px"><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d2116a994c48b70b21722433975019d5_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{8x^3 + 4x^2 +0x +7}{2x + 3}" title="Rendered by QuickLaTeX.com" height="31" width="115" style="vertical-align: -9px;" /></p> <p>Step 2: Long divide, asking yourself “what number multiplied by 2x would equal the number I’m focusing on?”</p> <p style="padding-left: 40px"><img class="alignnone size-full wp-image-442" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/polynomial-division.png" alt="An image showing how to divide two polynomials" width="386" height="260" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/polynomial-division.png 386w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/polynomial-division-300x202.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/polynomial-division-65x44.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/polynomial-division-225x152.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/polynomial-division-350x236.png 350w" /></p> <p>Step 3: Write the remainder out dividing it by the original polynomial</p> <p style="padding-left: 40px">Thus, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-c618535675290924cc6c2e20bfb6feb7_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{4x^2 + 7 +8x^3}{2x + 3} = 4x^2 - 4x + 6 + \frac{-11}{2x + 3}" title="Rendered by QuickLaTeX.com" height="31" width="298" style="vertical-align: -9px;" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/09/13/math-83-unit-d-overview/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/09/13/math-83-unit-d-overview/">Math Support Center blog</a> at the University of Baltimore on September 13, 2019.&nbsp;</em></p> </div></div>
<div class="chapter standard" id="chapter-factoring-polynomials" title="Factoring Polynomials"><div class="chapter-title-wrap"><h3 class="chapter-number">30</h3><h2 class="chapter-title">Factoring Polynomials</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <h5><strong>Finding the Greatest Common Factor (GCF)</strong></h5> <p>Finding the greatest common factor simply involves finding the largest number or term which will fit evenly into each number or term in a list. The way I like to go about this is by breaking each number or term into its smallest parts. Break each number down until you are multiplying together only prime numbers. All the numbers that they have in common should then be multiplied back up to create the GCF.</p> <p>This is what it would look like in a list of numbers:</p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-75039935b9148d0a07eaa7046852b71a_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="28 = 2 \ast 2 \ast 7" title="Rendered by QuickLaTeX.com" height="15" width="120" style="vertical-align: 0px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-29944cd94ef4e9cfd7c4ef1afb0ab955_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="40 = 2 \ast 2 \ast 2 \ast 5" title="Rendered by QuickLaTeX.com" height="15" width="150" style="vertical-align: 0px;" /></li> <li>The greatest common factor is <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-3b56ee24966d061cd4ca647f44a1b79b_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2 \ast 2 = 4" title="Rendered by QuickLaTeX.com" height="14" width="78" style="vertical-align: 0px;" />.</li> </ul> <p>And this is what it would look like in a list of terms:</p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-49debcc304fc0546cdfae2ed4bfba13c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^3 = x \ast x \ast x" title="Rendered by QuickLaTeX.com" height="19" width="125" style="vertical-align: 0px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-28c83fd880f2df13dda672a609fd0043_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^5 = x \ast x \ast x \ast x \ast x" title="Rendered by QuickLaTeX.com" height="19" width="189" style="vertical-align: 0px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ddfdd0b612936db21cf7531b9b5e8c03_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^2 = x \ast x" title="Rendered by QuickLaTeX.com" height="19" width="93" style="vertical-align: 0px;" /></li> <li>The greatest common factor is <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-7a926b0f5a49bc198876797058f11010_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x \ast x = x^2" title="Rendered by QuickLaTeX.com" height="19" width="92" style="vertical-align: 0px;" /></li> </ul> <p>Sometimes, you will have to factor out the GCF in a function. In this case, treat each term like an individual and follow the instructions mentioned above. Then, whatever the GCF is, divide each term by that GCF, rewrite the function, and make sure to place the GCF outside of a parenthesis which contains the new function.</p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-bcc4cf4c8510b2f94c6157c4d6980a58_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="8x + 14" title="Rendered by QuickLaTeX.com" height="16" width="69" style="vertical-align: -2px;" /> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-65605a9bfa4d4e40cfef81d4a751f1eb_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2 \ast 4 \ast x + 2 \ast 7" title="Rendered by QuickLaTeX.com" height="17" width="139" style="vertical-align: -2px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-65605a9bfa4d4e40cfef81d4a751f1eb_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2 \ast 4 \ast x + 2 \ast 7" title="Rendered by QuickLaTeX.com" height="17" width="139" style="vertical-align: -2px;" />; 2 is the greatest common factor</li> </ul> </li> <li>The final answer is <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-0ed9106490967e1bc2a9e3dd7a4bf814_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2(4x+7)" title="Rendered by QuickLaTeX.com" height="22" width="84" style="vertical-align: -6px;" /></li> </ul> <p>&nbsp;</p> <h5><strong>Factoring by Grouping</strong></h5> <ul><li>Step 1: Group the terms in two groups of two terms so that each group has a common factor.</li> <li>Step 2: Factor out the GCF from each group.</li> <li>Step 3: If there is a common binomial factor, factor it out.</li> <li>Step 4: If not, rearrange the terms and try these steps again.</li> <li>Step 5: Make sure to check your work by multiplying (FOIL-ing) your answer to see if you can get back to the original problem.</li> </ul> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-91b63bf1fcd9a803349b9e0a09c2cc2e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="xy + 2x +3y + 6" title="Rendered by QuickLaTeX.com" height="18" width="155" style="vertical-align: -4px;" /></p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-7321845f5a7cab5e921798c4bf9b6ccd_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(xy + 2x) + (3y + 6)" title="Rendered by QuickLaTeX.com" height="22" width="185" style="vertical-align: -6px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f1b219c7e4d5b9c3edd4206c59a688db_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x(y+2) + 3(y + 2)" title="Rendered by QuickLaTeX.com" height="22" width="175" style="vertical-align: -6px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f1b219c7e4d5b9c3edd4206c59a688db_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x(y+2) + 3(y + 2)" title="Rendered by QuickLaTeX.com" height="22" width="175" style="vertical-align: -6px;" /></li> </ul> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-3f3a58e6eabe4941ab260787455c7953_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(y + 2)(x + 3)" title="Rendered by QuickLaTeX.com" height="22" width="126" style="vertical-align: -6px;" /></p> <p>&nbsp;</p> <h5><strong>Factoring Trinomials of the Form</strong> <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-2ef003a8cf52e86cf18ff48687d60bdf_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^2+bx+c" title="Rendered by QuickLaTeX.com" height="21" width="104" style="vertical-align: -2px;" /></h5> <p>Sometimes trinomials can be factored into two binomials. It helps to make a chart in order to find the correct pairs of numbers. The pairs of numbers to go into (x+_)(x+_) depend on whether the sum of a pair of factors for the third term is equal to the number presented in the second term.</p> <p><img class="alignnone size-full wp-image-321" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E5.png" alt="Example of factoring trinomials" width="496" height="266" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E5.png 496w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E5-300x161.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E5-65x35.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E5-225x121.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E5-350x188.png 350w" /></p> <p>&nbsp;</p> <h5><strong>Factoring Trinomials of the Form</strong> <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-3d98d80cc37e810a0d33cfb9f66ed6ea_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="ax^2+bx+c" title="Rendered by QuickLaTeX.com" height="21" width="115" style="vertical-align: -2px;" /></h5> <p>Step 1: Split the first term to <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-1703b8ddfe898cd3fe3c437bb689e612_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="ax" title="Rendered by QuickLaTeX.com" height="9" width="23" style="vertical-align: 0px;" /> and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-18e47b3ed445355ba232dfccfd985d0d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" />.</p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-2376b462e8d6047d3ffe304b4fd03089_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="3x^2 + 11x + 6 = (3x + ?)(x + ?)" title="Rendered by QuickLaTeX.com" height="25" width="273" style="vertical-align: -6px;" /></li> </ul> <p>Step 2: Find all possible factors of the third term.</p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-27fe690262e4ba8ee3b1fa90d23d54b1_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="6 = 1 \ast 6" title="Rendered by QuickLaTeX.com" height="14" width="79" style="vertical-align: 0px;" /> or <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-135fed30524f6f7dd961cc596214d6d5_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="6 = 2 \ast 3" title="Rendered by QuickLaTeX.com" height="14" width="79" style="vertical-align: 0px;" /></li> </ul> <p>Step 3: Test Factors. This first one creates the wrong middle number.</p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a76a252bae009905f51212a7b7ad3816_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(3x + 1)(x + 6) = 3x^2 + 19x + 6" title="Rendered by QuickLaTeX.com" height="25" width="292" style="vertical-align: -6px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-1495e36357bc4bcc05bd91f5a436018a_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(3x \ast 6) + (1 \ast x) = 19x" title="Rendered by QuickLaTeX.com" height="22" width="214" style="vertical-align: -6px;" /></li> </ul> <p>This one also doesn’t work. Just keep testing. Sometimes, you will try everything and it still doesn’t quite work. Try splitting ax differently. In this case, 3 can only be split into 3 and 1, but if you’re working with 6, for example, you could try splitting it into 6 and 1 or 2 and 3.</p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-05c0cd138d0d95752bbe61a026704917_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(3x + 6)(x + 1) = 3x^2 + 9x + 6" title="Rendered by QuickLaTeX.com" height="25" width="282" style="vertical-align: -6px;" /></li> </ul> <p>Step 4: Finally, this set works. Make sure to check your work by FOIL-ing.</p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-db5e969aeede8c610377480ac41c58db_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(3x + 2)(x + 3) = 3x^2 + 11x + 6" title="Rendered by QuickLaTeX.com" height="25" width="292" style="vertical-align: -6px;" /></li> </ul> <p>&nbsp;</p> <h5><strong>Factoring Out the GCF with Polynomials</strong></h5> <p>Sometimes before we can even begin to factor a polynomial, we have the ability to factor out a common factor of all 3 terms. In the example below, notice all the numbers have a GCF of <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9b0c36933b917219b4872617f2efbadd_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2x^2" title="Rendered by QuickLaTeX.com" height="19" width="30" style="vertical-align: 0px;" />, we can start by factoring it out.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-6b5424aabbd32785b8ad738f0a15aa8c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="24x^4 + 40x^3 + 6x^2" title="Rendered by QuickLaTeX.com" height="21" width="166" style="vertical-align: -2px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-604d07cb9380e1704e5817119752fe72_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2x^2(12x^2 + 20x + 3)" title="Rendered by QuickLaTeX.com" height="25" width="184" style="vertical-align: -6px;" /></p> <p>From there, we can continue to factor as we normally would.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-94cb45a1e99411f3a014686aa6605ff6_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2x^2(12x^2 + 20x + 3) = 2x^2(2x + 3)(6x + 1)" title="Rendered by QuickLaTeX.com" height="25" width="394" style="vertical-align: -6px;" /></p> <p>&nbsp;</p> <h5><strong>Factoring Trinomials of the Form</strong> <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-3d98d80cc37e810a0d33cfb9f66ed6ea_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="ax^2+bx+c" title="Rendered by QuickLaTeX.com" height="21" width="115" style="vertical-align: -2px;" /> <strong>by Grouping</strong></h5> <p>We can also use the grouping method for trinomials. Take the example problem below.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-04b73e62465044cf978d6ed91226905f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2x^2 + 11x +12" title="Rendered by QuickLaTeX.com" height="21" width="136" style="vertical-align: -2px;" /></p> <p>We can rewrite this so that there is an even number of terms, as shown here:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-c2f99e9e2d90f937cf55462d010a3fbd_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2x^2 + ?x + ?x +12" title="Rendered by QuickLaTeX.com" height="21" width="154" style="vertical-align: -2px;" /></p> <p>We want to find a factor of 24 (because 2 from the first term multiplied by 12 of the third term) that also adds up to 11.</p> <p><img class="alignnone size-full wp-image-322" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E16.png" alt="Table showing factors of 24" width="570" height="173" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E16.png 570w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E16-300x91.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E16-65x20.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E16-225x68.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E16-350x106.png 350w" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d7fd6b11fecd2b5587efff5258838dfc_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="2x^2 + 3x + 8x +12" title="Rendered by QuickLaTeX.com" height="21" width="174" style="vertical-align: -2px;" /></p> <p>Now that we have our middle numbers, we can group.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d3b17f9edb901f713e3ae68503000560_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(2x^2 + 3x) + (8x + 12)" title="Rendered by QuickLaTeX.com" height="25" width="206" style="vertical-align: -6px;" /></p> <p>Now we’ll factor out the greatest common factors within each group. It looks like we have a common binomial in each group.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ad1ba1c2db3ddeab895e695ee7c7779f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x(2x + 3) + 4(2x + 3)" title="Rendered by QuickLaTeX.com" height="22" width="198" style="vertical-align: -6px;" /></p> <p>So our end product is:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9e53a23564a24536ebc788dda314b122_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(2x + 3)(x + 4)" title="Rendered by QuickLaTeX.com" height="22" width="137" style="vertical-align: -6px;" /></p> <p>&nbsp;</p> <h5><strong>Factoring Perfect Square Trinomials and the Difference of Two Squares</strong></h5> <p>Decide whether <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9707c9beeb9af0c0d3added010f93f55_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^2 + 8x + 16" title="Rendered by QuickLaTeX.com" height="21" width="116" style="vertical-align: -2px;" /> is a perfect square trinomial</p> <p>Two terms <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-5eea69f1781017d713df672c642956f3_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^2" title="Rendered by QuickLaTeX.com" height="19" width="20" style="vertical-align: 0px;" /> and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-1d3ca3c18e479017bbe5d4f5f9ba4eeb_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="16" title="Rendered by QuickLaTeX.com" height="14" width="20" style="vertical-align: 0px;" /> are squares <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-6737f2707c575596e4758c6512f8d4c9_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(16 - 4^2)" title="Rendered by QuickLaTeX.com" height="25" width="80" style="vertical-align: -6px;" /></p> <p>Twice the product of <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-18e47b3ed445355ba232dfccfd985d0d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> and <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-0a021498bd4c69254c88b92637194df8_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="4" title="Rendered by QuickLaTeX.com" height="14" width="10" style="vertical-align: 0px;" /> is the final term of the trinomial <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b8b1cd7fa8d3c95e191fc0d069213ad7_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(2 \ast x \ast 4) = 8" title="Rendered by QuickLaTeX.com" height="22" width="125" style="vertical-align: -6px;" /></p> <p>Thus, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9707c9beeb9af0c0d3added010f93f55_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^2 + 8x + 16" title="Rendered by QuickLaTeX.com" height="21" width="116" style="vertical-align: -2px;" /> is a perfect square trinomial</p> <p>This means that it can easily be factored to <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-5de35fe2e1a9c0ad7ac5cd74131110c5_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(x + 4)^2" title="Rendered by QuickLaTeX.com" height="25" width="71" style="vertical-align: -6px;" /></p> <p>&nbsp;</p> <h5><strong>Factoring the Difference of Two Squares</strong></h5> <p>When both terms in a binomial are squares (meaning that they are a variable which includes a square or a number which can be evenly square-rooted) and the signs of the terms are different, one can use the following equation to factor it:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-883c9b6b21064df93e332ccf33e21e09_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="a^2 - b^2 = (a + b)(a - b)" title="Rendered by QuickLaTeX.com" height="25" width="216" style="vertical-align: -6px;" /></p> <p>For example, <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-64480f07bd064538d6f4588918be6c62_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="z^2 - 4" title="Rendered by QuickLaTeX.com" height="19" width="56" style="vertical-align: 0px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b3a0ea2efcd9e3966ceace04a8b930e4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="z^2 - 2^2" title="Rendered by QuickLaTeX.com" height="19" width="65" style="vertical-align: 0px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-31f54486a9475e4f8a083a93f59b58de_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(z + 2)(z - 2)" title="Rendered by QuickLaTeX.com" height="22" width="124" style="vertical-align: -6px;" /></p> <p>&nbsp;</p> <h5><strong>Solving Quadratic Equations by Factoring</strong></h5> <p>Factoring can also be used to solve quadratic equations, like so:</p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-adddb791b9b71da45a0c9bc197108f8f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^2 - 9x - 22 = 0" title="Rendered by QuickLaTeX.com" height="19" width="154" style="vertical-align: 0px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-dec877b35544aaf0058faa08198e21f9_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(x - 11)(x + 2) = 0" title="Rendered by QuickLaTeX.com" height="22" width="176" style="vertical-align: -6px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9ee9664a914409241e98a51f8a00031a_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(x - 11) = 0" title="Rendered by QuickLaTeX.com" height="22" width="111" style="vertical-align: -6px;" />, and so <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d153cbaf87b09c35bd997d5963320564_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = 11" title="Rendered by QuickLaTeX.com" height="14" width="60" style="vertical-align: 0px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-84eacd4fad4574d4eea3b5222f55647c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(x + 2) = 0" title="Rendered by QuickLaTeX.com" height="22" width="101" style="vertical-align: -6px;" />, and so <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-5449777bc8ba6c0dfdfcfdf95125b417_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = -2" title="Rendered by QuickLaTeX.com" height="14" width="66" style="vertical-align: 0px;" /></li> </ul> <p>So the solutions to this quadratic equation are 11 and -2.</p> <p>If the equation has a degree greater than two, it can be solved by factoring and then using the method above.</p> <ul><li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-8a651c5233984602170e0618d9646aa1_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="3x^3 - 12x = 0" title="Rendered by QuickLaTeX.com" height="19" width="129" style="vertical-align: 0px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-95f0e19d04197d01847e8e89e8cedbe4_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="3x(x^2 - 4) = 0" title="Rendered by QuickLaTeX.com" height="25" width="134" style="vertical-align: -6px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-fe5ec6bc967d9f2c82ac86080798dc52_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="3x(x + 2)(x - 2) = 0" title="Rendered by QuickLaTeX.com" height="22" width="190" style="vertical-align: -6px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-e27d84ee430af5f97fa895831abef116_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="3x = 0" title="Rendered by QuickLaTeX.com" height="14" width="61" style="vertical-align: 0px;" />, means <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-e9861e510ec9fed1f81697af5092c649_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = 0" title="Rendered by QuickLaTeX.com" height="14" width="50" style="vertical-align: 0px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-84eacd4fad4574d4eea3b5222f55647c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(x + 2) = 0" title="Rendered by QuickLaTeX.com" height="22" width="101" style="vertical-align: -6px;" />, means <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-5449777bc8ba6c0dfdfcfdf95125b417_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = -2" title="Rendered by QuickLaTeX.com" height="14" width="66" style="vertical-align: 0px;" /></li> <li><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-7a40166c9e4e2969c381a22011ab3c79_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(x - 2) = 0" title="Rendered by QuickLaTeX.com" height="22" width="101" style="vertical-align: -6px;" />, means <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-4f433e0d0947e76f73a3249867e748b9_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = 2" title="Rendered by QuickLaTeX.com" height="14" width="50" style="vertical-align: 0px;" /></li> </ul> <p>So the solutions are 0, 2, and -2.</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/09/25/factoring-polynomials/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/09/25/factoring-polynomials/">Math Support Center blog</a> at the University of Baltimore on September 25, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-rational-expressions-and-equations" title="Rational Expressions and Equations"><div class="chapter-title-wrap"><h3 class="chapter-number">31</h3><h2 class="chapter-title">Rational Expressions and Equations</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <h5><strong>What is a Rational Expression?</strong></h5> <p>A rational number is a number that can be written as a quotient of integers. In other words, it’s any number that can create a nice and neat fraction. A rational expression is also a quotient, it’s just made up of polynomials. A rational expression can be written in the form P/Q. For example:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-ef3f14fb6d4e7a9dc0eac11b85ff0dd9_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{2}{3}" title="Rendered by QuickLaTeX.com" height="26" width="9" style="vertical-align: -7px;" /> or <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-04129dcda23804d66fdffa98747e78d7_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{3y^8}{8}" title="Rendered by QuickLaTeX.com" height="30" width="25" style="vertical-align: -7px;" /> or <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-e1d8d7192e7de8e03f82cfe2c8c8b934_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{-4p}{p^3 + 2p + 1}" title="Rendered by QuickLaTeX.com" height="32" width="68" style="vertical-align: -11px;" /></p> <p>&nbsp;</p> <h5><strong>Evaluating Rational Expressions</strong></h5> <p>Rational expressions have different numerical values depending on what values replace the variables. For example:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d0269bed7132720cd22632d46d0c30f3_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{x + 4}{2x-3}" title="Rendered by QuickLaTeX.com" height="26" width="39" style="vertical-align: -7px;" /></p> <p>If <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-7438cc68b4189dbfb9a23eb48b639146_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = 5" title="Rendered by QuickLaTeX.com" height="15" width="50" style="vertical-align: 0px;" /> then <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-acca965f3e65b1910b95083025ca02b3_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{5+4}{2(5)-3} = \frac{9}{10-3} = \frac{9}{7}" title="Rendered by QuickLaTeX.com" height="31" width="162" style="vertical-align: -12px;" /></p> <p>If <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-4f433e0d0947e76f73a3249867e748b9_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = 2" title="Rendered by QuickLaTeX.com" height="14" width="50" style="vertical-align: 0px;" /> then <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-6796117c1ea65f48c63aee420ce2f5cc_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{2+4}{2(2)-3} = \frac{6}{4-3} = \frac{6}{1} = 6" title="Rendered by QuickLaTeX.com" height="31" width="193" style="vertical-align: -12px;" /></p> <p>&nbsp;</p> <h5><strong>Identifying When a Rational Expression is Undefined</strong></h5> <p>When the denominator of a rational expression is 0, then it is undefined. There are some equations in which the denominator is sometimes 0 and some in which the denominator is never 0. Take the following equation as an example.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-342da3fc3cbf938d4c3579f18a7d465c_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{x}{x-3}" title="Rendered by QuickLaTeX.com" height="22" width="31" style="vertical-align: -7px;" /></p> <p>When we set <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-2dcc4883410018980a0a66ddecd545bc_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x-3" title="Rendered by QuickLaTeX.com" height="14" width="48" style="vertical-align: 0px;" /> to equal 0, we find that when <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-18e47b3ed445355ba232dfccfd985d0d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> is 3, the denominator is 0. So when <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-18e47b3ed445355ba232dfccfd985d0d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> is 3, the rational expression is undefined.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-2679547dc51627a56fd0674968772a2d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{x^3-6x^2-10x}{3}" title="Rendered by QuickLaTeX.com" height="29" width="94" style="vertical-align: -7px;" /></p> <p>The denominator of the above equation is never 0, and so it is never undefined.</p> <p>Sometimes you will need to factor the denominator to find the variables which make it undefined, like in the following example.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-70eb1a68b5267859ad45c5ab5d2c1d38_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{x^2+2}{x^2-3x+2}" title="Rendered by QuickLaTeX.com" height="32" width="69" style="vertical-align: -10px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-4b2035fd7856b3fe1cbc20efceba20d1_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x^2 - 3x + 2 = 0" title="Rendered by QuickLaTeX.com" height="21" width="144" style="vertical-align: -2px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d1eb9e9b9e0f6cffde9dbb36f049eab1_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(x - 2)(x - 1) = 0" title="Rendered by QuickLaTeX.com" height="22" width="165" style="vertical-align: -6px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-7a40166c9e4e2969c381a22011ab3c79_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(x - 2) = 0" title="Rendered by QuickLaTeX.com" height="22" width="101" style="vertical-align: -6px;" /> <em>and</em> <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f69e969079f66f8da936e53e6314fd91_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(x - 1) = 0" title="Rendered by QuickLaTeX.com" height="22" width="101" style="vertical-align: -6px;" /></p> <p>So when <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-18e47b3ed445355ba232dfccfd985d0d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x" title="Rendered by QuickLaTeX.com" height="9" width="12" style="vertical-align: 0px;" /> equals 2 or 1, the expression is undefined.</p> <p>&nbsp;</p> <h5><strong>Simplifying Rational Expressions</strong></h5> <p>Sometimes a fraction made of polynomials can be simplified. This can be done by taking out the greatest common factor, like in the example below.</p> <p><img class="alignnone size-full wp-image-327" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F6-300x264-1.png" alt="Example of simplifying rational expressions" width="300" height="264" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F6-300x264-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F6-300x264-1-65x57.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F6-300x264-1-225x198.png 225w" /></p> <p>This may also be done by factoring the polynomials in both the numerator and denominator.</p> <p><img class="alignnone size-full wp-image-328" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F7-198x300-1.png" alt="Example of simplifying rational expression with polynomials" width="198" height="300" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F7-198x300-1.png 198w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F7-198x300-1-65x98.png 65w" /></p> <p>&nbsp;</p> <h5><strong>Multiplying Rational Expressions</strong></h5> <p>Multiplying rational expressions follow the same rules as multiplying other kinds of fractions. The numerator of one fraction is multiplied by the numerator of the other, and the denominator of one fraction is multiplied by the other.</p> <p><img class="alignnone size-full wp-image-329" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F8.png" alt="Example of multiplying rational expressions" width="472" height="86" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F8.png 472w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F8-300x55.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F8-65x12.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F8-225x41.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F8-350x64.png 350w" /></p> <p>This goes for polynomials as well. Just be sure to FOIL when necessary.</p> <p><img class="alignnone size-full wp-image-330" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F9-235x300-1.png" alt="Example of multiplying a rational expression with polynomials" width="235" height="300" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F9-235x300-1.png 235w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F9-235x300-1-65x83.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F9-235x300-1-225x287.png 225w" /></p> <p>&nbsp;</p> <h5><strong>Dividing Rational Expressions</strong></h5> <p>Remember that dividing fractions is the same thing as flipping the second fraction and multiplying.</p> <p><img class="alignnone size-full wp-image-331" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F10.png" alt="Example of dividing a rational expression" width="504" height="108" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F10.png 504w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F10-300x64.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F10-65x14.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F10-225x48.png 225w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F10-350x75.png 350w" /></p> <p>This is also true for polynomials. Again, don’t forget to FOIL when necessary.</p> <p><img class="alignnone size-full wp-image-332" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F11-200x300-1.png" alt="Example of dividing a rational expression with polynomials" width="200" height="300" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F11-200x300-1.png 200w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F11-200x300-1-65x98.png 65w" /></p> <p>&nbsp;</p> <h5><strong>Adding and Subtracting Rational Expressions with the Same Denominator</strong></h5> <p>When adding and subtracting with rational expressions that have the same denominator, all you have to worry about is adding and subtracting what’s in the numerator.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b37e4fa829db10b1fcbbb722045a1af8_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{6}{5} + \frac{2}{5} = \frac{6+2}{5} = \frac{8}{5}" title="Rendered by QuickLaTeX.com" height="26" width="150" style="vertical-align: -7px;" /></p> <p>This is true for any kind of term that may be in the numerator or denominator.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-e808d2840e3c59813821d07aae2ec58e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{5m}{2n} = \frac{m}{2n} = \frac{(5m+m)}{2n} = \frac{6m}{2n} = \frac{3m}{n}" title="Rendered by QuickLaTeX.com" height="29" width="280" style="vertical-align: -7px;" /></p> <p>or</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f9cf398823e4027682f9effe2551b950_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{2y}{2y-7} - \frac{7}{2y-7} = \frac{2y-7}{2y-7} = \frac{1}{1} = 1" title="Rendered by QuickLaTeX.com" height="31" width="257" style="vertical-align: -10px;" /></p> <p>&nbsp;</p> <h5><strong>Finding the Least Common Denominator</strong></h5> <p>Finding the least common denominator involves breaking each number up into its smallest variables and seeing how those numbers compare so that the smallest number that they both fit into appears. Take a look at the example below. 8 can be broken up into 2*2*2 and 6 can be broken up into 2*3. We need a number that satisfies the need for all of these numbers, even if a few overlap. The solution to this is 2*2*2*3. It incorporates the 8’s need to have 3 2’s and the 6’s need to have a 2 and a 3. 2*2*2*3 = 24 so our LCD is 24.</p> <p><img class="alignnone size-full wp-image-333" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E25.png" alt="Sample equation with LCM" width="271" height="111" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E25.png 271w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E25-65x27.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-E25-225x92.png 225w" /></p> <p>This also works with variables. We’re working with 5x and 15x^2. We need to find a number that fulfills the need of a 5 and an x as well as a 5, a 3, and 2 x’s. The equation 5*x*x*3 fulfills all of those needs. So our LCD is 15x^2.</p> <p><img class="alignnone size-full wp-image-334" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F15-300x260-1.png" alt="Sample variables equation with LCD" width="300" height="260" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F15-300x260-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F15-300x260-1-65x56.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F15-300x260-1-225x195.png 225w" /></p> <p>&nbsp;</p> <h5><strong>Writing Equivalent Rational Expressions</strong></h5> <p>When writing equivalent rational expressions, we are, in a sense, multiplying an expression of 1.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-4aaa6f01651629e248c9ac7aac3df533_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{P}{Q} = \frac{P}{Q} \ast 1 = \frac{P}{Q} \ast \frac{R}{R} = \frac{PR}{QR}" title="Rendered by QuickLaTeX.com" height="29" width="230" style="vertical-align: -10px;" /></p> <p>This is useful for when we want to translate one equation to keep the same value but use a different denominator.</p> <p>In this case, we have to ask ourselves, “what can I multiply by <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-9af8931ce1240f6c292b0cfcaf0cd418_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="9a" title="Rendered by QuickLaTeX.com" height="14" width="22" style="vertical-align: 0px;" /> in order to get <img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-f7bc263b5edf1d08182c3e28c2ca505f_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="27a^2b" title="Rendered by QuickLaTeX.com" height="19" width="49" style="vertical-align: 0px;" />?”</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-c3b22cdb801cfa45d8091f6f5632ceb0_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{4b}{9a} = \frac{?}{27a^2b}" title="Rendered by QuickLaTeX.com" height="27" width="91" style="vertical-align: -8px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-72a2992b0da96dd7dec3459018eacd6b_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{4b}{9a} \ast \frac{3ab}{3ab} = \frac{12ab^2}{27a^2b}" title="Rendered by QuickLaTeX.com" height="30" width="139" style="vertical-align: -8px;" /></p> <p>&nbsp;</p> <h5><strong>Adding and Subtracting Rational Expressions with Different Denominators</strong></h5> <p>Find the LCD and then multiply one rational expression by what’s missing from it to get that LCD in the denominator. Make sure to do this with the other fraction, if necessary, so that in the end, both fractions have the same denominator – the LCD.</p> <p><img class="alignnone size-full wp-image-335" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F29-300x203-1.png" alt="Example rational expression with LCD" width="300" height="203" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F29-300x203-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F29-300x203-1-65x44.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2020/12/Unit-F29-300x203-1-225x152.png 225w" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-c9a20adbe58b1d1cb655a60c1f4d4875_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{3}{10x^2} = \frac{7}{25x} = \frac{3(5)}{10x^2(5)} + \frac{7(2x)}{25x(2x)}" title="Rendered by QuickLaTeX.com" height="34" width="266" style="vertical-align: -12px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-2d3f20a129f53428a805f460ad87026d_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{15}{50x^2} + \frac{14x}{50x^2} = \frac{15+14x}{50x^2}" title="Rendered by QuickLaTeX.com" height="27" width="185" style="vertical-align: -8px;" /></p> <p>&nbsp;</p> <h5><strong>Solving Equations Containing Rational Expressions</strong></h5> <p>Sometimes, you will be asked to solve for a missing variable when there are equations containing rational expressions. I recommend doing what is needed so that the denominators of all the numbers are eliminated and you can work with whole numbers again.</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-5836089bbdf249a2e75d6cb50ab70cf7_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{x}{2} + \frac{8}{3} = \frac{1}{6}" title="Rendered by QuickLaTeX.com" height="26" width="90" style="vertical-align: -7px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-e4325deca5c1e931a8da95bfc467a459_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="6(\frac{x}{2}) + 6(\frac{8}{3}) = 6(\frac{1}{6})" title="Rendered by QuickLaTeX.com" height="26" width="173" style="vertical-align: -7px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d7d2d48c762e8fd728f986278e924450_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{6x}{2} + \frac{48}{3} = 1" title="Rendered by QuickLaTeX.com" height="26" width="104" style="vertical-align: -7px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-8c19e54b752876cd97c411e4d394a264_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="3x + 16 = 1" title="Rendered by QuickLaTeX.com" height="16" width="106" style="vertical-align: -2px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-0eb1dee1866a7ec0f8d10dc8d12e7a2e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="3x = -15" title="Rendered by QuickLaTeX.com" height="15" width="88" style="vertical-align: 0px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b426730b9bf739f18dee8555f574323a_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = -5" title="Rendered by QuickLaTeX.com" height="15" width="66" style="vertical-align: 0px;" /></p> <p>Another, more complex example would be:</p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d1030f9ccc61b61bda0259ff97469b7e_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{4x}{x^2+x=30} + \frac{2}{x-5} = \frac{1}{x+6}" title="Rendered by QuickLaTeX.com" height="29" width="193" style="vertical-align: -10px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-3d215f81ce3b463e2b4c6e9bb2c7d3d9_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="(x+6)(x-5)(\frac{4x}{x^2+x-30} + \frac{2}{x-5}) = (x+6)(x-5)(\frac{1}{(x+6)})" title="Rendered by QuickLaTeX.com" height="31" width="497" style="vertical-align: -12px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-355a65d982aac2c27e6e115b99948aac_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="\frac{(x+6)(x-5)(4x)}{(x+6)(x-5)} + \frac{(2)(x+6)(x-5)}{x-5} = \frac{(x+6)(x-5)}{x+6}" title="Rendered by QuickLaTeX.com" height="34" width="372" style="vertical-align: -12px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-a29c096eb1780a0adfea50fbe4ecbcd6_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="4x + (2)(x + 6) = x - 5" title="Rendered by QuickLaTeX.com" height="22" width="216" style="vertical-align: -6px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-d02c00321d7125279d05c59abf3d3615_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="4x + 2x + 12 = x - 5" title="Rendered by QuickLaTeX.com" height="17" width="194" style="vertical-align: -2px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-3ba6ac216b4c73a5377612d5bf005305_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="6x +12 = x - 5" title="Rendered by QuickLaTeX.com" height="17" width="145" style="vertical-align: -2px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-b6d9dfa07766f312a5e79c8db5da96e8_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="5x = =17" title="Rendered by QuickLaTeX.com" height="15" width="88" style="vertical-align: 0px;" /></p> <p><img src="https://ubalt.pressbooks.pub/app/uploads/quicklatex/quicklatex.com-88d511d61886003b6c80ca1dc5e59ff1_l3.svg" class="ql-img-inline-formula quicklatex-auto-format" alt="x = -\frac{17}{5}" title="Rendered by QuickLaTeX.com" height="26" width="76" style="vertical-align: -7px;" /></p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/10/07/rational-expressions-and-equations/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/10/07/rational-expressions-and-equations/">Math Support Center blog</a> at the University of Baltimore on October 7, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-mathematical-ideas-problem-solving-techniques" title="Mathematical Ideas: Problem-Solving Techniques"><div class="chapter-title-wrap"><h3 class="chapter-number">32</h3><h2 class="chapter-title">Mathematical Ideas: Problem-Solving Techniques</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <h5><strong>Solving Problems by Inductive Reasoning</strong></h5> <p>Before we can talk about how to use inductive reasoning, we need to define it and distinguish it from deductive reasoning.</p> <p><strong>Inductive reasoning</strong> is when one makes generalizations based on repeated observations of specific examples. For instance, if I have only ever had mean math teachers, I might draw the conclusion that all math teachers are mean. Because I witnessed multiple instances of mean math teachers and only mean math teachers, I’ve drawn this conclusion. That being said, one of the downfalls of inductive reasoning is that it only takes meeting one nice math teacher for my original conclusion to be proven false. This is called a <strong>counterexample</strong>. Since inductive reasoning can so easily be proven false with one counterexample, we don’t say that a conclusion drawn from inductive reasoning is the absolute truth unless we can also prove it using deductive reasoning. With inductive reasoning, we can never be sure that what is true in a specific case will be true in general, but it is a way of making an educated guess.</p> <p><strong>Deductive reasoning</strong> depends on a hypothesis that is considered to be true. In other words, if X = Y and Y = Z, then we can deduce that X = Z. An example of this might be that if we know for a fact that all dogs are good, and Lucky is a dog, then we can deduce that Lucky is good.</p> <p>&nbsp;</p> <h5><strong>Strategies for Problem Solving</strong></h5> <p>No matter what tool you use to solve a problem, there is a method for going about solving the problem.</p> <p>&nbsp;</p> <ol><li><em>Understand the Problem:</em> You may need to read a problem several times before you can conceptualize it. Don’t become frustrated, and take a walk if you need to. It might take some time to click.</li> <li><em>Devise a Plan:</em> There may be more than one way to solve the problem. Find the way which is most comfortable for you or the most practical.</li> <li><em>Carry Out the Plan:</em> Try it out. You may need to adjust your plan if you run into roadblocks or dead ends.</li> <li><em>Look Back and Check:</em> Make sure your answer gives sense given the context.</li> </ol> <p>There are several different ways one might go about solving a problem. Here are a few:</p> <p>&nbsp;</p> <ul><li><strong>Tables and Charts:</strong> Sometimes you’ll be working with a lot of data or computing a problem with a lot of different steps. It may be best to keep it organized in a table or chart so you can refer back to previous work later.</li> <li><strong>Working Backward:</strong> Sometimes you’ll be given a word problem where they describe a series of algebraic functions that took place and then what the end result is. Sometimes you’ll have to work backward chronologically.</li> <li><strong>Using Trial and Error:</strong> Sometimes you’ll know what mathematical function you need to use but not what number to start with. You may need to use trial and error to get the exact right number.</li> <li><strong>Guessing and Checking:</strong> Sometimes it will appear that a math problem will have more than one correct answer. Be sure to go back and check your work to determine if some of the answers don’t actually work out.</li> <li><strong>Considering a Similar, Simpler Problem:</strong> Sometimes you can use the strategy you think you would like to use on a simpler, hypothetical problem first to see if you can find a pattern and apply it to the harder problem.</li> <li><strong>Drawing a Sketch:</strong> Sometimes—especially with geometrical problems—it’s more helpful to draw a sketch of what is being asked of you.</li> <li><strong>Using Common Sense:</strong> Be sure to read questions very carefully. Sometimes it will seem like the answer to a question is either too obvious or impossible. There is usually a phrasing of the problem which would lead you to believe that the rules are one way when really it’s describing something else. Pay attention to literal language.</li> </ul> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/11/06/mathematical-ideas-problem-solving-techniques/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/11/06/mathematical-ideas-problem-solving-techniques/">Math Support Center blog</a> at the University of Baltimore on November 6, 2019.</em></p> </div></div>
<div class="chapter standard" id="chapter-mathematical-ideas-basic-concepts-of-set-theory" title="Mathematical Ideas: Basic Concepts of Set Theory"><div class="chapter-title-wrap"><h3 class="chapter-number">33</h3><h2 class="chapter-title">Mathematical Ideas: Basic Concepts of Set Theory</h2><h2 class="chapter-author">Jenna Lehmann</h2></div><div class="ugc chapter-ugc"> <h5><strong>Symbols and Terminology</strong></h5> <p>A set is a collection of objects of values that are in this case called <strong>elements</strong> or <strong>members</strong>. They can be described using words, lists, or set-builder notation.</p> <p>&nbsp;</p> <ul><li>Words: a set of odd numbers less than 6</li> <li>Listing: {1,3,5}</li> <li>Set-builder notation: {x|x is an odd counting number less than 6}</li> </ul> <p>If a set has no elements, it’s called an <strong>empty</strong> or <strong>null set</strong> and its symbol is Ø. Make sure not to write this as {Ø}, because that is technically incorrect.</p> <p>It is important to make sure that a set is <strong>well-defined</strong>, meaning that there’s no room for subjective interpretation about whether something belongs in a set or not. An example of a well-defined set is a set of all numbers between 1 and 10. We can say for sure that 5 belongs and 13 doesn’t. A set that is not well defined is a set of all numbers that are aesthetically pleasing. It’s not clear what would define aesthetically pleasing so we’re unsure about whether 5 or 13 would fit.</p> <p>The symbols <strong>∈</strong> and <strong>∉</strong> are used to describe whether something is or isn’t an element of a set. Going back to our example of all numbers between 1-10 (we’ll name this set A) we can say that 5 ∈ A while 13 ∉ A.</p> <p>&nbsp;</p> <h5><strong>Important Definitions for Sets</strong></h5> <p>Here are some important definitions before moving forward:</p> <ul><li><strong>Natural numbers</strong> or <strong>Counting number</strong>s are all integers starting at 1: {1, 2, 3, 4,…}</li> <li><strong>Whole numbers</strong> are all integers starting at 0: {0, 1, 2, 3, 4…}</li> <li><strong>Integers</strong> are all whole numbers from -∞ to ∞: {… -3, -2, -1, 0, 1, 2, 3…}</li> <li><strong>Rational numbers</strong> are numbers that can be created by dividing two integers (like 1/2 or 9/10 or 4/1).</li> <li><strong>Real Numbers</strong> are any number that isn’t imaginary, so the typical integer, fraction, or decimal we’re used to. An imaginary number is when a negative number is square rooted (√-1)</li> <li><strong>Irrational Numbers</strong> are decimals that can’t be expressed as the fraction of two integers. The square root of 2 would be an example of this because the decimals are ongoing and there is no discernible pattern to the decimals.</li> <li>A <strong>cardinal number</strong> is the number of elements in a set and it is written as n(A) and spoken as “n of A.” So if I was given set Z = {1, 5, 7, 2, 9, 10}, I would say n(Z) is 6.</li> <li>A <strong>finite set</strong> is a set that has a whole number as its cardinal number. So we could technically count how many elements are in a set.</li> <li>An <strong>infinite set</strong> is a set where the number of elements is infinite and we couldn’t possibly count them.</li> <li>Two sets are <strong>equal</strong> if two conditions are met: (1) every element of the first set is an element of the second set and (2) if every element of the second set is an element of the first one. That being said, it does not matter if the elements are written in a different order ({a, b, c, d} = {a, c, d, b}) and repeating elements doesn’t add a new element ({a, b, a, c, d, d} = {a, b, c, d}).</li> </ul> <p>&nbsp;</p> <h5><strong>Venn Diagrams and Subsets</strong></h5> <p>The universe in which we are working, or the area that we’re concerned within a set, is called the <strong>universal set</strong>. This consists of everything in the wider set. <strong>Venn diagrams</strong> are often used to discuss commonalities and differences between sets in the universal set. The universal set is everything within the rectangle encompassing the Venn diagram including the Venn diagram itself. The Venn diagram is made up of sets within U and can overlap. Set A is everything in circle A, set B is everything in circle B, and where they overlap are all the elements that they have in common.</p> <p><img class="alignnone size-full wp-image-356" src="https://ubalt.pressbooks.pub/app/uploads/sites/11/2021/01/venn-diagram-example-1-300x213-1.png" alt="Example of a Venn diagram" width="300" height="213" srcset="https://ubalt.pressbooks.pub/app/uploads/sites/11/2021/01/venn-diagram-example-1-300x213-1.png 300w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2021/01/venn-diagram-example-1-300x213-1-65x46.png 65w, https://ubalt.pressbooks.pub/app/uploads/sites/11/2021/01/venn-diagram-example-1-300x213-1-225x160.png 225w" /></p> <p><strong>Complements</strong> of a set are everything that a set is not. The complement of A is A’ (spoken as A prime) and it includes everything in U except what is included in A.</p> <p>&nbsp;</p> <h5><strong>Subsets of a Set</strong></h5> <p>A <strong>subset</strong> is any set that is also part of another set. For example, if U = {1, 2, 3, 4, 5} and A = {1, 4, 5}, then we would say that A is a subset of U. This is denoted like this: A ⊆ U. If B = {1, 2, 4, 7}, because 7 is not part of U, we would say B is not a subset of U, also denoted like this: B ⊄ U.</p> <p>There are different kinds of subsets. Any subset can be called a subset, but some can be described as a <strong>proper subset</strong>. A proper subset is a subset that has elements of a set but not exactly all of the elements in that set. For example, if set Y = {1, 2, 3} and Z = {1, 2, 3, 4}, then we could say that Y is a subset of Z and we could also say that Y is a proper subset of Z because it does not include all of the elements of Z. This is written as such: Y ⊂ Z.</p> <p>Sometimes you will be asked to calculate how many subsets exist within a set. This can be calculated using powers of 2. For example, if I have the set {1, 4, 6, 2, 7}, I can see that there are 5 elements in the set. I make that 5 an exponent of 2 (2^5) to calculate how many subsets are possible: 32. To calculate the number of proper subsets, the equation is (2^n) – 1.</p> <p>&nbsp;</p> <h5><strong>Set Operations</strong></h5> <p>An <strong>intersection</strong> of sets is the elements of two sets that they have in common. For example, {1, 4, 5} ∩ {6, 9, 5} = {5}. Put in other words, A ∩ B = {x|x ∈ A and x ∈ B}. If two sets have no elements in common, they are called <strong>disjointed sets</strong> and written as such: A ∩ B = ø. A <strong>union</strong> of sets is the set of all elements belonging to either set one or set two, written as such: A ∪ B = {x|x ∈ A or x ∈ B}. A <strong>difference</strong> of sets is the set of all elements of the first set and not the second. For example, if set A is {1, 2, 3, 4, 5, 6} and B is {1, 3, 5}, then the difference would be {2, 4,6}. In other words, A – B = {x|x ∈ A and x ∉ B}.</p> <p>When elements are placed in {braces}, it doesn’t matter in what order they are listed. When elements are placed in (parentheses), it’s called an <strong>ordered pair</strong> and it does matter what order they are listed in. In other words (a,b) ≠ (b,a). In the ordered pair (a,b), a is the <strong>first component</strong> and b is the <strong>second component</strong>.</p> <p>A <strong>cartesian product of sets</strong> is a way of creating a set of ordered pairs. It’s written like A X B and when presented with a problem asking you to find cartesian products, you have to create ordered pairs with each number in each set in the order that the notation dictates. For example, if A = {1, 2, 3} and B = {8, 9}, and you were asked to find A X B, then the answer would be {(1,8), (1,9), (2,8), (2,9), (3,8), (3,9)}. If you were asked to find B X A, the answer would be {(8,1), (8,2), (8,3), (9,1), (9,2), (9,3)}. The cardinal number of a cartesian product is going to be the cardinal number of set 1 times the cardinal number of set 2, or n(A) x n(B).</p> <hr /> <p><em>This chapter was originally posted to the <a href="https://blogs.ubalt.edu/mathsupportcenter/2019/11/11/mathematical-ideas-basic-concepts-of-set-theory/" data-url="https://blogs.ubalt.edu/mathsupportcenter/2019/11/11/mathematical-ideas-basic-concepts-of-set-theory/">Math Support Center blog</a> at the University of Baltimore on November 11, 2019.</em></p> </div></div>

</body>
</html>