Advance Praise for Improving the User Experience through Practical Data Analytics Mike Fritz, Manager of Usability, PeopleFluent, Massachusetts, USA Paul D. Berger, Visiting Scholar and Professor of Marketing, and Director of the Master of Science in Marketing Analytics (MSMA) program, Bentley University, Massachusetts, USA “It is only fitting that a statistics book written for UX professionals keeps its users in mind, and Mike Fritz and Paul D. Berger do that masterfully in “Improving the User Experience through Practical Data Analytics”. Readers will find mastering statistical techniques approachable and even enjoyable through the authors’ use of case studies, applied examples, and detailed instructions for analyzing data using both Excel and SPSS in each chapter. A great resource for UX professionals who desire to increase their statistical rigor.” –Nicholas Aramovich, Ph.D., Assistant Professor, Organizational Psychology Program, Alliant International University, San Diego “Improving the User Experience through Practical Data Analytics by Mike Fritz and Paul D. Berger is a handy-dandy desk reference for all usability professionals. Too often usability reports either don’t include necessary statistics, or worse, provide the wrong statistics. With this easy to use guide, now there’s no excuse. The book is laid out so that it is easy to map your study in to one of the chapters and find all the relevant tests and formulas to use.” –Bob Virzi, Raver Consulting, Adjunct Professor of Usability Testing at Bentley University, Waltham, MA (MS in Human Factors in Information Design program) “Are you intimidated by statistical concepts? Has your manager asked you if your test results are “significant”? Are you concerned that you are using the appropriate statistical test? Can you use Excel, a tool that you probably already know well, to do statistical analysis of your test data? “Improving the User Experience through Practical Data Analytics” by Mike Fritz and Paul D. Berger helps answer these questions with clear explanations of statistical concepts and methods, step-by-step procedures for using Excel as your analysis tool, case studies, tips on how to avoid common errors and biases, and a good dose of humor. Each chapter in the book describes a method, ranging from paired- samples t-tests to regression analysis, using a case study to put the method in context – an extremely useful and user-friendly approach. I recommend this book for all designers and researchers who are trying to understand both what method should be used and how to use that method.” –Chauncey E. Wilson, UX Architect, USA “This book stays above statistical detail by posing realistic business scenarios, then clarifying the concepts of data-driven design for usability students and prac- titioners, giving this growing profession the objectivity and repeatability that VPs seek. Choosing the applicable method and particularly interpreting the analysis for a business problem is emphasized while illustrated computer applications handle the numerical load.” –Charles N. Abernethy, BSEE, PhD, CHFP Improving the User Experience through Practical Data Analytics Gain Meaningful Insight and Increase Your Bottom Line Mike Fritz Paul D. Berger AMSTERDAM • BOSTON • HEIDELBERG • LONDON NEW YORK • OXFORD • PARIS • SAN DIEGO SAN FRANCISCO • SINGAPORE• SYDNEY • TOKYO Morgan Kaufmann is an imprint of Elsevier Acquiring Editor: Todd Green Editorial Project Manager: Kaitlin Herbert Project Manager: Punithavathy Govindaradjane Designer: Maria Inês Cruz Morgan Kaufmann is an imprint of Elsevier 225 Wyman Street, Waltham, MA 02451, USA Copyright © 2015 Elsevier Inc. All rights reserved. All Illustrations Copyright © 2015 Rick Pinchera. All Rights Reserved. No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Details on how to seek permission, further information about the Publisher’s permissions policies and our arrangements with organizations such as the Copyright Clearance Center and the Copyright Licensing Agency, can be found at our website: www.elsevier.com/permissions. This book and the individual contributions contained in it are protected under copyright by the Publisher (other than as may be noted herein). Notices All characters appearing in this work are fictitious. Any resemblance to real persons, living or dead, is purely coincidental. Knowledge and best practice in this field are constantly changing. As new research and experience broaden our understanding, changes in research methods, professional practices, or medical treatment may become necessary. Practitioners and researchers must always rely on their own experience and knowledge in evaluating and using any information, methods, compounds, or experiments described herein. In using such information or methods they should be mindful of their own safety and the safety of others, including parties for whom they have a professional responsibility. To the fullest extent of the law, neither the Publisher nor the authors, contributors, or editors, assume any liability for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions, or ideas contained in the material herein. ISBN: 978-0-12-800635-1 British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library Library of Congress Cataloging-in-Publication Data A catalogue record for this book is available from the Library of Congress For information on all Morgan Kaufmann publications visit our website at www.mkp.com To Mary, the love of my life – Mike To my wonderful wife, Susan – Paul Preface The book will help you utilize both descriptive and predictive statistical techniques to gain meaningful insight from data collected employing traditional UX research methods, including moderated usability studies, unmoderated usability studies, surveys and contextual inquiries. However, the analytic methods we described can easily be applied to data collected in a myriad of other UX research methods, including focus groups, live Web site analytics, card sorting, competitive research, and physiological testing like eye tracking, heart rate variance, and skin conductance. This book is a how-to guide, not a treatise on statistics. We provide practical advise about which methods to use in what situations, and how to interpret the results in a meaningful way. In addition, the book provides lots of easy-to-grasp tutoring for those who have a limited knowledge of statistics. We hope the book makes many of the calculations— such as calculating a simple correlation coefficient—seem almost effortless, while providing all the necessary “hand-holding” when utilizing a more complex method, such as logistic regression. WHY WE WROTE THE BOOK Over the past 5 years, excellent books have been published regarding collecting, analyzing, and presenting usability metrics. Arguably, the best ones in this category are the Morgan Kaufmann books, including Measuring the User Experience by Tom Tullis and Bill Albert, Beyond the Usability Lab by Bill Albert, Tom Tullis and Donna Tedesco, and Quantifying the User Experience by Jeff Sauro and James R. Lewis. These books do an outstanding job of instructing the UX professional how to choose the right metric, apply it, and effectively use the information it reveals. And yet, as we surveyed the UX research literature landscape, we saw there was currently no book that urges UX professionals to use predictive and other advanced statistical tools in their work. (The current books on usability metrics leave out the techniques often used for data analysis, such as multiple regression analysis.) But these statistical tools—which begin with basic correlation and regression analysis— are now fairly easy to access. In fact, if you have Excel, you probably have most of these tools already at your fingertips! At the same time, we recognize that many UX researchers come to the profession without formal statistical training. As a matter of fact, usability studies, contextual inquiries, surveys, and other UX research methods are sometimes performed on an ad hoc basis by designers, information architects, and front-end coders who have had no formal training in these methods, let alone training in the statistical tools used in the analysis of the data collected through such methods. xv xvi Preface Because of these realities, we start with an introductory chapter on basic statistical fundamentals. Then, we proceed gently into basic means comparison and ANOVA models. Then we move into basic correlation and more advanced regression analyses. Throughout, we strive to make techniques such as means comparisons, correlation, and regression analysis so easy to understand and apply that you will naturally turn to one of them after collecting your data. Armed with the meaning of the results, you will be able to make design decisions with authority and the backing of empirical evidence. HOW THIS BOOK IS SPECIAL • We show the real-world application of these techniques through the vignettes that begin and close each chapter. By seeing parallels between the problems introduced and resolved in each chapter and your own work, you’ll easily be able to ascertain the right statistical method to use in your particular UX research project. In addition, our hope is that you’ll find the vignettes, and the accompanying illustrations, entertaining. All characters appearing in this work are fictitious. Any resemblance to real persons, living or dead, is purely coincidental. • We provide clear insight into the statistical principles without getting bogged down in a theoretical treatise. But, we provide enough theory for you to understand why you’re applying a certain technique. After all, understanding why you’re doing something is just as important as knowing what you’re doing. • We minimize the amount of mathematical detail, while still doing full justice to the mathematical rigor of the presentation and the precision of our statements. In addition, many of our numerical examples use simple numbers. (This is a choice we consciously made, and it embraces a question posed by Ching Chun Li, Professor of Biometry at the University of Pittsburgh (1912–2003), which the authors took to heart and have incorporated into their writing: “How does one first learn to solve quadratic equations? By working with equations such as 242.5X2 − 683.1X − 19428.5 = 0, or with equations like X2 − 5X − 6 = 0?”) Our belief is that simpler numerical calculations aid the readers in the intuitive understanding of the material to a degree that more than offsets any disadvan- tage from using numbers that don’t look “real.” • We focus on how to get the software to do the analysis. There are a few exceptions, in those cases where Excel does not provide a built-in solution, when we show you how to use other Excel commands to duplicate, as closely as possible, what Excel would do if the technique were a built-in available one. Also, we provide end-of-chapter exercises that encourage, demonstrate, and, indeed, require the use of the statistical software described. By the way, we do not apologize for writing our chapters in a way that does not insist that the reader understand what the software is doing under the hood! Preface xvii • We’ve provided additional explanatory commentary through sidebars. The information contained in the sidebars is not essential to the task of applying the analytics to the research problem at hand, but we believe they add richness to the discussion. THE SOFTWARE WE USE We illustrate the use of statistical software packages with Excel and SPSS (Statistical Package for the Social Sciences). There are a large number of displays of both software packages in action. The Excel displays illustrate Excel 2007 for the PC. There is a specific module within Excel, named “Data Analysis,” that needs to be activated. We show you how to perform this activation. Once you are using “Data Analysis,” there is no difference at all between the Excel 2007 and Excel 2010. Since there are some minor—and not so minor—differences between the PC and Mac versions of Excel, we’ve provided a Mac addendum at the end of the book that shows you how to complete the same tasks step-by-step on the Mac version. Most of our displays of SPSS illustrate SPSS Edition 19. In the later chapters, we illustrate SPSS using SPSS Edition 22, the most recent version. For purposes of the techniques and analyses discussed and performed in this book, there is no meaningful difference between the two editions in how the techniques are accessed, and the resulting output format. (If you purchase SPSS, make sure that these techniques described in the book are available in your version before you buy; there are many different versions with different prices.) WHAT YOU NEED TO ALREADY KNOW Nothing! For the statistical beginner, we provide a chapter dedicated to some basic statistical concepts. We wrote this chapter assuming that a reader has not studied the subject matter before, but we believe that the vast majority of readers, even if they have studied the material before, will benefit from at least a cursory reading of this first chapter. The two key topics that we emphasize in the chapter are confidence intervals and hypothesis testing. We also provide some background for these two topics, centering around discussion of the bell-shaped (i.e., normal) probability distribution. A few other useful topics from a typical introductory statistics course are reviewed on an ad hoc basis. The principles and techniques discussed in this book transcend the area of their application to the UX field; the only difference from one application area to another is that different situations arise with different frequency, and correspondingly, the use of various techniques occurs with different frequency. Still, it is always helpful for people to actually see applications in their area of endeavor, and thus, we never forget that the aim is to illustrate application of the techniques in the UX area. After all, xviii Preface many people beginning their study of predictive analytics and statistical techniques “don’t know what they don’t know;” this includes envisioning the ways in which the material can be usefully applied. We assume a modest working knowledge of high school algebra. On occasion, we believe it is necessary to go a small distance beyond routine high school algebra. But, we strive to minimize the frequency of these occasions, and when it is necessary, we explain why, in the most intuitive way that we can. These circumstances exemplify how we aim to walk the fine line noted above: minimal mathematical presentation without compromising the rigor of the material or the precision of our statements. ORGANIZATION AND COVERAGE Our goal was to write a book that covered the most important and commonly used statistical methods employed by UX researchers. We have strived to keep the scope of the book at a level that is compatible with what most UX researchers can handle without great difficulty. At various points in the book, we refer to areas that we believe are beyond the scope of the book. However, these are not areas in which a lack of knowledge will materially hamper a cogent analysis and allow meaningful conclusions to be drawn from the methods demonstrated. We have made attempts to be consistent in our ordering of the topics and the references from one chapter to another. For example, in the six Chapters, 2–7, we essentially present three different techniques, devoting one chapter to the case of independent data and the other to the case of a “repeated-measures”/“within-subjects” design. Both of these situations arise frequently in the UX world, and we believe that it is important to not only know how to handle each case analytically, but to be able to recognize which case is applicable in a given situation, and how to “design” each situation. With this view, Chapters 2 and 3 go together, as do Chapters 4 and 5, and also Chapters 6 and 7. A special highlight of our book is the extensive coverage of the topic of correlation and regression analysis. We have three separate and extensive Chapters (9–11) on simple regression, multiple and stepwise regression, and binary logistic regression. EXERCISES AND SUPPLEMENTARY MATERIAL Each chapter (except for the introductory Chapter 1) has Exercises at the end of the chapter. The data for these exercises, in both Excel and SPSS (except in a few cases where Excel does not have the necessary functionality), and the corresponding out- put in Excel and SPSS, are available on the book’s companion Web site (booksite. elsevier.com/9780128006351). Also present on the Web site, on a separate file for each exercise, is a discussion of the solution based on the software output. The Exer- cise section of each chapter provides the exact names of the aforementioned files. About the Authors MIKE FRITZ Mike Fritz has been helping businesses make their products both more usable and useful for over 20 years. An ardent proponent of the user- centered design process, he’s helped to maxi- mize the user experience for Verizon, Monster, GlaxoSmithKline, Lilly, Windstream, Fidelity Investments, Forrester, WGBH (Boston), and PeopleFluent, among others. Mike’s specialty is collecting user data through a variety of UX research methods—including moderated and unmoderated usability tests, contextual inquiries, surveys, Web analytics, focus groups, interviews, and more—and making informed design deci- sions based on meaningful interpretation of that data. Mike’s motto of “Test Early, Test Often” has resulted in exceptional user expe- riences, whether you’re installing FiOS or applying for a job. Currently, Mike is the Manger of Usability at PeopleFluent, the leading provider of total talent management software and services. Mike is also CEO and Founder of BigUXData.com, a user experience research and design firm; the firm’s emphasis is on collecting and interpreting data collected from variety of UX research methods to inform designs that will maximize the usability and utility of any product or service. Mike holds a Bachelor in Arts in Journalism from the University of South Carolina and a Masters in Science in Human Factors in Information Design from Bentley University. In his free time, he plays jazz piano in the Boston area and swims in his beloved Lake Cochituate. You can reach Mike at [email protected]. PAUL D. BERGER Paul D. Berger is a Visiting Scholar and Profes- sor of Marketing at Bentley University, where he is also the director of the Master of Science in Marketing Analytics (MSMA) program. He earned his SB, SM, and PhD degrees from the Massachusetts Institute of Technology (MIT), Sloan School of Management. He has published several texts, including Experimental Design: with Applications in Management, Engineering, and xix