IN THIS ISSUE MACHINE LEARNING • MEDICINE • AUTONOMOUS VEHICLES t h e L A W Y E R VOLUME 14 ISSUE 1 | FALL 2017 | SECTION OF SCIENCE & TECHNOLOGY LAW | AMERICAN BAR ASSOCIATION ALGORITHMS: IN CONTROL? LISA R. LIFSHITZ, LOIS D. MERMELSTEIN, AND LARRY W. THORPE, ISSUE EDITORS Published in The SciTech Lawyer, Volume 14, Number 1, Fall 2017. © 2017 American Bar Association. Reproduced with permission. All rights reserved. This information or any portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association. messagefromthechair EDITORIAL BOARD CO-EDITOR-IN-CHIEF STEPHEN M. GOODMAN Eileen Smith Ewing, Chair 2016–17 LOIS D. MERMELSTEIN Pryor Cashman LLP The Law Office of New York, NY Lois D. Mermelstein [email protected] Alouiss@tinlo,i TsmXermelstein.com MATTHEW HENSHON Artificial Intelligence: Revolution or Evolution? Henshon Klein LLP CO-EDITOR-IN-CHIEF Boston, MA Many fear advances in artificial intelligence (AI). No less a mind than that of PETER MCLAUGHLIN [email protected] Burns & Levinson LLP Stephen Hawking said, “The development of full artificial intelligence could LISA R. LIFSHITZ Boston, MA [email protected] Torkin Manes LLP spell the end of the human race. . . . It would take off on its own, and re-design Toronto, ON DEPUTY-EDITOR-IN-CHIEF [email protected] itself at an ever-increasing rate. Humans, who are limited by slow biological CAROL HENDERSON Stetson University College of Law SARAH MCMILLAN evolution, couldn’t compete, and would be superseded.” Gulfport, FL McGlinchey Stafford PLLC [email protected] New Orleans, LA Of course, other promising, revolutionary technological advances have [email protected] ASSISTANT EDITORS raised similar, Armageddon-like fears: cloning and gene alteration, to name a MICHAEL A. AISENBERG RUSSELL MOY Mitre Corp. Washington, DC few. As lawyers committed to the use of science for the betterment of human- McLean, VA [email protected] ity, we face a special challenge—it is we who must work alongside scientists [email protected] GEORGE LYNN PAUL BEVERLY ALLEN George L. Paul, P.C. to promote, monitor, and regulate the best uses of technological innovation— Washington, DC Phoenix, AZ and to draft policies on potentially harmful uses. It’s a significant job, but one [email protected] [email protected] HAROLD L. BURSTYN LARRY W. THORPE the members of this Section (and indeed readers of and contributors to this Furgang & Adwar LLP Springfield, TN magazine) are uniquely qualified to approach. Syracuse, NY [email protected] [email protected] LISA MARIE VON BIELA In their respective articles in this issue, Natasha Duarte and April Doss KRISTA CARVER Sammamish, WA each take on broad ethical and policy issues affecting the development of AI Covington & Burling LLP [email protected] Washington, DC CHARLES WOODHOUSE across fields of knowledge. Duarte introduces possible ethical frameworks [email protected] Woodhouse Shanahan PA for the use of AI in analyzing big data and making automated decisions. Doss EILEEN SMITH EWING Washington, DC Boston, MA [email protected] argues that our traditional, common-law approach to forming law and pol- [email protected] COMMITTEE LIAISONS PETER J. GILLESPIE JONATHAN GANNON icy—namely, the gradual accumulation of judicial decisions—is simply not Laner Muchin, Ltd. JUNG JIN LEE dynamic enough to meet the rapidity of change in areas like AI. Chicago, IL [email protected] There are a number of excellent pieces in this issue on more granular AI AVERY GOLDSTEIN topics as well. In medicine, Matt Henshon advises the use of AI in small ways, Blue Filament Law Birmingham, MI to enhance human decision making. Aubrey Haddach and Jeffrey Licitra [email protected] demonstrate the high human cost of an AI-driven false positive in medical SECTION OF SCIENCE & TECHNOLOGY LAW OFFICERS diagnostics. Privacy issues come to the fore in Kay Firth-Butterfield’s article CHAIR, 2016–17 BUDGET OFFICER EILEEN SMITH EWING GARTH JACOBSON on data privacy and AI: the European Union, for one, seeks transparency in Boston, MA CT Corporation [email protected] Seattle, WA how automated decision-making systems may reach adverse decisions against CHAIR, 2017–18 [email protected] consumers. DAVID Z. BODENHEIMER SECTION DELEGATES Crowell & Moring LLP ELLEN J. FLANNERY Professor Gary Marchant offers a hopeful note on how AI may affect the Washington, DC Covington & Burling LLP [email protected] Washington, DC practice of law. He concludes we lawyers face an evolution, not a revolution, as CHAIR-ELECT [email protected] many developing technologies will benefit our efforts but not replace the need WILLIAM B. BAKER BONNIE FOUGHT Potomac Law Group PLLC Hillsborough, CA for human oversight. Washington, DC [email protected] [email protected] Speaking of change and evolution, this is my farewell column as Section PAST CHAIR LIAISON TO VICE CHAIR OFFICERS Chair. In August, we had a very productive ABA Annual Meeting in New JULIE FLEMING CYNTHIA H. CWIK Fleming Strategic Jones Day York City, during which we cosponsored a very successful ABA Showcase Atlanta, GA San Diego, CA [email protected] [email protected] Program on Cybersecurity (the Section’s Eric Hibbard was among the pan- SECRETARY elists). Our past Section Chair Heather Rafter offered a timely CLE panel on KATHERINE LEWIS Meister Seelig & Fein LLP copyright issues affecting music and technology. Committee leaders Kather- New York, NY [email protected] ine Lewis (our incoming Secretary) and Barron Oda provided two fascinating AMERICAN BAR ASSOCIATION CONTACTS arts-focused CLE programs—one on virtual and augmented reality; the other SECTION STAFF ART DIRECTOR on technological advances in detecting art forgery. DIRECTOR KELLY BOOK CARYN CROSS HAWK [email protected] At the conclusion of the Annual Meeting, it was my privilege to pass the [email protected] SECTION EMAIL ADDRESS gavel to our new Section Chair: David Z. Bodenheimer of Washington, D.C. ABA PUBLISHING [email protected] CONTRACT EDITOR MEMBERSHIP QUESTIONS Nationally ranked by Chambers USA in the area of government contracts, MELISSA VASICH OR ADDRESS CHANGES? [email protected] 1-800-285-2221 or David’s expertise in that field and in related areas, such as privacy, cybersecu- [email protected] rity, and homeland security, will be a great boon to our Section. I commend TSchiee nScceiT &ec The cLhanwoyleorg (yI LSSaNw o1f5 t5h0e- A20m90e)r icisa np uBbalris Ahsesdo cqiuaatirotner, l3y2 a1s N ao sretrhv Ciclea rtko Sittsr emete, mChbiecrasg bo,y I Lth 6e0 S6e5c4t-i7o5n9 o8f. him to you all. u It endeavors to provide information about current developments in law, science, medicine, and technology that is of professional interest to the members of the ABA Section of Science & Technology Law. Any member of the ABA may join the Section by paying its annual dues of $55. Subscriptions are available to nonmembers for $55 a year ($65 for foreign subscribers). Some back issues are available for $12 plus a $3.95 handling charge from the ABA Service Center, American Bar Association, 321 North Clark Street, Chicago, IL 60654-7598; 1-800-285-2221. Requests to reprint articles should be sent to ABA Copyrights & Contracts, www.americanbar.org/utility/reprint/Periodicals; all other correspondence and manuscripts should be sent to The SciTech Lawyer Contract Editor Melissa Vasich, [email protected]. For more information, visit www.americanbar.org/publications/scitech_lawyer_home.html. The material published in The SciTech Lawyer reflects the views of the authors and has not been approved by the Section of Science & Technology Law, the Editorial Board, the House of Delegates, or the Board of Governors of the ABA. Copyright Published in The SciTech Lawyer, Volume 14, Number 1, Fall 2017. © 2017 American Bar Association. Reproduced with permission. All rights reserved. © 2017 American Bar Association. All rights reserved. This information or any portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association. tableofcontents 2 MESSAGE FROM THE CHAIR analytics, and practice management assistants, but it will be an Artificial Intelligence: Revolution or Evolution? evolution, not a revolution. By Eileen Smith Ewing, Chair 2016–17 By Gary E. Marchant 4 A SIMPLE GUIDE TO MACHINE LEARNING 24 W HEN LAW AND ETHICS COLLIDE WITH AUTONOMOUS “Artificial intelligence” (AI) usually refers to machine learning. VEHICLES Machine learning uses algorithms to perform inductive Thought experiments can be used to study ethical issues reasoning, figuring out “the rules” given the factual inputs involving autonomous vehicle (AV) algorithms. How should and the results. Applying those rules to new sets of factual a manufacturer program an AV to respond to an inevitable inputs can deduce results in new cases. Lawyers are already crash, where continuing on the path will injure a large group, using machine learning to help with legal research, evaluate steering away will injure a single individual or small group, and pleadings, perform large-scale document review, and more. attempting to avoid the collision may injure all? The legal and By Warren E. Agin moral solutions may not be the same. By Stephen S. Wu 10 A RTIFICIAL INTELLIGENCE IN HEALTH CARE: APPLICATIONS AND LEGAL ISSUES 28 A RTIFICIAL INTELLIGENCE AND THE LAW: MORE QUESTIONS Big data and machine learning are enabling innovators to THAN ANSWERS? enhance clinical care, advance medical research, and improve Current U.S. legislation involving AI is principally concerned efficiency, through the use of “black-box” algorithms that with data privacy and autonomous vehicles. The European are too complex for their reasoning to be understood. Safety General Data Protection Regulation (GDPR) will give citizens regulation, medical malpractice and product liability claims, the right to demand an account of how an adverse decision was intellectual property, and patient privacy will impact the way achieved. This will require transparency in AI systems, which black-box medicine is developed and deployed. will raise intellectual property and privacy issues that will have By W. Nicholson Price II to be reconciled with legislation or in the courts. By Kay Firth-Butterfield 14 AI AND MEDICINE: HOW FAST WILL ADAPTATION OCCUR? Computers excel at working with “structured data,” such as 32 BUILDING ETHICAL ALGORITHMS billing codes or lab test results, but human medical judgment Ethical review of automated decision-making systems is a and doctor’s notes are much harder for a computer to analyze. necessary prerequisite to the large-scale deployment of these In medicine, the cost of a false positive may be low, but the systems. Several established frameworks provide ethical cost of a false negative can be catastrophic. Thus, applying principles to guide organizations’ best practices around AI to medicine requires small steps that can supplement and technology design and data use, and can be adapted to big data enhance—rather than replace—human decision making. analytics, automated decision-making systems, and AI. By Matthew Henshon By Natasha Duarte 16 B -TECH CORNER: PRENATAL GENETIC TESTING: 38 W HY CHANGES IN DATA SCIENCE ARE DRIVING A NEED FOR WHERE ALGORITHMS MAY FAIL QUANTUM LAW AND POLICY, AND HOW WE GET THERE With noninvasive prenatal genetic testing, the cost of a false We are living in a Newtonian age with respect to legal positive is not low for parents who relied on the results and and policy issues for emerging technologies, content with unfortunately terminated the pregnancies or who otherwise traditional approaches and relying on the slow accretion of planned for the arrival of a child with a trisomy condition. A precedent. If we do not make the leap to quantum policy, better understanding of the technology that makes these tests embracing a duality where conflicting rights and ideals are possible should lead to better laws and patient outcomes. balanced and encouraged to thrive at the same time, our entire By Aubrey Haddach and Jeffrey Licitra ecosystem of jurisprudence and privacy rights will suffer. By April F. Doss 20 A RTIFICIAL INTELLIGENCE AND THE FUTURE OF LEGAL PRACTICE 43 IN MEMORIAM: CHARLES RAY “CHAS” MERRILL Despite the alarming headlines, AI will not replace most The Section celebrates the life of Chas Merrill, a pioneer, lawyers’ jobs, at least in the short term. It will create new legal intellect, and patient mentor who was a key leader in the issues for lawyers, such as the liability issues of autonomous Information Security Committee. cars and the safety of medical robots, and will transform the By Stephen S. Wu and Michael S. Baum way lawyers practice, with technology-assisted review, legal Published in The SciTech Lawyer, Volume 14, Number 1, Fall 2017. © 2017 American Bar Association. Reproduced with permission. All rights reserved. This information or any portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association. Published in The SciTech Lawyer, Volume 14, Number 1, Fall 2017. © 2017 American Bar Association. Reproduced with permission. All rights reserved. This information or any portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association. BY WARREN E. AGIN A SIMPLE GUIDE TO MACHINE LEARNING L awyers know a lot about a wide on Amazon, help sort your mail, find are rapidly displacing the traditional range of subjects—the result of information for you on Google, and taxicab service. Understanding what constantly dealing with a broad allow Siri to answer your questions. machine learning is and what it can variety of factual situations. Never- In the legal field, products built on do is key to understanding its future theless, most lawyers might not know machine learning are already start- effects on the legal industry. much about machine learning and ing to appear. Lexis and Westlaw how it impacts lawyers in particular. now incorporate machine learning What Is Machine Learning? This article provides a short and sim- in their natural language search and Humans are good at deductive rea- ple guide to machine learning geared to other features. ROSS Intelligence is soning. For example, if I told you that attorneys. an AI research tool that finds relevant a bankruptcy claim for rent was lim- “Artificial intelligence” (AI) usu- “phrases” from within cases and other ited to one year’s rent, you would ally refers to machine learning in one sources in response to a plain language easily figure out the amount of the form or another. It might appear as the search. Through the use of natural lan- allowed claim. If the total rent claim stuff of science fiction, or perhaps aca- guage processing, you can ask ROSS was $100,000, but one year’s rent was demia, but in reality machine learning questions in fully formed sentences. $70,000, you would apply the rule techniques are in wide use today. Such Kira Systems uses machine learning and deduce that the allowable claim is techniques recommend books for you to quickly analyze large numbers of $70,000. No problem. You can deter- contracts. mine the result easily, and you can Warren E. Agin (wea@swiggartagin. These are just two of dozens of also easily program a computer to com) is a principal of Analytic Law LLC new, machine learning–based prod- consistently apply that rule to other in Boston, which helps law firms and legal ucts. On the surface, these tools might situations. Now reverse the process. departments find quantitative solutions seem similar to those currently avail- Assume I told you that your client was to legal problems. He also chairs the ABA able—but they actually do something owed $100,000 and that the annual rent Business Law Section’s Legal Analytics fundamentally different, making them was $70,000, and then told you that the Committee and teaches legal analytics as not only potentially far more efficient allowable claim was $70,000. Could an adjunct professor at Boston College and powerful, but also disruptive. you figure out how I got that answer? Law School. You can follow him on For example, machine learning is the You might guess that the rule is that Twitter at @AnalyticLaw. Mr. Agin thanks “secret sauce” that enables ridesharing the claim is limited to one year’s rent, Michael Bommarito of LexPredict and services like Uber to efficiently adjust but could you be sure? Perhaps the rule Thomas Hamilton of ROSS Intelligence for pricing to maximize both the demand was something entirely different. This kindly reviewing and commenting on an for rides and the availability of drivers, is inductive reasoning, and it is much earlier version of this article, published in predict how long it will take a driver more difficult to do. the February 2017 issue of Business Law to pick you up, and calculate how long Machine learning techniques Today, but emphasizes that any errors are your ride will take. With machine are computational methods for fig- his, not theirs. learning, Uber and similar companies uring out “the rules,” or at least FFAALLLL 22001177 TThheeSScciiTTeecchhLLaawwyyeerr 55 Published in The SciTech Lawyer, Volume 14, Number 1, Fall 2017. © 2017 American Bar Association. Reproduced with permission. All rights reserved. This information or any portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association. for example, the number of creditors, the Figure 1 debtor’s market capitalization, where the case was filed, and, of course, the even- tual fee awarded to the debtor’s counsel. We might compare these numbers and discover that if we graphed the fee awards against the debtor’s market capitalization, e e it looks something like figure 1 (purely al F g hypothetically). e L There seems to be a trend. The larger the market capitalization (the x axis), the higher the legal fee seems to be. In fact, the data points look sort of like a line. We can calculate the line that best fits the Market Cap data points using a technique called lin- ear regression (see fig. 2). Figure 2 We can even see the equation that the line represents. You take the market capi- talization for the debtor, multiply it by 4.92 percent, and add $116,314 (these two variables are the “weighting mecha- nisms,” explained in detail below). This e al Fe is called a “prediction model.” The pre- eg diction model might not perfectly fit the L data used to create it—after all, not all the data points fall exactly on the line—but it provides a useful approximation. That approximation will provide a pretty good estimate for legal fees in future cases Market Cap (that’s what the R2 number on the graph tells us). For the record, the data here is approximations of the rules, given the reasoning to figure out the rule. You imaginary, hand-tailored to demonstrate factual inputs and the results. Those rules then apply that rule to get the next num- the methodology. can then be applied to new sets of factual ber. Broken down a little, the prior game Naturally, real-world problems are inputs to deduce results in new cases. looks like this: more complex. Instead of a short series of For instance, consider number series numbers as inputs, a real-world problem games. For example: Input Result might use dozens, perhaps thousands, of 1 1 2 possible inputs that might be applied to 2 4 6 8 10 ? 1 1 2 3 an undiscovered rule to obtain a known 1 1 2 3 5 answer. We also do not necessarily know The next number is 12, right? Here, 1 1 2 3 5 ? which of the inputs the unknown rule the inputs are the series of numbers 2 uses! through 10, and from this we induce the We look at the group of inputs and To solve a more complex problem, we rule for getting the next number—add 2 induce a rule that gives us the displayed might begin by building a database with to the last number in the series. Here is results. Once we have derived a work- the relevant points of information about another one: able rule, we can apply it to the last row to a large number of cases, in each instance get the result 8, but more importantly we collecting the data points that we think 1 1 2 3 5 ? can apply it to any group of numbers in might affect the answer. To build our pre- the Fibonacci sequence. This is a simple diction model, we would select cases at The next number is 8. This is a Fibo- (very simple) example of what machine random to use as a “training set,” put- nacci sequence, and the rule is that you learning does. ting the remainder aside to use as a “test add together the last two numbers in the Let’s take a more complex example. set.” Then we would begin to analyze the series. Assume we wanted to predict the amount various relationships among the data With these games, what you are doing of a debtor’s counsel’s fees in a Chapter points in our training set using statistical in your head is looking at a series of 11 case. We could take a look at cases in methods. Statistical analytics can help us inputs and answers, and using inductive the past and get information about each: identify the factors that seem to correlate 6 TheSciTechLawyer FALL 2017 Published in The SciTech Lawyer, Volume 14, Number 1, Fall 2017. © 2017 American Bar Association. Reproduced with permission. All rights reserved. This information or any portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association. with the known results and the factors Assume we want to set up a com- training set. This might be data from that clearly do not matter. puter system to identify these a structured, or highly defined, data- Advanced statistical methods might handwritten images and tell us what base, or unstructured data like you help us sort through the various rela- letter each image represents. Defining might find in a set of discovery docu- tionships and find an equation that takes a rule set is too difficult for us to do by ments. Second, you have the answers. some of the inputs and provides an esti- hand and come up with anything that With a structured database, a particular mated result that is pretty close to the is remotely usable, but we know there is answer will be closely identified with actual results. Assuming we find such an a rule set. The letter A is clearly differ- the input information. With unstruc- equation, we then try it out on the test set ent from the letter P, and C is different tured information, the answer might to see if it does a good job there as well— from G, but how do you describe those be a category, such as which letter an predicting results that are close to the real differences in a way a computer can use image represents or whether a particu- results. If our predictive model works on to consistently determine which image lar email is spam; or the answer might our test set, then we consider ourselves represents which letter? be part of a relationship, such as text in lucky. We can now predict a debtor’s The answer is that you don’t. a court decision that relates to a legal counsel’s legal fees ahead of time; at least Instead, you reduce each image to a set question asked by a researcher. Third, until changing circumstances—perhaps of data points, tell the computer what you have the learning algorithm itself— rules changes, a policy change at the U.S. the image is of, and let the computer the software code that explores the Trustee’s Office, or the effect our very induce the rule set that reliably matches relationships between the input infor- own model has on which counsel get all the sets of data points to the cor- mation and the answers. Finally, you hired for cases—render our model inac- rect answers. For the image recognition have weighting mechanisms—basically curate. If our model does not work on the problem, you might begin by defin- parts of the algorithm that help define test set data, than we consider it flawed ing each letter as a 20 pixel by 20 pixel the relationships between the input and go back to the drawing board. image, with each pixel having a differ- information and the answers, within For real-world problems, this kind of ent grayscale score. That gives you 400 the confines of the algorithm. Once analysis is difficult. The job of collecting data points, each with a different value you have these four components, the the data, cleaning it, and analyzing it for depending on how dark that pixel is. learner simply adjusts the weighting relationships takes a lot of time. Given Each of these sets of 400 data points is mechanisms in a controlled manner the large number of potential variables associated with the answer—the letter until it finds values for the weighting that affect real-world relationships, iden- they represent. These sets become the mechanisms that allow the algorithm tifying those that matter is somewhat training set, and another database of to accurately match the input informa- a process of trial and error. We might data points and answers is the test set. tion with the known correct answers. get lucky and generate results quickly, We then feed that training set into our Let’s see how this might work with we might invest substantial resources machine learning algorithm—called a my hypothetical system for estimat- without finding an answer at all, or the “learner”—and let it go to work. ing a debtor’s counsel’s fees. In the relationships might simply prove to be What does the learner actually do? example (see fig. 2), the market capital- too complex for the methods I described This is a little more difficult to explain, izations are the input information (X). to work adequately. Inductive reason- partially because there are a lot of dif- The known legal fees for each case are ing is difficult to do manually. This brings ferent types of learners using a variety the answers (Y). For purposes of illus- us to machine learning. Machine learn- of methods. Computer scientists have tration, let’s assume the algorithm is Y ing can efficiently find relationships using developed a number of different kinds = aX + b (a vast simplification, but I’m inductive reasoning. of techniques that allow a computer going to use it to demonstrate a point). As an example of what machine learn- program to infer rule sets from defined The weighting mechanisms are the two ing can do, consider these images: sets of inputs and known answers. variables a and b. Instead of manually Some are conceptually easier to under- calculating the values of a and b using stand than others. In this article, I linear regression, a machine learning describe, in simple terms, how one of program might instead try different these techniques works. Machine learn- values of a and b, each time checking ing programs will use a variation of to see how well the line fits the actual one or more of these techniques. The data points mathematically. If a change most advanced systems include several in a or b improves the fit of the line, techniques, using the one that fits the the learner might continue to change specific problem best or seems to gen- a and b in the same direction, until the erate the most accurate answers. changes no longer improve the line’s fit. In general, think of a learner as Of course, in my example it is eas- including four components. First, you ier just to calculate a and b using linear have the input information from the regression techniques. I don’t even FALL 2017 TheSciTechLawyer 7 Published in The SciTech Lawyer, Volume 14, Number 1, Fall 2017. © 2017 American Bar Association. Reproduced with permission. All rights reserved. This information or any portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association. need to have math skills to do it—the single number that reflects the weights In between are what are called “hid- functionality is built right into Micro- given the various inputs. den layers” of perceptrons, each taking in soft Excel and other common software Fourth, the weighted sum is fed into a one or more input numbers from a prior products. Given a spreadsheet with the step function. This is a function that out- layer and outputting a single number data, I can perform the calculation with puts a single number based on the weighted to one or more perceptrons in the next a few mouse clicks. Machine learning sum. A simple step function might output layer. By stacking the layers of percep- programs, however, can figure out the a 0 if the weighted sum is between 0 and trons, the “deep learner” acts a little bit relationships when there are millions of 0.5, and a 1 if the weighted sum is between like a computer circuit, one whose opera- data points and billions of relationships— 0.5 and 1. Usually a perceptron will use a tions are programmed by the changes in when modeling the systems is impossible logarithmic step function designed to gen- the weights. to do by hand because of the complex- erate a number between, say, 0 and 1 along The computer scientist building the ity. Machine learning systems are limited a logarithmic scale so that most weighted neural network determines its design— only by the quality of the data and the values will generate a result at or near 0, or how many perceptrons the system uses, power of the computers running them. at or near 1, but some will generate a result where the input data comes from, how Now, let’s look at an example of a in the middle. the perceptrons connect, what step func- machine learning system. Some systems will include a fifth ele- tion gets used, and how the system ment: a “bias.” The bias is a variable that interprets the output numbers. However, Neural Networks is added or subtracted from the weighted the learner itself decides what weights are The term “neural network” conveys the sum to bias the perceptron toward out- given to each input as the numbers move impression of something obscure and putting a higher or lower result. through the network, and what biases mysterious, but it is probably the easi- In summary, the perceptron is a sim- are applied to each perceptron. As the est form of a machine learning system to ple mathematical construct that takes in weights and biases change, the outputs explain to the uninitiated. This is because a bunch of numbers and outputs a sin- will change. The learner’s goal is to keep it is made up of layers of a relatively sim- gle number. By computing the weighted adjusting the weights and biases used ple construct called a “perceptron.” sum of the inputs, running that number by the system until the system produces through the step function, and adjusting answers using the input information the result using a final bias, the perceptron that most closely approximate the actual, tells you whether the collection of inputs known answers. produces a result above or below a thresh- Returning to the handwriting recog- old level. This mechanism works much nition example, remember that we broke like a switch. The result of that switch down each letter image into 400 pix- might be fed to another perceptron, or it els, each with a grayscale value. Each of might relate to a particular “answer.” For those 400 data points would become a example, if your learner is doing hand- input number into our system and be Credit: https://blog.dbrgn.ch/2013/3/26/ writing recognition, you might have a fed into one or more of the perceptrons perceptrons-in-python/ perceptron that tells you the image is in the first input layer. Those outputs the letter A based on whether the output would pass through some hidden lay- This perceptron contains four compo- number is closer to a 1 than a 0. ers in the middle. Finally, we would have nents, the first being one or more inputs In a neural network, the perceptrons an output layer of 26 perceptrons, one represented by the circles on the left. typically are stacked in layers. The first for each letter. The output perceptron The input is simply a number, perhaps layers receive the input information for with the highest output value will tell us between 0 and 1. It might represent part the learner, and the last layer outputs what letter the system thinks the image of our input information, or it might be the results. represents. the output from another perceptron. Then, we pick some initial values Second, each input number is given a for the weights and biases, run all the weight—a percentage by which the input samples in our training set through the is multiplied. For example, if the percep- system, and see what happens. Do the tron has four inputs of equal importance, output answers match the real answers? each input is multiplied by 25 percent. Probably not even close the first time Alternatively, one input might be multi- through. So, the system begins adjust- plied by 70 percent while the other three ing weights and biases, with small, are each multiplied by 10 percent, reflect- incremental changes, testing against the ing that one input is far more important Credit: http://www.intechopen.com/ training set and continuously looking than the others. books/cerebral-palsy-challenges-for-the- for improvements in the results until it Third, these weighted input numbers future/brain-computer-interfaces-for- becomes as accurate as it is going to get. are added to generate a weighted sum—a cerebral-palsy Then, the test set is fed into the system to 8 TheSciTechLawyer FALL 2017 Published in The SciTech Lawyer, Volume 14, Number 1, Fall 2017. © 2017 American Bar Association. Reproduced with permission. All rights reserved. This information or any portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association. see if the set of weights and biases we just exact match where the word “definition” determined produces accurate results. If occurs in the same sentence as the term Legal tools based it does, we now have an algorithm that “adequate protection.” ROSS does some- we can use to interpret handwriting. thing different. Using the Watson AI on machine It might seem a little like magic, but systems and its own algorithms, it looks learning have even a relatively simple neural network, within the search query for word groups properly constructed, can be used to read it recognizes and then finds the results enormous handwriting with a high degree of accu- it has learned to associate with those racy. Neural networks are particularly word groups. If you search for “what is application. good at sorting things into categories, the definition of adequate protection,” especially when using a discrete set of the system will associate the query “what input data points. What letter is it? Is it a is the definition” with similar queries, picture of a face or something else? Is a such as “what is the meaning of” or just and to allow for periodic retraining. As proof of claim filed in a bankruptcy case “what is.” It will also recognize the term a result, it will become extremely adept objectionable or not? “adequate protection” as a single concept at providing immediate responses to instead of two separate words, and likely, the most common queries by users. It Machine Learning in Action given the context, understand it as a word might also be able to eventually give you a These examples are basic, designed to found in bankruptcy materials. Finally, it confidence level in its answer, comparing provide some understanding of what are will have associated a successful response the information it provides against the fairly abstract systems. Machine learners as being one that gives you certain types entire scope of reported decisions and its come in many flavors—some suitable for of clauses including the term “ade- users’ reactions to similar, prior responses, performing basic sorting mechanisms, quate protection.” It will not understand to let you know how reliable the results and others capable of identifying and specifically that you are looking for a def- provided might be. Even though the indexing complex relationships among inition, but because others who used the system doesn’t understand the material in information in unstructured databases. system and made similar inquiries pre- the same manner as a human, its ability Some systems work using fairly simple ferred responses providing definitions, to track relationship building over a large programs and can run on a typical office you will get clauses containing similar scope of content and a large number of computer, and others are highly com- language patterns and, viola, you will get interactions allows it to behave as you plex and require supercomputers or large your definition. might, if you had researched a particular server farms to accomplish their tasks. You should not even have to use the point or issue thoroughly many times To understand the power of machine term “adequate protection” to get an previously. This provides a research learning systems compared with non- answer back discussing the concept when tool far more powerful than existing learning analytic tools, let’s revisit an that is the appropriate answer to your methodologies. earlier example: ROSS Intelligence. ROSS question. So long as your question trig- Legal tools based on machine learning is built on the IBM Watson system, gers the right associations, the system have enormous application. Lawyers although it also includes its own machine will, over time, learn to return the correct are already using learners to help with learning systems to perform many of responses. legal research, categorize document sets its tasks. Watson’s search tools employ a The key is that a machine learning for discovery, evaluate pleadings and number of machine learning algorithms system learns. In a way, we do the same transactional documents for structural working together to categorize seman- thing ROSS does. The first time we errors or ambiguity, perform large- tic relationships in unstructured textual research a topic, we might look at a lot scale document review in mergers and databases. In other words, if you give of cases and go down a lot of dead ends. acquisitions, and identify contracts Watson a large database of textual mate- The next time, we are more efficient. After affected by systemic changes like Brexit. rial dealing with a particular subject, dealing with a concept several times, we General Motors’ legal department, Watson begins by indexing the material, no longer need to do the research. We and likely other large companies, are noting the vocabulary and which words remember what the key case is, and at exploring using machine learning tend to associate with other words. Even most we check to see if there is anything techniques to evaluate and predict though Watson does not actually under- new. We know how the cases link together, litigation outcomes and even help choose stand the text’s meaning, it develops, so the new materials are easy to find. which law firms they employ. Machine through this analysis, the ability to mimic A machine learning–based research learning is not the solution for every understanding by finding the patterns in tool can do this on a much broader scale. question, but it can help answer a large the text. It learns not just from our particular number of questions that simply were For example, when you conduct a research efforts, but also from those of not answerable in the past, and that is Boolean search in a traditional service everyone who uses the system. As the why the advent of machine learning for “definition /s ‘adequate protection,’” system receives more use, it employs user in the legal profession will prove truly the service searches its database for an feedback to assess how its model performs transformational. u FALL 2017 TheSciTechLawyer 9 Published in The SciTech Lawyer, Volume 14, Number 1, Fall 2017. © 2017 American Bar Association. Reproduced with permission. All rights reserved. This information or any portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association. artificial intelligence in health care APPLICATIONS AND LEGAL ISSUES by w. nicholson price ii a rtificial intelligence (AI) is rapidly smartphones or recorded on fitness track- of anatomical pathologists or radiologists moving to change the healthcare sys- ers. Machine learning techniques, a subset within the span of years.5 Another cur- tem. Driven by the juxtaposition of of AI, use simple learning rules and itera- rent algorithm can predict which trauma big data and powerful machine learning tive techniques to find and use patterns victims are likely to hemorrhage by con- techniques—terms I will explain momen- in these vast amounts of data. The result- stantly analyzing vital signs and can in tarily—innovators have begun to develop ing algorithms can make predictions and turn call for intervention to forestall catas- tools to improve the process of clinical group sets—how long is a patient expected trophe; such prognostic algorithms could care, to advance medical research, and to live given his collection of symptoms, come into use in a similarly short time to improve efficiency. These tools rely on and does that picture of a patch of skin frame.6 A bit farther off, black-box algo- algorithms, programs created from health- look like a benign or a cancerous lesion?— rithms could be used for diagnosis more care data that can make predictions or but typically, these techniques cannot generally, to recommend off-label uses for recommendations. However, the algo- explain why or how they reach the conclu- existing drugs, to allocate scarce resources rithms themselves are often too complex sion they do. Either they cannot explain to patients most likely to benefit from for their reasoning to be understood or it at all, or they can give explanations that them, to detect fraud or problematic med- even stated explicitly. Such algorithms may are accurate but meaningless in terms of ical behavior, or to guide research into be best described as “black-box.”1 This medical understanding.2 Because of this new diseases or conditions. In fact, black- article briefly describes the concept of AI inherent opacity (which might or might box algorithms are already in use today in medicine, including several possible not be augmented with deliberate secrecy in smartphone apps that aim to identify applications, then considers its legal impli- about how the algorithms were devel- developmental disorders in infants based cations in four areas of law: regulation, oped and validated), I describe this field on facial features7 or autism in young chil- tort, intellectual property, and privacy. as to “black-box medicine,” though it has dren based on eye movement tracking.8 also been referred to as AI in medicine or The potential for benefit from such black- AI in Medicine “predictive analytics.”3 To add to the com- box medicine is substantial, but it comes Medicine, like many other fields, is expe- plexity, when more data are available for with its own challenges: scientific and riencing a confluence of two recent the machine learning algorithms, those medical, certainly, but also legal. How do developments: the rise of big data, and the data can be incorporated to refine future we ensure that black-box medicine is safe growth of sophisticated machine learn- predictions, as well as to change the algo- and effective, how do we ensure its effi- ing/AI techniques that can be used to rithms themselves. The algorithms at the cient development and deployment, and find complex patterns in those data. Big heart of black-box medicine, then, are not how do we protect patients and patient data as a phenomenon is characterized only opaque but also likely to change over privacy throughout the process? by the “three Vs” of volume (large quan- time. tities of data), variety (heterogeneity in Black-box medicine has tremendous Regulation the data), and velocity (fast access to the potential for use throughout the health- The first question to ask is perhaps the data). In medicine, the data come from care system, including in prognostics, most fundamental: How do we ensure that many sources: electronic health records, diagnostics, image analysis, resource allo- black-box algorithms are high quality— medical literature, clinical trials, insurance cation, and treatment recommendations. that is, that they do what they say, and that claims data, pharmacy records, and even Machine learning is most familiar in the they do it well and safely? New and emerg- information entered by patients into their context of image recognition, and an algo- ing medical technologies and devices are rithm has already been developed that can typically regulated for safety and efficacy W. Nicholson Price II, PhD (wnp@ identify skin cancer by analyzing images by the Food and Drug Administration umich.edu) is an assistant professor of of skin lesions; the algorithm performs as (FDA). Whether the FDA actually has law at the University of Michigan Law well as board-certified dermatologists.4 A statutory authority over free-standing School. His work focuses on innovation recent New England Journal of Medicine algorithms used to make medical deci- in the life sciences, with a significant article suggests that such algorithms could sions (or to help make them) depends on emphasis on the use of big data and soon enter widespread use in image analy- the relatively complex question of what is artificial intelligence in health care. sis, aiding or displacing much of the work a “medical device.” The FDA’s regulation 10 TheSciTechLawyer FALL 2017 Published in The SciTech Lawyer, Volume 14, Number 1, Fall 2017. © 2017 American Bar Association. Reproduced with permission. All rights reserved. This information or any portion thereof may not be copied or disseminated in any form or by any means or stored in an electronic database or retrieval system without the express written consent of the American Bar Association.
Description: