ebook img

Concise Computer Vision: an Introduction into Theory and Algorithms PDF

441 Pages·2014·25.343 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Concise Computer Vision: an Introduction into Theory and Algorithms

Undergraduate Topics in Computer Science Reinhard Klette Concise Computer Vision An Introduction into Theory and Algorithms Undergraduate Topics in Computer Science UndergraduateTopicsinComputerScience(UTiCS)delivershigh-qualityinstructionalcontentforun- dergraduatesstudyinginallareasofcomputingandinformationscience.Fromcorefoundationaland theoreticalmaterialtofinal-yeartopicsandapplications,UTiCSbookstakeafresh,concise,andmod- ernapproachandareidealforself-studyorforaone-ortwo-semestercourse.Thetextsareallauthored byestablishedexpertsintheirfields,reviewedbyaninternationaladvisoryboard,andcontainnumer- ousexamplesandproblems.Manyincludefullyworkedsolutions. Forfurthervolumes: www.springer.com/series/7592 Reinhard Klette Concise Computer Vision An Introduction into Theory and Algorithms ReinhardKlette ComputerScienceDepartment UniversityofAuckland Auckland,NewZealand SeriesEditor IanMackie AdvisoryBoard SamsonAbramsky,UniversityofOxford,Oxford,UK KarinBreitman,PontificalCatholicUniversityofRiodeJaneiro,RiodeJaneiro,Brazil ChrisHankin,ImperialCollegeLondon,London,UK DexterKozen,CornellUniversity,Ithaca,USA AndrewPitts,UniversityofCambridge,Cambridge,UK HanneRiisNielson,TechnicalUniversityofDenmark,KongensLyngby,Denmark StevenSkiena,StonyBrookUniversity,StonyBrook,USA IainStewart,UniversityofDurham,Durham,UK ISSN1863-7310 ISSN2197-1781(electronic) UndergraduateTopicsinComputerScience ISBN978-1-4471-6319-0 ISBN978-1-4471-6320-6(eBook) DOI10.1007/978-1-4471-6320-6 SpringerLondonHeidelbergNewYorkDordrecht LibraryofCongressControlNumber:2013958392 ©Springer-VerlagLondon2014 Thisworkissubjecttocopyright.AllrightsarereservedbythePublisher,whetherthewholeorpartof thematerialisconcerned,specificallytherightsoftranslation,reprinting,reuseofillustrations,recitation, broadcasting,reproductiononmicrofilmsorinanyotherphysicalway,andtransmissionorinformation storageandretrieval,electronicadaptation,computersoftware,orbysimilarordissimilarmethodology nowknownorhereafterdeveloped.Exemptedfromthislegalreservationarebriefexcerptsinconnection with reviews or scholarly analysis or material supplied specifically for the purpose of being entered and executed on a computer system, for exclusive use by the purchaser of the work. Duplication of this publication or parts thereof is permitted only under the provisions of the Copyright Law of the Publisher’slocation,initscurrentversion,andpermissionforusemustalwaysbeobtainedfromSpringer. PermissionsforusemaybeobtainedthroughRightsLinkattheCopyrightClearanceCenter.Violations areliabletoprosecutionundertherespectiveCopyrightLaw. Theuseofgeneraldescriptivenames,registerednames,trademarks,servicemarks,etc.inthispublication doesnotimply,evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfromtherelevant protectivelawsandregulationsandthereforefreeforgeneraluse. Whiletheadviceandinformationinthisbookarebelievedtobetrueandaccurateatthedateofpub- lication,neithertheauthorsnortheeditorsnorthepublishercanacceptanylegalresponsibilityforany errorsoromissionsthatmaybemade.Thepublishermakesnowarranty,expressorimplied,withrespect tothematerialcontainedherein. Printedonacid-freepaper SpringerispartofSpringerScience+BusinessMedia(www.springer.com) Dedicatedtoallwhohavedreams Computer vision may count the trees, estimate the distance to the islands, but it cannotdetectthefantasiesthepeoplemighthavehadwhovisitedthisbay Preface This is a textbook for a third- or fourth-year undergraduate course on Computer vision,whichisadisciplineinscienceandengineering. SubjectAreaoftheBook ComputerVisionaimsatusingcamerasforanalysing or understanding scenes in the real world. This discipline studies methodological andalgorithmicproblemsaswellastopicsrelatedtotheimplementationofdesigned solutions. Incomputervisionwemaywanttoknowhowfarawayabuildingistoacam- era, whether a vehicle drives in the middle of its lane, how many people are in a scene,orweevenwanttorecognizeaparticularperson—alltobeansweredbased on recorded images or videos. Areas of application have expanded recently due to a solid progress in computer vision. There are significant advances in camera andcomputingtechnologies,butalsointheoreticalfoundationsofcomputervision methodologies. In recent years, computer vision became a key technology in many fields. For modern consumer products, see, for example apps for mobile phones, driver- assistanceforcars,oruserinteractionwithcomputergames.Inindustrialautoma- tion,computervisionisroutinelyusedforqualityorprocesscontrol.Therearesig- nificantcontributionsforthemovieindustry(e.g.theuseofavatarsorthecreation ofvirtualworldsbasedonrecordedimages,theenhancementofhistoricvideodata, orhigh-qualitypresentationsofmovies).Thisisjustmentioningafewapplication areas, which all come with particular image or video data, and particular needs to processoranalysethosedata. FeaturesoftheBook Thistextbookprovidesageneralintroductionintobasicsof computervision,aspotentiallyofuseformanydiverseareasofapplications.Math- ematical subjects play an important role, and the book also discusses algorithms. Thebookisnotaddressingparticularapplications. Inserts(greyboxes)inthebookprovidehistoriccontextinformation,references orsourcesforpresentedmaterial,andparticularhintsonmathematicalsubjectsdis- cussed first time at a given location. They are additional readings to the baseline materialprovided. vii viii Preface Thebookisnot aguideoncurrentresearchincomputervision,anditprovides onlyaveryfewreferences;thereadercanlocatemoreeasilyonthenetbysearch- ingforkeywordsofinterest.Thefieldofcomputervisionisactuallysovivid,with countless references, such that any attempt would fail to insert in the given lim- ited space a reasonable collection of references. But here is one hint at least: visit homepages.inf.ed.ac.uk/rbf/CVonline/ for a web-based introduction into topics in computervision. TargetAudiences Thistextbookprovidesmaterialforanintroductorycourseat third-orfourth-yearlevelinanEngineeringorScienceundergraduateprogramme. Having some prior knowledge in image processing, image analysis, or computer graphics is of benefit, but the first two chapters of this text book also provide a first-timeintroductionintocomputationalimaging. Previous Uses of the Material Parts of the presented materials have been used inmylecturesintheMechatronicsandComputerScienceprogrammesatTheUni- versityofAuckland,NewZealand,atCIMATGuanajuato,Mexico,atFreiburgand GöttingenUniversity,Germany,attheTechnicalUniversityCordoba,Argentina,at theTaiwanNationalNormalUniversity,Taiwan,andatWuhanUniversity,China. Thepresentedmaterialalsobenefitsfromfourearlierbookpublications,[R.Klette andP.Zamperoni.HandbookofImageProcessingOperators.Wiley,Chichester,1996],[R.Klette, K. Schlüns, and A. Koschan. Computer Vision. Springer, Singapore, 1998], [R. Klette and A. Rosenfeld. Digital Geometry. Morgan Kaufmann, San Francisco, 2004], and [F. Huang, R.Klette,andK.Scheibe.PanoramicImaging.Wiley,WestSussex,2008]. The first two of those four books accompanied computer vision lectures of the authorinGermanyandNewZealandinthe1990sandearly2000s,andthethirdone alsomorerecentlectures. Notes to the Instructor and Suggested Uses The book contains more material than what can be covered in a one-semester course. An instructor should select accordingtogivencontextsuchaspriorknowledgeofstudentsandresearchfocus insubsequentcourses. Each chapter ends with some exercises, including programming exercises. The bookdoesnotfavouranyparticularimplementationenvironment.Usingprocedures from systems such as OpenCV will typically simplify the solution. Programming exercisesareintentionallyformulatedinawaytoofferstudentsawiderangeofop- tionsforansweringthem.Forexample,forExercise2.5inChap.2,youcanuseJava appletstovisualizetheresults(butthetextdoesnotaskforit),youcanusesmall-or large-sizedimages(thetextdoesnotspecifyit),andyoucanlimitcursormovement toacentralpartoftheinputimagesuchthatthe11 11squarearoundlocationp × isalwayscompletelycontainedinyourimage(oryoucanalsocoverspecialcases when moving the cursor also closer to the image border). As a result, every stu- dentshouldcomeupwithher/hisindividualsolutiontoprogrammingexercises,and creativityinthedesignedsolutionshouldalsobehonoured. Preface ix Supplemental Resources The book is accompanied by supplemental material (data,sources,examples,presentations)onawebsite.Seewww.cs.auckland.ac.nz/ ~rklette/Books/K2014/. Acknowledgements Inalphabeticalorderofsurnames,Iamthankingthefollow- ing colleagues, former or current students, and friends (if I am just mentioning a figure,thenIamactuallythankingforjointworkorcontactsaboutasubjectrelated tothatfigure): A-Kn AliAl-Sarraf (Fig.2.32),HernanBadino(Fig.9.25),AnkoBörner(various commentsondraftsofthebook,andalsocontributionstoSect.5.4.2),HugoCarlos (support while writing the book at CIMAT), Diego Caudillo (Figs. 1.9, 5.28, and 5.29), Gilberto Chávez (Figs. 3.39 and 5.36, top row), Chia-Yen Chen (Figs. 6.21 and 7.25), Kaihua Chen (Fig. 3.33), Ting-Yen Chen (Fig. 5.35, contributions to Sect. 2.4, to Chap. 5, and provision of sources), Eduardo Destefanis (contribution to Example 9.1 and Fig. 9.5), Uwe Franke (Figs. 3.36, 6.3, and bottom, right, in 9.23), Stefan Gehrig (comments on stereo analysis parts and Fig. 9.25), Roberto Guzmán(Fig.5.36,bottomrow),WangHan(havinghisstudentsinvolvedincheck- ingadraftofthebook),RalfHaeusler(contributionstoSect.8.1.5),GabrielHart- mann(Fig.9.24),SimonHermann(contributionstoSects.5.4.2and8.1.2,Figs.4.16 and7.5),VáclavHlavácˇ(suggestionsforimprovingthecontentsofChaps.1and2), Heiko Hirschmüller (Fig. 7.1), Wolfgang Huber (Fig. 4.12, bottom, right), Fay Huang(contributionstoChap.6,inparticulartoSect.6.1.4),RuyiJiang(contribu- tionstoSect.9.3.3),WaqarKhan(Fig.7.17),RonKimmel(presentationsuggestions on local operators and optic flow—which I need to keep mainly as a project for a futurerevisionofthetext),KarstenKnoeppel(contributionstoSect.9.3.4), Ko-Sc Andreas Koschan (comments on various parts of the book and Fig. 7.18, right), Vladimir Kovalevsky (Fig. 2.15), Peter Kovesi (contributions to Chaps. 1 and2regardingphasecongruency,includingthepermissiontoreproducefigures), WalterKropatsch(suggestionstoChaps.2and3),RichardLewis-Shell(Fig.4.12, bottom,left),FajieLi(Exercise5.9),JuanLin(contributionstoSect.10.3),YizheLin (Fig.6.19),DongweiLiu(Fig.2.16),YanLiu(permissiontopublishFig.1.6),Rocío Lizárraga(permissiontopublishFig.5.2,bottomrow),PeterMeer (commentson Sect.2.4.2),JamesMilburn(contributionstoSect.4.4).PedroReal(commentson geometricandtopologicsubjects),MahdiRezaei(contributionstofacedetectionin Chap.10,includingtextandfigures,andExercise10.2),BodoRosenhahn(Fig.7.9, right), John Rugis (definition of similarity curvature and Exercises 7.2 and 7.6), JamesRussell(contributionstoSect.5.1.1),JorgeSanchez(contributiontoExam- ple9.1,Figs.9.1,right,and9.5),KonstantinSchauwecker(commentsonfeaturede- tectorsandRANSACplanedetection,Figs.6.10,right,7.19,9.9,and2.23),Karsten Scheibe(contributionstoChap.6,inparticulartoSect.6.1.4),andFig.7.1),Karsten Schlüns(contributionstoSect.7.4), Sh-Z Bok-SukShin(Latexeditingsuggestions,commentsonvariouspartsofthe book,contributionstoSects.3.4.1and5.1.1,andFig.9.23withrelatedcomments), x Preface Eric Song (Fig. 5.6, left), Zijiang Song (contributions to Chap. 9, in particular to Sect.9.2.4),KathrinSpiller(contributionto3DcaseinSect.7.2.2),JunliTao(con- tributionstopedestriandetectioninChap.10,includingtextandfiguresandExer- cise10.1,andcommentsaboutthestructureofthischapter),AkihikoTorii(contri- butionstoSect.6.1.4),JohanVanHorebeek(commentsonChap.10),TobiVaudrey (contributions to Sect. 2.3.2 and Fig. 4.18, contributions to Sect. 9.3.4, and Exer- cise9.6),MouWei(commentsonChap.4),Shou-KangWei(jointworkonsubjects related to Sect. 6.1.4), Tiangong Wei (contributions to Sect. 7.4.3), Jürgen Wiest (Fig.9.1,left),YihuiZheng(contributionstoSect.5.1.1),ZezhongXu(contributions toSect.3.4.1andFig.3.40),ShenghaiYuan(commentsonSects.3.3.1and3.3.2), Qi Zang (Exercise 5.5, and Figs. 2.21, 5.37, and 10.1), Yi Zeng (Fig. 9.15), and JovišaŽunic´(contributionstoSect.3.3.2). The author is, in particular, indebted to Sandino Morales (D.F., Mexico) for implementing and testing algorithms, providing many figures, contributions to Chaps. 4 and 8, and for numerous comments about various parts of the book, to Władysław Skarbek (Warsaw, Poland) for manifold suggestions for improving the contents, and for contributing Exercises 1.9, 2.10, 2.11, 3.12, 4.11, 5.7, 5.8, and6.10,andtoGarryTee(Auckland,NewZealand)forcarefulreading,comment- ing,forpartsofInsert5.9,thefootnoteonp.402,andmanymorevaluablehints. Ithankmywife,GiselaKlette,forauthoringSect.3.2.4abouttheEuclideandis- tance transform and critical views on structure and details of the book while the bookwaswrittenatCIMATGuanajuatobetweenmidJulytobeginningofNovem- ber2013duringasabbaticalleavefromTheUniversityofAuckland,NewZealand. Guanajuato,Mexico ReinhardKlette 3November2013

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.