ebook img

Soft Computing for Image and Multimedia Data Processing PDF

278 Pages·2013·9.813 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Soft Computing for Image and Multimedia Data Processing

Siddhartha Bhattacharyya  Ujjwal Maulik Soft Computing for Image and Multimedia Data Processing Soft Computing for Image and Multimedia Data Processing Siddhartha Bhattacharyya • Ujjwal Maulik Soft Computing for Image and Multimedia Data Processing 123 SiddharthaBhattacharyya UjjwalMaulik DepartmentofInformationTechnology DepartmentofComputerScience& RCCInstituteofInformationTechnology Engineering Kolkata JadavpurUniversity India Kolkata India ISBN978-3-642-40254-8 ISBN978-3-642-40255-5(eBook) DOI10.1007/978-3-642-40255-5 SpringerHeidelbergNewYorkDordrechtLondon LibraryofCongressControlNumber:2013950983 © Springer-VerlagBerlinHeidelberg2013 Thisworkissubjecttocopyright.AllrightsarereservedbythePublisher,whetherthewholeorpartof thematerialisconcerned,specificallytherightsoftranslation,reprinting,reuseofillustrations,recitation, broadcasting,reproductiononmicrofilmsorinanyotherphysicalway,andtransmissionorinformation storageandretrieval,electronicadaptation,computersoftware,orbysimilarordissimilarmethodology nowknownorhereafterdeveloped.Exemptedfromthislegalreservationarebriefexcerptsinconnection with reviews or scholarly analysis or material supplied specifically for the purpose of being entered and executed on a computer system, for exclusive use by the purchaser of the work. Duplication of this publication or parts thereof is permitted only under the provisions of the Copyright Law of the Publisher’slocation,initscurrentversion,andpermissionforusemustalwaysbeobtainedfromSpringer. PermissionsforusemaybeobtainedthroughRightsLinkattheCopyrightClearanceCenter.Violations areliabletoprosecutionundertherespectiveCopyrightLaw. Theuseofgeneraldescriptivenames,registerednames,trademarks,servicemarks,etc.inthispublication doesnotimply,evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfromtherelevant protectivelawsandregulationsandthereforefreeforgeneraluse. While the advice and information in this book are believed to be true and accurate at the date of publication,neithertheauthorsnortheeditorsnorthepublishercanacceptanylegalresponsibilityfor anyerrorsoromissionsthatmaybemade.Thepublishermakesnowarranty,expressorimplied,with respecttothematerialcontainedherein. Printedonacid-freepaper SpringerispartofSpringerScience+BusinessMedia(www.springer.com) To myrespected parentsLateAjitKumar BhattacharyyaandLateHashi Bhattacharyya SIDDHARTHABHATTACHARYYA To myrespected parentsManojKumar MaulikandGouriMaulik,beloved sonUtsav andallmystudents UJJWALMAULIK Preface Inthisinformationera,processingofnoisyandnoise-freeimagesandmultimedia data for faithful analysis and retrieval of useful information has assumed utmost importance.One ofthe main tenetsof thisimage processingtask is the extraction of relevant features through segmentation of the data under consideration. Proper analysis of image data for relevant object specific information requires efficient extractionandsegmentationtechniques.Faithfuldetectionofimageedgesforthis purposeis also a grave matter of concernfor researchers.Handlingof image data in the binary domain may be a trivial task but it becomes severe when it comes to the gray scale or color intensity gamut. This is purely due to the enormity and varietyoftheunderlyingdataunderconsideration.Ifoneisrequiredtohandleonly two (0 or 255) values in the binary domain, in the gray scale it is 256 shades of gray. The situation becomes more difficult in the color domain where there are 16,777,216(D256(cid:2)256(cid:2)256)colorstodealwith.Addtoittheprocessingoverhead involvedinhandlingmultimediadata,whereeachandeveryimageframewithina videosequenceneedstobeprocessed.Tobeprecise,theentireprocessistootime- complex. Several classical approaches for processing images and multimedia data exist in the literature. Among the score of intelligent approaches in this direction, the soft computing paradigm is equipped with several tools and techniques, which incorporate intelligent concepts and principles. Artificial neural networks, fuzzy sets and fuzzy logic, rough sets and evolutionarycomputationform the backbone of this intelligent computing framework. Out of the several components of the soft computing paradigm, these four components either operate independently or insynergy. Artificial neural networks are electrical analogues of biological neurons in the human brain. These operate on numeric data and are well known for their learning, generalization and approximation capabilities. The constituent neurons are prone to graceful degradation. These neurons remain densely interconnected withneighboringneuronsfollowingthearchitectureofbiologicalneuronsbymeans of some interconnection strengths (or weights). When real-world data is incident on these neurons, they generate a weighted sum of the incident inputs. They also vii viii Preface impressanactivationoftheinputdataandpassitontotheconnectedneuronsvia theinterconnectionweights.Artificialneuralnetworksfigureindifferenttopologies. Theyoperateinbothsupervisedandunsupervisedforms.Inthesupervisedtypeof models, the network is trained with an input–outputrelationship. In unsupervised learning, the network does not require such training. On the contrary,it adapts to thegroundconditions. Fuzzy sets and systems deals with ambiguity, imprecision, vagueness and uncertaintyobservedinreal-worldsituations.Thesesystemsresorttofuzzylogical connectivesfor modelinghumanreasoningand operatein a linguistic framework. Theirstrengthliesintheircapabilitytoperformapproximatereasoning.Whilethe crisp set contains its members with either a membership of 1 or 0, a fuzzy set includes all elements of the universal set of the domain but with varying degrees ofmembershipintheinterval[0,1]. Roughsettheoryhascomeupoflate,asanewmathematicalapproachtomodel imperfectknowledge.Itpresentsanotherattempttohandlereal-worlduncertainties. Theroughsettheoreticapproachseemstobeoffundamentalimportancetoartificial intelligence and cognitive sciences, especially in the areas of machine learning, knowledge acquisition, decision analysis, knowledge discovery from databases, expert systems, inductive reasoning and pattern recognition. The main advantage of rough set theory in data analysis is that it does not need any preliminary or additional information about data—unlike probability in statistics, or basic probability assignment in Dempster–Shafer theory, grade of membership or the valueofpossibilityinfuzzysettheory. The evolutionary computation and metaheuristic search techniques provide powerfulsearchandoptimizationmethodologies.Thesearebasedonapooloftrial solutions/individuals(population)inthesearchspacetostartwith.Severalin-built operatorsareappliedontheindividualsofthepopulationtoderivebettersolutions. Thesuitabilityofaparticularindividualinthepopulationinaparticulargeneration isdeterminedbyafitnessfunctionalsoreferredtoastheobjectivefunction.Oncethe suitabilityoftheindividualsinthepopulationinaparticulargenerationismeasured, the operators are applied to generate a newer population for the next generation. Typical examples of these techniques include genetic algorithms, ant colony optimization,particle swarm optimization,the differentialevolutionaryalgorithm, and simulated annealing, to name a few. These techniques have been lately used withmultipleconflictingobjectives.Theseversionsofthesetechniques,referredto asmultiobjectiveevolutionarytechniques,dealwithmultipleconflictingobjectives in that a set of solutions is finally obtained out of the search procedure. Typical examplesincludethemultiobjectivegeneticalgorithm(MOGA),themultiobjective differentialevolutionaryalgorithm(MODE),themultiobjectivesimulatedannealing (MOSA),tonameafew. This book is aimed at addressing the problem of image segmentation, object extractionandedge detectionusing a soft computingframework.The mainaspect ofthiseffortliesinremovingtheinherentlimitationsintheexistingneuralnetwork topologies as revealed by theoretical investigations. Newer topologies have been Preface ix presented taking into cognizance the space- and time-complexity of the existing methods. Chapter1ofthebookthrowslightonthedifferenttenetsofthesoftcomputing paradigm.Itfirstdiscussestheanatomicalcharacteristicsofthehumanbrainalong with synaptic learning,transmission, storage and processingof informationin the humanbrain.Furthermore,itdrawsananalogybetweentheartificialneuralnetwork and the human brain. Next, it discusses the different learning methodologies of artificialneuralnetworks.Itexploresthetopologyandfunctionsofbothsupervised and unsupervisedmodelsof artificial neuralnetworks. Fuzzy sets and fuzzy logic arealsotoucheduponinthischapter.Next,themathematicalprerequisitesofrough settheoryrelevanttothisbookareelucidated.Therestofthechapterdealswiththe optimizationproblemsandsolutiontechniquesinexistence.Theseincludegenetic algorithms, the classical differential evolutionary algorithm, simulated annealing, andtheirmultiobjectivecounterparts.Thechapteralsothrowssomelightonparticle swarmoptimization. Chapter2focussesontheimagerecognitionandtransformationdetectioncapa- bilities of a trained multilayer perceptron (MLP) architecture. The generalization and approximation capabilities of an MLP architecture are also illustrated in this chapterwiththehelpofsuitableexamples. Chapter3presentsanapplicationofamultilayerperceptron(MLP)fordetecting and controlling the lighting conditions in an illuminated scene. Here also the generalizationand approximationcapabilities of the multilayer perceptron (MLP) comeintoplay. Asoftcomputingtechniquefortrackingofhigh-speedobjectsinreal-lifevideo sequences is presented in Chap.4. The underlying principle of the approach is to segment the optical flow field regionscomputed between the successive image framesofreal-lifevideosequencesintocoherentandincoherentflowregions.The examplesillustratedinthischapterapplyfuzzyhostilityindex-basedsegmentation oftheflowregions. InChap.5thelimitationsofthemultilayerself-organizingneuralnetworkarchi- tecture(MLSONN),intermsofitstransfercharacteristics,thresholdingmechanism andthe erroradjustmentmethodology,are discussed and addressed.A three-layer bidirectionalself-organizingneuralnetwork(BDSONN)architectureapplicablefor real-timeimageprocessingapplicationsispresented.Thearchitectureusescounter- propagation of network states for self-organizing input information into outputs. Thenetworkdynamicsandoperationarediscussed.Thenetworkusesanadaptive fuzzy context-sensitive thresholding (CONSENT) mechanism for the processing task, which enhances the generalization capabilities of the network architecture. Thisisbecausethenetworktakesintocognizancetheinherentheterogeneitiesinthe inputimages.Thenetworkinterconnectionweightsareassignedandupdatedbythe relative fuzzy membershipsof the representative pixels in the image information, rather than through backpropagation strategies. The efficiency of the proposed networkarchitectureoverits MLSONN counterpartusing a bilevelsigmoidaland a beta function is reported with regards to immunity to different types of noise. Theimprovementofthe qualityoftheextractedobjectsis also demonstratedwith

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.