ebook img

Visual Attention Mechanisms PDF

282 Pages·2002·19.088 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Visual Attention Mechanisms

Visual Attention Mechanisms Visual Attention Mechanisms Edited by Virginio Cantoni University of Pa via Pavia, ltaly Maria Marinaro University of Salemo Baronissi, ltaly and Alfredo Petrosino National Research Council Naples, ltaly Springer Science+Business Media, LLC Library of Congress Cataloging-in-Publieation Data International School on Neural Networks "E.R. CaianielIo" on Visual Attention Mechanisms (5th : 2000 : Vietri sul Mare, Italy) Visual attention mechanisms 1 edited by Virginio Cantoni, Maria Marinaro, and Alfredo Petrosino. p.em. Proeeedings ofthe Fifth International Sehool on Neural Networks "E.R. Caianiello" on Visual Attention Mechanisms, Oet. 23-28, 2000, Vietri sul Mare, Italy. Includes bibliographieal referenees and index. ISBN 978-1-4613-4928-0 ISBN 978-1-4615-0111-4 (eBook) DOI 10.1007/978-1-4615-0111-4 1. Computer vision--Congresses. 2. Visual pereeption--Congresses. 1. Cantoni, V. II. Marinaro, M. III. Petrosino, Alfredo. IV. Title. TA1634 .1687 2000 006.3'7--dc21 2002040777 Proceedings of the Fifth International School on Neural Networks "E.R. Caianiello" on Visual Attention Mechanisms, held October 23-28, 2000, in Vietri sul Mare, Italy ISBN 978-1-4613-4928-0 © 2002 Springer Science+Business Media New York Originally published by Kluwer Academic/Plenurn Publishers, New York in 2002 Softcover reprint ofthe hardcover lst edition 2002 http://www.wkap.nV 10987654321 A c.I.P. record for this book is available from the Library of Congress. All rights reserved No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, microfilming, recording, or otherwise, without written permission from the Publisher, with the exception of any material supplied specifically for the purpose of being entered and executed on a computer system, for exclusive use by the purchaser of the work. PREFACE The following are the proceedings of5th Courseofthe International School on Neural Nets "E.R. Caianiello"onVisual AttentionMechanismsheldinVietri suI Mare, Italy, onOctober23 28 2000,jointlyorganizedbythe International Institute for Advanced ScientificStudies(IIASS) and the Ettore Majorana Foundation and Center for Scientific Culture (EMFCSC). The school, openedto all suitablyqualifiedscientistsfrom aroundthe world, isorganizedintolectures,panel discussions and poster presentations and covered a number ofbroad themes relevant to visual attention,rangingfrom computervisiontopsychologyandphysiologyofvision. The theme ofthe school was the attention processes ofvision systems and the aim was to point out the analogies and the divergences ofbiological vision with the frameworks introduced bycomputerscientistsinartificial vision. For what concerns biological vision, the technological developments over the last decade, leading to new instruments (imagery from PET, MRI, new neuronal tracers, etc.) and consequently new and more detailed data, allowed theories, that even in the recent past could only be conjectured, to be validated and proved. As a consequence, new insights were provided intobiological solutions forattentional mechanismsand ingeneral for the primatevisual system. In particular, several authors proposed and questioned whether there was a dichotomy in the visual system differentiating the two visual streams that respectively support where and what (for) in visual processing. Moreover, the studies on vision in primate has identified two processing phases: a pre-attentive one in which the visual system detects regions of interest withinitsfield ofviewbymeansalsoofalertingmechanisms,andanattentiveoneinwhicheach detectedregionisanalyzedindetail. Broadlyspeaking, thehighcomputerperformancedepends onlypartiallyonusingfasterand morereliablehardware, butto a largeextentitdepends onthearchitectureand ontheprocessing techniques. This is true particularly in the field ofimageprocessingand computervision, which is characterized by very large quantity ofsensory data, but in which most ofthe information collected is meaningless for the task at end. Real time performances can be achieved only applying some attentional mechanisms that allow to restrict the computationjustonthe relevant data,attheright time. Toimplementthisstrategycomputervisionresearchershavefocused their efforts on the realization ofspace-variant sensors able to emulate human vision which is both active and space-variant. Such approach is basedon a foveal/multiresolution search guided both bylow level detection, alerting, and tracking schemes and byhigh level interpretationprocesses. It leads to the use ofvariable resolution grids, according to the imagedetail required each time, soexploitingthecapabilitiesofmulti-resolutionandpyramidcomputervisionsystems. Both these biological and artificial strategies are then following a general framework, as illustrated in Figure I, in which a space variant/multiresolution environment is exploited. A peripheralbehavior, aimingto theselectionoftherelevantareas, consistsofaparallel activityof simpleoperators, onthe completefield ofview, atalow resolution. This pre-attentivelikephase isimplementedon specialized and massivelyparallel 'hardware'. Then the full capabilitiesofthe system at the most sophisticated levels ofdetail are required to analyze sequentially selected v ~o{'\\s{1'I !d({'~:,used Atten1.' rV¥~ ~i:-~<:),' lOA"»~ o ~, ~ parallelselectlonn otevents ondregJon restricted seuent\olcrlOl~ otspecillcregions Figure1.The8cycleofbiologicalandartificialattentionsystems. image segments (or regions of interest ROIs) to interpret the scene reality and follow its evolution. The workshop consisted ofa pilot phase ofseven foundation lectures on the physiological and functional infrastructures for visual attention inbiological and artificial systems, and offour subsequent modules each_consisting of four lectures (dealing with solutions in nature and machines respectively) and ofa number ofpanel presentations and ofpanel discussion. Table I shows the scheduled program for the pilot phase aimed to the introduction of a common framework for the different involved communities. The following modules, listed in Table 2, consider in more details somebasic aspects ofthe general common framework (the 8-cycle) for boththelogical-functional andimplementationviewpoint. Table1.Foundations. Nature Machine MagnoandParvoVisual Streams EarlyVision WhereandWhatfor-Perceptionand ActivePerception Bottom-upandTop-downMechanisms Log-mapanalysis The lectures focused on presenting the state-of-the-art and outlining open questions. In particular, they sought to stress links, suggesting possible synergies between different cultural areas. The number ofparticipants to the workshop was limited to 50. Besides the 20 invited lecturers, 30 more participants were admitted. Priority for these positions was given to young researcherswhomadesignificantcontributionsto theopendiscussions. Boththe lecturesandthe contentsofthepanelareincludedinthepresentproceedings. VI Table 2. Basics from the logical-functional-implementation aspects of attention vision systems. Subject Nature Machine ThePop-outTheory FuzzyEngagement Mechanisms TheTextonTheory AttentionalEngagements Experienceswiththe Stimulus-DrivenCapture Eyeputerandother ofAttention Scanpathrecorders MechanismsofAttentional Search(A)Symmetries Control Detectionversus VisualSearch MechanismsofSpatial Discrimination Attention TheComplexityofSearch HierarchicalPerceptual Multiresolution and Tasks Loops Planning ConnectionistModelsof AttentionandAction Attentional Processing Planning Neurobiological Modelsof Hierarchicaland AttentionalVisual Visual Attention AssociativeNetworks Architectures TheSelectiveTuning Attentional Pyramidal Model Neural Mechanisms The contents ofthe book vary somewhat from the scheduled workshop program in orderto accommodatethepositionsthatemergedfrom theworkshop. Acknowledgements The workshop, and thus indirectly this book, was made possible through the generous financial support of three Institutions: the Department of "Scienze Fisiche" of Salerno University, the Cybernetic and Biophysics Group (GNCB) of the Italian National Research Council (CNR), and the Department ofComputer and System Engineering ofPaviaUniversity. Theirsupportisgratefullyacknowledged. Special thanks should go to Ornellade PasqualeandTinaNappi, for their precioushelp and patienceinorganizingtheworkshop. Asecondspecial thankmustbegiven to Vietri sui Marefor theweather; forthehistorical culture; andforthewonderful scenariooftheAmalficoast! VirginioCantoni,MariaMarinaro,and Alfredo Petrosino vii CONTENTS Foundations Visual AttentionandtheParallel Visual Pathways C. A. Marzi EarlyVisionandSoftComputing 7 V. DiGesu PsychophysicalMeasurementofAttentionalModulationinLow-Level Vision UsingtheLateral-InteractionsParadigm 25 E. Freeman,J. Driver,D. Sagi Log-mapAnalysis 41 L. Lombardi,M. Porta Visual AttentionMechanismsinMotion Analysis 53 V. Roberto Bottom-upandTop-DownMechanisms 61 E. Pessa Pop-OutTheory: SegmentationWithoutClassificationbythePrimaryVisualCortex 69 Z.P.Li Model-BasedAttentionFixation usingLog-PolarImages 79 A. Bernardino,J. Santos-Victor,G. Sandini TemporallyFaithfulRepresentationsofSalientStimulusMovement PatternsintheEarlyVisual System 93 A. Thiel,S. D.Wilke, M. Geschner,M. Bongard,1. Ammermtiller, C. W. Eurich,H. Schwegler AttentionalEngagements FuzzyEngagementsMechanisms 101 M. Piastra SaliencyandFigure-GroundEffects 115 Z.P.Li IX Stimulus-DrivenandGoal-DirectedAttentional Control 125 S. Yantis EyeMovement AnalysisDuringVisualExplorationofGraphical Interfaces 135 D.Zambarbieri,C.Robino,S. Ramat VisualSearch Neural MechanismsofAttentional Control 145 S.Yantis SymmetryinComputerVision 155 V. DiGesu SpatialSizeLimitsinStereoscopicVision 171 B. Y. Schlesinger,Y. Yeshurun Multiresolution andPlanning TheComplexityofVisual SearchTasks 185 J. K. Tsotsos Short-TermandLong-TermEffectsofSelectiveVisuo-SpatialAttentionon VisualFieldDefectsinPatientswithCerebralLesions 195 D. Poggel,E. Kasten,E.M. Muller-Oehring,B. A. Sabel OnDesigningMoirePatterns 205 G. Lebanon, A. Bruckstein ConnectionistModelsofAttentional Processing 219 E.Pessa AttentionalVisualArchitectures NeurobiologicalModelsofVisual Attention 229 J. K. Tsotsos TheSelectiveTuningModelforVisual Attention 239 J. K. Tsotsos Multiresolution andAssociativeRepresentationofObjects 251 A. Merigot AttentionalPyramidal Neural Mechanisms 267 A. Petrosino Index 281 x VISUAL ATTENTION ANDTHE PARALLELVISUAL PATHWAYS CarloA. Marzi Departmentof Neurologicaland Visual Sciences UniversityofVerona 37134,Verona, Italy INTRODUCTION Before examining the differential role played by the two major parallel visual pathways in attention I should like to very briefly review the current status of the parvo-magno functional segregationatvariouslevelsofthe visual pathways,mainlyreferringtoprimates. In the optic pathways retinal axons separate into several tracts that terminate in different subcortical areas: the lateral geniculate nucleus (LGN), the superior colliculus (sq, the pretectum, the suprachiasmatic nucleus, and the accessory optic nuclei. These centers extract different aspects ofthe light signal: For example, the LGN and SC encode visual information for form and ambient vision while the other two use the light for completely different purposes, namely, toregulatethecircadianrhythm and tostabilizetheeyes in theorbit,respectively. There are up to 20 different types of ganglion cells whose receptive fields cover the retina homogeneouslythus constitutingparallel filters encodingdifferent aspectsofthe visual world 1,14. Strictlyspeaking, then, the parallel pathwaysare not limited to two orthree, as we shall see later, but areas manyas the numberoffunctionally different ganglion cells. The ganglion cells receive their input from the photoreceptors indirectly through bipolarcells. There are about 10 different types of bipolar cells and this implies that parallel processing starts immediately at the first synapse ofthe retina, namely that between the cone and the bipolar cell. Additionally, there are two types of inhibitory intemeurons: the horizontal cells that exerts their effects at the photoreceptor-bipolar synapse and the amacrine cells that function at the bipolar-ganglion cells synapse4 . Ganglioncellsproject theiraxons to different layers ofthe LGN according to whether they are midget cells, which project to parvocellular layers, or parasol cells, which project to the magnocellular layers. The former show red-green color opponency and have on average smaller receptive fields; the latter are sensitive to luminance modulation and have larger receptive fields. The projections ofM and P into separate territories in the LGN are unique to primates, although parallel visual channels exist in othermammalian species. Forexample, in the cat, different relay cell types (X and Y ganglion cells), are largely intermingled within the A laminae ofthe LGN, instead ofbeing separated in different layers as in primates. Interestingly, the difference in the organization ofthe parallel pathways in primate and non-primate species is present even at early VisualAttentionMechanisms EditedbyCantonietal.,KluwerAcademic/PlenumPublishers,NewYork,2002 stages of embryonic development. There is evidence that retinal ganglion cells diverge into parasol and midget subtypes soon after their last mitotic division and that optic axons project directlyandselectivelytoeithertheMorPmoietiesofthedevelopingembryonicLGN 16. In addition to the Mand Pcells, the LGNofprimatescontainsanotherclass ofrelay cells, the K cells 7. Our understanding of the structure and function of the K relay cells, which can be consideredasathirdparallelpathway,israpidlyincreasingII. Inmonkeys, theyform three layers located betweenthe main LGN layers and terminate within singlecytochromeoxidase (CO) blob columns in cortical layer III where they relay short-wave input and within cortical layer Iwhere they relay low-acuity visual input. They resemble both structurally and physiologically the W cellsofthecat'svisualsystemand, in keepingwith that, theyare functionally related with theSC. Overall, the morphology ofthe K axons is similar in distantly related species suggesting that the basic features ofthispathwayarecommontoallprimates. The bulk ofvisual input is transmitted from the LGN to M- and P-related sublayers and modules in the primary visual cortex whose microcircuitry is far too complicated for the present purposes; for a recent, comprehensivereview, see 2. Suffice here to say that the parallel pathways continue to be largely separated in VI where the P pathway is divided into color and form subsystems, corresponding to CO blobs and interblobs, respectively. From here the information is channeled into the ventral stream and eventually reaches the inferotemporal cortex. In contrast, the M pathway,subservesvisual motionand visuo-spatialoperationsand from the upperpartoflayerIV ofVI sends its information through the dorsal stream to parietal areas. Both in VI and V2 there are interconnections between the two systems that challenge a strict segregation, however, the overallbalanceisstill in favorofatripartitic(M, Pand Kpathways)subdivisionoflabor. PARALLELCORTICALPATHWAYS Beyondtheprimaryvisualcortex,themonkeycortexcontainsat least30separatevisualareasthat areorganized into two functionallyspecializedprocessingpathwaysoriginating from theprimary visual cortex (VI) 23. The occipitotemporal pathway, or "ventral stream," is mainly, but not exclusively, fed byP inputand iscrucial for the identificationofobjects (the"what"pathway). In contrast, the occipitoparietal pathway, or "dorsal stream," which is mainly fed by the M input, is crucial for spatial vision (the "where" pathway) and for the visual guidance of movements. Below, I will briefly discuss the evidence for a functional dichotomy outside the primary visual cortex. PhysiologicalEvidence. Cells in areas within the ventral stream such as VI, V2, V4, and infero-temporal cortex respond preferentiallyto shape, colorand texture whereas cells in areas within the dorsal stream, VI, V2, V3, middle temporal area (MT), medial superior temporal area (MST) respond selectively to spatial aspects of stimuli, such as direction ofmotion and velocity, as well as to tracking eye movements. Aprinciple common to both pathways is their hierarchical organization: the input is progressively processed according to increasingly more elaborate modes. In the ventral stream simple visual attributes such as line orientation are decoded in VI while at further stations, i.e. the inferotemporal cortex, there are neurons responding specifically to objects or faces. By the same token, in the dorsal stream, VI cells show direction selectivity while in further areas, e.g. the lateral intraparietal area (LIP) or MST, neurons respond selectively to complex patterns of motionandoptic flow. An implicationofsuchanorganization is that "higher" visual areas in bothstreamsdisplay longer visual response latencies than do "lower" ones as a result ofthe time required for the transferof information from one processing stage to the next. A recent study by Schmolesky et al. 19 has 2

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.