Table Of ContentShi/ImageandVideoCompressionforMultimediaEngineering 7364_C000 FinalProof page i 23.2.2008 2:45pm CompositorName:BMani
Shi/ImageandVideoCompressionforMultimediaEngineering 7364_C000 FinalProof page ii 23.2.2008 2:45pm CompositorName:BMani
IMAGE PROCESSING SERIES
Series Editor: Phillip A. Laplante, Pennsylvania State University
Published Titles
Adaptive Image Processing: A Computational Intelligence Perspective
Stuart William Perry, Hau-San Wong, and Ling Guan
Color Image Processing: Methods and Applications
Rastislav Lukac and Konstantinos N. Plataniotis
Image Acquisition and Processing with LabVIEW™
Christopher G. Relf
Image and Video Compression for Multimedia Engineering
Second Edition
Yun Q. Shi and Huiyang Sun
Multimedia Image and Video Processing
Ling Guan, S.Y. Kung, and Jan Larsen
Shape Analysis and Classification: Theory and Practice
Luciano da Fontoura Costa and Roberto Marcondes Cesar Jr.
Software Engineering for Image Processing Systems
Phillip A. Laplante
Shi/ImageandVideoCompressionforMultimediaEngineering 7364_C000 FinalProof page iii 23.2.2008 2:45pm CompositorName:BMani
Yun Q. Shi
New Jersey Institute of Technology
Newark, New Jersey, USA
Huifang Sun
Mitsubishi Electric Research Laboratories
Cambridge, Massachusetts, USA
Boca Raton London New York
CRC Press is an imprint of the
Taylor & Francis Group, an informa business
Shi/ImageandVideoCompressionforMultimediaEngineering 7364_C000 FinalProof page iv 23.2.2008 2:45pm CompositorName:BMani
CRC Press
Taylor & Francis Group
6000 Broken Sound Parkway NW, Suite 300
Boca Raton, FL 33487-2742
© 2008 by Taylor & Francis Group, LLC
CRC Press is an imprint of Taylor & Francis Group, an Informa business
No claim to original U.S. Government works
Printed in the United States of America on acid-free paper
10 9 8 7 6 5 4 3 2 1
International Standard Book Number-13: 978-0-8493-7364-0 (Hardcover)
This book contains information obtained from authentic and highly regarded sources Reasonable efforts have been
made to publish reliable data and information, but the author and publisher cannot assume responsibility for the valid-
ity of all materials or the consequences of their use. The Authors and Publishers have attempted to trace the copyright
holders of all material reproduced in this publication and apologize to copyright holders if permission to publish in this
form has not been obtained. If any copyright material has not been acknowledged please write and let us know so we may
rectify in any future reprint
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, transmitted, or uti-
lized in any form by any electronic, mechanical, or other means, now known or hereafter invented, including photocopy-
ing, microfilming, and recording, or in any information storage or retrieval system, without written permission from the
publishers.
For permission to photocopy or use material electronically from this work, please access www.copyright.com (http://
www.copyright.com/) or contact the Copyright Clearance Center, Inc. (CCC) 222 Rosewood Drive, Danvers, MA 01923,
978-750-8400. CCC is a not-for-profit organization that provides licenses and registration for a variety of users. For orga-
nizations that have been granted a photocopy license by the CCC, a separate system of payment has been arranged.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are used only for
identification and explanation without intent to infringe.
Library of Congress Cataloging-in-Publication Data
Shi, Yun Q.
Image and video compression for multimedia engineering : fundamentals, algorithms, and
standards / Yun Q. Shi and Huifang Sun. -- 2nd ed.
p. cm. -- (Image processing series)
Includes bibliographical references and index.
ISBN 978-0-8493-7364-0 (alk. paper)
1. Multimedia systems. 2. Image compression. 3. Video compression. I. Sun, Huifang. II. Title.
QA76.575.S555 2008
006.7--dc22 2007048389
Visit the Taylor & Francis Web site at
http://www.taylorandfrancis.com
and the CRC Press Web site at
http://www.crcpress.com
Shi/ImageandVideoCompressionforMultimediaEngineering 7364_C000 FinalProof page v 23.2.2008 2:45pm CompositorName:BMani
To beloved Kong Wai Shih,Wen Su,
Yi Xi Li,Shu Jun Zheng, and
Xian Hong Li
and
To beloved Xuedong,
Min, Yin, Andrew, Rich, Haixin,and
Allison
Shi/ImageandVideoCompressionforMultimediaEngineering 7364_C000 FinalProof page vii 23.2.2008 2:45pm CompositorName:BMani
Contents
Prefaceto the Second Edition................................................................................................... xxi
Prefaceto the First Edition..................................................................................................... xxiii
Content and Organization of the Book.................................................................................xxvii
Authors...................................................................................................................................... xxxi
Part I Fundamentals
Chapter 1 Introduction
1.1 Practical Needs for Image and Video Compression..............................................4
1.2 Feasibility of Image and Video Compression........................................................4
1.2.1 Statistical Redundancy.............................................................................................5
1.2.1.1 SpatialRedundancy...................................................................................5
1.2.1.2 Temporal Redundancy..............................................................................7
1.2.1.3 Coding Redundancy..................................................................................9
1.2.2 Psychovisual Redundancy.....................................................................................10
1.2.2.1 Luminance Masking................................................................................11
1.2.2.2 TextureMasking.......................................................................................13
1.2.2.3 Frequency Masking..................................................................................15
1.2.2.4 Temporal Masking...................................................................................16
1.2.2.5 Color Masking..........................................................................................16
1.2.2.6 Color Masking and ItsApplication in Video Compression..............19
1.2.2.7 Summary:DifferentialSensitivity..........................................................20
1.3 Visual Quality Measurement................................................................................20
1.3.1 Subjective Quality Measurement..........................................................................21
1.3.2 Objective QualityMeasurement...........................................................................22
1.3.2.1 Signal to Noise Ratio...............................................................................22
1.3.2.2 An Objective Quality Measure Basedon Human
VisualPerception.....................................................................................23
1.4 Information Theory Results..................................................................................26
1.4.1 Entropy.....................................................................................................................27
1.4.1.1 InformationMeasure...............................................................................27
1.4.1.2 AverageInformation per Symbol..........................................................27
1.4.2 Shannon’sNoiseless Source Coding Theorem....................................................28
1.4.3 Shannon’sNoisyChannel Coding Theorem.......................................................29
1.4.4 Shannon’sSourceCoding Theorem.....................................................................29
1.4.5 InformationTransmission Theorem.....................................................................29
1.5 Summary................................................................................................................30
Exercises.........................................................................................................................30
References.......................................................................................................................31
Chapter 2 Quantization
2.1 Quantization and the Source Encoder..................................................................33
2.2 Uniform Quantization...........................................................................................35
2.2.1 Basics.........................................................................................................................36
vii
Shi/ImageandVideoCompressionforMultimediaEngineering 7364_C000 FinalProof page viii 23.2.2008 2:45pm CompositorName:BMani
viii
2.2.1.1 Definitions.................................................................................................36
2.2.1.2 Quantization Distortion..........................................................................38
2.2.1.3 Quantizer Design.....................................................................................39
2.2.2 Optimum UniformQuantizer...............................................................................40
2.2.2.1 UniformQuantizer with UniformlyDistributed Input......................40
2.2.2.2 Conditions of Optimum Quantization..................................................41
2.2.2.3 Optimum UniformQuantizer with Different
Input Distributions...................................................................................44
2.3 Nonuniform Quantization....................................................................................45
2.3.1 Optimum (Nonuniform) Quantization................................................................45
2.3.2 Companding Quantization....................................................................................48
2.4 Adaptive Quantization..........................................................................................50
2.4.1 ForwardAdaptive Quantization..........................................................................52
2.4.2 Backward AdaptiveQuantization........................................................................52
2.4.3 Adaptive Quantization witha One-Word Memory..........................................53
2.4.4 Switched Quantization...........................................................................................53
2.5 Pulse Code Modulation.........................................................................................54
2.6 Summary................................................................................................................56
Exercises.........................................................................................................................57
References.......................................................................................................................58
Chapter 3 Differential Coding
3.1 Introduction to DPCM...........................................................................................59
3.1.1 Simple Pixel-to-Pixel DPCM..................................................................................60
3.1.2 GeneralDPCMSystems.........................................................................................63
3.2 Optimum Linear Prediction..................................................................................64
3.2.1 Formulation..............................................................................................................65
3.2.2 Orthogonality Condition and Minimum Mean Square Error..........................66
3.2.3 Solution to Yule–Walker Equations.....................................................................66
3.3 Some Issues in the Implementation of DPCM.....................................................66
3.3.1 Optimum DPCM System.......................................................................................67
3.3.2 1-D, 2-D, and 3-D DPCM.......................................................................................67
3.3.3 Orderof Predictor...................................................................................................68
3.3.4 Adaptive Prediction................................................................................................68
3.3.5 Effect ofTransmission Errors................................................................................68
3.4 Delta Modulation...................................................................................................70
3.5 Interframe Differential Coding.............................................................................73
3.5.1 Conditional Replenishment...................................................................................73
3.5.2 3-D DPCM................................................................................................................74
3.5.3 Motion Compensated Predictive Coding............................................................75
3.6 Information-Preserving Differential Coding........................................................76
3.7 Summary................................................................................................................77
Exercises.........................................................................................................................78
References.......................................................................................................................78
Chapter 4 Transform Coding
4.1 Introduction...........................................................................................................81
4.1.1 Hotelling Transform...............................................................................................81
4.1.2 Statistical Interpretation.........................................................................................83
4.1.3 Geometrical Interpretation.....................................................................................84
Shi/ImageandVideoCompressionforMultimediaEngineering 7364_C000 FinalProof page ix 23.2.2008 2:45pm CompositorName:BMani
ix
4.1.4 Basis VectorInterpretation....................................................................................85
4.1.5 Procedures of TransformCoding.........................................................................86
4.2 Linear Transforms.................................................................................................87
4.2.1 2-D Image Transformation Kernel........................................................................87
4.2.1.1 Separability................................................................................................87
4.2.1.2 Symmetry..................................................................................................88
4.2.1.3 Matrix Form..............................................................................................88
4.2.1.4 Orthogonality............................................................................................89
4.2.2 Basis Image Interpretation.....................................................................................89
4.2.3 Subimage Size Selection.........................................................................................90
4.3 Transforms of Particular Interest..........................................................................92
4.3.1 Discrete Fourier Transform....................................................................................92
4.3.2 Discrete Walsh Transform.....................................................................................93
4.3.3 Discrete Hadamard Transform.............................................................................93
4.3.4 Discrete CosineTransform.....................................................................................95
4.3.4.1 Background...............................................................................................95
4.3.4.2 TransformationKernel............................................................................95
4.3.4.3 Relationship withDFT............................................................................96
4.3.5 Performance Comparison......................................................................................98
4.3.5.1 Energy Compaction.................................................................................98
4.3.5.2 MeanSquare Reconstruction Error.......................................................99
4.3.5.3 ComputationalComplexity..................................................................101
4.3.5.4 Summary.................................................................................................101
4.4 Bit Allocation.......................................................................................................101
4.4.1 ZonalCoding.........................................................................................................101
4.4.2 Threshold Coding.................................................................................................102
4.4.2.1 Thresholding and Shifting....................................................................103
4.4.2.2 Normalization and Roundoff...............................................................104
4.4.2.3 Zigzag Scan.............................................................................................106
4.4.2.4 Huffman Coding....................................................................................106
4.4.2.5 SpecialCode Words...............................................................................107
4.4.2.6 Rate Buffer Feedback and Equalization..............................................107
4.5 Some Issues..........................................................................................................108
4.5.1 Effectof Transmission Error................................................................................108
4.5.2 Reconstruction ErrorSources..............................................................................108
4.5.3 Comparison between DPCM and TC................................................................109
4.5.4 HybridCoding......................................................................................................109
4.6 Summary..............................................................................................................110
Exercises.......................................................................................................................112
References.....................................................................................................................112
Chapter 5 Variable-Length Coding: Information Theory Results (II)
5.1 Some Fundamental Results.................................................................................115
5.1.1 Coding an Information Source............................................................................115
5.1.2 Some Desired Characteristics..............................................................................116
5.1.2.1 Block Code..............................................................................................116
5.1.2.2 Uniquely Decodable Code....................................................................116
5.1.2.3 InstantaneousCodes..............................................................................118
5.1.2.4 CompactCode........................................................................................120
5.1.3 Discrete MemorylessSources..............................................................................120
Description:Multimedia hardware still cannot accommodate the demand for large amounts of visual data. Without the generation of high-quality video bitstreams, limited hardware capabilities will continue to stifle the advancement of multimedia technologies. Thorough grounding in coding is needed so that applicat