ebook img

Fundamentals of Music Processing: Audio, Analysis, Algorithms, Applications PDF

509 Pages·2015·50.46 MB·English
by  Müller
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Fundamentals of Music Processing: Audio, Analysis, Algorithms, Applications

Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications Fundamentals of Music Processing ü Meinard M ller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 123 Meinard Müller International AudioLaboratories Erlangen Erlangen Germany ISBN978-3-319-21944-8 ISBN978-3-319-21945-5 (eBook) DOI 10.1007/978-3-319-21945-5 LibraryofCongressControlNumber:2015945158 SpringerChamHeidelbergNewYorkDordrechtLondon ©SpringerInternationalPublishingSwitzerland2015 Thisworkissubjecttocopyright.AllrightsarereservedbythePublisher,whetherthewholeorpart of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission orinformationstorageandretrieval,electronicadaptation,computersoftware,orbysimilarordissimilar methodologynowknownorhereafterdeveloped. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publicationdoesnotimply,evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfrom therelevantprotectivelawsandregulationsandthereforefreeforgeneraluse. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authorsortheeditorsgiveawarranty,expressorimplied,withrespecttothematerialcontainedhereinor foranyerrorsoromissionsthatmayhavebeenmade. Printedonacid-freepaper SpringerInternationalPublishingAGSwitzerlandispartofSpringerScience+BusinessMedia (www.springer.com) ToMichaelClausenandHans-PeterSeidel Preface Music is a ubiquitous and vital part of the lives of billions of people worldwide. Musicalcreationsandperformancesareamongstthemostcomplexandintricateof ourculturalartifacts,andtheemotionalpowerofmusiccantouchusinsurprising andprofoundways.Musicspansanenormousrangeofformsandstyles,fromsim- ple, unaccompanied folk songs, to popular and jazz music, to symphonies for full orchestras. The digital revolution in music distribution and storage has simultane- ouslyfueledtremendousinterestinandattentiontothewaysthatinformationtech- nologycanbeappliedtothiskindofcontent.Frombrowsingpersonalcollections, todiscoveringnewartists,tomanagingandprotectingtherightsofmusiccreators, computers are now deeply involved in almost every aspect of music consumption, whichisnoteventomentiontheirvitalroleinmuchoftoday’smusicproduction. Despite the importance of music, music processing is still a relatively young disciplinecomparedwithspeechprocessing,aresearchfieldwithalongtradition. Actually,alargerresearchcommunityrepresentedbytheInternationalSocietyfor MusicInformationRetrieval(ISMIR),whichsystematicallydealswithawiderange of computer-based music analysis, processing, and retrieval topics, was formed in theyear2000.Traditionally,computer-basedmusicresearchhasmostlybeencon- ductedonthebasisofsymbolicrepresentationsusingmusicnotationorMIDIrep- resentations. Because of the increasing availability of digitized audio material and anexplosionofcomputingpower,automatedprocessingofwaveform-basedaudio signalsisnowincreasinglyinthefocusofresearchefforts. Many of these research efforts are directed towards the development of tech- nologiesthatallowuserstoaccessandexploremusicinallitsdifferentfacets.For example,audiofingerprintingtechniquesarenowadaysintegratedintocommercial productsthathelpuserstoorganizetheirprivatemusiccollections.Musicprocess- ingtechniquesareusedinextendedaudioplayersthathighlightthecurrentmeasures withinsheetmusicwhileplayingbackarecordingofasymphony.Ondemand,ad- ditionalinformationaboutmelodicandharmonicprogressionsorrhythmandtempo isautomaticallypresentedtothelistener.Interactivemusicinterfacesdisplaystruc- tural parts of the current piece of music and allow users to directly jump to any keypartsuchasthechorussection,themainmusicaltheme,orasolosectionwith- vii viii Preface outtediousfast-forwardingandrewinding.Furthermore,listenersareequippedwith Google-like search engines that enable them to explore large music collections in variousways.Forexample,theusermaycreateaquerybyspecifyingacertainnote constellation,orsomeharmonicorrhythmicpatternbywhistlingamelodyortap- ping a rhythm, or simply by selecting a short passage from a CD recording; the system then provides the user with a ranked list of available music excerpts from thecollectionthataremusicallyrelatedtothequery.Inmusicprocessing,onemain objectiveistocontributeconcepts,models,algorithms,implementations,andeval- uationsfortacklingsuchtypesofanalysisandretrievalproblems. This textbook is devoted to the emerging fields of music processing and mu- sicinformationretrieval(MIR)—interdisciplinaryresearchareaswhicharerelated to various disciplines including signal processing, information retrieval, machine learning,multimediaengineering,libraryscience,musicology,anddigitalhumani- ties.Themaingoalofthisbookistogiveanintroductiontothisvibrantandexciting newresearchareaforawidereadership.Well-establishedtopicsinmusicanalysis andretrievalhavebeenselectedtoserveasmotivatingapplicationscenarios.Within thesescenarios,fundamentaltechniquesandalgorithmsthatareapplicabletoawide rangeofanalysisandretrievalproblemsarepresentedindepth. Thisbookismeanttobeatextbook thatissuitableforcoursesattheadvanced undergraduateandbeginningmasterlevel.Bymixingtheoryandpractice,thebook providesbothprofoundtechnologicalknowledgeaswellasacomprehensivetreat- mentofmusicprocessingapplications.Furthermore,byincludingnumerousexam- ples, illustrations (the book contains more than 300 figures), and exercises, I hope thatthebookprovidesinterestingmaterialforcoursesinvariousfieldssuchascom- puterscience,multimediaengineering,informationscience,anddigitalhumanities. Thesubsequentsectionsofthisprefacecontainfurtherinformationontheoverall structure of the book, the interconnections between the various topics and tech- niques, and suggestions on how this book may be used as a basis for different courses.Wefirstgiveanoverviewofthebook’scontentbyquicklygoingthrough the individual chapters. Then, we explain various ways of reading and using the book,eachtimefocusingonadifferentaspect.Westartwiththeviewofalecturer whowantstousethistextbookasabasisforanintroductorycourseinmusicpro- cessingormusicinformationretrieval.Then,weshowhowthebookmaybeusedfor anintroductorycourseonFourieranalysisanditsapplications.Finally,weassume the view of a computer scientist who wants to teach fundamental issues on data representations and algorithms, where music may serve as an underlying applica- tiondomain.Describingthesedifferentviews,wetrytoworkoutthedependencies between the chapters as well as the conceptual relationships between the various musicprocessingtasks. Preface ix Content This textbook consists of eight chapters. The first two chapters cover fundamental materialonmusicrepresentationsandtheFouriertransform—conceptsthatarere- quiredthroughoutthebook.Thesetwochaptersmakethebookself-containedtoa great extent. In the subsequent chapters, concrete music processing tasks serve as startingpointsforourinvestigations.Eachofthesechaptersisorganizedinasimilar fashion.Achapterstartswithageneraldescriptionofthemusicprocessingscenario at hand and integrates the topic into a wider context. Motivated by the scenario at hand,eachchapterdiscussesimportanttechniquesandalgorithmsthataregenerally applicable to a wide range of analysis, classification, and retrieval problems. All thesetechniquesaretreatedinamathematicallyrigorousway.Atthesametime,the techniquesareimmediatelyappliedtoaconcretemusicprocessingtask.Bymixing theoryandpractice,thebook’sgoalistoconveybothprofoundtechnologicalknowl- edgeaswellasasolidunderstandingofmusicprocessingapplications.Eachofthe chapters ends with a section that includes links to the research literature, hints for furtherreading,alistofreferences,andexercises.Beforewediscusshowthistext- bookmaybeemployedinacourseorusedforself-study,wefirstgiveanoverview oftheindividualchaptersandthemaintopics. Musical information can be represented in many different ways. In Chapter1, we consider three widely used music representations: sheet music, symbolic, and audio representations. This first chapter also introduces basic terminology that is usedthroughoutthebook.Inparticular,wediscussmusicalandacousticproperties ofaudiosignalsincludingaspectssuchasfrequency,pitch,dynamics,andtimbre. Important technical terminology is covered in Chapter2. In particular, we ap- proachtheFouriertransform—whichisperhapsthemostfundamentaltoolinsignal processing—from various perspectives. For the reader who is more interested in the musical aspects of the book, Section2.1 provides a summary of the most im- portant facts on the Fourier transform. In particular, the notion of a spectrogram, whichyieldsatime–frequencyrepresentationofanaudiosignal,isintroduced.The remainderofthechaptertreatstheFouriertransformingreatermathematicaldepth and also includes the fast Fourier transform (FFT)—an algorithm of great beauty andhighpracticalrelevance. As a first music processing task, we study in Chapter3 the problem of mu- sic synchronization. The objective is to temporally align compatible representa- tions of the same piece of music. Considering this scenario, we explain the need for musically informed audio features. In particular, we introduce the concept of chroma-basedmusicfeatures,whichcapturepropertiesthatarerelatedtoharmony and melody. Furthermore, we study an alignment technique known as dynamic time warping (DTW), a concept that is applicable for the analysis of general time series. For its efficient computation, we discuss an algorithm based on dynamic programming—a widely used method for solving a complex problem by breaking itdownintoacollectionofsimplersubproblems. InChapter4,weaddressacentralandwell-researchedareawithinMIRknown as music structure analysis. Given a music recording, the objective is to identify x Preface Music Notions, Techniques& Chapter Processing Algorithms Scenario Music Musicnotation, MIDI, audiosignal, 1 Representations waveform,pitch, loudness, timbre Discrete/analog signal, sinusoid, Fourier Analysis exponential,Fourier transform, 2 ofSignals Fourier representation, DFT, FFT, STFT Chroma feature, dyamic Music 3 programming, dyamictime warping Synchronization (DTW),alignment, userinterface Similaritymatrix, repetition, Music Structure thumbnail, homogeneity, novelty, 4 Analysis evaluation, precision, recall, F- measure, visualization, scapeplot Harmony, musictheory, chords, Chord 5 scales, templates, hiddenMarkov Recognition model(HMM), evaluation Onset, novelty,tempo, tempogram, Tempo andBeat 6 beat, periodicity, Fourier analysis, Tracking autocorrelation Identification,fingerprint, indexing, Content-Based 7 invertedlist, matching,version, cover Audio Retrieval song Harmonic/percussivecomponent, Musically signalreconstruction, instanteneous 8 InformedAudio frequency, fundamental frequency Decomposition (F0), trajectory, nonnegativematrix factorization (NMF) importantstructuralelementsandtotemporallysegmenttherecordingaccordingto theseelements.Withinthisscenario,wediscussfundamentalsegmentationprinci- ples based on repetitions, homogeneity, and novelty—principles that also apply to othertypesofmultimediabeyondmusic.Asanimportanttechnicaltool,westudy in detail the concept of self-similarity matrices and discuss their structural prop- erties. Finally, we briefly touch the topic of evaluation, introducing the notions of precision,recall,andF-measure.Thesemeasuresareusedtocomparethecomputed resultsthatareobtainedbyanautomatedprocedurewithso-calledgroundtruthan- notationsthataretypicallygeneratedmanuallybysomedomainexpert. In Chapter5, we consider the problem of analyzing harmonic properties of a piece of music by determining a descriptive progression of chords from a given Preface xi audiorecording.Wetakethisopportunitytofirstdiscusssomebasictheoryofhar- monyincludingconceptssuchasintervals,chords,andscales.Then,motivatedby the automated chord recognition scenario, we introduce template-based matching procedures and hidden Markov models—a concept of central importance for the analysisoftemporalpatternsintime-dependentdatastreamsincludingspeech,ges- tures,andmusic. Tempoandbeatarefurtherfundamentalpropertiesofmusic.InChapter6,we introduce the basic ideas on how to extract tempo-related information from audio recordings.Inthisscenario,afirstchallengeistolocatenoteonsetinformation—a taskthatrequiresmethodsfordetectingchangesinenergyandspectralcontent.To derive tempo and beat information, note onset candidates are then analyzed with regard to quasiperiodic patterns. This leads us to the study of general methods for localperiodicityanalysisoftimeseries. Oneimportanttopicininformationretrievalisconcernedwiththedevelopment of search engines that enable users to explore music collections in a flexible and intuitive way. In Chapter7, we discuss audio retrieval strategies that follow the query-by-exampleparadigm:givenanaudioquery,thetaskistoretrievealldocu- ments that are somehow similar or related to the query. Starting with audio iden- tification, a technique used in many commercial applications such as Shazam, we study various retrieval strategies to handle different degrees of similarity. Further- more, considering efficiency issues, we discuss fundamental indexing techniques basedoninvertedlists—aconceptoriginallyusedintextretrieval. InthefinalChapter8onaudiodecomposition,wepresentachallengingresearch directionthatiscloselyrelatedtosourceseparation.Withinthiswideresearcharea, weconsiderthreesubproblems:harmonic–percussiveseparation,mainmelodyex- traction,andscore-informedaudiodecomposition.Withinthesescenarios,wedis- cussanumberofkeytechniquesincludinginstantaneousfrequencyestimation,fun- damentalfrequency(F0)estimation,spectrograminversion,andnonnegativematrix factorization(NMF).Furthermore,weencounteranumberofacousticandmusical propertiesofaudiorecordingsthathavebeenintroducedanddiscussedinprevious chapters,whichroundsoffthebook. TargetReadership In the last fifteen years, music processing and music information retrieval (MIR) have developed into a vibrant and multidisciplinary area of research. Because of the diversity and richness of music, this area brings together researchers and stu- dents from a multitude of fields including information science, audio engineering, computerscience,andmusicology.Thisbook’sintentionistoofferinterestingma- terialforcoursesinthesefields.Themaintargetgroupsofthisbookaremasterand advancedbachelorstudents.Furthermore,wealsohopethatresearcherswhoarein- terestedindelvingintothefieldofmusicprocessingwillbenefitfromthistextbook. The eight chapters are organized in a modular fashion, thus offering lecturers and

Description:
This textbook provides both profound technological knowledge and a comprehensive treatment of essential topics in music processing and music information retrieval. Including numerous examples, figures, and exercises, this book is suited for students, lecturers, and researchers working in audio engin
See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.