ebook img

Data Science: Innovative Developments in Data Analysis and Clustering PDF

346 Pages·2017·5.137 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Data Science: Innovative Developments in Data Analysis and Clustering

Studies in Classifi cation, Data Analysis, and Knowledge Organization Francesco Palumbo Angela Montanari Maurizio Vichi Editors Data Science Innovative Developments in Data Analysis and Clustering Studies in Classification, Data Analysis, and Knowledge Organization ManagingEditors EditorialBoard H.-H.Bock,Aachen D.Baier,Cottbus W.Gaul,Karlsruhe F.Critchley,MiltonKeynes M.Vichi,Rome R.Decker,Bielefeld C.Weihs,Dortmund E.Diday,Paris M.Greenacre,Barcelona C.N.Lauro,Naples J.Meulman,Leiden P.Monari,Bologna S.Nishisato,Toronto N.Ohsumi,Tokyo O.Opitz,Augsburg G.Ritter,Passau M.Schader,Mannheim Moreinformationaboutthisseriesathttp://www.springer.com/series/1564 Francesco Palumbo • Angela Montanari (cid:129) Maurizio Vichi Editors Data Science Innovative Developments in Data Analysis and Clustering 123 Editors FrancescoPalumbo AngelaMontanari DepartmentofPoliticalSciences DepartmentofStatisticalSciencesPaolo UniversityofNaplesFedericoII Fortunati Napoli,Italy AlmaMaterStudiorum,University ofBologna Bologna,Italy MaurizioVichi DepartmentofStatisticalSciences SapienzaUniversityofRome Rome,Italy ISSN1431-8814 ISSN2198-3321 (electronic) StudiesinClassification,DataAnalysis,andKnowledgeOrganization ISBN978-3-319-55722-9 ISBN978-3-319-55723-6 (eBook) DOI10.1007/978-3-319-55723-6 LibraryofCongressControlNumber:2017942955 ©SpringerInternationalPublishingAG2017 Thisworkissubjecttocopyright.AllrightsarereservedbythePublisher,whetherthewholeorpartof thematerialisconcerned,specificallytherightsoftranslation,reprinting,reuseofillustrations,recitation, broadcasting,reproductiononmicrofilmsorinanyotherphysicalway,andtransmissionorinformation storageandretrieval,electronicadaptation,computersoftware,orbysimilarordissimilarmethodology nowknownorhereafterdeveloped. Theuseofgeneraldescriptivenames,registerednames,trademarks,servicemarks,etc.inthispublication doesnotimply,evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfromtherelevant protectivelawsandregulationsandthereforefreeforgeneraluse. Thepublisher,theauthorsandtheeditorsaresafetoassumethattheadviceandinformationinthisbook arebelievedtobetrueandaccurateatthedateofpublication.Neitherthepublishernortheauthorsor theeditorsgiveawarranty,expressorimplied,withrespecttothematerialcontainedhereinorforany errorsoromissionsthatmayhavebeenmade.Thepublisherremainsneutralwithregardtojurisdictional claimsinpublishedmapsandinstitutionalaffiliations. Printedonacid-freepaper ThisSpringerimprintispublishedbySpringerNature TheregisteredcompanyisSpringerInternationalPublishingAG Theregisteredcompanyaddressis:Gewerbestrasse11,6330Cham,Switzerland Preface On 4 July 1985 in Cambridge (UK), six classification societies gave birth to the InternationalFederationofClassificationSocieties;theSocietàItalianadiStatistica (SIS) played an active propulsive role in the IFCS constitution. During the 30 years of the IFCS life, many other classification societies from all around the world have joined the IFCS. Throughthe active participation of its members, the SIS has actively and enthusiastically contributed to the IFCS growth. In 1997 the first conference of the Classification and Data Analysis Group of the SIS was hosted by the Universityof Chieti-Pescara. Followingthis long story of presence, in the occasion of IFCS’s 30th birthday, under the IFCS presidency of Maurizio Vichi, the Classification and Data Analysis Group of the SIS and the Department of Statistical Sciences P. Fortunati of the Alma Mater Studiorum University of Bologna were proudly willing to organize the IFCS conference in Bologna. The conferenceorganizerwasAngelaMontanari(UniversityofBologna)andFrancesco Palumbo(UniversityofNaplesFedericoII)servedaschairofthescientificprogram committee.Theconferencewasheldbetween5and8July2015. Scholars from many different countries attended the conference. The commit- mentofthelocalorganizingcommitteeandtheearnestnessoftheScientificProgram Committeeensuredasuccessfulandworthwhileconference.Wearegratefultothe membersoftheScientificProgramCommittee:A.Cerioli(ClaDAG,Italy),D.Choi (KCS, Korea), C. Cuevas Covarrubias(SOLCAD, Mexico), N. Dean (BCS, UK), A.Ferligoj(SSS, Slovenia),P. Giudici(ClaDAG,Italy),C. Hennig(BSC, UK),T. Imaizumi (JCS, Japan), B. Lausen (GfKl, UK), P. McNicholas (CS, Canada), M. Nadif (SFC, France), A. Okada (JCS, Japan), I. Papadimitriou (GSDA, Greece), J. Pociecha (SKAD, Poland), A. Sbihi (MCA, Marocco), B. Scotney (IPRCS, Ireland), F. Sousa (CLAD, Portugal), D. Steinley (CS, USA), I. Van Mechelen (VOC,Belgium),andJ.Vermunt(VOC,theNetherlands).IFCShaslong-standing tradition of cooperation and exchange with other scientific statistical societies; in occasion of the IFCS conference in Bologna, V. Esposito Vinzi (France) and P. Groenen(the Netherlands) were invited to join the Scientific Program Committee asdelegatesoftheISBISandIASCsocieties,respectively. v vi Preface More than 200 contributions were organized into specialized sessions, con- tributedpapersessions,andonepostersession.Moreover,fivekeynotelectureswere givenbyeminentcolleaguesondifferenttopicsofdataanalysisandclassification. The opening plenary session was devoted to the IFCS birthday celebration and a special session celebrated the 25th anniversary of the publication of the book on GeneralizedAdditiveModelsbyHastieandTibshirani. Thanks to the collaboration with the publisher Springer and to its interest and attention to the IFCS activities during these 30 years, according to the longly consolidated tradition, the present post proceeding volume has been edited after theconference. Thescientificcommunityunanimouslyconsidersdatascienceasoneofthemost promising fields where to direct scientific research in the next years. However, already in occasion of the fifth IFCS conference, which was held in the year 1996 in Kobe (Japan), the related proceedingsvolume was entitled Data Science, Classification,andRelatedMethods(Hayashietal.eds;SpringerJapan,publisher). To emblematize the line of continuity along the IFCS, in occasion of the 30th birthdayconference,wehavedecidedtoentitlethevolumeDataScience:Innovative Developmentsin DataAnalysisandClustering. Thevolumeis a collectionoffull papers submitted after the conference. Papers were selected after a peer-review process,accordingtothehigh-qualitystandardsoftheseries. The volume is made of 27 contributions organized in three parts including contributionson: (cid:129) Classificationmethodsforhigh-dimensionaldata (cid:129) Clusteringmethodsandapplications (cid:129) Multivariatemethodsandapplications Bologna,Italy AngelaMontanari Napoli,Italy FrancescoPalumbo Roma,Italy MaurizioVichi November2016 Acknowledgments We are indebted to many people who allowed the success of the IFCS 2015 conferencewiththeircommitment.Thisbookrepresentsthefinaloutcomeofallthe workdonefortheorganizationoftheconference,duringthedaysoftheconference and after the end of it. First, we are grateful to the Department of Statistical SciencesoftheUniversityofBolognawhohostedtheconference.Inparticularour thanksareaddressedtothemembersoftheorganizingcommittee:L.Anderlucci,S. Bianconcini,S.Cagnone,L.DeAngelis,G.Galimberti,A.Lubisco,M.Lupparelli, P.Monari,L.Stracqualursi,andC.Viroli.TwomoreadditionalthanksareforLaura Anderlucci,who has taken care of the conferenceweb site, and for Paola Monari who,discreetlybuteffectively,hasmadeallher experienceandshrewdnessin the organizationavailableforthesuccessoftheconference. We are also indebted to our colleagues that have collaborated in the review processofthisvolume: AndreaCerioli, ClaudioConversano, PasqualeDolce, LeonardoGrilli, PatrickGroenen, ChristianHennig, TadashiImaizumi, BertholdLausen, AntonelloMaruotti, PaulMcNicholas, FionnMurtagh, MohamedNadif, AkinoriOkada, DomenicoPiccolo, GiancarloRagozini, IvenVanMechelen, JoséFernandoVera, RosannaVerde, VincenzoEspositoVinzi, DomenicoVistocco. Lastbutnotleast,wearealsoindebtedwithSASInstitute,Springer,APTServizi Regione Emilia Romagna and Ascom Bologna for their financial support to the conference. vii Contents PartI ClassificationMethodsforHighDimensionalData MissingDataImputationandItsEffectontheAccuracy ofClassification................................................................... 3 LynetteA.Hunt On Coupling Robust Estimation with Regularization forHigh-DimensionalData ..................................................... 15 JanKalinaandJaroslavHlinka ClassificationMethodsintheResearchontheFinancialStanding ofConstructionEnterprisesAfterBankruptcyinPoland................... 29 Barbara Pawełek, Krzysztof Gałuszka, Jadwiga Kostrzewska, andMaciejKostrzewski On the Identification of Correlated Differential Features forSupervisedClassificationofHigh-DimensionalData.................... 43 ShuKayNgandGeoffreyJ.McLachlan PartII ClusteringMethodsandApplications T-SharperImagesandT-LevelCutsofFuzzyPartitions.................... 61 SlavkaBodjanova Benchmarking for Clustering Methods Based on Real Data: AStatisticalView................................................................. 73 Anne-LaureBoulesteixandMyriamHatz RepresentableHierarchicalClusteringMethodsforAsymmetric Networks.......................................................................... 83 Gunnar Carlsson, Facundo Mémoli, Alejandro Ribeiro, andSantiagoSegarra ix x Contents AMedian-BasedConsensusRuleforDistanceExponentSelection intheFrameworkofIntelligentandWeightedMinkowskiClustering .... 97 Renato Cordeiro de Amorim, Nadia Tahiri, Boris Mirkin, andVladimirMakarenkov FindingPrototypesThroughaTwo-StepFuzzyApproach.................. 111 MarioFordelloneandFrancescoPalumbo Clustering Air MonitoringStationsAccordingtoBackground and Ambient Pollution Using Hidden Markov Models andMultidimensionalScaling.................................................. 123 ÁlvaroGómez-Losada MarkedPointProcessesforMicroarrayDataClustering................... 133 KhadidjaHenni,OlivierAlata,AbdellatifElIdrissi,BrigitteVannier, LyndaZaoui,andAhmedMoussa Social Differentiation of Cultural Taste and Practice inContemporaryJapan:NonhierarchicalAsymmetric ClusterAnalysis.................................................................. 149 MikiNakai TheClassificationandVisualizationofTwitterTrendingTopics ConsideringTimeSeriesVariation............................................. 161 AtsuhoNakayama Handling Missing Data in Observational Clinical Studies ConcerningCardiovascularRisk:AnInsightintoCriticalAspects........ 175 NadiaSolaro,DanielaLucini,andMassimoPagani PartIII MultivariateMethodsandApplications PredictionErrorinDistance-BasedGeneralizedLinearModels........... 191 EvaBoj,TeresaCosta,andJosepFortiana AnInflatedModeltoAccountforLargeHeterogeneityinOrdinal Data................................................................................ 205 StefaniaCapecchi,RosariaSimone,andDomenicoPiccolo FunctionalDataAnalysisforOptimizingStrategiesofCash-Flow Management...................................................................... 219 FrancescaDiSalvo,MarcelloChiodi,andPietroPatricola The Five FactorModel ofPersonalityandEvaluationofDrug ConsumptionRisk ............................................................... 231 ElaineFehrman,AwazK.Muhammad,EvgenyM.Mirkes,Vincent Egan,andAlexanderN.Gorban CorrelationAnalysisforMultivariateFunctionalData ..................... 243 TomaszGórecki,MirosławKrzys´ko,andWaldemarWołyn´ski

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.