Table Of ContentCloudera Administration
ImportantNotice
©2010-2021Cloudera,Inc.Allrightsreserved.
Cloudera,theClouderalogo,andanyotherproductor
servicenamesorsloganscontainedinthisdocumentaretrademarksofClouderaand
itssuppliersorlicensors,andmaynotbecopied,imitatedorused,inwholeorinpart,
withoutthepriorwrittenpermissionofClouderaortheapplicabletrademarkholder.If
thisdocumentationincludescode,includingbutnotlimitedto,codeexamples,Cloudera
makesthisavailabletoyouunderthetermsoftheApacheLicense,Version2.0,including
anyrequirednotices.AcopyoftheApacheLicenseVersion2.0,includinganynotices,
isincludedherein.AcopyoftheApacheLicenseVersion2.0canalsobefoundhere:
https://opensource.org/licenses/Apache-2.0
HadoopandtheHadoopelephantlogoaretrademarksoftheApacheSoftware
Foundation.Allothertrademarks,registeredtrademarks,productnamesandcompany
namesorlogosmentionedinthisdocumentarethepropertyoftheirrespectiveowners.
Referencetoanyproducts,services,processesorotherinformation,bytradename,
trademark,manufacturer,supplierorotherwisedoesnotconstituteorimply
endorsement,sponsorshiporrecommendationthereofbyus.
Complyingwithallapplicablecopyrightlawsistheresponsibilityoftheuser.Without
limitingtherightsundercopyright,nopartofthisdocumentmaybereproduced,stored
inorintroducedintoaretrievalsystem,ortransmittedinanyformorbyanymeans
(electronic,mechanical,photocopying,recording,orotherwise),orforanypurpose,
withouttheexpresswrittenpermissionofCloudera.
Clouderamayhavepatents,patentapplications,trademarks,copyrights,orother
intellectualpropertyrightscoveringsubjectmatterinthisdocument.Exceptasexpressly
providedinanywrittenlicenseagreementfromCloudera,thefurnishingofthisdocument
doesnotgiveyouanylicensetothesepatents,trademarkscopyrights,orother
intellectualproperty.ForinformationaboutpatentscoveringClouderaproducts,see
http://tiny.cloudera.com/patents.
Theinformationinthisdocumentissubjecttochangewithoutnotice.Clouderashall
notbeliableforanydamagesresultingfromtechnicalerrorsoromissionswhichmay
bepresentinthisdocument,orfromuseofthisdocument.
Cloudera,Inc.
395PageMillRoad
PaloAlto,CA94306
info@cloudera.com
US:1-888-789-1488
Intl:1-650-362-0488
www.cloudera.com
ReleaseInformation
Version:ClouderaEnterprise5.5.x
Date:February3,2021
Table of Contents
About Cloudera Administration................................................................................7
Managing CDH and Managed Services.....................................................................8
ManagingCDHandManagedServicesUsingClouderaManager........................................................................8
Configuration Overview.........................................................................................................................................................8
Managing Clusters...............................................................................................................................................................32
ManagingServices...............................................................................................................................................................36
Managing Roles...................................................................................................................................................................45
ManagingHosts...................................................................................................................................................................49
MaintenanceMode..............................................................................................................................................................62
ManagingCDHUsingtheCommandLine..........................................................................................................65
StartingCDHServicesUsingtheCommandLine..................................................................................................................66
StoppingCDHServicesUsingtheCommandLine................................................................................................................71
MigratingDatabetweenClustersUsingdistcp....................................................................................................................73
ManagingIndividualServices.............................................................................................................................78
ManagingFlume..................................................................................................................................................................78
ManagingtheHBaseService...............................................................................................................................................80
Managing HDFS.................................................................................................................................................................134
ManagingHive...................................................................................................................................................................165
Managing Hue...................................................................................................................................................................173
Managing Impala..............................................................................................................................................................197
ManagingKey-ValueStoreIndexer....................................................................................................................................208
ManagingMapReduceandYARN......................................................................................................................................209
ManagingOozie.................................................................................................................................................................217
Managing Solr...................................................................................................................................................................225
ManagingSpark.................................................................................................................................................................228
ManagingtheSqoop1Client.............................................................................................................................................231
ManagingSqoop2.............................................................................................................................................................232
ManagingZooKeeper.........................................................................................................................................................232
ConfiguringServicestoUsetheGPLExtrasParcel.............................................................................................................233
Resource Management........................................................................................235
Schedulers........................................................................................................................................................235
Cloudera Manager Resource Management.....................................................................................................235
LinuxControlGroups(cgroups)........................................................................................................................237
ResourceManagementwithControlGroups.....................................................................................................................239
ConfiguringResourceParameters......................................................................................................................................240
StaticServicePools...........................................................................................................................................241
Dynamic Resource Pools..................................................................................................................................242
ManagingDynamicResourcePools...................................................................................................................................243
YARNPoolStatusandConfigurationOptions....................................................................................................................245
AssigningApplicationsandQueriestoResourcePools......................................................................................................246
Configuration Sets.............................................................................................................................................................248
Scheduling Rules................................................................................................................................................................249
Managing Impala Admission Control...............................................................................................................250
Managingthe Impala Llama ApplicationMaster..............................................................................................252
EnablingIntegratedResourceManagementUsingClouderaManager............................................................................253
DisablingIntegratedResourceManagementUsingClouderaManager...........................................................................254
ConfiguringLlamaUsingClouderaManager.....................................................................................................................254
Impala Resource Management........................................................................................................................254
AdmissionControlandQueryQueuing..............................................................................................................................254
IntegratedResourceManagementwithYARN...................................................................................................................262
Performance Management...................................................................................264
OptimizingPerformanceinCDH.......................................................................................................................264
ChoosingaDataCompressionFormat.............................................................................................................267
Tuningthe Solr Server......................................................................................................................................268
TuningtoCompleteDuringSetup......................................................................................................................................268
GeneralTuning...................................................................................................................................................................268
Other Resources.................................................................................................................................................................275
Tuning Spark Applications................................................................................................................................275
Tuning YARN.....................................................................................................................................................281
Overview............................................................................................................................................................................282
Cluster Configuration.........................................................................................................................................................285
YARN Configuration...........................................................................................................................................................286
MapReduce Configuration.................................................................................................................................................288
Step7:MapReduceConfiguration.....................................................................................................................................288
Step7A:MapReduceSanityChecking................................................................................................................................288
ConfiguringYourClusterInClouderaManager.................................................................................................................288
High Availability...................................................................................................290
HDFS High Availability......................................................................................................................................290
IntroductiontoHDFSHighAvailability...............................................................................................................................290
ConfiguringHardwareforHDFSHA...................................................................................................................................291
EnablingHDFSHA..............................................................................................................................................................292
DisablingandRedeployingHDFSHA..................................................................................................................................304
ConfiguringOtherCDHComponentstoUseHDFSHA.......................................................................................................305
AdministeringanHDFSHighAvailabilityCluster...............................................................................................................308
ChangingaNameserviceNameforHighlyAvailableHDFSUsingClouderaManager......................................................312
MapReduce(MRv1)andYARN(MRv2)HighAvailability..................................................................................312
YARN(MRv2)ResourceManagerHighAvailability.............................................................................................................313
WorkPreservingRecoveryforYARNComponents.............................................................................................................320
MapReduce(MRv1)JobTrackerHighAvailability..............................................................................................................322
ClouderaNavigatorKeyTrusteeServerHighAvailability.................................................................................334
ConfiguringKeyTrusteeServerHighAvailabilityUsingClouderaManager......................................................................334
ConfiguringKeyTrusteeServerHighAvailabilityUsingtheCommandLine......................................................................335
RecoveringaKeyTrusteeServer........................................................................................................................................337
KeyTrusteeKMSHighAvailability....................................................................................................................337
HighAvailabilityforOtherCDHComponents...................................................................................................338
HBaseHighAvailability......................................................................................................................................................338
Hive Metastore High Availability.......................................................................................................................................343
HueHighAvailability.........................................................................................................................................................345
Llama High Availability......................................................................................................................................................348
ConfiguringOozieforHighAvailability..............................................................................................................................349
Search High Availability.....................................................................................................................................................350
ConfiguringClouderaManagerforHighAvailabilityWithaLoadBalancer.....................................................352
IntroductiontoClouderaManagerDeploymentArchitecture...........................................................................................352
PrerequisitesforSettingupClouderaManagerHighAvailability......................................................................................353
High-LevelStepstoConfigureClouderaManagerHighAvailability.................................................................................354
DatabaseHighAvailabilityConfiguration..........................................................................................................................380
TLSandKerberosConfigurationforClouderaManagerHighAvailability.........................................................................381
Backup and Disaster Recovery..............................................................................383
PortRequirementsforBackupandDisasterRecovery.....................................................................................383
Data Replication...............................................................................................................................................384
Designating a Replication Source......................................................................................................................................386
HDFS Replication................................................................................................................................................................387
Hive Replication.................................................................................................................................................................396
Impala Metadata Replication............................................................................................................................................404
UsingSnapshotswithReplication......................................................................................................................................404
EnablingReplicationBetweenClustersinDifferentKerberosRealms................................................................................404
Replication of Encrypted Data...........................................................................................................................................406
HBase Replication..............................................................................................................................................................407
Snapshots.........................................................................................................................................................414
Cloudera Manager Snapshot Policies................................................................................................................................414
Managing HBase Snapshots..............................................................................................................................................417
ManagingHDFSSnapshots................................................................................................................................................428
Cloudera Manager Administration........................................................................433
Starting,Stopping,andRestartingtheClouderaManagerServer...................................................................433
ConfiguringClouderaManagerServerPorts....................................................................................................433
MovingtheClouderaManagerServertoaNewHost.....................................................................................433
ManagingtheClouderaManagerServerLog...................................................................................................434
Viewing the Log.................................................................................................................................................................434
SettingtheClouderaManagerServerLogLocation..........................................................................................................435
ClouderaManagerAgents................................................................................................................................435
Starting,Stopping,andRestartingClouderaManagerAgents..........................................................................................436
ConfiguringClouderaManagerAgents.............................................................................................................................437
ManagingClouderaManagerAgentLogs.........................................................................................................................440
ChangingHostnames........................................................................................................................................441
Configuring Network Settings..........................................................................................................................443
Alerts................................................................................................................................................................443
Managing Alerts................................................................................................................................................................444
Managing Licenses...........................................................................................................................................451
SendingUsageandDiagnosticDatatoCloudera.............................................................................................456
ConfiguringaProxyServer.................................................................................................................................................456
ManagingAnonymousUsageDataCollection..................................................................................................................456
ManagingHueAnalyticsDataCollection..........................................................................................................................456
DiagnosticDataCollection.................................................................................................................................................457
ExportingandImportingClouderaManagerConfiguration.............................................................................459
Backing up Cloudera Manager.........................................................................................................................459
BackingupDatabases........................................................................................................................................................461
OtherClouderaManagerTasksandSettings...................................................................................................461
Settings..............................................................................................................................................................................461
Alerts..................................................................................................................................................................................462
Users..................................................................................................................................................................................462
Kerberos.............................................................................................................................................................................462
License...............................................................................................................................................................................462
UserInterfaceLanguage....................................................................................................................................................462
Peers..................................................................................................................................................................................462
Cloudera Management Service........................................................................................................................462
Cloudera Navigator Data Management Component Administration......................468
ClouderaNavigatorAuditServer......................................................................................................................468
PublishingAuditEvents......................................................................................................................................................471
ClouderaNavigatorMetadataServer...............................................................................................................472
Managing Metadata Extraction........................................................................................................................................478
ManagingMetadataPolicies.............................................................................................................................................479
Appendix: Apache License, Version 2.0.................................................................480
AboutClouderaAdministration
About Cloudera Administration
ThisguidedescribeshowtoconfigureandadministeraClouderadeployment.Administratorsmanageresources,
availability,andbackupandrecoveryconfigurations.Inaddition,thisguideshowshowtoimplementhighavailability,
anddiscussesintegration.
ClouderaAdministration|7
ManagingCDHandManagedServices
Managing CDH and Managed Services
IfyouuseClouderaManagertomanageyourcluster,configuringandmanagingyourcluster,aswellasindividual
servicesandhosts,usesadifferentparadigmthanifyouuseCDHwithoutClouderaManager.Forthisreason,many
oftheseconfigurationtasksoffertwodifferentsubtasks,oneeachforclustersmanagedbyClouderaManagerand
oneforclusterswhichdonotuseClouderaManager.Often,thetasksarenotinterchangeable.Forinstance,ifyouuse
ClouderaManageryoucannotusestandardHadoopcommand-lineutilitiestostartandstopservices.Instead,youuse
ClouderaManagertoperformthesetasks.
Managing CDH and Managed Services Using Cloudera Manager
YoumanageCDHandmanagedservicesusingtheClouderaManagerAdminConsoleandClouderaManagerAPI.
ThefollowingsectionsfocusontheClouderaManagerAdminConsole.
ConfigurationOverview
WhenClouderaManagerconfiguresaservice,itallocatesrolesthatarerequiredforthatservicetothehostsinyour
cluster.Theroledetermineswhichservicedaemonsrunonahost.
Forexample,foranHDFSserviceinstance,ClouderaManagerconfigures:
• OnehosttoruntheNameNoderole.
• OnehosttorunasthesecondaryNameNoderole.
• OnehosttoruntheBalancerrole.
• RemaininghostsastorunDataNoderoles.
Arolegroupisasetofconfigurationpropertiesforaroletype,aswellasalistofroleinstancesassociatedwiththat
group.ClouderaManagerautomaticallycreatesadefaultrolegroupnamedRoleTypeDefaultGroupforeachrole
type.
Whenyouruntheinstallationorupgradewizard,ClouderaManagerconfiguresthedefaultrolegroupsitadds,and
addsanyotherrequiredrolegroupsforagivenroletype.Forexample,aDataNoderoleonthesamehostasthe
NameNodemightrequireadifferentconfigurationthanDataNoderolesrunningonotherhosts.ClouderaManager
createsaseparaterolegroupfortheDataNoderolerunningontheNameNodehostandusesthedefaultconfiguration
forDataNoderolesrunningonotherhosts.
ClouderaManagerwizardsautoconfigurerolegrouppropertiesbasedontheresourcesavailableonthehosts.For
propertiesthatarenotdependentonhostresources,ClouderaManagerdefaultvaluestypicallyalignwithCDHdefault
valuesforthatconfiguration.ClouderaManagerdeviateswhentheCDHdefaultisnotarecommendedconfiguration
orwhenthedefaultvaluesareillegal.
ClouderaManagerConfigurationLayout
AfterrunningtheInstallationwizard,useClouderaManagertoreconfiguretheexistingservicesandaddandconfigure
additionalhostsandservices.
ClouderaManagerconfigurationscreensoffertwolayoutoptions:new(thedefault)andclassic.Youcanswitchbetween
layoutsusingtheSwitchtoXXXlayoutlinkatthetoprightofthepage.Keepthefollowinginmindwhenyouselecta
layout:
• Ifyouswitchtotheclassiclayout,ClouderaManagerpreservesthatsettingwhenyouupgradetoanewversion.
• Selectionsmadeinonelayoutarenotpreservedwhenyouswitch.
• Certainfeatures,includingcontrolsforconfiguringNavigatorauditeventsandHDFSlogredaction,aresupported
onlyinthenewlayout.
8|ClouderaAdministration
ManagingCDHandManagedServices
Newlayoutpagescontaincontrolsthatallowyoufilterconfigurationpropertiesbasedonconfigurationstatus,category,
andgroup.Forexample,todisplaytheJournalNodemaximumlogsizeproperty(JournalNodeMaxLogSize),clickthe
CATEGORY>JournalNodeandGROUP>Logsfilters:
Whenaconfigurationpropertyhasbeensettoavaluedifferentfromthedefault,aresettodefaultvalueicon
displays.
Classiclayoutpagesareorganizedbyrolegroupandcategorieswithintherolegroup.Forexample,todisplaythe
JournalNodemaximumlogsizeproperty(JournalNodeMaxLogSize),selectJournalNodeDefaultGroup>Logs.
ClouderaAdministration|9
ManagingCDHandManagedServices
Whenaconfigurationpropertyhasbeensettoavaluedifferentfromthedefault,aResettothedefaultvaluelink
displays.
Thereisnomechanismforresettingtoanautoconfiguredvalue.However,youcanusetheconfigurationhistoryand
rollbackfeaturetorevertanyconfigurationchanges.
ModifyingConfigurationPropertiesUsingClouderaManager
Note:
ThistopicdiscusseshowtoconfigurepropertiesusingtheClouderaManager"newlayout."Theolder
layout,calledthe"classiclayout"isstillavailable.Forinstructionsonusingtheclassiclayout,see
ModifyingConfigurationProperties(ClassicLayout)onpage15.
Toswitchbetweenthelayouts,clickeithertheSwitchtothenewlayoutorSwitchtotheclassic
layoutlinksintheupper-rightportionofallconfigurationpages.
MinimumRequiredRole:Configurator(alsoprovidedbyClusterAdministrator,FullAdministrator)
WhenaserviceisaddedtoClouderaManager,eitherthroughtheinstallationorupgradewizardorwiththeAdd
Servicesworkflow,ClouderaManagerautomaticallysetstheconfigurationproperties,basedontheneedsoftheservice
andcharacteristicsoftheclusterinwhichitwillrun.Theseconfigurationpropertiesincludebothservice-wide
configurationproperties,aswellasspecificpropertiesforeachroletypeassociatedwiththeservice,managedthrough
rolegroups.Arolegroupisasetofconfigurationpropertiesforaroletype,aswellasalistofroleinstancesassociated
withthatgroup.ClouderaManagerautomaticallycreatesadefaultrolegroupnamedRoleTypeDefaultGroupfor
eachroletype.SeeRoleGroupsonpage48.
ChangingtheConfigurationofaServiceorRoleInstance
1. Gototheservicestatuspage.(Cluster>servicename)
2. ClicktheConfigurationtab.
3. Locatethepropertyyouwanttoedit.Youcantypeallorpartofthepropertynameinthesearchbox,orusethe
filtersontheleftsideofthescreen:
• TheStatussectionlimitsthedisplayedpropertiesbytheirstatus.Possiblestatusesinclude:
10|ClouderaAdministration
Description:Hadoop and the Hadoop elephant logo are trademarks of the Apache Software. Foundation. All other trademarks, registered trademarks, product