·課程名稱:Hadoop管理工程師(CCAH)認(rèn)證
·開課時間:2024-06-15
【課程簡介】
從安裝及配置、負(fù)載均衡及調(diào)整,以及 診斷和解決部署問題等各方面了解 Hadoop 系統(tǒng)管理員的概念和實踐。
面向需要建立或維護 Hadoop 集群的管理員。培訓(xùn)對象要求具備 Linux 基本知識,Hadoop相關(guān)知識不作要求。
CCA Administrator Exam (CCA131) 管理員認(rèn)證考試
考試形式:120分鐘;70%通過;基于一個預(yù)配置的Cloudera企業(yè)版集群,解決8~12個場景下的任務(wù)
【課程簡介】
作為大數(shù)據(jù)核心技術(shù),hadoop 為企業(yè)提供了高擴展、高冗余、高容錯、和經(jīng)濟有效的“數(shù)據(jù)驅(qū)動”解決方案。針對目前普遍缺乏海量數(shù)據(jù)技術(shù)人員的現(xiàn)狀,青藍咨詢的CCAH課程面向具備和掌握Linux系統(tǒng)管理和網(wǎng)絡(luò)相關(guān)技能和經(jīng)驗。無需具備Hadoop基礎(chǔ)和經(jīng)驗。
【授課對象】
系統(tǒng)管理員或者任何需要管理Apache Hadoop機群的人員(包括產(chǎn)品及開發(fā)環(huán)境)。
【授課內(nèi)容】
· Hadoop分布式文件系統(tǒng)和MapReduce工作原理
· Hadoop集群硬件配置規(guī)劃
· Hadoop集群網(wǎng)絡(luò)配置規(guī)劃
· Hadoop集群配置及優(yōu)化
· 如何配置NameNode HA
· 任何配置NameNode Federation
· 任何配置FairScheduler為多用戶共享Hadoop集群
· 任何為Hadoop集群安裝和實現(xiàn)基于Kerberos的安全性
· 如何維護和監(jiān)測Hadoop集群
· 如何使用Flume加載動態(tài)產(chǎn)生的文件以及使用Sqoop連接關(guān)系數(shù)據(jù)庫進行數(shù)據(jù)導(dǎo)入導(dǎo)出
· Hive、Pig和HBase等Hadoop生態(tài)系統(tǒng)工具相關(guān)的系統(tǒng)管理工作
模塊 |
內(nèi)容 |
The Case for Apache Hadoop |
l Why Hadoop? l A Brief History of Hadoop l Core Hadoop Components l Fundamental Concepts |
HDFS
|
l HDFS Features l Writing and Reading Files l NameNode Considerations l Overview of HDFS Security l Using the Namenode Web UI l Using the Hadoop File Shell |
Getting Data into HDFS |
l Ingesting Data from External Sources with Flume l Ingesting Data from Relational Databases with Sqoop l REST Interfaces l Best Practices for Importing Data |
MapReduce |
l What Is MapReduce? l Features of MapReduce l Basic Concepts l Architectural Overview l MapReduce Version 2 l Failure Recovery l Using the JobTracker Web UI |
Planning Your Hadoop Cluster
|
l General Planning Considerations l Choosing the Right Hardware l Network Considerations l Configuring Nodes l Planning for Cluster Management |
Hadoop Installation and Initial Configuration
|
l Deployment Types l Installing Hadoop l Specifying the Hadoop Configuration l Performing Initial HDFS Configuration l Performing Initial MapReduce Configuration l Log File Locations l |
Installing and Configuring Hive, Impala, and Pig
|
l Hive l Impala l Pig |
Hadoop Clients
|
l What is a Hadoop Client? l Installing and Configuring Hadoop Clients l Installing and Configuring Hue l Hue Authentication and Configuration |
Cloudera Manager
|
l The Motivation for Cloudera Manager l Cloudera Manager Features l Standard and Enterprise Versions l Cloudera Manager Topology l Installing Cloudera Manager l Installing Hadoop Using Cloudera Manager l Performing Basic Administration Tasks l Advanced Cluster Configuration l Advanced Configuration Parameters l Configuring Hadoop Ports l Explicitly Including and Excluding Hosts l Configuring HDFS for Rack Awareness l Configuring HDFS High Availability |
Hadoop Security
|
l Why Hadoop Security Is Important l Hadoop’s Security System Concepts l What Kerberos Is and How it Works l Securing a Hadoop Cluster with Kerberos |
Managing and Scheduling Jobs
|
l Managing Running Jobs l Scheduling Hadoop Jobs l Configuring the FairScheduler Cluster Maintenance l Checking HDFS Status l Copying Data Between Clusters l Adding and Removing Cluster Nodes l Rebalancing the Cluster l NameNode Metadata Backup l Cluster Upgrading |
Cluster Monitoring and Troubleshooting
|
l General System Monitoring l Managing Hadoop’s Log Files l Monitoring Hadoop Clusters l Common Troubleshooting Issues |
注:具體開課時間將根據(jù)實際進行調(diào)整,請關(guān)注青藍咨詢官方公眾號消息或咨詢課程顧問!
【聯(lián)系青藍咨詢】
地址: 深圳市南山區(qū)高新南一道06號TCL大廈B座3樓309室 (公交站:大沖 地鐵站:一號線高新園C出口)
郵編:518057
電話:0755-86950769
網(wǎng)址:http://www.mycalorietracker.com