課程信息
課程名稱: Hadoop管理工程師(CCAH)認(rèn)證
公開(kāi)班、定制班
開(kāi)課時(shí)間:2024-06-15
課程介紹
【課程簡(jiǎn)介】
從安裝及配置、負(fù)載均衡及調(diào)整,以及 診斷和解決部署問(wèn)題等各方面了解 Hadoop 系統(tǒng)管理員的概念和實(shí)踐。
面向需要建立或維護(hù) Hadoop 集群的管理員。培訓(xùn)對(duì)象要求具備 Linux 基本知識(shí),Hadoop相關(guān)知識(shí)不作要求。
CCA Administrator Exam (CCA131) 管理員認(rèn)證考試
考試形式:120分鐘;70%通過(guò);基于一個(gè)預(yù)配置的Cloudera企業(yè)版集群,解決8~12個(gè)場(chǎng)景下的任務(wù)
【課程簡(jiǎn)介】
作為大數(shù)據(jù)核心技術(shù),hadoop 為企業(yè)提供了高擴(kuò)展、高冗余、高容錯(cuò)、和經(jīng)濟(jì)有效的“數(shù)據(jù)驅(qū)動(dòng)”解決方案。針對(duì)目前普遍缺乏海量數(shù)據(jù)技術(shù)人員的現(xiàn)狀,青藍(lán)咨詢的CCAH課程面向具備和掌握Linux系統(tǒng)管理和網(wǎng)絡(luò)相關(guān)技能和經(jīng)驗(yàn)。無(wú)需具備Hadoop基礎(chǔ)和經(jīng)驗(yàn)。
【授課對(duì)象】
系統(tǒng)管理員或者任何需要管理Apache Hadoop機(jī)群的人員(包括產(chǎn)品及開(kāi)發(fā)環(huán)境)。
【授課內(nèi)容】
· Hadoop分布式文件系統(tǒng)和MapReduce工作原理
· Hadoop集群硬件配置規(guī)劃
· Hadoop集群網(wǎng)絡(luò)配置規(guī)劃
· Hadoop集群配置及優(yōu)化
· 如何配置NameNode HA
· 任何配置NameNode Federation
· 任何配置FairScheduler為多用戶共享Hadoop集群
· 任何為Hadoop集群安裝和實(shí)現(xiàn)基于Kerberos的安全性
· 如何維護(hù)和監(jiān)測(cè)Hadoop集群
· 如何使用Flume加載動(dòng)態(tài)產(chǎn)生的文件以及使用Sqoop連接關(guān)系數(shù)據(jù)庫(kù)進(jìn)行數(shù)據(jù)導(dǎo)入導(dǎo)出
· Hive、Pig和HBase等Hadoop生態(tài)系統(tǒng)工具相關(guān)的系統(tǒng)管理工作
模塊 |
內(nèi)容 |
The Case for Apache Hadoop |
l Why Hadoop? l A Brief History of Hadoop l Core Hadoop Components l Fundamental Concepts |
HDFS
|
l HDFS Features l Writing and Reading Files l NameNode Considerations l Overview of HDFS Security l Using the Namenode Web UI l Using the Hadoop File Shell |
Getting Data into HDFS |
l Ingesting Data from External Sources with Flume l Ingesting Data from Relational Databases with Sqoop l REST Interfaces l Best Practices for Importing Data |
MapReduce |
l What Is MapReduce? l Features of MapReduce l Basic Concepts l Architectural Overview l MapReduce Version 2 l Failure Recovery l Using the JobTracker Web UI |
Planning Your Hadoop Cluster
|
l General Planning Considerations l Choosing the Right Hardware l Network Considerations l Configuring Nodes l Planning for Cluster Management |
Hadoop Installation and Initial Configuration
|
l Deployment Types l Installing Hadoop l Specifying the Hadoop Configuration l Performing Initial HDFS Configuration l Performing Initial MapReduce Configuration l Log File Locations l |
Installing and Configuring Hive, Impala, and Pig
|
l Hive l Impala l Pig |
Hadoop Clients
|
l What is a Hadoop Client? l Installing and Configuring Hadoop Clients l Installing and Configuring Hue l Hue Authentication and Configuration |
Cloudera Manager
|
l The Motivation for Cloudera Manager l Cloudera Manager Features l Standard and Enterprise Versions l Cloudera Manager Topology l Installing Cloudera Manager l Installing Hadoop Using Cloudera Manager l Performing Basic Administration Tasks l Advanced Cluster Configuration l Advanced Configuration Parameters l Configuring Hadoop Ports l Explicitly Including and Excluding Hosts l Configuring HDFS for Rack Awareness l Configuring HDFS High Availability |
Hadoop Security
|
l Why Hadoop Security Is Important l Hadoop’s Security System Concepts l What Kerberos Is and How it Works l Securing a Hadoop Cluster with Kerberos |
Managing and Scheduling Jobs
|
l Managing Running Jobs l Scheduling Hadoop Jobs l Configuring the FairScheduler Cluster Maintenance l Checking HDFS Status l Copying Data Between Clusters l Adding and Removing Cluster Nodes l Rebalancing the Cluster l NameNode Metadata Backup l Cluster Upgrading |
Cluster Monitoring and Troubleshooting
|
l General System Monitoring l Managing Hadoop’s Log Files l Monitoring Hadoop Clusters l Common Troubleshooting Issues |
注:具體開(kāi)課時(shí)間將根據(jù)實(shí)際進(jìn)行調(diào)整,請(qǐng)關(guān)注青藍(lán)咨詢官方公眾號(hào)消息或咨詢課程顧問(wèn)!
【聯(lián)系青藍(lán)咨詢】
地址: 深圳市南山區(qū)高新南一道06號(hào)TCL大廈B座3樓309室 (公交站:大沖 地鐵站:一號(hào)線高新園C出口)
郵編:518057
電話:0755-86950769
網(wǎng)址:http://www.mycalorietracker.com