課程信息

課程名稱: Hadoop管理工程師(CCAH)認(rèn)證

公開(kāi)班、定制班

開(kāi)課時(shí)間:2024-06-15

課程介紹


課程簡(jiǎn)介

從安裝及配置、負(fù)載均衡及調(diào)整,以及 診斷和解決部署問(wèn)題等各方面了解 Hadoop 系統(tǒng)管理員的概念和實(shí)踐。

面向需要建立或維護(hù) Hadoop 集群的管理員。培訓(xùn)對(duì)象要求具備 Linux 基本知識(shí),Hadoop相關(guān)知識(shí)不作要求。  

CCA Administrator Exam (CCA131) 管理員認(rèn)證考試

考試形式:120分鐘;70%通過(guò);基于一個(gè)預(yù)配置的Cloudera企業(yè)版集群,解決8~12個(gè)場(chǎng)景下的任務(wù)

 

課程簡(jiǎn)介

作為大數(shù)據(jù)核心技術(shù),hadoop 為企業(yè)提供了高擴(kuò)展、高冗余、高容錯(cuò)、和經(jīng)濟(jì)有效的“數(shù)據(jù)驅(qū)動(dòng)”解決方案。針對(duì)目前普遍缺乏海量數(shù)據(jù)技術(shù)人員的現(xiàn)狀,青藍(lán)咨詢的CCAH課程面向具備和掌握Linux系統(tǒng)管理和網(wǎng)絡(luò)相關(guān)技能和經(jīng)驗(yàn)。無(wú)需具備Hadoop基礎(chǔ)和經(jīng)驗(yàn)。


授課對(duì)象

系統(tǒng)管理員或者任何需要管理Apache Hadoop機(jī)群的人員(包括產(chǎn)品及開(kāi)發(fā)環(huán)境)。


授課內(nèi)容

· Hadoop分布式文件系統(tǒng)和MapReduce工作原理

· Hadoop集群硬件配置規(guī)劃

· Hadoop集群網(wǎng)絡(luò)配置規(guī)劃

· Hadoop集群配置及優(yōu)化

· 如何配置NameNode HA

· 任何配置NameNode Federation

· 任何配置FairScheduler為多用戶共享Hadoop集群

· 任何為Hadoop集群安裝和實(shí)現(xiàn)基于Kerberos的安全性

· 如何維護(hù)和監(jiān)測(cè)Hadoop集群

· 如何使用Flume加載動(dòng)態(tài)產(chǎn)生的文件以及使用Sqoop連接關(guān)系數(shù)據(jù)庫(kù)進(jìn)行數(shù)據(jù)導(dǎo)入導(dǎo)出

· Hive、Pig和HBase等Hadoop生態(tài)系統(tǒng)工具相關(guān)的系統(tǒng)管理工作


模塊

內(nèi)容

The Case for Apache Hadoop

Why Hadoop?

A Brief History of Hadoop

Core Hadoop Components

Fundamental Concepts

HDFS

 

HDFS Features

Writing and Reading Files

NameNode Considerations

Overview of HDFS Security

Using the Namenode Web UI

Using the Hadoop File Shell

Getting Data into HDFS

Ingesting Data from External Sources with Flume

Ingesting Data from Relational Databases with Sqoop

REST Interfaces

Best Practices for Importing Data

MapReduce

What Is MapReduce?

Features of MapReduce

Basic Concepts

Architectural Overview

MapReduce Version 2

Failure Recovery

Using the JobTracker Web UI

Planning Your Hadoop Cluster

 

General Planning Considerations

Choosing the Right Hardware

Network Considerations

Configuring Nodes

Planning for Cluster Management

Hadoop Installation and Initial Configuration

 

 Deployment Types

 Installing Hadoop

 Specifying the Hadoop Configuration

 Performing Initial HDFS Configuration

 Performing Initial MapReduce Configuration

 Log File Locations

Installing and Configuring Hive, Impala, and Pig

 

 Hive

 Impala

 Pig

Hadoop Clients

 

 What is a Hadoop Client?

 Installing and Configuring Hadoop Clients

 Installing and Configuring Hue

 Hue Authentication and Configuration

Cloudera Manager

 

 

 The Motivation for Cloudera Manager

  Cloudera Manager Features

 Standard and Enterprise Versions

 Cloudera Manager Topology

 Installing Cloudera Manager

 Installing Hadoop Using Cloudera Manager

 Performing Basic Administration Tasks

Advanced Cluster Configuration

 Advanced Configuration Parameters

 Configuring Hadoop Ports

 Explicitly Including and Excluding Hosts

 Configuring HDFS for Rack Awareness

 Configuring HDFS High Availability

Hadoop Security

 

 Why Hadoop Security Is Important

 Hadoop’s Security System Concepts

 What Kerberos Is and How it Works

 Securing a Hadoop Cluster with Kerberos

Managing and Scheduling Jobs

 

 Managing Running Jobs

 Scheduling Hadoop Jobs

 Configuring the FairScheduler Cluster Maintenance

 Checking HDFS Status

 Copying Data Between Clusters

 Adding and Removing Cluster Nodes

 Rebalancing the Cluster

 NameNode Metadata Backup

 Cluster Upgrading

Cluster  Monitoring  and Troubleshooting

 

 General System Monitoring

 Managing Hadoop’s Log Files

 Monitoring Hadoop Clusters

 Common Troubleshooting Issues


注:具體開(kāi)課時(shí)間將根據(jù)實(shí)際進(jìn)行調(diào)整,請(qǐng)關(guān)注青藍(lán)咨詢官方公眾號(hào)消息或咨詢課程顧問(wèn)!




【聯(lián)系青藍(lán)咨詢】

地址: 深圳市南山區(qū)高新南一道06號(hào)TCL大廈B座3樓309室 (公交站:大沖   地鐵站:一號(hào)線高新園C出口) 

    郵編:518057 

    電話:0755-86950769

    郵箱:peixun@shzhchina.com 

    網(wǎng)址:http://www.mycalorietracker.com

 

掃碼關(guān)注 了解更多課程信息