Hadoop Platform and Application Framework
date_range Débute le 20 mars 2017
event_note Se termine le 24 avril 2017
list 5 séquences
assignment Niveau : Introductif
chat_bubble_outline Langue : Anglais
card_giftcard 15 points
Envie de partager ce MOOC dans votre entreprise ?
My Mooc
For Business
- /5
Avis de la communauté
0 avis

Les infos clés

credit_card Formation gratuite
timer 25 heures de cours

En résumé

This course is for novice programmers or business people who would like to understand the core tools used to wrangle and analyze big data. With no prior experience, you will have the opportunity to walk through hands-on examples with Hadoop and Spark frameworks, two of the most common in the industry. You will be comfortable explaining the specific components and basic processes of the Hadoop architecture, software stack, and execution environment. In the assignments you will be guided in how data scientists apply the important concepts and techniques such as Map-Reduce that are used to solve fundamental problems in big data. You'll feel empowered to have conversations about big data and the data analysis process.

more_horiz Lire plus
more_horiz Lire moins
dns

Le programme

  • Week 1 - Hadoop Basics
    Welcome to the first module of the Big Data Platform course. This first module will provide insight into Big Data Hype, its technologies opportunities and challenges. We will take a deeper look into the Hadoop stack and tool and technologies associated with B...
  • Week 2 - Introduction to the Hadoop Stack
    In this module we will take a detailed look at the Hadoop stack ranging from the basic HDFS components, to application execution frameworks, and languages, services.
  • Week 3 - Introduction to Hadoop Distributed File System (HDFS)
    In this module we will take a detailed look at the Hadoop Distributed File System (HDFS). We will cover the main design goals of HDFS, understand the read/write process to HDFS, the main configuration parameters that can be tuned to control HDFS performance an...
  • Week 4 - Introduction to Map/Reduce
    This module will introduce Map/Reduce concepts and practice. You will learn about the big idea of Map/Reduce and you will learn how to design, implement, and execute tasks in the map/reduce framework. You will also learn the trade-offs in map/reduce and how t...
  • Week 5 - Spark
    Welcome to module 5, Introduction to Spark, this week we will focus on the Apache Spark cluster computing framework, an important contender of Hadoop MapReduce in the Big Data Arena. Spark provides great performance advantages over Hadoop MapReduce,especially...
record_voice_over

Les intervenants

  • Natasha Balac, Director, Predictive Analytics Center of Excellence (PACE)
    San Diego Supercomputer Center
  • Paul Rodriguez, Research Programmer
    San Diego Supercomputer Center (SDSC)
  • Andrea Zonca, HPC Applications Specialist
    San Diego Supercomputer Center (SDSC)
store

Le concepteur

UC San Diego is an academic powerhouse and economic engine, recognized as one of the top 10 public universities by U.S. News and World Report. Innovation is central to who we are and what we do. Here, students learn that knowledge isn't just acquired in the classroom—life is their laboratory.
assistant

La plateforme

Coursera est une entreprise numérique proposant des formation en ligne ouverte à tous fondée par les professeurs d'informatique Andrew Ng et Daphne Koller de l'université Stanford, située à Mountain View, Californie.

Ce qui la différencie le plus des autres plateformes MOOC, c'est qu'elle travaille qu'avec les meilleures universités et organisations mondiales et diffuse leurs contenus sur le web.

Vous êtes le concepteur de ce MOOC ?
Quelle note donnez-vous à cette ressource ?
Contenu
0/5
Plateforme
0/5
Animation
0/5