Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud

Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud

Cours
en
Anglais
Sous-titres disponibles
12 h
Ce contenu est noté 3.7407 sur 5
Source
  • Sur www.coursera.org
Conditions
  • À son rythme
  • Accès libre
  • Certificat payant
Plus d'informations
  • 4 séquences
  • Niveau Introductif
  • Sous-titres en Korean

Their employees are learning daily with Edflex

  • Safran
  • Air France
  • TotalEnergies
  • Generali
Découvrir Edflex

Détails du cours

Déroulé

WEEK 1 : Course Orientation
You will become familiar with the course, your classmates, and our learning environment. The orientation will also help you obtain the technical skills required for the course. 
In Module 1, we introduce you to the world of Big Data applications. We start by introducing you to Apache Spark, a common framework used for many different tasks throughout the course. We then introduce some Big Data distro packages, the HDFS file system, and finally the idea of batch-based Big Data processing using the MapReduce programming paradigm. 

WEEK 2 : Large Scale Data Storage
In this module, you will learn about large scale data storage technologies and frameworks. We start by exploring the challenges of storing large data in distributed systems. We then discuss in-memory key/value storage systems, NoSQL distributed databases, and distributed publish/subscribe queues. 

WEEK 3 : Streaming Systems
This module introduces you to real-time streaming systems, also known as Fast Data. We talk about Apache Storm in length, Apache Spark Streaming, and Lambda and Kappa architectures. Finally, we contrast all these technologies as a streaming ecosystem.  

WEEK 4 : Graph Processing and Machine Learning
In this module, we discuss the applications of Big Data. In particular, we focus on two topics: graph processing, where massive graphs (such as the web graph) are processed for information, and machine learning, where massive amounts of data are used to train models such as clustering algorithms and frequent pattern mining. We also introduce you to deep learning, where large data sets are used to train neural networks with effective results.  

Prerequisites : This course is intended for practitioners. We introduce a wide range of Big Data technologies and frameworks that are very commonly used across computer industry. We assume you are familiar with some programming language (such as Python or Java), and are now interested to take your knowledge to the next step by leveraging "frameworks" that do much of the heavy lifting involved in distributed Big Data systems. Most of the code snippets introduced in the lectures can be read as pseudocode.

Prérequis

Aucun.

Intervenants

Reza Farivar
Data Engineering Manager at Capital One, Adjunct Research Assistant Professor of Computer Science
Department of Computer Science

Roy H. Campbell
Professor of Computer Science
Department of Computer Science

Éditeur

L’université de l’Illinois à Urbana-Champaign est fondée en 1867 (UIUC). Le campus principal de l’université de l’Illinois est situé dans les villes jumelles de Champaign et Urbana, à deux cents kilomètres au sud de Chicago. 

Cette grande université est classée parmi les plus prestigieuses mondialement par divers moyens de mesure, comme le Center for World University Rankings qui la place 22ème mondialement pour la période 2020-21.

Plateforme

Coursera est une entreprise numérique proposant des formations en ligne ouverte à tous fondée par les professeurs d'informatique Andrew Ng et Daphne Koller de l'université Stanford, située à Mountain View, Californie.

Ce qui la différencie le plus des autres plateformes MOOC, c'est qu'elle travaille qu'avec les meilleures universités et organisations mondiales et diffuse leurs contenus sur le web.

Ce contenu est noté 3.7407 sur 5
(aucun avis)
Ce contenu est noté 3.7407 sur 5
(aucun avis)
Complétez cette ressource pour donner votre avis