Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud

Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud

课程
en
英语
字幕可用
12 时
此内容评级为 3.7407/5
来源
  • 来自www.coursera.org
状况
  • 自定进度
  • 免费获取
  • 收费证书
更多信息
  • 4 序列
  • 等级 介绍
  • 字幕在 Korean

Their employees are learning daily with Edflex

  • Safran
  • Air France
  • TotalEnergies
  • Generali
Learn more

课程详情

教学大纲

WEEK 1 : Course Orientation
You will become familiar with the course, your classmates, and our learning environment. The orientation will also help you obtain the technical skills required for the course. 
In Module 1, we introduce you to the world of Big Data applications. We start by introducing you to Apache Spark, a common framework used for many different tasks throughout the course. We then introduce some Big Data distro packages, the HDFS file system, and finally the idea of batch-based Big Data processing using the MapReduce programming paradigm. 

WEEK 2 : Large Scale Data Storage
In this module, you will learn about large scale data storage technologies and frameworks. We start by exploring the challenges of storing large data in distributed systems. We then discuss in-memory key/value storage systems, NoSQL distributed databases, and distributed publish/subscribe queues. 

WEEK 3 : Streaming Systems
This module introduces you to real-time streaming systems, also known as Fast Data. We talk about Apache Storm in length, Apache Spark Streaming, and Lambda and Kappa architectures. Finally, we contrast all these technologies as a streaming ecosystem.  

WEEK 4 : Graph Processing and Machine Learning
In this module, we discuss the applications of Big Data. In particular, we focus on two topics: graph processing, where massive graphs (such as the web graph) are processed for information, and machine learning, where massive amounts of data are used to train models such as clustering algorithms and frequent pattern mining. We also introduce you to deep learning, where large data sets are used to train neural networks with effective results.  

Prerequisites : This course is intended for practitioners. We introduce a wide range of Big Data technologies and frameworks that are very commonly used across computer industry. We assume you are familiar with some programming language (such as Python or Java), and are now interested to take your knowledge to the next step by leveraging "frameworks" that do much of the heavy lifting involved in distributed Big Data systems. Most of the code snippets introduced in the lectures can be read as pseudocode.

先决条件

没有。

讲师

Reza Farivar
Data Engineering Manager at Capital One, Adjunct Research Assistant Professor of Computer Science
Department of Computer Science

Roy H. Campbell
Professor of Computer Science
Department of Computer Science

编辑

伊利诺伊大学香槟分校(UIUC)成立于 1867 年。伊利诺伊大学的主校区位于芝加哥以南 200 公里处的香槟和厄巴纳双城。

根据世界大学排名中心(Center for World University Rankings)等多项排名,这所重点大学跻身全球最负盛名的大学之列,2020-21 年的全球排名为第 22 位。

平台

Coursera是一家数字公司,提供由位于加利福尼亚州山景城的计算机教师Andrew Ng和达芙妮科勒斯坦福大学创建的大型开放式在线课程。

Coursera与顶尖大学和组织合作,在线提供一些课程,并提供许多科目的课程,包括:物理,工程,人文,医学,生物学,社会科学,数学,商业,计算机科学,数字营销,数据科学 和其他科目。

此内容评级为 3.7407/5
(没有评论)
此内容评级为 3.7407/5
(没有评论)
完成这个资源,写一篇评论