Data Manipulation at Scale: Systems and Algorithms

Data Manipulation at Scale: Systems and Algorithms

Course
en
English
24 h
This content is rated 0 out of 5
Source
  • From www.coursera.org
Conditions
  • Self-paced
  • Free Access
  • Fee-based Certificate
More info
  • 4 Sequences
  • Introductive Level

Their employees are learning daily with Edflex

  • Safran
  • Air France
  • TotalEnergies
  • Generali
Learn more

Course details

Syllabus

  • Week 1 - Data Science Context and Concepts
    Understand the terminology and recurring principles associated with data science, and understand the structure of data science projects and emerging methodologies to approach them. Why does this emerging field exist? How does it relate to other fields? Ho...
  • Week 2 - Relational Databases and the Relational Algebra
    Relational Databases are the workhouse of large-scale data management. Although originally motivated by problems in enterprise operations, they have proven remarkably capable for analytics as well. But most importantly, the principles underlying relational d...
  • Week 3 - MapReduce and Parallel Dataflow Programming
    The MapReduce programming model (as distinct from its implementations) was proposed as a simplifying abstraction for parallel manipulation of massive datasets, and remains an important concept to know when using and evaluating modern big data platforms.
  • Week 4 - NoSQL: Systems and Concepts
    NoSQL systems are purely about scale rather than analytics, and are arguably less relevant for the practicing data scientist. However, they occupy an important place in many practical big data platform architectures, and data scientists need to understand the...
  • Week 4 - Graph Analytics
    Graph-structured data are increasingly common in data science contexts due to their ubiquity in modeling the communication between entities: people (social networks), computers (Internet communication), cities and countries (transportation networks), or corpor...

Prerequisite

None.

Instructors

Bill Howe
Director of Research
Scalable Data Analytics

Editor

The University of Washington is a public research university in Seattle, Washington. Founded on November 4, 1861 as Territorial University, Washington is one of the oldest universities on the West Coast and was established in Seattle about a decade after the city's founding.

The university has a 703-acre main campus located in the city's University District, as well as campuses in Tacoma and Bothell. Overall, UW comprises more than 500 buildings and more than 20 million gross square feet of space, including one of the world's largest library systems with more than 26 academic libraries, art centres, museums, laboratories, lecture halls and stadiums.

Washington is the flagship institution of Washington State's six public universities. It is renowned for its medical, technical and scientific research.

Platform

Coursera is a digital company offering massive open online course founded by computer teachers Andrew Ng and Daphne Koller Stanford University, located in Mountain View, California. 

Coursera works with top universities and organizations to make some of their courses available online, and offers courses in many subjects, including: physics, engineering, humanities, medicine, biology, social sciences, mathematics, business, computer science, digital marketing, data science, and other subjects.

This content is rated 4.5 out of 5
(no review)
This content is rated 4.5 out of 5
(no review)
Complete this resource to write a review