Managing Big Data with R and Hadoop

Closed
课程
en
英语
20 时
此内容评级为 0/5
来源
  • 来自www.futurelearn.com
更多信息
  • 5 序列
  • 等级 介绍
  • 从5 五月 2019开始
  • 以8 六月 2019结束

Their employees are learning daily with Edflex

  • Safran
  • Air France
  • TotalEnergies
  • Generali
Learn more

课程详情

教学大纲

What topics will you cover?

  1. Welcome to BIG DATA
  2. Working with Hadoop
  3. First steps in R and RHadoop
  4. Statistical learning with RHadoop: clustering
  5. Statistical learning with RHadoop: regression and classification   

先决条件

This course is designed for people interested in data science, computational statistics and machine learning and have basic experiences with them. It will be also useful for advanced undergraduate students and first year PhD students in data analysis, statistics or bioinformatics, who wish to understand how to manage big data with Hadoop using R programming language.

We expect that the learners will also have basic experiences with linux, bash and R and are capable to download and run virtual machine.

What software or tools do you need?

All software needed to actively participate the course is provided within the virtual machine that the followers are supposed to download and run on the local machine. No extra software is needed. You will need a modest local machine with 15GB free disk space and 2GB RAM.

讲师

Janez Povh
I am an active researcher in mathematical optimization, which has many applications in data science and where HPC is an inevitable tool.


Biljana Mileva Boshkoska
Biljsna Mileva Boshkoska is an assistant professor in computer science. Her interests include decision support systems, data mining and working with big data.


Leon Kos
Leon Kos is a 25+ years veteran of using Linux desktop on a daily basis to build digital relationships for research, teaching, and getting the job done by programming.

编辑

The Partnership for Advanced Computing in Europe (PRACE) is an international non-profit association with its seat in Brussels. The PRACE Research Infrastructure provides a persistent world-class high performance computing service for scientists and researchers from academia and industry in Europe.

The computer systems and their operations accessible through PRACE are provided by 4 PRACE members (BSC representing Spain, CINECA representing Italy, CSCS representing Switzerland, GCS representing Germany and GENCI representing France). The Implementation Phase of PRACE receives funding from the EU’s Seventh Framework Programme (FP7/2007-2013) under grant agreement RI-312763 and from the EU’s Horizon 2020 Research and Innovation Programme (2014-2020) under grant agreement 653838.

平台

FutureLearn est une plate-forme d'apprentissage proposant des formations en ligne ouvertes à tous (MOOC)

Fondée en Décembre 2012, la société est entièrement détenue par l'Open University à Milton Keynes, en Angleterre.

Elle est la 1ère plateforme offrant des MOOC au Royaume-Uni, avec à son actif plus d'une cinquantaine d'universités partenaires provenant du Royaume Uni mais aussi du reste du monde.

FutureLearn se différencie également par des partenariats avec des entités non-universitaires comme le British Museum, le British Council, la British Library et la national Film and Television School.

此内容评级为 4.5/5
(没有评论)
此内容评级为 4.5/5
(没有评论)
完成这个资源,写一篇评论