Managing Big Data with R and Hadoop

Closed
Cours
en
Anglais
20 h
Ce contenu est noté 0 sur 5
Source
  • Sur www.futurelearn.com
Plus d'informations
  • 5 séquences
  • Niveau Introductif
  • Débute le 5 mai 2019
  • Clôture le 8 juin 2019

Their employees are learning daily with Edflex

  • Safran
  • Air France
  • TotalEnergies
  • Generali
Découvrir Edflex

Détails du cours

Déroulé

What topics will you cover?

  1. Welcome to BIG DATA
  2. Working with Hadoop
  3. First steps in R and RHadoop
  4. Statistical learning with RHadoop: clustering
  5. Statistical learning with RHadoop: regression and classification   

Prérequis

This course is designed for people interested in data science, computational statistics and machine learning and have basic experiences with them. It will be also useful for advanced undergraduate students and first year PhD students in data analysis, statistics or bioinformatics, who wish to understand how to manage big data with Hadoop using R programming language.

We expect that the learners will also have basic experiences with linux, bash and R and are capable to download and run virtual machine.

What software or tools do you need?

All software needed to actively participate the course is provided within the virtual machine that the followers are supposed to download and run on the local machine. No extra software is needed. You will need a modest local machine with 15GB free disk space and 2GB RAM.

Intervenants

Janez Povh
I am an active researcher in mathematical optimization, which has many applications in data science and where HPC is an inevitable tool.


Biljana Mileva Boshkoska
Biljsna Mileva Boshkoska is an assistant professor in computer science. Her interests include decision support systems, data mining and working with big data.


Leon Kos
Leon Kos is a 25+ years veteran of using Linux desktop on a daily basis to build digital relationships for research, teaching, and getting the job done by programming.

Éditeur

The Partnership for Advanced Computing in Europe (PRACE) is an international non-profit association with its seat in Brussels. The PRACE Research Infrastructure provides a persistent world-class high performance computing service for scientists and researchers from academia and industry in Europe.

The computer systems and their operations accessible through PRACE are provided by 4 PRACE members (BSC representing Spain, CINECA representing Italy, CSCS representing Switzerland, GCS representing Germany and GENCI representing France). The Implementation Phase of PRACE receives funding from the EU’s Seventh Framework Programme (FP7/2007-2013) under grant agreement RI-312763 and from the EU’s Horizon 2020 Research and Innovation Programme (2014-2020) under grant agreement 653838.

Plateforme

FutureLearn est une plate-forme d'apprentissage proposant des formations en ligne ouvertes à tous (MOOC)

Fondée en Décembre 2012, la société est entièrement détenue par l'Open University à Milton Keynes, en Angleterre.

Elle est la 1ère plateforme offrant des MOOC au Royaume-Uni, avec à son actif plus d'une cinquantaine d'universités partenaires provenant du Royaume Uni mais aussi du reste du monde.

FutureLearn se différencie également par des partenariats avec des entités non-universitaires comme le British Museum, le British Council, la British Library et la national Film and Television School.

Ce contenu est noté 4.5 sur 5
(aucun avis)
Ce contenu est noté 4.5 sur 5
(aucun avis)
Complétez cette ressource pour donner votre avis