Data Science - Theory to practice

Catalogue des cours de Télécom SudParis

Code

IGFE CSC 7018

Niveau

MSc

Graduate

PostGraduate

Domaine

Informatique

Programme

Master of Science

Crédits ECTS

4

Heures programmées

27

Charge de travail

14 hour coursework, 20 hours practice

Coordonnateur(s)

Département

  • Réseaux et Services Multimédia Mobiles

Organisation

Cours/TD/TP/projet/examen :

Acquis d'apprentissage

The goal of the course is to have a broad introduction on data
science and artificial intelligence techniques. The course is split into
three parts:
- Introduction to Data Science, in which we learn the why data
is the value and what are the existing challenges that needs mining
of the data.
- Unsupervised learning, in which we study the concept and
some of the related algorithms: hierarchical clustering, kmeans,
dbscan, hdbscan, etc.
- Supervised learning, in which we study the concept and
some of the related algorithm: regression (linear and logistic),
decision trees, Naïve Bayes, SVM, random forest
- Text analysis (supervised and unsupervised) in which we will
review the specificities of text analysis
Each course is followed by practical work using R and/or python
In cooperation with Total.

Contenu

- Data Science in scale
- Big Data problems
- Introduction to Data mining
- Data handling with R / Python
- Supervised Machine Learning algorithms
- Unsupervised Machine Learning algorithms
- Text mining

Formule de l'évaluation

Practical session grading