Data Science - Theory to practice

Catalog of Télécom SudParis courses

Code

IGFE CSC 7018

Level

MSc

Graduate

PostGraduate

Domain

Informatique

Program

Master of Science

ECTS Credits

4

Class hours

27

Workload

14 hour coursework, 20 hours practice

Program Manager(s)

Department

  • Réseaux et Services Multimédia Mobiles

Organisation

Cours/TD/TP/projet/examen :

Learning objectives

The goal of the course is to have a broad introduction on data
science and artificial intelligence techniques. The course is split into
three parts:
- Introduction to Data Science, in which we learn the why data
is the value and what are the existing challenges that needs mining
of the data.
- Unsupervised learning, in which we study the concept and
some of the related algorithms: hierarchical clustering, kmeans,
dbscan, hdbscan, etc.
- Supervised learning, in which we study the concept and
some of the related algorithm: regression (linear and logistic),
decision trees, Naïve Bayes, SVM, random forest
- Text analysis (supervised and unsupervised) in which we will
review the specificities of text analysis
Each course is followed by practical work using R and/or python
In cooperation with Total.

Content

- Data Science in scale
- Big Data problems
- Introduction to Data mining
- Data handling with R / Python
- Supervised Machine Learning algorithms
- Unsupervised Machine Learning algorithms
- Text mining

Assessment formula

Practical session grading