Catalogue des cours de Télécom SudParis - Multimédia indexing (HTI 5)

Code

IGFE IMA 5001

Niveau

Graduate

Semestre

Fall

Domaine

Image

Programme

Programme Ingénieur

Langue

Anglais/English

Crédits ECTS

Heures programmées

Charge de travail

Coordonnateur(s)

Titus ZAHARIA

Département

Advanced Research and Techniques for Multidimensional Imaging Systems

Equipe pédagogique

Organisation

Cours/TD/TP/projet/examen : 36/0/9/0

Acquis d'apprentissage

At the end of this module the students should be able to:
- Exploit the content-based representations with visual descriptors for indexing, searching and enriching heterogeneous content,
- Select appropriate descriptors for solving a given retrieval application, and integrate it within a content-based search engine,
- Analyze and evaluate interoperable indexing tools based on multimedia standards and description languages (MPEG-7, XML…),
- Conceive and implement a semantic learning chain for a given semantic indexing application, focalized on a specific type of content, including feature extraction, descriptor specification and machine learning stages,
- Construct a deep learning approach for solving an artificial intelligence-related problem (e.g. image/object recognition).

Compétences CDIO

1.1.1 - Mathématiques (y compris statistiques)
1.2 - Connaissance des principes fondamentaux d'ingénierie
1.3 - Connaissances avancées en ingénierie : méthodes et outils
2.1.4 - Analyse en contexte non parfaitement défini
2.1.5 - Solutions et recommandations

Prérequis

None

Mots-clés

Multimedia indexation, visual descriptors, shape, color, motion, texture, description scheme, description languages, MPEG-7standard, semantic learning, object recognition, deep learning

Contenu

- 2D/3D shape extraction
- Color representions
- Extraction of texture primitives
- Motion analysis
- The metadata era: a new multimedia consumption
- Low-level descriptors for content indexing and content-based access: color descriptors, shape descriptors (2D, 3D and 2D/3D), motion descriptors, texture descriptors
- Overview of the MPEG-7 standard
- Interest point descriptors
- Automatic extraction of semantics: from pixels to semantic entities based on machine learning approaches
- Artificial intelligence: deep learning techniques
- Video structuring and abstraction: shot/scene detection, keyframe selection
- Object-based representations
- Query by example and similarity metrics
- Towards high-level descriptions: description schemes, hierarchical and multigranular representations
- Structural and semantic descriptions of multimedia documents
- Training, profiles and relevance feedback
- Search engines and data mining
- Deep learning
- Applications (video archiving, sign language, face recognition)

Evaluation

Continuous evaluation based on lab assignments (BE) and personal supervised project (P) linked to real industrial applications or to national/European research projects.
Final mark = Average (BE, P)