Digital Media and Artificial Intelligence (AI): Applications for sounds and pictures
Article REF: TE5897 V1

Digital Media and Artificial Intelligence (AI): Applications for sounds and pictures

Author : Jean-Noël GOUYET

Publication date: August 10, 2022, Review date: September 14, 2024 | Lire en français

Logo Techniques de l'Ingenieur You do not have access to this resource.
Request your free trial access! Free trial

Already subscribed?

Overview

ABSTRACT

Artificial intelligence (AI) has experienced accelerated growth in the digital media field since the 2015s. This article first provides a reminder of the principles, components and techniques of artificial intelligence, in particular machine learning, and deep learning with neural networks. It then proposes a sample of AI applications developed in the field of images in photography, for old films, and in video. In the field of sounds, the article presents some examples related to the automatic processing of speech, 3D audio and music.

Read this article from a comprehensive knowledge base, updated and supplemented with articles reviewed by scientific committees.

Read the article

AUTHOR

  • Jean-Noël GOUYET: Training engineer in digital media techniques and management - Former Research Manager at INA (Institut National de l'Audiovisuel)

 INTRODUCTION

Back in the 1950s, the pioneers of artificial intelligence (AI) assumed that learning and artificial intelligence (AI) could be simulated by a machine. Particularly since the 2000s, the numerous projects, research and application developments testify, on the one hand, to the growth of this IT sector, and, on the other, to the major human and financial investments made by the world's leading players in the development of projects and products incorporating AI. In the United States: Google, Apple, Facebook, Amazon, Microsoft – and in China: Baidu, Alibaba, Tencent...

Another characteristic of AI is the wide range of knowledge and technologies involved: cognitive sciences, learning modes and machine learning, automatic speech processing, signal and image analysis and processing, computer vision, robotics...

The aim of this series of two articles is to provide an overview of the quantity and diversity of AI applications in digital media, which have been multiplying since the mid-2010s.

This first article is divided into three parts:

  • a review of the principles, components and techniques of AI, as well as its uses;

  • a sample of AI applications in the field of images (photo, film, video);

  • a sample of AI applications in the field of sound (speech, 3D audio, music).

The second article [TE 5 898] :

  • presents these and other AI applications in the broadcast and media industry;

  • focuses on two case studies:

    • journalism and AI,

    • deepfakes.

       

Specific products or services mentioned in this article are for illustrative purposes only and do not represent a promotion, recommendation or endorsement by the author of this document. All articles or specialized sites presenting and evaluating them (referenced in the appendix) are the sole responsibility of their respective authors.

Numerous references detailing AI techniques and models used in applications are provided in the "Further reading" appendix, for the interested reader to consult. These are generally...

You do not have access to this resource.
Logo Techniques de l'Ingenieur

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed?


KEYWORDS

film   |   artificial intelligence   |   photo   |   video   |   3D audio   |   AI   |   speech processing   |   music

Ongoing reading
Digital media and Artificial Intelligence (AI): Image and sound applications

Article included in this offer

"Digital documents and content management"

( 71 articles )

Complete knowledge base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

View offer details