Deep Learning in Digital Media
This is a special topics course with emphasis on theoretical and practical concepts of deep learning for digital video applications ranging from entertainment to broadcasting, autonomous driving, smart cities and digital health. Topics include overview of neural networks, deep convolutional neural networks for image and video applications, Recurrent Neural networks for video processing, cost functions and regularization. Concepts of Digital image and video processing, capturing, compression and display technologies, Human perception, measures of visual quality and emerging visual trends of high dynamic range and light field will also be covered.
- Overview of Digital Image & Video Processing
- Capturing, Compression & Display technologies
- Human Visual System & Human Perception and
- Measures of Visual Quality
- High Dynamic Range and Light Field Technologies
- Brief overview of Machine Learning & Neural Networks
- Deep Learning
- Deep Convolutional Neural Networks (CNN) for Image processing
- Cost Function, CNN parameters and hyper parameters tuning
- Under-fitting, Overfitting and regularization for video processing
- Recurrent Neural Network, Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU) for Video processing
- Generative adversarial network for the video content
- Auto-encoder for image and video compression
- Transfer learning for digital video
- Students are organized in teams and work on one of the following projects:
- Deep Learning in Digital Video processing and broadcasting (denoising, compression)
- Deep Learning in Autonomous Driving
- Deep Learning in Video Super Resolution (Upscaling)
- Deep Learning in Smart Cities (Vehicle to Vehicle video communication, parking, etc.)
- Deep Learning in Digital Health (Human Monitoring, Behavior)
- Oral Presentation on the project topic by end of 3rd lecture
- A technical Report on the topic is due the day before. This should include introduction, background information, proposed approach, and bibliography.
- Final Project completion date: Last day of the course
- A final technical report is expected in the form of a paper. Software code and executable should also be submitted. Each team should demonstrate its work.
- Oral exam and questioning period will be allocated for each team. Questions will cover the course material and project work.
- Technical Report/Proposal 10%
- Presentation 1 (Oral Exam) 10%
- Final Technical Report 30%
- Presentation and Demo 15%
- Oral Exam 25%
- Teamwork 10%