CAVSR centre is utilized to create real time video lectures. From the recording content, the audio and video files are extracted for data collection. Data annotation was performed on the real time contents, to develop training and testing corpus.
Module 1: Automatic Speech Recognition(ASR).
1.1 ASR with BERT based Spell Corrector.
1.2 Self Supervised Knowledge Distillation framework for ASR.
Module 2: Visual Speech Recognition(VSR).
Module 3: Audio Visual Speech Recognition(AVSR).
Copyright Approved on Automated Captioning for E-Learning Contents for Hearing Impaired Using Cross modal Transformer based Audio and Visual Fusion Framework.
3.2 Intermediary Level AVSR Fusion
Module 4: Knowledge Distillation Framework for Audio Visual Speech Recognition.
Copyright Applied on Cross Modal based Knowledge Distillation Framework for Audio Visual Speech Recognition
Dr.L.Ashok kumar,
Professor,
Department of Electrical & Electronic Engg.
PSG College of Technology
Coimbatore - 641004
Tamil Nadu, India
Phone: 0422-2572167 Extn: 255