PSG TECH DST

PSG College Of Technology
Centre for Audio Visual Speech Recognition
PSG CAVSR


List of Technical Publications:

  1. Kumar, L. A., Renuka, D. K., Rose, S. L., & Wartana, I. M. (2022). Deep learning based assistive technology on audio visual speech recognition for hearing impaired. International Journal of Cognitive Computing in Engineering, 3, 24-30.

  2. Shunmugapriya, M. C., Renuka, D. K., & Kumar, L. A. (2022). Towards improving speech recognition model with post-processing spell correction using BERT. Journal of Intelligent & Fuzzy Systems, (Preprint), 1-10.

  3. Priya, M. S., Renuka, D. K., Kumar, L. A., & Rose, S. L. (2022). Multilingual low resource Indian language speech recognition and spell correction using Indic BERT. Sādhanā, 47(4), 227.

  4. Attention based Multi-Modal Learning for Audio-Visual Speech Recognition, 2022, 4th Int'l Conference on Artificial Intelligence and Speech Technology.

  5. Shunmugapriya, M. C. (2021). Recurrent network-based hybrid acoustic model for Automatic Speech Recognition. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(10), 7308-7315.

  6. Kumar, A., & Priya, S. (2021, December). Analysis of Audio Visual Feature Extraction Techniques for AVSR System. In Proceedings of the First International Conference on Combinatorial and Optimization, ICCAP 2021, December 7-8 2021, Chennai, India.

  7. Kumar, A., & Raajkumar, G. (2021, December). Automatic Speech Recognition for Indian Accent Lectures contents using End-to-End Speech Recognition model. In Proceedings of the First International Conference on Combinatorial and Optimization, ICCAP 2021, December 7-8 2021, Chennai, India.

  8. L Ashok Kumar, D Karthika Renuka, Dineshraja V and Fatima Abdul Jabbar. Cross Modal Knowledge Distillation for Audio Visual Speech Recognition, 11th International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA2023) Cardiff Metropolitan University, Llandaff Campus, Cardiff, United Kingdom, 11th - 12th, April 2023 (Accepted )

  9. L Ashok Kumar, D Karthika Renuka, V Harsha Priya, S Sudarshan. Spoken Language Translation using Conformer model, IEEE International Conference on Intelligent Systems for Communication, IoT and Security ICISCoIS 2023,10th - 11th February 2023 (Accepted )

  10. L Ashok Kumar, D Karthika Renuka, M C Shanmuga Priya. Towards Robust Speech Recognition Model using Deep Learning, IEEE International Conference on Intelligent Systems for Communication, IoT and Security ICISCoIS 2023,10th - 11th February 2023 (Accepted )

  11. L Ashok Kumar, D Karthika Renuka, Naveena K S and Sree Resmi S. CRDNN-BiLSTM Knowledge Distillation Model towards Enhancing the Automatic Speech Recognition (Under Processing)

  12. L Ashok Kumar, D Karthika Renuka, Dineshraja V, Fatima Abdul Jabbar, Naveena K S, Sree Resmi S. Conformer CTC with Language Models for Improving the Performance of Automatic Speech Recognition (Under Processing)

Contact

Dr.L.Ashok kumar,
Professor,
Department of Electrical & Electronic Engg.
PSG College of Technology
Coimbatore - 641004
Tamil Nadu, India


Phone: 0422-2572167 Extn: 255

Email: psgdeeplearning@gmail.com
Follow us on :