For the last 16 years I have been working on signal processing and machine learning aspects, targeted to mainly speech applications. In collaboration with Govt. of India (DIT, MCIT, DST) and other premium technological institutes of India, we have developed various speech systems in Indian languages. During the initial period of my career my focus was on acquisition and incorporation of prosody for developing various speech systems. Later my focus has been shifted to (i) expressive speech analysis/synthesis, (ii) development of robust speech systems, (iii) vocal folds activity analysis and syntheis in view of speecha nd biomedical applications, (iv) development of appropriate signal processing methods to extract the characteristic features from Hindustani music and (v) big-data analysis framework and audio and multimedia analytics.
My current focus is on (i) development of robust speech interfaces in the context of Indian languages targeted to the objectives such as E-Governance, Digital India and Smart phones, (ii) Exploring signal processing and machine learning paradigms for automatic processing of Hindustani music and (iii) Exploring big-data analytics for speech, music, audio and video document representation, indexing and retrieval tasks.
Â
-
Prosody modification using instants of significant excitation Sreenivasa Rao K., Yegnanarayana B. By Krothapalli Sreenivasa Rao 14 972-980 (2006)
-
Prosody modification using instants of significant excitation Sreenivasa Rao K., Yegnanarayana B. By Krothapalli Sreenivasa Rao 14 972-980 (2006)
-
Voice/Non-voice Detection Using Phase of Zero Frequency Filtered Speech Signal S. B. Sunil Kumar and K. Sreenivasa Rao By Speech Communication Vol. 81, Elsevier 90 - 103 (2016)
-
Voice/Non-voice Detection Using Phase of Zero Frequency Filtered Speech Signal S. B. Sunil Kumar and K. Sreenivasa Rao By Speech Communication Vol. 81, Elsevier 90 - 103 (2016)
-
Determination of instants of significant excitation in speech using Hilbert envelope and group delay function K. Sreenivasa Rao, S. R. M. Prasanna and B. Yegnanarayana By IEEE Signal Processing Letters Vol. 14, IEEE 762 - 765 (2007)
-
Determination of instants of significant excitation in speech using Hilbert envelope and group delay function K. Sreenivasa Rao, S. R. M. Prasanna and B. Yegnanarayana By IEEE Signal Processing Letters Vol. 14, IEEE 762 - 765 (2007)
-
Duration modification using Glottal Closure Instants and Vowel Onset Points Sreenivasa Rao K., Yegnanarayana B. By Speech communication 51 1263-1269 (2009)
-
Duration modification using Glottal Closure Instants and Vowel Onset Points Sreenivasa Rao K., Yegnanarayana B. By Speech communication 51 1263-1269 (2009)
-
Voice Conversion by Mapping the Speaker-specific features using Pitch Synchronous Approach K. Sreenivasa Rao By Computer Speech and Language Vol. 24, Elsevier 474 - 494 (2010)
-
Voice Conversion by Mapping the Speaker-specific features using Pitch Synchronous Approach K. Sreenivasa Rao By Computer Speech and Language Vol. 24, Elsevier 474 - 494 (2010)
Principal Investigator
- National Language Translation Mission (NLTM): BHASHINI
Ph. D. Students
Abhijit Debnath
Area of Research: Multimedia Data Analytics
Annepu. Sai Sriharsha
Area of Research: Speech and Natural Language Processing
Aravinda Reddy P N
Area of Research: Speech Processing
Arup Kumar Dutta
Area of Research: Speech and Audio Processing
Haque Arijul
Area of Research: Speech processing
Priya Dharshini G
Area of Research: Speech Processing
Saikat Biswas
Area of Research: Audio Data Analytics
Soumen Paul
Area of Research: Human Computer Interactions - Computer Vision
Soumya Majumdar
Area of Research: Speech Processing
Sudhakar P
Area of Research: Speech Processing
Y Madhu Keerthana
Area of Research: Speech Processing