Speech and Image Processing


  • Dr. Shashidhar G. Koolagudi
  • Dr. Jeny Rajan

Research Scholars:

  • Mr. Y V Srinivasa Murthy
  • Mr. Pravin Bhaskar Ramteke
  • Ms. Fathima Afroz
  • Mrs. Nagaratna B. Chittaragi
  • Mr. Manjunath Mulimani
  • Ms. Sreeja Suresh
  • Mr. Krishna Kumar P
  • Mr. Yamanappa
  • Mr. Girish G N
  • Mr. Tojo Mathew
Ongoing Research:
  • Content-based Music Information Retrieval and its Applications towards Music Industry
  • Phonology Analysis from Children’s Speech
  • Characterization and Quantification of Stuttering Events using Speech Features
  • Speech Processing Approaches Towards Characterization and Identification of Dialects
  • Acoustic Scene Classification by Characterizing different Events using Speech Features
  • Mispronunciation Detection in Children’s speech
  • Enhancement and Analysis of Magnetic Resonance and Ultrasound Images
  • Development of improved image denoising methods
  • Segmentation and quantification of intra-retinal cysts from optical coherence tomography scans
+ Journals
  1. P. Krishna Kumar, Tadashi Araki, Jeny Rajan, Luca Saba, Francesco Lavra, Nobutaka Ikeda, Aditya M. Sharma et al. "Accurate lumen diameter measurement in curved vessels in carotid ultrasound: an iterative scale-space and spatial transformation approach." Medical & Biological Engineering & Computing (2016): 1-20.
  2. Araki, Tadashi, Asheed M. Kumar, P. Krishna Kumar, Ajay Gupta, Luca Saba, Jeny Rajan, Francesco Lavra et al. "Ultrasound-based automated carotid lumen diameter/stenosis measurement and its validation system." Journal for Vascular Ultrasound 40, no. 3 (2016): 120-134.
  3. Araki, Tadashi, P. Krishna Kumar, Harman S. Suri, Nobutaka Ikeda, Ajay Gupta, Luca Saba, Jeny Rajan et al. "Two automated techniques for carotid lumen diameter measurement: regional versus boundary approaches." Journal of medical systems 40, no. 7 (2016): 1-19.
  4. Saba, Luca, Tadashi Araki, P. Krishna Kumar, Jeny Rajan, Francesco Lavra, Nobutaka Ikeda, Aditya M. Sharma et al. "Carotid inter‐adventitial diameter is more strongly related to plaque score than lumen diameter: An automated tool for stroke analysis." Journal of Clinical Ultrasound (2016).
  5. Sharma, Aditya M., Ajay Gupta, P. Krishna Kumar, Jeny Rajan, Luca Saba, Ikeda Nobutaka, John R. Laird, Andrew Nicolades, and Jasjit S. Suri. "A review on carotid ultrasound atherosclerotic tissue characterization and stroke risk stratification in machine learning framework." Current atherosclerosis reports 17, no. 9 (2015): 1-13.
  6. P. Krishna Kumar, P. Darshan, Sheethal Kumar, Rahul Ravindra, Jeny Rajan, Luca Saba, and Jasjit S. Suri. "Magnetic resonance image denoising using nonlocal maximum likelihood paradigm in DCT‐framework." International Journal of Imaging Systems and Technology 25, no. 3 (2015): 256-264.
+ Conferences
  1. Koolagudi, S. G., Vishwanath, B. K., Akshatha, M., & Murthy, Y. V. (2017). Performance Analysis of LPC and MFCC Features in Voice Conversion Using Artificial Neural Networks. In Proceedings of the International Conference on Data Engineering and Communication Technology (pp. 275-280). Springer Singapore.
  2. Mayank Varshney, Shashidhar G. Koolagudi, Sudhakar Velusamy and Pravin B. Ramteke, ”Ensuring Performance of Graphics Processing Units: A Programmer's Perspective", Proceedings of the International Conference on Data Engineering and Communication Technology: ICDECT 2016, Vol. 2, pages 225-235, 2017. 
  3. Maonica B., Priyanka Das, Pravin B. Ramteke and Shashidhar G. Koolagudi, “Selective Cropper for Geometrical Objects in OpenFlipper” Proceedings of the International Conference on Data Engineering and Communication Technology: ICDECT 2016, Vol. 1, pages 391-399, 2017.
  4. Thomas, M., Jothish, Mintu., Thomas, Navin., Koolagudi, S. G., and Murthy, Y. S., (2016, November). Detection of Similarity in Music Files using Signal Level Analysis. In 36th IEEE International Conference on Technologies for Smart Nation (TENCON), Singapore. IEEE, 2016.
  5. Luitel, B., Murthy, Y. S., and Koolagudi, S. G. (2016, August). Sound Event Detection in Urban Soundscape using Two-level Classification. In Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER), IEEE (pp. 259-263). IEEE.
  6. G. N. Girish, Abhishek R. Kothari and Jeny Rajan, “Automated Segmentation of Intra-Retinal Cysts from Optical Coherence Tomography Scans Using Marker Controlled Watershed Transform,” in 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’16). Orlando, Florida, USA: IEEE, Aug 2016.
  7. Thomas, M., Murthy, Y. S., and Koolagudi, S. G. (2016, May). Detection of largest possible repeated patterns in Indian audio songs using spectral features. In Electrical and Computer Engineering (CCECE), 2016 IEEE Canadian Conference on (pp. 1-5). IEEE.
  8. Sharma, R., Murthy, Y. S., and Koolagudi, S. G. (2016). Audio Songs Classification Based on Music Patterns. In Proceedings of the Second International Conference on Computer and Communication Technologies (pp. 157-166). Springer India.
  9. Sooraj Kumar R., G. N. Girish, Pavin B. Ramteke and Shashidhar G. Koolagudi, “Text Independent Automatic Accent Identication System for Kannada Language,” in International Conference on Data Engineering and Communication Technology (ICDECT-2016). Pune, INDIA: Springer, Mar 2016.
  10. Narendra Rao T.J., G. N. Girish, and Jeny Rajan, “An Improved Contextual Information Based Approach for Anomaly Detection via Adaptive Inference for Surveillance Application,” in International Conference on Computer Vision and Image Processing (CVIP-2016). IIT Roorkee, INDIA: Springer, Feb 2016. 
  11. P. Krishna Kumar, C. Kesavadas and Jeny Rajan. "A Semi-automatic Method for Carotid Artery Wall Segmentation in MR Images." INDICON 2016, IEEE, 2016. (waiting for publication).
  12. Soorajkumar, R., P. Krishna Kumar, D. Girish, and Jeny Rajan. "Coupled PDE for Ultrasound Despeckling Using ENI Classification." Procedia Computer Science 89 (2016): 658-665.
  13. Soorajkumar, R., P. Krishna Kumar, D. Girish, and Jeny Rajan. "Fourth order PDE based ultrasound despeckling using ENI classification." In Signal Processing and Communications (SPCOM), 2016 International Conference on, pp. 1-5. IEEE, 2016.
  14. Raghuram, M. A., Nikhil R. Chavan, Shashidhar G. Koolagudi, and Pravin B. Ramteke. "Efficient audio segmentation in soccer videos." In Electrical and Computer Engineering (CCECE), 2016 IEEE Canadian Conference on, pp. 1-4. IEEE, 2016.
  15. Vikrant Chaugule, Aadheeshwar Vijayakumar, Abhishek D., Pravin Bhaskar Ramteke, and Shashidhar G. Koolagudi, “Product Review Based on Optimized Facial Expression Detection” Ninth International Conference on Contemporary Computing (IC3 2016), 2016 .
  16. Chandana Velaga, Shivani Gupta, Kavya sree B, Pravin Bhaskar Ramteke, and Shashidhar G. Koolagudi, “Classical Mechanics Tutor: A graphical kit,” International Conference on Intelligent Computing and Applications (ICICA 2016) 2016.
  17. Sumukh R. M., Shashidhar G. Koolagudi, Naresh V., Fathima Afroz, and Abhishek Reddy Y N, "Realistic Golf Flight Simulation," In Computing for Sustainable Global Development (INDIACom), 2016 3rd International Conference on, pp. 2215-2219. IEEE, 2016.
  18. Meenakshy Balachandran, Kriti Nagori, Aishwarya Rajan, Shashidhar G. Koolagudi, and Fathima Afroz, "Optimization of Declarative Graphics by Parallel Programming," In Proceedings of the 36th IEEE Conference on Technologies of Smart Nation (TENCON 2016). IEEE, 2016.
  19. Manjunath Mulimani, Shashidhar G. Koolagudi, "Acoustic Scene Classification using MFCC and MP features" in IEEE AASP challenge on Detection and Classification of Acoustic Scenes and Events - 2016, Budapest, Hungary, 2016.
  20. Jamadagni, Chirag and Anilkumar, Abhijith and Mathew, Kevin Thomas and Mulimani, Manjunath and Koolagudi, Shashidhar, "Dynamic 3D graph visualizations in julia" in Proceedings of the Summer Computer Simulation Conference, Society for Computer Simulation International, 2016
  21. Savin P. S., Pravin B. Ramteke, and Shashidhar G. Koolagudi, “Recognition of Repetition and Prolongation in Stuttered Speech Using ANN,” in Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics: ICACNI 2015, Vol. 1, pages=65-71,  2016. 
  22. Pravin B. Ramteke, Shashidhar G. Koolagudi, and Fathima Afroz, “Repetition Detection in Stuttered Speech,” in Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics: ICACNI 2015, Vol. 1, pages 611-617, 2016. 
  23. Murthy, Y. S., and Koolagudi, S. G. (2015, May). Classification of vocal and non-vocal regions from audio songs using spectral features and pitch variations. In Electrical and Computer Engineering (CCECE), 2015 IEEE 28th Canadian Conference on (pp. 1271-1276). IEEE.
  24. Shashidhar Koolagudi G., B. Shivakranthi, K. Sreenivasa Rao and Pravin B. Ramteke. "Contribution of Telugu vowels in identifying emotions." In Advances in Pattern Recognition (ICAPR), 2015 Eighth International Conference on, Kolkata, pp. 1-6. IEEE, 2015.
  25. Pravin B. Ramteke, Shashidhar G. Koolagudi, and Arun Prabhakar, “Feature Analysis for Mispronounced Phonemes in the case of Alvoelar Approximant (/r/) Substituted with Voiced Dental Consonant(/∂/),” Contemporary Computing (IC3), 2015 Eighth International Conference on, Noida, 2015, pp. 132-137. 
  26. Kedia Yash, Aditya Hendre, Shreyans Jain, and Fathima Afroz, “Layer based 3D clipping,” 12th Annual IEEE India Conference, pp. 1-5, 2015.
  27. Koolagudi Shashidhar G., Sriyak Sridhar, Nagendra Elango, Karthik Kumar and Fathima Afroz, "Advertisement Detection in Commercial Radio Channels," In Industrial & Information Systems (ICIIS), 2015 IEEE 10th International Conference on, pp. 272-277. IEEE, 2015.


Contact us

Dr. B. R. Chandavarkar
Head of the Department
Department of CSE, NITK, Surathkal
P. O. Srinivasnagar, Mangalore - 575 025
Karnataka, India.
Hot line: +91-0824-2474053
Email: hodcse[AT]nitk[DOT]ac[DOT]in


Connect with us

We're on Social Networks. Follow us & get in touch.