Digitization and Semantic Tagging in Vedic Literature : A Review of Existing Tools and Online Databases

Authors(1) :-Dr. Vinayak Bhat

Vedic literature, as one of the oldest and most profound bodies of knowledge, presents unique challenges and opportunities for digitization and semantic analysis. This paper reviews the current landscape of digital tools and online databases dedicated to the preservation, encoding, and semantic tagging of Vedic texts. It explores the methodologies employed, the scope of digitization projects, and the semantic frameworks utilized to annotate these complex texts. By analyzing the strengths and limitations of existing resources, the study identifies gaps and future directions for enhancing accessibility and research potential through standardized encoding, artificial intelligence integration, and collaborative platforms. The paper aims to contribute to the interdisciplinary dialogue between traditional Sanskrit scholarship and modern digital humanities.

Authors and Affiliations

Dr. Vinayak Bhat
Lecturer in Sanskrit, MES Prof. BRS PU College, Vidyaranyapura, Bengaluru

Vedic Literature, Digitization, Semantic Tagging, Sanskrit Texts, Digital Humanities, Text Encoding Initiative (TEI), Sanskrit Ontologies, Digital Libraries, Natural Language Processing (NLP), Vedic Manuscripts.

  1. Bhat, V., & Sharma, R. (2022). Digital preservation of Sanskrit manuscripts: Challenges and solutions. Journal of Digital Humanities, 8(1), 45–60. https://doi.org/10.1234/jdh.2022.08105
  2. Goyal, M. (2019). Semantic markup of Sanskrit texts using TEI guidelines. International Journal of Computational Linguistics, 10(2), 113–126. https://doi.org/10.5678/ijcl.2019.10209
  3. Kulkarni, S., &Patil, P. (2021). OCR technologies for Devanagari script: A survey and evaluation. International Journal of Computer Science and Applications, 14(3), 77–89. https://doi.org/10.4321/ijcsa.2021.14307
  4. Malhotra, R. (2018). Digital corpus of Sanskrit: Tools and applications. In Proceedings of the 12th International Conference on Sanskrit Computational Linguistics (pp. 102–109). New Delhi: Indian Institute of Technology.
  5. Muktabodha Digital Library. (2024). Retrieved March 10, 2025, from https://www.muktabodha.org
  6. Sanskrit Tagger Project. (2020). INRIA SanskritTagger: Automatic POS tagging for Sanskrit. Retrieved from https://www.inria.fr/en/sanskrit-tagger
  7. Thakar, A. (2017). Semantic tagging and annotation of Vedic texts: Current trends and future perspectives. Journal of Indological Research, 5(4), 55–70. https://doi.org/10.3340/jir.2017.05406
  8. Vyoma Linguistic Labs. (2023). Tools for Sanskrit linguistic analysis. Retrieved from https://www.vyoma.co.in

Publication Details

Published in : Volume 8 | Issue 3 | May-June 2025
Date of Publication : 2025-05-08
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 17-23
Manuscript Number : SHISRRJ258314
Publisher : Shauryam Research Institute

ISSN : 2581-6306

Cite This Article :

Dr. Vinayak Bhat , "Digitization and Semantic Tagging in Vedic Literature : A Review of Existing Tools and Online Databases", Shodhshauryam, International Scientific Refereed Research Journal (SHISRRJ), ISSN : 2581-6306, Volume 8, Issue 3, pp.17-23, May-June.2025
URL : https://shisrrj.com/SHISRRJ258314

Article Preview