2025 Research · Dataset

Unifying Classical Tamil under one queryable, multi‑framework schema.

10,393 verses across Sangam, Bhakti, Epic & Spiritual texts — annotated through Tolkappiyam poetics, rasa, dhvani, and a cross‑cultural bridge layer. Built on decades of upstream digitization work.

JSON Schema Soniox STT NVIDIA NIM Wiktionary 🤗 HF
indic‑corpora / sentamizh‑corpus