Skip to content
Change the repository type filter

All

    Repositories list

    • The Monocorpus project is a collection of tools designed to facilitate the development of a Tatar language monocorpus
      Python
      0000Updated Aug 21, 2025Aug 21, 2025
    • Automated extraction of structured data from Tatar-language PDF documents using Google Gemini and Yandex Disk public links. Supports chunked processing, prompt engineering, and JSON output validation.
      Python
      0400Updated Jun 5, 2025Jun 5, 2025
    • tahrirgoh

      Public
      Tahrirgoh is a web platform for dataset collection for the Grammatical Error Correction (GEC) task. The only difference from original is translation of interface to Tatar language
      Python
      2000Updated Apr 7, 2025Apr 7, 2025
    • 😎Awesome list about everything in Tatar 🌱Искиткеч татар галәме исемлеге
      11200Updated Apr 3, 2025Apr 3, 2025
    • tat-lm

      Public
      Language modeling and instruction tuning for Tatar
      0000Updated Oct 16, 2024Oct 16, 2024
    • Script to setup new project in Label Studio
      HTML
      0000Updated Oct 11, 2024Oct 11, 2024
    • gizgech

      Public
      Curated Collection of Projects for Tatar Language Development
      0000Updated Sep 6, 2024Sep 6, 2024
    • template

      Public template
      Streamlined template for GitHub repos.
      0000Updated Apr 9, 2024Apr 9, 2024
    • Python
      0000Updated Mar 9, 2024Mar 9, 2024
    • Pipelines, crawlers and tools for mining Speech-to-Text corpus for Tatar language
      0000Updated Dec 20, 2023Dec 20, 2023