Project Anuvaad
  • Sunbird Anuvaad Overview
    • Features
  • USE
    • Playbook
    • Video Tutorials
  • LEARN
    • Architecture
    • Technology Stack
    • Repository structure and developers guide
    • Setting up Anuvaad on your own
    • Git branching strategies
    • Anuvaad Module Config Guidelines
  • MODULES
    • Modulewise Appendix
    • Anuvaad Workflow Manager
    • User management
    • Document converter
    • Auditor
    • OCR Content handler
    • Block merger
    • Content Handler
    • Document Digitization
    • File uploader
    • Aligner
    • ETL Translator
    • File translator
    • Anuvaad Zuul Gateway System
    • Anuvaad Translator
    • Tokenizer
    • Analytics
    • NMT
  • Legacy
    • Model Retraining
    • NMT Inference
    • Integration
      • Registration
      • Login and auth token
      • Supported Language pairs and translation models
      • Translate texts
    • Service Contracts
    • API Host Endpoints
  • ENGAGE
    • FAQ
    • KT Videos
    • Source Code Repository
    • Discuss
    • Tools
      • anuvaad-corpus-tools
      • layout-mt-corpus
      • ocr-toolkit
      • anuvaad-ocr-corpus
      • parallel-corpus
      • anuvaad-em
Powered by GitBook
On this page
Edit on GitHub
Export as PDF
  1. LEARN

Technology Stack

Technology Stack

PreviousArchitectureNextRepository structure and developers guide

Last updated 6 months ago

Component
Details

Internal modules are integrated through Kafka messaging.

Primary data storage.

Secondary in memory storage.

Cloud Storage

Samba storage is used to store user input files.

Serve as a redirection server and also takes care of system level configs. Ngnix acts as the gateway.

API Gateway to apply filters on client requests,authenticate,authorize,throttle client requests.

AI ML Assets

Component
Details

Layout detection model.

Used for Line detection.

Custom trained Tesseract used for OCR.

Custom trained model used for translation.

open-source platform for serving language AI models at scale.

Apache Kafka
MongoDB
Redis
NGINX
Zuul
PRIMA
CRAFT
Tesseract
IndicTrans2
Dhruva