Project Anuvaad
  • Sunbird Anuvaad Overview
    • Features
  • USE
    • Playbook
    • Video Tutorials
  • LEARN
    • Architecture
    • Technology Stack
    • Repository structure and developers guide
    • Setting up Anuvaad on your own
    • Git branching strategies
    • Anuvaad Module Config Guidelines
  • MODULES
    • Modulewise Appendix
    • Anuvaad Workflow Manager
    • User management
    • Document converter
    • Auditor
    • OCR Content handler
    • Block merger
    • Content Handler
    • Document Digitization
    • File uploader
    • Aligner
    • ETL Translator
    • File translator
    • Anuvaad Zuul Gateway System
    • Anuvaad Translator
    • Tokenizer
    • Analytics
    • NMT
  • Legacy
    • Model Retraining
    • NMT Inference
    • Integration
      • Registration
      • Login and auth token
      • Supported Language pairs and translation models
      • Translate texts
    • Service Contracts
    • API Host Endpoints
  • ENGAGE
    • FAQ
    • KT Videos
    • Source Code Repository
    • Discuss
    • Tools
      • anuvaad-corpus-tools
      • layout-mt-corpus
      • ocr-toolkit
      • anuvaad-ocr-corpus
      • parallel-corpus
      • anuvaad-em
Powered by GitBook
On this page
  • Modules
  • Converter Module
Edit on GitHub
Export as PDF
  1. MODULES

Document converter

PreviousUser managementNextAuditor

Last updated 11 months ago

This microservice is intended to generate the final document after translation and digitization. This currently supports pdf, txt, xlsx document generation.

  • API Contract:

  • Code:

Modules

Converter Module

DocumentConverter

API to create digitized txt & xlsx files for Translation Flow. RBAC enabled.

Mandatory parameters: record_id, user_id, file_type

Actions:

  • Validating input params as per the policies

  • Page data is converted into dataframes

  • Writing the data into file and storing them on Samba store

DocumentConverter CURL Request
curl --location --request POST 'http://localhost:5001//anuvaad-etl/document-converter/v0/document-converter' \
--header 'Content-Type: application/json' \
--data-raw '{ 
  "record_id":"A_OD10GV-IVRCU-1617009019569%7C0-16170090212740283.json", 
  "user_id":"d4e0b570-b72a-44e5-9110-5fdd54370a9d", 
  "file_type":"txt" 
}'

DocumentExporter

API to create digitized txt & pdf files on Document Digitization flow. RBAC enabled.

Mandatory parameters: record_id, user_id, file_type

Actions:

  • Validating input params as per the policies

  • Generating the docs using ReportLab

  • Writing the data into file and storing them on Samba store

DocumentExporter CURL Request
curl --location --request POST 'http://localhost:5001//anuvaad-etl/document-converter/v0/document-exporter' \
--header 'Content-Type: application/json' \
--data-raw '{ 
  "record_id":"A_OD10GV-IVRCU-1617009019569%7C0-16170090212740283.json", 
  "user_id":"d4e0b570-b72a-44e5-9110-5fdd54370a9d", 
  "file_type":"txt" 
}'
here
here