Technology Stack

Technology Stack

ComponentDetails

Translator and OpenNMT are integrated through Kafka messaging.

Primary data storage.

Secondary in memory storage.

Cloud Storage

Samba storage is used to store user input files.

Serve as a redirection server and also takes care of system level configs. Ngnix acts as the gateway.

API Gateway to apply filters on client requests,authenticate,authorize,throttle client requests.

AI ML Assets

ComponentDetails

Layout detection model.

Used for OCR in Document Digitization v1.0 , v1.5. Replaced with custom trained Tesseract in latest versions.

Used for Line detection.

Custom trained Tesseract used for OCR.

Custom trained OpenNMT used for translation.

Last updated