Skip to content

analytiq-hub/doc-router

Backend Tests

The Smart Document Router is an open source document processing data layer.

  • It ingests unstructured docs through REST APIs and integrations from faxes, emails, and ERPs.
  • It processes documents at scale with OCR and LLMs
  • And it chunks, embeds, and organizes documents into queriable knowledge bases

The Document Router is designed to work standalone or with a human-in-the-loop, and can process medical, insurance, financial, supply chain, and legal documents.

It acts as a system of record for the extraction schemas and prompts, and it is portable over all major clouds and LLM providers.

A Document Agent is available to configure prompts and extractions, and to review processed results.

Tech stack

  • NextJS, NextAuth, MaterialUI, TailwindCSS
  • FastAPI
  • MongoDB
  • Pydantic
  • LiteLLM
  • OpenAI, Anthropic, Gemini, Vertex AI for GCP, AWS Bedrock, xAI, OpenRouter...

PyData Boston DocRouter Slides (Feb '24) have more details about tech stack, and how Cursor AI was used to build the DocRouter.

User Experience

Document List Document Agent

Example Deployment

Smart Document Router

Presentations

Docs

About

No description, website, or topics provided.

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors