Yes, HIPAA compliance is on the roadmap and should be out in a few weeks. We spent a lot of time on healthcare/sensitive data use cases.
Google Document AI and Watson SDU seem to be an afterthought for IBM/Google. The accuracy and configurability often fall short when you want to use them in a production setting.
Comparing to other legacy document processing companies, I think there are a few areas where we differentiate:
1. We handle end-to-end workflows from integrating with data sources, defining the transformation, and automatically triggering new runs when there’s an update to the data.
2. We built our entire stack on LLM and Vision transformers and use OCR/parser to check the results. This allows the mapping and tasks to be a lot more robust and flexible.
3. We have validations, reference checking, and confidence score metrics that enable fast human-in-the-loop iteration.
How do your capabilities compare to Google Document AI or Watson SDU? Also what about standalone competitors such as Indico Data or DocuPanda?