OCR神器:Surya,支持90+种语言、布局分析、表格识别,性能媲美Google Cloud Vision、Tesseract,每页处理速度0.62秒
1、可以进行线条级别的文本检测
2、布局分析包括表格、图像、标题等
3、阅读顺序检测
4、表格识别,能够检测行和列
github:https://t.co/EDlOl6WZ1u... See more
GitHub - transformerlab/transformerlab-app: Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.
Announcing @MistralAI OCR - the world’s best document understanding API.
🔍 State-of-the-art understanding of complex documents
🌍 Natively multilingual and multimodal
⚡ Fastest in its category
📄 Doc-as-prompt, structured output
🔒 Available for on-prem deployment https://t.co/fUzlznYIsc