feat: add PaddleOCR-VL parser engine#3189
Conversation
|
Before merging this PR, it may be worth discussing the abstraction layer first — see #3197 (RFC: pluggable OCR/VLM parser abstraction). The current approach of adding a new four-file subpackage per engine is the same pattern we've seen for MinerU and Docling, and the list of requested engines is growing (SmolDocling, qwen3-vl, DeepSeek-OCR, GLM-OCR, Mistral OCR, Upstage Document Parse, ...). A shared |
|
Good point. I agree that a shared I'm happy to either:
Which approach do you prefer? |
|
Option 1 makes more sense — let's finalize the protocol interface in #3197 first, then your PR can be the first concrete implementation against it. Waiting for maintainer feedback on the RFC direction before drafting the interface spec. Once that's in, happy to collaborate on the design in #3197 so you have a clear target to implement against. |
What does this PR do?
Adds PaddleOCR-VL as a new parser engine for LightRAG, providing an alternative to MinerU and Docling for document parsing.
Fixes #3114
Features
Files Added
Configuration
Environment Variables
Usage
Dependencies
Required
Testing
Start PaddleOCR-VL server:
Run LightRAG with PaddleOCR-VL:
Notes