### 🚀 The feature - [x] datasets: DocLayNet, PubLayNet, DocBank, M6Doc, RanLayNet, PRImA, ? (offline work - merge into unique) - [x] model: https://github.com/Zeba-Xie/RTMDet-R2 (two stages: backbone, neck + head + losses) / https://arxiv.org/abs/2406.03459 LW-DETR - [x] metrics: mmAP (75, 50) - rotated (ref.: https://github.com/open-mmlab/mmrotate/blob/main/mmrotate/core/evaluation/eval_map.py) - [x] Implement train / eval / latency scripts - reuse DetectionDataset for KIE annotations ? - [x] Integrate into pipeline - standalone predictor - Extend DocumentBuilder components - [ ] If layout information available improve sorting to keep reading order (heuristic ? graph based ordering (with networkx) ?) - [ ] Needs other ticket: Allow different output formats (markdown, ..) ### Motivation, pitch TODO: Split into single issues - and add better descriptions
🚀 The feature
Motivation, pitch
TODO: Split into single issues - and add better descriptions