Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual) April 5, 2025 by kamal Comments