Bounding-Box Masking을 통한 PDF 테이블 중복 제거 및 Token 소모 75% 절감
How to Fix PDF Table Duplication in RAG / LLM Pipelines (Python)
How to Fix PDF Table Duplication in RAG / LLM Pipelines (Python)
Building a Free AI PDF Assistant: How I Solved Parsing Issues and Minimized LLM Costs
How I built a PDF bank statement analyzer in 8 languages (and what I learned)
Stop Parsing PDFs at Render Time: A Better Architecture for Structured Extraction
Github Copilot helped us cut down 50-75% time in our e-commerce business
I Built a Free AI Research Paper Reader -Here's How
How We Built a Federal Court Rules Database
I built two Apify actors that scrape U.S. Congress trading data — directly from government sources, no QuiverQuant
I Built CrabPDF: a Privacy-First PDF Editor That Runs Locally in the Browser
The fastest non-VLM parser that preserves document structure: tables, headings, lists is OpenDataLoader PDF.
I tried automating PCB library creation from datasheets