Extract structured data from PDF invoices