Feb. 17, 2024, 11:26 a.m. | /u/4AVcnE

Machine Learning www.reddit.com

Hello,

I'm working for a German insurance company looking to automate the extraction of data from customer invoices received as PDFs. We're particularly interested in details like invoice numbers, date, names, addresses, and line items with prices, aiming to output this information as JSON for further processing. These entities may appear multiple times or not at all.

We've tried several methods without success:

* **GPT-4 and various models**: Didn't consistently provide structured JSON output.
* **Impira/LayoutLM for invoices**: Struggled with …

advice automate automated customer data extraction german hello information insurance invoice json line machinelearning ner numbers pdfs processing

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US