[D] Extract Data From PDF file in the form of JSON Object. | allainews.com

April 20, 2024, 5:54 p.m. | /u/Ghulam_Nabi

Machine Learning www.reddit.com

https://preview.redd.it/q93ittqnaovc1.png?width=777&format=png&auto=webp&s=aec5c1767690bd3269ba9e601623e4d85378fd37

This is the image which i captured from the Pdf file, one thing the pdf text is selectable like I can select all the text written in heading and in tables as well. I have tried the couple of technique:

1: Used the MultiModalVectorStoreIndex using the llama-index-multi-modal-llms-openai (GPT4-API) by first converting the PDF into the Images using the OCR and then retrived the tables from the PDF but one thing I need to define the number of pages from …

api file gpt4 image images index llama llms machinelearning modal multi-modal ocr openai pdf tables text

More from www.reddit.com / Machine Learning

[D] Is there a more systematic way of choosing the layers or how deep the … 8 hours ago | www.reddit.com

architecture deep learning least machinelearning +6

[D] Where does the real value of a data scientist come from? 12 hours ago | www.reddit.com

code companies data data scientist +11

[D] NVIDIA GPU Benchmarks & Comparison 15 hours ago | www.reddit.com

a100 ada cards cloud +15

[N] 1st Workshop on In-Context Learning at ICML 2024 15 hours ago | www.reddit.com

context context learning icml in-context learning +2

[R] A Careful Examination of Large Language Model Performance on Grade School Arithmetic 16 hours ago | www.reddit.com

abstract benchmark benchmarks claim +21

[D] [R] Are there any methods/works that enable extracting high-quality dense feature map from CLIP/OpenCLIP … 18 hours ago | www.reddit.com

clip compute feature finetuning +8

[P] [D] Is inference time the important performance metric for ML Models on edge/mobile? 23 hours ago | www.reddit.com

apps devices edge embed +15

[D] UI-based Agents - the next big thing? 1 day ago | www.reddit.com

agents ai agents become big +10

[D] Any-dimensional equivariant neural networks 1 day, 1 hour ago | www.reddit.com

abstract assumptions authors cases +18

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net