[D] Extract Data From PDF file in the form of JSON Object. | allainews.com

April 20, 2024, 5:54 p.m. | /u/Ghulam_Nabi

Machine Learning www.reddit.com

https://preview.redd.it/q93ittqnaovc1.png?width=777&format=png&auto=webp&s=aec5c1767690bd3269ba9e601623e4d85378fd37

This is the image which i captured from the Pdf file, one thing the pdf text is selectable like I can select all the text written in heading and in tables as well. I have tried the couple of technique:

1: Used the MultiModalVectorStoreIndex using the llama-index-multi-modal-llms-openai (GPT4-API) by first converting the PDF into the Images using the OCR and then retrived the tables from the PDF but one thing I need to define the number of pages from …

api file gpt4 image images index llama llms machinelearning modal multi-modal ocr openai pdf tables text

More from www.reddit.com / Machine Learning

[R] Our new classification algorithm outperforms CatBoost, XGBoost, LightGBM on five benchmark datasets, on accuracy … 3 hours ago | www.reddit.com

accuracy algorithm algorithms benchmark +18

[D] Thoughts on DSPy 8 hours ago | www.reddit.com

core dspy explore imagine +8

[D] Please consider signing this letter to open source AlphaFold3 11 hours ago | www.reddit.com

acid alphafold bioinformatics capability +13

[P] SimpleGEMM: Fast and minimal tensor core matrix multiplication in CUDA 15 hours ago | www.reddit.com

architecture code core cuda +10

[P] I made a website that visualizes your codebase with LLMs 16 hours ago | www.reddit.com

codebase llms machinelearning website

[P] DARWIN - open-sourced Devin alternative 19 hours ago | www.reddit.com

access ai software ai software engineer alternative +16

[R] How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with … 22 hours ago | www.reddit.com

abstract machinelearning

[R] Curvature-Informed SGD via General Purpose Lie-Group Preconditioners 22 hours ago | www.reddit.com

abstract algorithm approximation criterion +15

[P] A look at the latest major open LLM releases: Mixtral, Llama 3, Phi-3, and … 1 day ago | www.reddit.com

latest llama llama 3 llm +8

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net