Try NVIDIA NIM APIs

OCDNet and OCRNet are pre-trained models designed for optical character detection and recognition respectively.

771

1y

Model for table extraction that receives an image as input, runs OCR on the image, and returns the text within the image and its bounding boxes.

189K

8mo

Cutting-edge vision-language model exceling in retrieving text and metadata from images.

295K

9mo