Translate images of plots into tables with one-shot visual language understanding.
The Google DePlot model is a one-shot visual language understanding solution that translates images of plots or charts into linearized tables.
By using this model, you are agreeing to the terms and conditions of the license, acceptable use policy and Google Research privacy policy.
Architecture Type: Transformer
Network Architecture: Pix2Struct
Input Format: Red, Green, Blue (RGB) Image + Text
Input Parameters: None
Other Properties Related to Input: None
Output Format: Text
Output Parameters: temperature, top_p, max_tokens
Other Properties Related to Output: stream
Linux
Engine: Triton
Test Hardware: Other