logo

IMAGE EXTRACTION


1. Image to Text - Image Binary Input

Method: POST
URL: /image-to-text
Headers: Content-Type: application/octet-stream

Query Paramters

Attribute Description Type Required Options
text_language Select image text language for better accuracy. text false Afrikaans
Amharic
Arabic
Assamese
Azerbaijani
Azerbaijani (Cyrillic)
Belarusian
Bengali
Tibetan
Bosnian
Breton
Bulgarian
Catalan
Cebuano
Czech
Chinese Simplified
Chinese Simplified (Vertical)
Chinese Traditional
Chinese Traditional (Vertical)
Cherokee
Corsican
Welsh
Danish
German
Divehi
Dzongkha
Greek
English
Middle English
Esperanto
Math / Equation Detection
Estonian
Basque
Faroese
Persian
Filipino
Finnish
French
Frankish
Middle French
Frisian
Scottish Gaelic
Irish
Galician
Ancient Greek
Gujarati
Haitian Creole
Hebrew
Hindi
Croatian
Hungarian
Armenian
Inuktitut
Indonesian
Icelandic
Italian
Old Italian
Javanese
Japanese
Japanese (Vertical)
Kannada
Georgian
Old Georgian
Kazakh
Khmer
Kyrgyz
Kurdish (Kurmanji)
Korean
Lao
Latin
Latvian
Lithuanian
Luxembourgish
Malayalam
Marathi
Macedonian
Maltese
Mongolian
Maori
Malay
Burmese
Nepali
Dutch
Norwegian
Occitan
Oriya
Orientation and script detection (OSD)
Punjabi
Polish
Portuguese
Pashto
Quechua
Romanian
Russian
Sanskrit
Arabic Script
Armenian Script
Bengali Script
Canadian Aboriginal Script
Cherokee Script
Cyrillic Script
Devanagari Script
Ethiopic Script
Fraktur Script
Georgian Script
Greek Script
Gujarati Script
Gurmukhi Script
HanS Script
HanS (Vertical) Script
HanT Script
HanT (Vertical) Script
Hangul Script
Hangul (Vertical) Script
Hebrew Script
Japanese Script
Japanese (Vertical) Script
Kannada Script
Khmer Script
Lao Script
Latin Script
Malayalam Script
Myanmar Script
Oriya Script
Sinhala Script
Syriac Script
Tamil Script
Telugu Script
Thaana Script
Thai Script
Tibetan Script
Vietnamese Script
Sinhalese
Slovak
Slovenian
Sindhi
Spanish
Old Spanish
Albanian
Serbian
Serbian (Latin)
Sundanese
Swahili
Swedish
Syriac
Tamil
Tatar
Telugu
Tajik
Thai
Tigrinya
Tongan
Turkish
Uighur
Ukrainian
Urdu
Uzbek
Uzbek (Cyrillic)
Vietnamese
Yiddish
Yoruba

Body - Request body will contain binary data of the image

Example Request

curl -X POST 'https://pdf.msquare.pro/image-to-text?appName=DocCrafter&text_language=eng' \
            -H 'content-type: application/octet-stream' \
            -H 'authorization: YOUR_TOKEN_HERE' \
            -d 'Image binary data'
           

2. Image to Text - Image URL Input

Method POST
URL /image-to-text
Headers Content-Type: application/json

Body Parameters


Attribute Description Type Required Options
image_url URL of the image Text True
text_language Select image text language for better accuracy. text false Afrikaans
Amharic
Arabic
Assamese
Azerbaijani
Azerbaijani (Cyrillic)
Belarusian
Bengali
Tibetan
Bosnian
Breton
Bulgarian
Catalan
Cebuano
Czech
Chinese Simplified
Chinese Simplified (Vertical)
Chinese Traditional
Chinese Traditional (Vertical)
Cherokee
Corsican
Welsh
Danish
German
Divehi
Dzongkha
Greek
English
Middle English
Esperanto
Math / Equation Detection
Estonian
Basque
Faroese
Persian
Filipino
Finnish
French
Frankish
Middle French
Frisian
Scottish Gaelic
Irish
Galician
Ancient Greek
Gujarati
Haitian Creole
Hebrew
Hindi
Croatian
Hungarian
Armenian
Inuktitut
Indonesian
Icelandic
Italian
Old Italian
Javanese
Japanese
Japanese (Vertical)
Kannada
Georgian
Old Georgian
Kazakh
Khmer
Kyrgyz
Kurdish (Kurmanji)
Korean
Lao
Latin
Latvian
Lithuanian
Luxembourgish
Malayalam
Marathi
Macedonian
Maltese
Mongolian
Maori
Malay
Burmese
Nepali
Dutch
Norwegian
Occitan
Oriya
Orientation and script detection (OSD)
Punjabi
Polish
Portuguese
Pashto
Quechua
Romanian
Russian
Sanskrit
Arabic Script
Armenian Script
Bengali Script
Canadian Aboriginal Script
Cherokee Script
Cyrillic Script
Devanagari Script
Ethiopic Script
Fraktur Script
Georgian Script
Greek Script
Gujarati Script
Gurmukhi Script
HanS Script
HanS (Vertical) Script
HanT Script
HanT (Vertical) Script
Hangul Script
Hangul (Vertical) Script
Hebrew Script
Japanese Script
Japanese (Vertical) Script
Kannada Script
Khmer Script
Lao Script
Latin Script
Malayalam Script
Myanmar Script
Oriya Script
Sinhala Script
Syriac Script
Tamil Script
Telugu Script
Thaana Script
Thai Script
Tibetan Script
Vietnamese Script
Sinhalese
Slovak
Slovenian
Sindhi
Spanish
Old Spanish
Albanian
Serbian
Serbian (Latin)
Sundanese
Swahili
Swedish
Syriac
Tamil
Tatar
Telugu
Tajik
Thai
Tigrinya
Tongan
Turkish
Uighur
Ukrainian
Urdu
Uzbek
Uzbek (Cyrillic)
Vietnamese
Yiddish
Yoruba

Example Request

curl -X POST 'https://pdf.msquare.pro/image-to-text?appName=DocCrafter' \
            -H 'content-type: application/json' \
            -H 'authorization: ***' \
            -d '{"image_url":"https://images.pexels.com/photos/268533/pexels-photo-268533.jpeg?auto=compress&cs=tinysrgb&dpr=1&w=500"}'
           

Response

The response will be a JSON object containing the extracted text under the data field.

Notes