1. Image Recognition
API Documents - GuGuData | Production-Ready APIs Built for Developers
  • GuGuData API documents
  • Metadata
    • DEMO
      • [DEMO] Global QS World University Rankings
      • [DEMO] Global University
      • [DEMO] Chinese Poem
      • [DEMO] Global QS World University Rankings
      • [DEMO] Stock US Symbols
      • [DEMO] Stock HK Symbols
    • Global QS World University Rankings
      GET
    • Global University
      GET
    • Chinese Poem
      GET
    • Global QS World University Rankings
      GET
    • Stock US Symbols
      GET
    • Stock HK Symbols
      GET
  • Website Tools
    • DEMO
      • [DEMO] Webpage Readable Content Extraction
      • [DEMO] Domain SSL Certificate Information Parsing
      • [DEMO] Domain DNS Information Query
      • [DEMO] Query Website Favicon and Title
      • [DEMO] Format International Phone
      • [DEMO] URL to static HTML
      • [DEMO] URL to Image
      • [DEMO] URL to Markdown
      • [DEMO] Get URL links
      • [DEMO] Website Snapshot
      • [DEMO] Domain Whois
      • [DEMO] IP Address
      • [DEMO] Article Extract
      • [DEMO] Geographic Coordinate System Converter
      • [DEMO] Extract Structured JSON from Webpage
      • [DEMO] Short Link API
    • Webpage Readable Content Extraction
      POST
    • Domain SSL Certificate Information Parsing
      GET
    • Domain DNS Information Query
      GET
    • Query Website Favicon and Title
      GET
    • Format International Phone
      GET
    • URL to static HTML
      POST
    • URL to Image
      GET
    • URL to Markdown
      POST
    • Get URL links
      GET
    • Website Snapshot
      POST
    • Domain Whois
      GET
    • IP Address
      GET
    • Article Extract
      POST
    • Geographic Coordinate System Converter
      GET
    • Extract Structured JSON from Webpage
      POST
    • Short Link API
      POST
  • Image Recognition
    • DEMO
      • [DEMO] HTML to PDF
      • [DEMO] Image OCR
      • [DEMO] PDF to Format
      • [DEMO] PDF Splitting
      • [DEMO] Markdown to PDF
      • [DEMO] PPT to Images
      • [DEMO] PDF to HTML
      • [DEMO] PDF Summary
      • [DEMO] Image Compress
      • [DEMO] Word to HTML
      • [DEMO] Convert HTML to Word
      • [DEMO] Convert PPT to PDF
    • HTML to PDF
      POST
    • Image OCR
      POST
    • PDF to Format
      POST
    • PDF Splitting
      POST
    • Markdown to PDF
      POST
    • PPT to Images
      POST
    • PDF to HTML
      POST
    • PDF Summary
      POST
    • Image Compress
      POST
    • Word to HTML
      POST
    • Convert HTML to Word
      POST
    • Convert PPT to PDF
      POST
  • QRcode and Barcode
    • DEMO
      • [DEMO] QR Code
      • [DEMO] Wifi QR Code
      • [DEMO] Bar Code
      • [DEMO] Decode QR Code from Image
    • QR Code
    • Wifi QR Code
    • Bar Code
    • Decode QR Code from Image
  • Text Tools
    • DEMO
      • [DEMO] Simplified and Traditional Chinese Converter
      • [DEMO] Text Similarity Calculator
      • [DEMO] Detect Text Language
      • [DEMO] ISBN Book Metadata Lookup
    • Simplified and Traditional Chinese Converter
    • Text Similarity Calculator
    • Detect Text Language
    • ISBN Book Metadata Lookup
  • healthcheck
    GET
  1. Image Recognition

PDF to Format

POST
https://api.gugudata.io/v1/imagerecognition/pdf2format
Last modified:2026-05-14 10:39:32
PDF Parsing and Formatted Output
Convert an uploaded PDF into a structured downstream format such as Markdown or text, based on the selected output type.
Method: POST
Path: /v1/imagerecognition/pdf2format
Demo: https://api.gugudata.io/v1/imagerecognition/pdf2format/demo
OpenAPI: https://gugudata.io/assets/openapi/gugudata.openapi.3.1.json
Request Parameters:
appkey (string, required): Application key used for request authentication. Supply the value as a query parameter, form field, or multipart field according to the request content type.
type (string, required): Endpoint-specific type selector. Refer to the endpoint description for supported values.
pdffile (file, required): PDF file uploaded as multipart form data.
Response Fields:
DataStatus.statusCode (integer, required): Application-level status code returned by the current v1 contract.
DataStatus.statusDescription (string, required): Application-level status message returned by the current v1 contract.
DataStatus.responseDateTime (string, required): Response timestamp returned by the current service contract.
DataStatus.dataTotalCount (integer, required): Total number of records that match the request.
Data.result (string, required): Parsed PDF data returned by the API, the format is determined by the type parameter
HTTP Status Codes:
200: Request processed successfully. Some endpoints expose a separate application-level status field in the response body, such as dataStatus.statusCode.
400: Invalid request parameters or request format. Check required fields, data types, and request body format.
401: Missing or unknown application key. Provide a valid appkey with the request.
403: The application key is recognized but access is not allowed. The key may be expired, inactive, or not permitted for the requested API.
429: Request rate or trial usage limit exceeded. Reduce concurrency or retry after the limit window resets.
500: Internal service error. Retry later or contact support if the error persists.
503: Upstream service unavailable. Retry later; the requested upstream dependency is temporarily unavailable.
Business Status Codes:
100 Normal response: No additional remark.
101 Parameter error: No additional remark.
102 Request rate limited: Requests cannot exceed 100 per second
103 Account overdue: No additional remark.
104 Invalid APPKEY: Please check if the passed APPKEY is the one obtained from the developer center
110 API response error: No additional remark.
Key Features:
General recognition API, supports standard PDF file parsing.
Multiple format output, supports TEXT, HTML, XML, TAG.
HTML includes perfect formatting.
Recognition accuracy improves continuously with machine learning.
1M file millisecond-level recognition performance.
Fully supports HTTPS (TLS v1.0 / v1.1 / v1.2 / v1.3).
Fully compatible with Apple ATS.
Nationwide multi-node CDN deployment.
Fast API response, load balancing across multiple servers.
Details:
https://gugudata.io/details/pdf2format

Request

Query Params

Body Params multipart/form-data

Request Code Samples

Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --globoff 'https://api.gugudata.io/v1/imagerecognition/pdf2format?appkey=sk-******' \
--form 'type="html"' \
--form 'pdffile=@"null"'

Responses

🟢200成功
application/json
Bodyapplication/json

Example
{
  "DataStatus": {
    "statusCode": 0,
    "statusDescription": "string",
    "responseDateTime": "string",
    "dataTotalCount": 0,
    "StatusCode": 100,
    "StatusDescription": "OK",
    "ResponseDateTime": "2026-01-01 00:00:00",
    "DataTotalCount": 1
  },
  "Data": {
    "result": "string"
  }
}
Modified at 2026-05-14 10:39:32
Previous
Image OCR
Next
PDF Splitting
Built with