1. Website Tools
API Documents - GuGuData | Production-Ready APIs Built for Developers
  • GuGuData API documents
  • Metadata
    • DEMO
      • [DEMO] Global QS World University Rankings
      • [DEMO] Global University
      • [DEMO] Chinese Poem
      • [DEMO] Global QS World University Rankings
      • [DEMO] Stock US Symbols
      • [DEMO] Stock HK Symbols
    • Global QS World University Rankings
      GET
    • Global University
      GET
    • Chinese Poem
      GET
    • Global QS World University Rankings
      GET
    • Stock US Symbols
      GET
    • Stock HK Symbols
      GET
  • Website Tools
    • DEMO
      • [DEMO] Webpage Readable Content Extraction
      • [DEMO] Domain SSL Certificate Information Parsing
      • [DEMO] Domain DNS Information Query
      • [DEMO] Query Website Favicon and Title
      • [DEMO] Format International Phone
      • [DEMO] URL to static HTML
      • [DEMO] URL to Image
      • [DEMO] URL to Markdown
      • [DEMO] Get URL links
      • [DEMO] Website Snapshot
      • [DEMO] Domain Whois
      • [DEMO] IP Address
      • [DEMO] Article Extract
      • [DEMO] Geographic Coordinate System Converter
      • [DEMO] Extract Structured JSON from Webpage
      • [DEMO] Short Link API
    • Webpage Readable Content Extraction
      POST
    • Domain SSL Certificate Information Parsing
      GET
    • Domain DNS Information Query
      GET
    • Query Website Favicon and Title
      GET
    • Format International Phone
      GET
    • URL to static HTML
      POST
    • URL to Image
      GET
    • URL to Markdown
      POST
    • Get URL links
      GET
    • Website Snapshot
      POST
    • Domain Whois
      GET
    • IP Address
      GET
    • Article Extract
      POST
    • Geographic Coordinate System Converter
      GET
    • Extract Structured JSON from Webpage
      POST
    • Short Link API
      POST
  • Image Recognition
    • DEMO
      • [DEMO] HTML to PDF
      • [DEMO] Image OCR
      • [DEMO] PDF to Format
      • [DEMO] PDF Splitting
      • [DEMO] Markdown to PDF
      • [DEMO] PPT to Images
      • [DEMO] PDF to HTML
      • [DEMO] PDF Summary
      • [DEMO] Image Compress
      • [DEMO] Word to HTML
      • [DEMO] Convert HTML to Word
      • [DEMO] Convert PPT to PDF
    • HTML to PDF
      POST
    • Image OCR
      POST
    • PDF to Format
      POST
    • PDF Splitting
      POST
    • Markdown to PDF
      POST
    • PPT to Images
      POST
    • PDF to HTML
      POST
    • PDF Summary
      POST
    • Image Compress
      POST
    • Word to HTML
      POST
    • Convert HTML to Word
      POST
    • Convert PPT to PDF
      POST
  • QRcode and Barcode
    • DEMO
      • [DEMO] QR Code
      • [DEMO] Wifi QR Code
      • [DEMO] Bar Code
      • [DEMO] Decode QR Code from Image
    • QR Code
    • Wifi QR Code
    • Bar Code
    • Decode QR Code from Image
  • Text Tools
    • DEMO
      • [DEMO] Simplified and Traditional Chinese Converter
      • [DEMO] Text Similarity Calculator
      • [DEMO] Detect Text Language
      • [DEMO] ISBN Book Metadata Lookup
    • Simplified and Traditional Chinese Converter
    • Text Similarity Calculator
    • Detect Text Language
    • ISBN Book Metadata Lookup
  • healthcheck
    GET
  1. Website Tools

Extract Structured JSON from Webpage

POST
https://api.gugudata.io/v1/websitetools/url2json
Last modified:2026-05-14 10:39:32
Extract Structured JSON from Webpage
Extract structured JSON from a webpage according to a natural-language prompt.
Method: POST
Path: /v1/websitetools/url2json
Demo: https://api.gugudata.io/v1/websitetools/url2json/demo
OpenAPI: https://gugudata.io/assets/openapi/gugudata.openapi.3.1.json
Request Parameters:
appkey (string, required): Application key used for request authentication. Supply the value as a query parameter, form field, or multipart field according to the request content type.
url (string, required): Target webpage URL.
prompt (string, required): Natural-language instruction that describes the structured data to extract from the webpage.
Response Fields:
dataStatus (object, required): Response metadata returned by the current v1 contract.
dataStatus.requestParameter (string, required): Normalized request parameters echoed by the service. Sensitive credentials are omitted when available.
dataStatus.statusCode (integer, required): Application-level status code returned by the current v1 contract. Successful demo responses currently return 200.
dataStatus.status (string, required): Application-level status enum. Successful demo responses currently return SUCCESS.
dataStatus.statusDescription (string, required): Application-level status message returned by the current v1 contract.
dataStatus.responseDateTime (string, required): Response timestamp returned by the current service contract.
dataStatus.dataTotalCount (integer, required): Total number of records that match the request.
data (object, required): Primary response payload returned by the endpoint.
HTTP Status Codes:
200: Request processed successfully. Some endpoints expose a separate application-level status field in the response body, such as dataStatus.statusCode.
400: Invalid request parameters or request format. Check required fields, data types, and request body format.
401: Missing or unknown application key. Provide a valid appkey with the request.
403: The application key is recognized but access is not allowed. The key may be expired, inactive, or not permitted for the requested API.
429: Request rate or trial usage limit exceeded. Reduce concurrency or retry after the limit window resets.
500: Internal service error. Retry later or contact support if the error persists.
503: Upstream service unavailable. Retry later; the requested upstream dependency is temporarily unavailable.
Business Status Codes:
200 Normal return: Structured JSON data successfully extracted from the webpage
400 Parameter error: Missing or invalid parameters (url and prompt are both required, url must be valid HTTP/HTTPS)
429 Request frequency limited: Cannot exceed 100 requests per second
403 Account in arrears: Payment required to continue using the service
402 APPKEY error: Please check whether the APPKEY passed is obtained from the developer center
500 API response error: Internal server error during JSON extraction
503 Service unavailable: External AI service temporarily unavailable
Key Features:
AI-powered structured data extraction from any webpage.
Custom prompt-based extraction for flexible data schemas.
Returns clean, structured JSON ready for programmatic use.
Supports any publicly accessible HTTP/HTTPS webpage.
Handles JavaScript-rendered pages via headless browser.
Full API support for HTTPS (TLS v1.0 / v1.1 / v1.2 / v1.3).
Fully compatible with Apple ATS.
Nationwide multi-node CDN deployment.
Ultra-fast response, API interface load balancing built with multiple servers.
Details:
https://gugudata.io/details/url2json

Request

Query Params

Header Params

Body Params application/json

Example
{
    "url": "https://www.gugudata.io",
    "prompt": "Extract the title and main content."
}

Request Code Samples

Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --globoff 'https://api.gugudata.io/v1/websitetools/url2json?appkey=sk-******' \
--header 'Content-Type: application/json' \
--data '{
    "url": "https://www.gugudata.io",
    "prompt": "Extract the title and main content."
}'

Responses

🟢200成功
application/json
Bodyapplication/json

Example
{
  "dataStatus": {
    "statusCode": 0,
    "status": "string",
    "statusDescription": "string",
    "responseDateTime": "string",
    "dataTotalCount": 0
  },
  "data": {},
  "DataStatus": {
    "StatusCode": 100,
    "StatusDescription": "OK",
    "ResponseDateTime": "2026-01-01 00:00:00",
    "DataTotalCount": 1
  }
}
Modified at 2026-05-14 10:39:32
Previous
Geographic Coordinate System Converter
Next
Short Link API
Built with