GET /v1/content

Extract clean HTML, Markdown, and structured metadata from any public URL. Ideal for AI pipelines, content monitoring, and indexing.

curl "https://api.chuger.com/v1/content?url=https://example.com/blog/article" \
  -H "Authorization: Bearer YOUR_API_TOKEN"

const res = await fetch(
  `https://api.chuger.com/v1/content?url=${encodeURIComponent('https://example.com/blog/article')}`,
  { headers: { Authorization: `Bearer ${process.env.CHUGER_TOKEN}` } },
);
const data = await res.json();

import os, requests

r = requests.get(
    "https://api.chuger.com/v1/content",
    params={"url": "https://example.com/blog/article"},
    headers={"Authorization": f"Bearer {os.environ['CHUGER_TOKEN']}"},
)
data = r.json()

{
  "url": "https://example.com/blog/article",
  "success": true,
  "statusCode": 200,
  "errorMessage": null,
  "html": "<article>...</article>",
  "markdown": "# Article title

First paragraph...",
  "metadata": {
    "title": "Article title",
    "description": "Short summary of the article",
    "keywords": "web, scraping, example",
    "author": "Jane Doe",
    "language": "en",
    "favicon": "https://example.com/favicon.ico",
    "ogImage": "https://example.com/cover.png",
    "ogImageWidth": "1200",
    "ogImageHeight": "630",
    "ogTitle": "Article title",
    "ogType": "article"
  }
}

Turn a web page into clean HTML, Markdown, and structured metadata.

GET https://api.chuger.com/v1/content

If you just need the raw HTML, use /v1/scrape. To process many URLs at once, use /v1/content/bulk.

Authentication

Bearer token in the Authorization header. See Authentication.

Cost

Plan	Credits per request
Basic	2
Pro	2
Business	2

Credits are only deducted on success.

Query parameters

query

urlstring

Required

The URL to extract content from. Must be HTTP or HTTPS, max 180 characters. Raw IP hosts and non-default ports are rejected.

Example

Response fields

urlstring

Required

The URL that was extracted.

successboolean

Required

true when content was successfully extracted.

statusCodeinteger

Required

The HTTP status code returned by the target site.

errorMessagestring

Populated when success is false.

htmlstring

Cleaned, readable HTML of the main content. Boilerplate (nav, footer, ads) is removed.

markdownstring

The same content rendered as Markdown.

metadataobject

Required

Page-level metadata.

titlestring

<title> tag.

descriptionstring

Meta description.

keywordsstring

Meta keywords.

authorstring

Page author.

languagestring

Language code, e.g. en.

faviconstring

Favicon URL.

ogImagestring

Open Graph image URL.

ogImageWidthstring

OG image width.

ogImageHeightstring

OG image height.

ogTitlestring

OG title.

ogTypestring

OG type, e.g. article.

Errors

Status	When
`401`	Missing / invalid token
`402`	No plan, or insufficient credits
`422`	`url` missing, malformed, too long, raw IP, or non-default port
`429`	Rate limit or monthly quota exceeded
`503`	Content could not be extracted

See Errors for the full reference.

Tips

markdown is convenient for feeding into LLMs and search indexes. Use the metadata block to populate previews, cards, and search results in your UI without an extra parse step.

For batch jobs, /v1/content/bulk is much more efficient than firing many parallel requests here.

Was this page helpful?