GET /v1/content
Extract clean HTML, Markdown, and structured metadata from any public URL. Ideal for AI pipelines, content monitoring, and indexing.
curl "https://api.chuger.com/v1/content?url=https://example.com/blog/article" \
-H "Authorization: Bearer YOUR_API_TOKEN"
const res = await fetch(
`https://api.chuger.com/v1/content?url=${encodeURIComponent('https://example.com/blog/article')}`,
{ headers: { Authorization: `Bearer ${process.env.CHUGER_TOKEN}` } },
);
const data = await res.json();
import os, requests
r = requests.get(
"https://api.chuger.com/v1/content",
params={"url": "https://example.com/blog/article"},
headers={"Authorization": f"Bearer {os.environ['CHUGER_TOKEN']}"},
)
data = r.json()
{
"url": "https://example.com/blog/article",
"success": true,
"statusCode": 200,
"errorMessage": null,
"html": "<article>...</article>",
"markdown": "# Article title
First paragraph...",
"metadata": {
"title": "Article title",
"description": "Short summary of the article",
"keywords": "web, scraping, example",
"author": "Jane Doe",
"language": "en",
"favicon": "https://example.com/favicon.ico",
"ogImage": "https://example.com/cover.png",
"ogImageWidth": "1200",
"ogImageHeight": "630",
"ogTitle": "Article title",
"ogType": "article"
}
}
{
"message": "Failed to extract content from the URL using available services."
}
Turn a web page into clean HTML, Markdown, and structured metadata.
GET https://api.chuger.com/v1/content
If you just need the raw HTML, use /v1/scrape. To process many URLs at once, use /v1/content/bulk.
Authentication
Bearer token in the Authorization header. See Authentication.
Cost
| Plan | Credits per request |
|---|---|
| Basic | 2 |
| Pro | 2 |
| Business | 2 |
Credits are only deducted on success.
Query parameters
The URL to extract content from. Must be HTTP or HTTPS, max 180 characters. Raw IP hosts and non-default ports are rejected.
Example
Response fields
The URL that was extracted.
true when content was successfully extracted.
The HTTP status code returned by the target site.
Populated when success is false.
Cleaned, readable HTML of the main content. Boilerplate (nav, footer, ads) is removed.
The same content rendered as Markdown.
Page-level metadata.
<title> tag.
Meta description.
Meta keywords.
Page author.
Language code, e.g. en.
Favicon URL.
Open Graph image URL.
OG image width.
OG image height.
OG title.
OG type, e.g. article.
Errors
| Status | When |
|---|---|
401 | Missing / invalid token |
402 | No plan, or insufficient credits |
422 | url missing, malformed, too long, raw IP, or non-default port |
429 | Rate limit or monthly quota exceeded |
503 | Content could not be extracted |
See Errors for the full reference.
Tips
markdown is convenient for feeding into LLMs and search indexes. Use the metadata block to populate previews, cards, and search results in your UI without an extra parse step.
For batch jobs, /v1/content/bulk is much more efficient than firing many parallel requests here.