gologin-webunlocker

v0.2.0

Published

12 days ago

Minimal TypeScript SDK for Gologin Web Unlocker scraping API

Downloads

365

0High
0Medium
0Low

gologin

gologin webunlocker sdk scraping

Gologin Web Unlocker SDK (TypeScript)

Minimal Node.js SDK for Gologin Web Unlocker scraping API.

The backend endpoint is:

GET https://parsing.webunlocker.gologin.com/v1/scrape?url={encoded_url}

Authentication is sent via header: apikey: <API_KEY>.

The backend response is raw HTML/text.

Install

npm install gologin-webunlocker

Install the CLI globally:

npm install -g gologin-webunlocker

Get API Key

To get a Web Unlocker API key, create an account and complete onboarding at:

https://gologin.com/web-unlocker

Then use the key in:

apikey request header
GOLOGIN_WEBUNLOCKER_API_KEY environment variable

CLI

After build/install, CLI command:

gologin-webunlocker <command> <url> [options]

Commands:

scrape (raw HTML/text from API)
text (derived from HTML in SDK)
markdown (derived from HTML in SDK)
json (derived metadata from HTML in SDK)

Options:

--api-key <key> or GOLOGIN_WEBUNLOCKER_API_KEY
--base-url <url>
--timeout-ms <number>
--max-retries <number>

Examples:

gologin-webunlocker scrape https://example.com --api-key wu_live_xxx
GOLOGIN_WEBUNLOCKER_API_KEY=wu_live_xxx gologin-webunlocker text https://example.com
GOLOGIN_WEBUNLOCKER_API_KEY=wu_live_xxx gologin-webunlocker json https://example.com

Quick Start

import { WebUnlocker } from "gologin-webunlocker";

const client = new WebUnlocker({
  apiKey: process.env.GOLOGIN_WEBUNLOCKER_API_KEY!
});

const result = await client.scrape("https://example.com");
console.log(result.status);
console.log(result.content.slice(0, 500));

Constructor Options

new WebUnlocker({
  apiKey: "wu_live_xxx",
  baseUrl: "https://parsing.webunlocker.gologin.com",
  timeoutMs: 15000,
  maxRetries: 2
});

apiKey: string required, sent as apikey header
baseUrl?: string defaults to https://parsing.webunlocker.gologin.com
timeoutMs?: number defaults to 15000
maxRetries?: number defaults to 2

Normalized `scrape()` Response

/v1/scrape returns raw HTML/text from the upstream page.
The SDK wraps it into a normalized object:

type ScrapeResult = {
  success: true;
  url: string;
  content: string;
  status?: number | null;
  contentType?: string | null;
  headers?: Record<string, string>;
};

scrape() throws typed errors for non-2xx responses.

Example:

const result = await client.scrape("https://example.com");
console.log(result.status);
console.log(result.contentType);
console.log(result.content.slice(0, 500));

`scrapeRaw()` Example

Use scrapeRaw() when you need direct access to native fetch Response:

const response = await client.scrapeRaw("https://example.com");
console.log(response.status);
const html = await response.text();

scrapeRaw() returns the raw Response object as-is (including non-2xx statuses).

`buildScrapeUrl()` Example

const requestUrl = client.buildScrapeUrl("https://example.com");
console.log(requestUrl);
// https://parsing.webunlocker.gologin.com/v1/scrape?url=https%3A%2F%2Fexample.com

SDK-Side Derived Methods

These methods are derived from the HTML returned by the API.
They do not require additional backend features.

`scrapeText()` (derived from HTML)

const result = await client.scrapeText("https://example.com");
console.log(result.text.slice(0, 500));

`scrapeMarkdown()` (derived from HTML)

const result = await client.scrapeMarkdown("https://example.com");
console.log(result.markdown.slice(0, 500));

`scrapeJSON()` (derived from HTML)

const result = await client.scrapeJSON("https://example.com");
console.log(result.data.title);
console.log(result.data.description);
console.log(result.data.links.slice(0, 5));

`batchScrape()` (client-side helper)

const results = await client.batchScrape(
  ["https://example.com", "https://gologin.com"],
  { concurrency: 2 }
);
console.log(results.map((r) => ({ url: r.url, status: r.status })));

Typed Errors

import {
  WebUnlocker,
  WebUnlockerError,
  AuthenticationError,
  RateLimitError,
  APIError,
  TimeoutError,
  NetworkError
} from "gologin-webunlocker";

try {
  const client = new WebUnlocker({ apiKey: "wu_live_xxx" });
  await client.scrape("https://example.com");
} catch (error) {
  if (error instanceof AuthenticationError) {
    console.error("Invalid API key");
  } else if (error instanceof RateLimitError) {
    console.error("Rate limited");
  } else if (error instanceof TimeoutError) {
    console.error("Request timed out");
  } else if (error instanceof NetworkError) {
    console.error("Network failure");
  } else if (error instanceof APIError) {
    console.error("Server/API error");
  } else if (error instanceof WebUnlockerError) {
    console.error("SDK error");
  } else {
    console.error("Unknown error", error);
  }
}

Error mapping:

401/403 -> AuthenticationError
429 -> RateLimitError
500+ -> APIError
abort/timeout -> TimeoutError
fetch/network issues -> NetworkError

Local Example

GOLOGIN_WEBUNLOCKER_API_KEY=wu_live_xxx npm run example

Development

git clone https://github.com/GologinLabs/gologin-webunlocker.git
cd gologin-webunlocker
npm install
npm run build

Release

npm run release:check
npm publish --access public

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme