puppeteer-proxy

Proxies Puppeteer Page requests.

Usage no npm install needed!

<script type="module">
  import puppeteerProxy from 'https://cdn.skypack.dev/puppeteer-proxy';
</script>

README

puppeteer-proxy 🎎

Travis build status Coveralls NPM version Canonical Code Style Twitter Follow

Proxies Puppeteer Page requests.

  • Allows to change proxy per Page and per Request.
  • Allows to authenticate using http://username:password@proxy schema.
  • Handles cookies.
  • Handles binary files.
  • Supports custom HTTP(S) agents.

Motivation

This package addresses several issues with Puppeteer:

  • It allows to set a proxy per Page and per Request (#678)
  • It allows to authenticate against proxy when making HTTPS requests (#3253)

The side-benefit of this implementation is that it allows to route all traffic through Node.js, i.e. you can use externally hosted Chrome instance (such as Browserless.io) to render DOM & evaluate JavaScript, and route all HTTP traffic through your Node.js instance.

The downside of this implementation is that it will introduce additional latency, i.e. requests will take longer to execute as request/ response will need to be always exchanged between Puppeteer and Node.js.

Implementation

puppeteer-proxy intercepts requests after it receives the request metadata from Puppeteer. puppeteer-proxy uses Node.js to make the HTTP requests. The response is then returned to the browser. When using puppeteer-proxy, browser never makes outbound HTTP requests.

Setup

You must call page.setRequestInterception(true) before using pageProxy.proxyRequest.

API

import {
  Agent as HttpAgent,
} from 'http';
import {
  Agent as HttpsAgent,
} from 'https';
import type {
  Page,
  Request,
} from 'puppeteer';
import {
  proxyRequest,
} from 'puppeteer-proxy';

/**
 * @property agent HTTP(s) agent to use when making the request.
 * @property page Instance of Puppeteer Page.
 * @property proxyUrl HTTP proxy URL. A different proxy can be set for each request.
 * @property request Instance of Puppeteer Request.
 */
type ProxyRequestConfigurationType = {|
  +agent?: HttpAgent | HttpsAgent,
  +page: Page,
  +proxyUrl?: string | { http: string, https: string },
  +request: Request,
|};

proxyRequest(configuration: ProxyRequestConfigurationType): PageProxyType;

Usage

Making a GET request using proxy

import puppeteer from 'puppeteer';
import {
  proxyRequest,
} from 'puppeteer-proxy';

(async () => {
  const browser = await puppeteer.launch();
  const page = await browser.newPage();

  await page.setRequestInterception(true);

  page.on('request', async (request) => {
    await proxyRequest({
      page,
      proxyUrl: 'http://127.0.0.1:3000',
      request,
    });
  });

  await page.goto('http://gajus.com');
})();