Introduction

The proxy system is a component of the scraping browser that forwards connections through a proxy server. It is primarily used to prevent IP blocking, access geographically restricted content, and enhance anonymity. Scraping Browser provides a configurable proxy system that allows for precise control over automated traffic routing, including anonymity, geolocation targeting, and higher reliability, making it easy to integrate proxies into your workflow.

Core Capabilities

The core capability of the proxy system lies in hiding the real IP address through an intermediary server and automatically or on-demand rotating different addresses from an IP pool. It also allows selecting proxy servers in specific geographic locations, simulating user access from around the world.

Application Scenarios

Primarily applicable to scenarios requiring large-scale data scraping and circumventing target website IP restrictions or blocks. It is also a key tool for accessing geographically restricted content and maintaining online anonymity in tasks such as market research.

Classification

Scrapeless offers two proxy configuration options:

  • Using built-in proxies: Easily route traffic through our managed residential proxies.
  • Using custom proxies: Use custom SOCKS5/HTTPS proxies for more flexible control over network routing.

Operation Guide

Using Built-in Proxies

Using Scraping Browser’s built-in proxies routes traffic through its managed residential proxies.

const puppeteer = require('puppeteer-core');
const connectionURL = 'wss://browser.scrapeless.com/browser?token=APIKey&session_ttl=180&proxy_country=ANY';
 
(async () => {
    const browser = await puppeteer.connect({browserWSEndpoint: connectionURL});
    const page = await browser.newPage();
    await page.goto('https://www.scrapeless.com');
    console.log(await page.title());
    await browser.close();
})();

Proxy geographic location by country

Our built-in proxies support 195 countries. Please refer to the detailed Scrapeless proxy documentation Scrapeless proxy documentation.

Custom Proxies

If you want to use your own proxy service/provider, you can specify the following parameters when creating a session. If you use your own proxy provider, you will not be charged for proxy usage.

const puppeteer = require('puppeteer-core');
 
(async () => {
  try {
    const proxyUrl = 'http://gw-us.scrapeless.com:8789';
    const username = 'username';
    const password = 'password';
    const browser = await puppeteer.launch({ args: [`--proxy-server=${proxyUrl}`], headless: false });
    const page = await browser.newPage();
    await page.authenticate({ username, password });
    await page.goto('API_URL');
    await browser.close();
  } catch (e) {
    console.log(e)
  }
})();

You can also purchase Scrapeless proxy services separately.

Proxy Playground: https://app.scrapeless.com/scrapelessteam/products/proxy/channel-list

Proxy Documentation: https://docs.scrapeless.com/en/proxies/features/proxy/