Browser API 常见问题解答

如何配置 Browser API 以在特定国家/地区运行？

使用 Browser API 时，可使用与其他 Bright Data 代理产品相同的国家定位参数。特定国家/地区设置脚本时，在 Bright Data 端点的 “USER” 凭据后添加 “-country” 标志，然后添加该国家/地区的 2 个字母的 ISO 代码。例如，对于在美国使用 Puppeteer 的 Browser API：

const SBR_WS_ENDPOINT = `wss://${USERNAME-country-us:PASSWORD}@brd.superproxy.io:9222`;

欧盟地区欧盟地区 You can target the entire European Union region in the same manner as “Country” above by adding “eu” after “country” in your request: “-country-eu”. 请求 sent using -country-eu, will use IPs from one of the countries below which are included automatically within “eu”: AL, AZ, KG, BA, UZ, BI, XK, SM, DE, AT, CH, UK, GB,IE, IM, FR, ES, NL, IT, PT, BE, AD, MT, MC, MA, LU, TN, DZ, GI, LI, SE, DK, FI, NO, AX, IS, GG, JE, EU, GL, VA, FX, FO.

Which coding languages, libraries, and webdrivers are supported by Browser API?

Bright Data 的 Browser API 支持多种编程语言和库。目前，我们使用 puppeteer、playwright 和 selenium 为 Node.js 和 Python 提供全面的本地支持，还可以使用下面的其他库集成其他语言，让您可以灵活地将 Browser API 直接集成到您当前的技术栈中。

语言/平台	puppeteer	playwright	selenium
Python	N/A	playwright-python	Selenium WebDriver
JS / Node	原生支持	原生支持	WebDriverJS
Ruby	Puppeteer-Ruby	playwright-ruby-client	适用于 Ruby 的 Selenium WebDriver
C#	.NET: Puppeteer Sharp	适用于 .NET 的 Playwright	适用于 .NET 的 Selenium WebDriver
Java	Puppeteer Java	适用于 Java 的 Playwright	原生支持
Go	chromedp	playwright-go	适用于 Go 的 Selenium WebDriver

如何调试 Browser API 会话的幕后情况？

在哪里可以找到 Browser API Debugger？

如何在本地自动启动开发工具以查看实时浏览器会话？

如果您想在每次会话中自动启动开发工具以查看实时浏览器会话，可以集成以下代码片段：

NodeJS - Puppeteer

// Node.js Puppeteer - launch devtools locally  

const {
    exec
} = require('child_process');
const chromeExecutable = 'google-chrome';

const delay = ms => new Promise(resolve => setTimeout(resolve, ms));
const openDevtools = async (page, client) => {
    // get current frameId  
    const frameId = page.mainFrame()._id;
    // get URL for devtools from Browser API  
    const {
        url: inspectUrl
    } = await client.send('Page.inspect', {
        frameId
    });
    // open devtools URL in local chrome  
    exec(`"${chromeExecutable}" "${inspectUrl}"`, error => {
        if (error)
            throw new Error('Unable to open devtools: ' + error);
    });
    // wait for devtools ui to load  
    await delay(5000);
};

const page = await browser.newPage();
const client = await page.target().createCDPSession();
await openDevtools(page, client);
await page.goto('http://example.com');

Debugger 演示下面就来看看 Browser API Debugger 的运行情况<inser-video-here>

如何直观地了解浏览器中发生的情况？

只要在代码中添加以下内容，就能随时轻松触发浏览器截图：

NodeJS

// node.js puppeteer - Taking screenshot to file screenshot.png 
await page.screenshot({ path: 'screenshot.png', fullPage: true });

要截取 Python 和 C# 的屏幕截图，请参阅此处: https://docs.brightdata.com/cn/scraping-automation/scraping-browser/code-examples

为什么某些页面的初始导航时间比其他页面长？

解锁目标网站需要大量的“幕后”工作。有些网站只需要几秒钟就能完成导航，而有些网站可能需要长达一两分钟才能完成导航，因为它们需要更复杂的解锁程序。因此，我们建议将导航超时时间设置为 “2 分钟”，以便在需要时有足够时间完成导航。在脚本中的 page.goto 调用前添加以下一行，即可将导航超时设置为 2 分钟。

// node.js puppeteer - Navigate to site with 2 min timeout  
page.goto('<https://example.com>', { timeout: 2*60*1000 });

最常见的错误代码有哪些？


错误代码	含义	您能做些什么？
服务器意外响应：407	远程浏览器的端口有问题	请检查您的远程浏览器的端口。Browser API 的正确端口是端口:9222
服务器意外响应：403	身份验证错误	从 Bright Data 控制面板检查身份验证凭据（用户名和密码），并检查是否使用了正确的 “Browser API” 区域。
服务器意外响应：503	服务不可用	我们现在可能正在扩展浏览器以满足需求。请尝试在 1 分钟后重新连接。

我似乎无法连接，是不是连接有问题？

您可以使用下面的 curl 检查您的连接：

Shell

curl -v -u USER:PASS https://brd.superproxy.io:9222/json/protocol

如有任何其他问题，请参阅我们的故障排除指南或联系技术支持: https://help.brightdata.com/hc/en-us/requests/new

如何将 Browser API 与 .NET Puppeteer Sharp 集成？

使用 C# 与 Browser API 产品集成需要修补 PuppeteerSharp 库，以添加对 websocket 身份验证的支持。具体方法如下：

C# PuppeteerSharp

using PuppeteerSharp;  
using System.Net.WebSockets;  
using System.Text;  
  
// Set the authentication credentials  
var auth = "USER:PASS";  
// Construct the WebSocket URL with authentication  
var ws = $"wss://{auth}@zproxy.lum-superproxy.io:9222";  
// Custom WebSocket factory function  
async Task<WebSocket> ws_factory(Uri url, IConnectionOptions options, CancellationToken cancellationToken)  
  
{  
    // Create a new ClientWebSocket instance
    var socket = new ClientWebSocket();  
    // Extract the user information (username and password) from the URL  
    var user_info = url.UserInfo;  
    if (user_info != "")  
    {  
        // Encode the user information in Base64 format  
        var auth = Convert.ToBase64String(Encoding.Default.GetBytes(user_info));  
        // Set the "Authorization" header of the WebSocket options with the encoded credentials  
        socket.Options.SetRequestHeader("Authorization", $"Basic {auth}");  
    }  
  
    // Disable the WebSocket keep-alive interval  
    socket.Options.KeepAliveInterval = TimeSpan.Zero;  
    // Connect to the WebSocket endpoint  
    await socket.ConnectAsync(url, cancellationToken);  
    return socket;  
}  
  
// Create ConnectOptions and configure the options  
var options = new ConnectOptions()  
  
{  
    // Set the BrowserWSEndpoint to the WebSocket URL  
    BrowserWSEndpoint = ws,  
    // Set the WebSocketFactory to the custom factory function  
    WebSocketFactory = ws_factory,  
};  
  
// Connect to the browser using PuppeteerSharp  
Console.WriteLine("Connecting to browser...");  
  
using (var browser = await Puppeteer.ConnectAsync(options))  
{  
    Console.WriteLine("Connected! Navigating...");  
    // Create a new page instance  
    var page = await browser.NewPageAsync();  
    // Navigate to the specified URL  
    await page.GoToAsync("https://example.com");  
    Console.WriteLine("Navigated! Scraping data...");  
    // Get the content of the page  
    var content = await page.GetContentAsync();  
    Console.WriteLine("Done!");  
    Console.WriteLine(content);  
}

Browser API 支持哪些编程语言？