What is Parsing?

Bright Data’s SERP API is a comprehensive solution that not only provides powerful scraping capabilities for various search engines but also includes advanced parsing functionality specifically for Google & Bing.

Parsing for SERP API is the process of transforming raw HTML into structured fields and values of data.

When parsing is activated, data from SERP HTMLs are further structured into usable fields and values (such as, rank, link, title, description, rating, and dozens more fields) enabling you to monitor competitor SERP rankings, analyze keyword trends, and gather valuable market insights.

Send a Basic Request

The following is the simplest GET parsed request with SERP API:

curl "https://www.google.com/search?q=pizza&lum_json=1" \
   --proxy brd.superproxy.io:22225 \
   --proxy-user brd-customer-<CUSTOMER_ID>-zone-<ZONE_NAME>:<ZONE_PASSWORD>

The above request is a synchronous request (the response is received in real-time). If you are looking to send an asynchronous parsed request see here.


Basic Request - Breakdown

brd.superproxy.ioAddress of our load balancer that will find the fastest Super Proxy for your request
22225Infrastructure port of our Super Proxies that is used to receive your requests
-user brd-customer-<CUSTOMER_ID> -zone-<ZONE_NAME>Username authentication. In its most basic form, it defines your username and what zone you will use for your request.
ZONE_PASSWORDZone password. All zones have passwords that are used for authentication
brd_json=1Returns parsed JSON instead of raw HTML

By default, a SERP API response without the brd_json=1 parameter, returns an unparsed structured HTML of the targeted SERP. If you would like to receive a parsed JSON response, add one of the following parameters at the end of your search query:

# Returns a single parsed JSON (instead of a raw HTML)

curl --proxy brd.superproxy.io:22225 \
  --proxy-user brd-customer-<CUSTOMER\_ID>-zone-<ZONE\_NAME>:<ZONE\_PASSWORD> \
  -k "https://www.google.com/search?q=pizza&brd\_json=1"
Parsing is supported for both Google and Bing search engines

Expected Parsed Output when using brd_json=1

The following is the exact JSON response received when sending the request above:

Next, we will examine a number of the important fields within the parsed JSON data to understand the type of structured data we have to offer.


A Comprehensive Guide to SERP API

At the top of the JSON response, you can find the general field which contains details about the search you ran and also includes the “results count” from the response.

General-Field

The following fields can be found in the general field

general.search_enginethe search engine used for the search
general.querythe keywords used for the search
general.results_cntthe results count
Google doesn’t display results count for Mobile, so this field is supported only with desktop search results.
general.search_timethe response time to get the results page
general.languagethe language that was set for the search,

Default: hl=en
general.locationthe location that was targeted with the search,(based on the localization and geo-location parameters)
general.mobilethe device the search was performed with (desktop\mobile)
general.basic_viewdeprecated
general.search _typethe type of search that was set to the request
general.Page_titleresults page title
general.Code_versionBright data parser version
general.Timestampthe time when the search executed
Input.original_urlthe url used for the search, this url includes all parameters applied for the search.applied for the search.

Starter fields to know

JSON fieldDescription
typeThe field type (site_link, text, rating, etc.)
titleThe text header, mostly the link text.
descriptionThe description under the link
referral_linkRedirection link
imageThis field can contain the image base64 string or the Image url
image_altImage alternative name

Explanation of other fields