Overview
We are thrilled to introduce a new feature in our Marketplace Dataset API, which is designed to enrich your data acquisition process. This update allows for a more granular and streamlined way to request and manage your data collections, facilitating more effective dataset generation according to your specific needs.
First Marketplace Collection
This refers to the initial data collection directly from our marketplace, conducted without any customizations or specific views set by you.
Requesting a Collection
The initial step to define what data you want to collect. This is used when specifying the parameters for a new dataset collection like defining the dataset ID and type or when discovering new data or specifying URL collections.
Checking Status of the Collection
This step is about querying information on an existing data collection request to understand its current state. It’s used when checking the status or details of a collection request, such as the total number of lines, the freshness of data, or other pertinent details.
Initiating a Collection
This step triggers the data collection, transitioning from the request phase to the actual data gathering phase. It’s used after defining the collection parameters and when ready to start the data collection process.
Subsequent Collection with Custom View
This is the follow-up data collection, performed after you’ve initiated your first collection and defined a custom view or specific requirements for your dataset.
Initiating an Existing Dataset
This endpoint allows users to re-initiate data collection for an existing dataset using its dataset ID, collection type, and view. It’s particularly useful for updating or refreshing a dataset based on the latest information or for recurring data collection tasks. The API call is similar to initiating a new collection but targets existing datasets.
Checking Status by Snapshot ID
After initiating a collection, users can check the progress or status of their data collection using the snapshot ID.
Downloading File by Snapshot ID
Once the data collection is complete, users can download the dataset using the snapshot ID. This is a GET request for downloading the actual data collected, enabling users to retrieve and utilize their tailored dataset.
This documentation provides a comprehensive guide to using the Marketplace API for initial data collection without a customer-defined view, and for managing collections after defining a view. For further information or assistance, users are encouraged to contact Bright Data support.
Was this page helpful?