How it works
You send one or more Reddit URLs or keywords to the Bright Data Reddit Scraper API. Bright Data handles the scraping infrastructure and returns clean, structured JSON.dataset_id to specify the data type (posts or comments) and return results in JSON, NDJSON or CSV.
Supported data types
Posts
Titles, descriptions, upvotes, community stats and attached media. Collect by post URL or discover by keyword or subreddit URL.
Comments
Comment text, replies, upvotes and post context. Collect by post or comment URL, with optional day-based filtering.
Endpoints
The Reddit Scraper API exposes two dataset families. Each family supports one or more input shapes.| Dataset | Input | What it returns |
|---|---|---|
Posts — Collect by URL (gd_lvz8ah06191smkebj4) | url (post URL) | A single post with metadata, community info and media |
Posts — Discover by keyword (gd_lvz8ah06191smkebj4) | keyword, date, num_of_posts | Posts matching a search term, filtered by date |
Posts — Discover by subreddit URL (gd_lvz8ah06191smkebj4) | url (subreddit URL), sort_by | Posts from one subreddit, sorted by new, top or hot |
Comments — Collect by URL (gd_lvzdpsdlw09j6t702) | url (post or comment URL), days_back | Comments from a post with full thread context |
Request methods
The Bright Data Reddit Scraper API supports two request methods. Choose based on your volume and latency needs.
Learn more in Understanding sync vs. async requests.
Capabilities and limits
| Capability | Detail |
|---|---|
| Output formats | JSON, NDJSON, CSV |
| Max URLs per sync request | 20 |
| Max URLs per async request | 5,000 |
| Data freshness | Real-time (scraped on demand) |
| Delivery options | API download, webhook, Amazon S3, Snowflake, Azure, GCS (all options) |
| Pricing | Pay per successful record (see pricing) |
Common questions
Is the data scraped in real time?
Is the data scraped in real time?
Yes. Each request triggers a live scrape against Reddit. There is no cached or stale data. Response times vary by endpoint: Collect-by-URL requests typically return in 10 to 30 seconds, while discovery requests may take longer depending on result volume.
What is the difference between Collect and Discover?
What is the difference between Collect and Discover?
Collect by URL scrapes a specific Reddit page you provide — a single post, a comment thread or a subreddit. Discover finds Reddit posts matching search criteria such as a keyword or a subreddit listing, then scrapes the results. Discovery is most useful via async requests when you don’t know the exact URLs up front.
Can I collect only recent comments from a post?
Can I collect only recent comments from a post?
Yes. The Comments API accepts an optional
days_back parameter. Pass an integer to limit results to comments posted within that many days.How is this different from scraping using proxies or Web Unlocker?
How is this different from scraping using proxies or Web Unlocker?
When scraping using proxies or Web Unlocker, you still need to write and maintain your own parsing logic and update it whenever Reddit changes its page structure. The Reddit Scraper API handles the entire stack: proxy rotation, anti-bot bypassing and parsing. You simply send a Reddit URL or keyword and get clean, structured JSON back with no scraping infrastructure or parser maintenance required on your end.
Next steps
Quickstart
Scrape your first Reddit post in 5 minutes.
Send your first request
Full code examples for every endpoint in cURL, Python and Node.js.
API reference
Endpoint specs, parameters and response schemas.