Help Center

Topic: Integrations


Tracking pages protected with CAPTCHA

Help Center IntegrationsLast updated: 9 February, 2023

CAPTCHA, or Completely Automated Public Turing test to tell Computers and Humans Apart, is a security mechanism used by websites to prevent automated scraping and other malicious activities. CAPTCHA typically requires users to perform a simple task, such as identifying letters or solving a puzzle, to prove that they are human. While this is an effective way to prevent bots from accessing a website, it can also pose a challenge for legitimate users who want to track changes in some websites.

If you're having trouble tracking pages protected with CAPTCHA, PageCrawl.io can help. PageCrawl.io is a cloud-based web scraping platform that enables you to extract data from any website with ease. With its user-friendly interface and robust set of features, PageCrawl.io is a top choice for businesses and individuals looking to collect data from the web.

One of the key features of PageCrawl.io is its integration with 2captcha.com, a leading CAPTCHA solving service. This integration allows PageCrawl.io to bypass CAPTCHA blocks and continue scraping data from protected pages. Here's how it works:

  1. When PageCrawl.io encounters a CAPTCHA on a website, it sends the CAPTCHA to 2captcha.com for solving.
  2. 2captcha.com uses a combination of human and machine intelligence to quickly and accurately solve the CAPTCHA.
  3. Once the CAPTCHA is solved, 2captcha.com returns the solution to PageCrawl.io, which uses it to access the protected page and scrape the data.

With PageCrawl.io and 2captcha.com working together, you can easily bypass CAPTCHA blocks and collect the data you need without any hassle. Whether you're a business looking to collect market data or a web scraping enthusiast looking to build your own database, PageCrawl.io has you covered.

Enabling 2Captcha.com integration

First, create an account on 2captcha.com and obtain your API key. Then, in PageCrawl.io, navigate to the settings section and enter your 2captcha API key.

Please note that the integration with 2captcha.com is only available for Enterprise plan owners. This premium feature provides even more robust data collection capabilities and allows you to bypass CAPTCHA blocks with ease.

Once you have set up your 2captcha API key in PageCrawl.io, you're ready to start tracking pages protected with CAPTCHA. Simply add new page to track and configure it to your needs, and PageCrawl.io will handle the rest.


Get Started with PageCrawl.io Software

Track a New Page