Help Center

Topic: Troubleshooting


Common Problems and Solutions for Page Loading Issues

Help Center TroubleshootingLast updated: 27 September, 2023

There may be various reasons why a page fails to open. This guide describes the most common problems and suggests solutions to help you overcome these issues.

Timeout

A timeout occurs when the page takes too long to respond. This may be a temporary issue with the page, or the page may be loading very slowly. Timeout limits vary depending on your plan:

  • Free plan: 45 seconds
  • Standard plan: 90 seconds
  • Enterprise plan: 180 seconds

To avoid timeouts please consider subscribing to a paid plan or upgrading your plan.

Page blocked

Some pages may use site protection features to block scrapers and website tracking tools like PageCrawl.io. Different pages may use different blocking mechanisms, but here are the most common ones:

  • Access Restricted to Specific Countries Page may be configured to only allow visitors from a specific country.

    • Solution: Specify a proxy location from a country that is not blocked. If you cannot find an available proxy, consider purchasing a proxy service for a specific country and configuring custom proxy in PageCrawl.io.
  • Proxy Location blocked The website may block the IP address of the proxy server PageCrawl.io is using.

401 or 403 Error

Most often indicates that PageCrawl.io Bot was not allowed to access the website. Use "Residential proxy pool" to avoid being blocked.

404 Page Not Found

In most cases this error indicates that page is no longer available to view. You should check and update the page URL.

Page Unreachable

The page can't be opened. In most cases website is down or the website in only reachable from a specific country

Site Protected with CAPTCHA

Pages may use CAPTCHA to protect the website from bots. To bypass this, you can use a service like 2Captcha which will use human workers to solve the captcha for you. PageCrawl.io has an integration with 2Captcha (you must be subscribed to Enterprise plan) you can sign up for and configure the API token generated from 2Captcha.

Unknown Error

In some cases there could be an unexpected error that causes pagecrawl.io bot to fail to check the page for changes. In case this error does not go away after a while, please contact support to notify us about the problem so we could prioritize the issue.


Get Started with PageCrawl.io Software

Track a New Page