Skip to main content

Exclude Broken Link Checking by URL

Prevent the Broken Link Checker from Checking some Links

Updated this week

Are you getting erroneous Broken Link messages?

Is your site coming back with Broken Link URLs that are indeed working?

Our broken link checker is blocked by a variety of different sites, for example:

  • social media sites

  • .gov sites

  • some .org sites

Remove a URL from Broken Link Checking for the Whole Account

Account administrators can configure the Broken Link Checker to ignore links from a particular URL for every site in the account. This is particularly handy for URLs such as facebook.com.

To ignore a link, follow these steps:

  1. Select Settings in the top-right navigation menu.

  2. Select Links under Account Checks Settings in the left-hand menu.

  3. Under the Ignored Links tab you will find tabs for Text Links or Regular Expression.

    1. Use Ignored Links to add a particular URL, or a domain name. For example: https://www.drupal.org/project/dubbot, or https://www.drupal.org/project/body_node_id_class.
      We know both of these links work, but the drupal.org website does not allow our crawler access to check the links.

    2. Use the Regular Expression tab to enter URLs using regex to skip URLs that may have different subdomains, like facebook.com or linkedin.com. To exclude any links that contain https://xxx.facebook.com/xxx (with xxx being anything) you would need to enter the following regular expression:
      https?:\/\/(www\.)?facebook\.com\/(.*)

Commonly Used Broken Link Exclusions

  • https?:\/\/(www\.)?twitter\.com\/(.*)

  • https?:\/\/(www\.)?tiktok\.com\/(.*)

  • https?:\/\/(www\.)?X\.com\/(.*)

  • https?:\/\/(www\.)?facebook\.com\/(.*)

More Broken Link Exclusions Articles

If you have questions, please contact our DubBot Support team via email at help@dubbot.com or via the blue chat bubble in the lower right corner of your screen. We are here to help!

Did this answer your question?