deepcrawl.com
robots.txt

Robots Exclusion Standard data for deepcrawl.com

Resource Scan

Scan Details

Site Domain deepcrawl.com
Base Domain deepcrawl.com
Scan Status Ok
Last Scan2025-09-24T10:28:34+00:00
Next Scan 2025-10-24T10:28:34+00:00

Last Scan

Scanned2025-09-24T10:28:34+00:00
URL https://deepcrawl.com/robots.txt
Redirect https://www.lumar.io/robots.txt
Redirect Domain www.lumar.io
Redirect Base lumar.io
Domain IPs 23.185.0.4
Redirect IPs 23.185.0.4, 2620:12a:8000::4, 2620:12a:8001::4
Response IP 23.185.0.4
Found Yes
Hash 77092f910a6169d26cd26d3cb53059cc6148813e9580535afda06bc3afeb1e84
SimHash d9499c58c392

Groups

*

Rule Path
Disallow /wp-json/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /trackback/
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /*.pdf
Disallow */attachment/
Disallow */feed/
Allow /wp-includes/js/
Allow /wp-includes/images/
Allow /wp-includes/css/
Disallow /?s=*
Disallow /search/*
Disallow /collection/

*

Rule Path
Disallow /wp-content/uploads/wp-import-export-lite/

Other Records

Field Value
sitemap https://www.lumar.io/sitemap_index.xml

Comments

  • Search results
  • Other
  • WP Import Export Rule