deepcrawl.com
robots.txt
Robots Exclusion Standard data for deepcrawl.com
Resource Scan
Scan Details
Site Domain | deepcrawl.com |
Base Domain | deepcrawl.com |
Scan Status | Ok |
Last Scan | 2025-09-24T10:28:34+00:00 |
Next Scan | 2025-10-24T10:28:34+00:00 |
Last Scan
Scanned | 2025-09-24T10:28:34+00:00 |
URL | https://deepcrawl.com/robots.txt |
Redirect | https://www.lumar.io/robots.txt |
Redirect Domain | www.lumar.io |
Redirect Base | lumar.io |
Domain IPs | 23.185.0.4 |
Redirect IPs | 23.185.0.4, 2620:12a:8000::4, 2620:12a:8001::4 |
Response IP | 23.185.0.4 |
Found | Yes |
Hash | 77092f910a6169d26cd26d3cb53059cc6148813e9580535afda06bc3afeb1e84 |
SimHash | d9499c58c392 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-json/ |
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /trackback/ |
Disallow | /wp-login.php |
Disallow | /wp-register.php |
Disallow | |
Disallow | */attachment/ |
Disallow | */feed/ |
Allow | /wp-includes/js/ |
Allow | /wp-includes/images/ |
Allow | /wp-includes/css/ |
Disallow | /?s=* |
Disallow | /search/* |
Disallow | /collection/ |
*
Rule | Path |
---|---|
Disallow | /wp-content/uploads/wp-import-export-lite/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.lumar.io/sitemap_index.xml |
Comments