ideascv.com
robots.txt

Robots Exclusion Standard data for ideascv.com

Resource Scan

Scan Details

Site Domain ideascv.com
Base Domain ideascv.com
Scan Status Ok
Last Scan2025-09-17T16:56:18+00:00
Next Scan 2025-09-24T16:56:18+00:00

Last Scan

Scanned2025-09-17T16:56:18+00:00
URL https://ideascv.com/robots.txt
Domain IPs 2001:8d8:100f:f000::241, 217.160.0.122
Response IP 217.160.0.122
Found Yes
Hash adf8bd2eb5f27c3213cbd92eb871d1c1642c83f45a115d99c85c0d35f0393831
SimHash c344586a63b0

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-includes/
Allow /wp-includes/js/
Allow /wp-includes/images/
Disallow /trackback/
Disallow /wp-login.php
Disallow /wp-register.php