scrapywar.com
robots.txt
Robots Exclusion Standard data for scrapywar.com
Resource Scan
Scan Details
Site Domain | scrapywar.com |
Base Domain | scrapywar.com |
Scan Status | Ok |
Last Scan | 5/28/2025, 1:40:02 PM |
Next Scan | 6/4/2025, 1:40:02 PM |
Last Scan
Scanned | 5/28/2025, 1:40:02 PM |
URL | https://scrapywar.com/robots.txt |
Domain IPs | 104.21.58.154, 172.67.161.121, 2606:4700:3033::ac43:a179, 2606:4700:3037::6815:3a9a |
Response IP | 172.67.161.121 |
Found | Yes |
Hash | f94d3bb021e7bec7f7218554e6450de3a09588bdb9ebeae12e3379b5d96f5f1f |
SimHash | e2b858488c03 |
Groups
*
Rule | Path |
---|---|
Allow | /wp-admin/admin-ajax.php |
Allow | /wp-content/uploads/* |
Allow | /wp-content/*.js |
Allow | /wp-content/*.css |
Allow | /wp-includes/*.js |
Allow | /wp-includes/*.css |
Allow | /feed/%24/ |
Disallow | /wp-admin/ |
Disallow | /cgi-bin/ |
Disallow | /wp-content/plugins/ |
Disallow | /wp-content/themes/ |
Disallow | /wp-includes/ |
Disallow | /*/attachment/ |
Disallow | /tag/*/page/ |
Disallow | /tag/*/feed/ |
Disallow | /page/ |
Disallow | /comments/ |
Disallow | /xmlrpc.php |
Disallow | /?attachment_id* |
Disallow | /?s=%2F |
Disallow | /search/ |
Disallow | /trackback/ |
Disallow | /*trackback* |
Disallow | /*/trackback/ |
Disallow | /feed/ |
Disallow | /comments/feed/ |
Disallow | /*/feed/%24/ |
Disallow | /*/feed/rss/%24/ |
Disallow | /*/trackback/%24/ |
Disallow | /*/*/feed/%24/ |
Disallow | /*/*/feed/rss/%24/ |
Disallow | /*/*/trackback/%24/ |
Disallow | /*/*/*/feed/%24/ |
Disallow | /*/*/*/feed/rss/%24/ |
Disallow | /*/*/*/trackback/%24/ |
Disallow | /*/*/data%3Atext/%24/ |
Disallow | /*/data%3Atext/%24/ |
Disallow | /data%3Atext/%24/ |
Other Records
Field | Value |
---|---|
sitemap | https://scrapywar.com/sitemap_index.xml |