impoca.com
robots.txt
Robots Exclusion Standard data for impoca.com
Resource Scan
Scan Details
Site Domain | impoca.com |
Base Domain | impoca.com |
Scan Status | Ok |
Last Scan | 2024-11-12T11:43:10+00:00 |
Next Scan | 2024-11-19T11:43:10+00:00 |
Last Scan
Scanned | 2024-11-12T11:43:10+00:00 |
URL | https://impoca.com/robots.txt |
Domain IPs | 104.21.54.57, 172.67.168.24, 2606:4700:3030::ac43:a818, 2606:4700:3035::6815:3639 |
Response IP | 172.67.168.24 |
Found | Yes |
Hash | a38f1307b2f0c1c1c951a30f986e405d16592f6711ee1f489fff03e52199c5bd |
SimHash | 4f605b48fa93 |
Groups
*
Rule | Path |
---|---|
Disallow | /*preview%3Dtrue |
Disallow | /cgi-bin/ |
Disallow | /album/ |
Disallow | /blog/search/ |
Disallow | /blog/archive/ |
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /wp-content/upgrade/ |
Disallow | /wp-login.php |
Disallow | /wp-register.php |
Disallow | /xmlrpc.php |
Disallow | /*/feed/ |
Disallow | /*?mobile=* |
Disallow | /*?ref=* |
Disallow | /*?s=* |
Other Records
Field | Value |
---|---|
sitemap | https://impoca.com/sitemap_index.xml |