wideinfo.org
robots.txt
Robots Exclusion Standard data for wideinfo.org
Resource Scan
Scan Details
Site Domain | wideinfo.org |
Base Domain | wideinfo.org |
Scan Status | Ok |
Last Scan | 2024-10-25T12:18:31+00:00 |
Next Scan | 2024-11-01T12:18:31+00:00 |
Last Scan
Scanned | 2024-10-25T12:18:31+00:00 |
URL | https://wideinfo.org/robots.txt |
Domain IPs | 172.66.40.254, 172.66.43.2, 2606:4700:3108::ac42:28fe, 2606:4700:3108::ac42:2b02 |
Response IP | 172.66.43.2 |
Found | Yes |
Hash | b303007a1b96e869c1f5b32b769c6f1b00ac0db5fcab88701ef51b3d67967303 |
SimHash | 691c4fcdc2b3 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /wp-admin/ |
Disallow | /readme.html |
Disallow | /comments/feed/ |
Disallow | /trackback/ |
Disallow | /index.php |
Disallow | /xmlrpc.php |
Disallow | /wp-content/plugins/ |
Disallow | /feed/ |
Disallow | */feed/ |
Disallow | */? |
Disallow | /? |
Allow | /wp-content/uploads/ |
Other Records
Field | Value |
---|---|
sitemap | http://feeds.feedburner.com/wideinfoorg |