wideinfo.org
robots.txt

Robots Exclusion Standard data for wideinfo.org

Resource Scan

Scan Details

Site Domain wideinfo.org
Base Domain wideinfo.org
Scan Status Ok
Last Scan2024-10-25T12:18:31+00:00
Next Scan 2024-11-01T12:18:31+00:00

Last Scan

Scanned2024-10-25T12:18:31+00:00
URL https://wideinfo.org/robots.txt
Domain IPs 172.66.40.254, 172.66.43.2, 2606:4700:3108::ac42:28fe, 2606:4700:3108::ac42:2b02
Response IP 172.66.43.2
Found Yes
Hash b303007a1b96e869c1f5b32b769c6f1b00ac0db5fcab88701ef51b3d67967303
SimHash 691c4fcdc2b3

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /readme.html
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow /wp-content/plugins/
Disallow /feed/
Disallow */feed/
Disallow */?
Disallow /?
Allow /wp-content/uploads/

mediapartners-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

googlebot

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
sitemap http://feeds.feedburner.com/wideinfoorg