1639039923.rsc.cdn77.org
robots.txt

Robots Exclusion Standard data for 1639039923.rsc.cdn77.org

Resource Scan

Scan Details

Site Domain 1639039923.rsc.cdn77.org
Base Domain 1639039923.rsc.cdn77.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-14T20:03:11+00:00
Next Scan 2024-12-13T20:03:11+00:00

Last Successful Scan

Scanned2023-02-18T21:43:42+00:00
URL https://1639039923.rsc.cdn77.org/robots.txt
Domain IPs 143.244.33.161, 143.244.33.172, 143.244.33.174, 143.244.33.177, 2a02:6ea0:d100::12, 2a02:6ea0:d100::13, 2a02:6ea0:d100::14, 2a02:6ea0:d100::15, 2a02:6ea0:d100::16, 2a02:6ea0:d100::17, 2a02:6ea0:d100::20, 2a02:6ea0:d10c::1, 89.187.162.133, 89.187.162.136, 89.187.162.143, 89.187.163.85
Response IP 143.244.33.172
Found Yes
Hash c258a2c036bd6c350ee166335c544396db604713911137a5a5e7ba07d9e324d6
SimHash 92840e8d6371

Groups

mediapartners-google

Rule Path
Allow /

grapeshot

Rule Path
Allow /

*

Rule Path
Disallow /print/
Disallow /p/
Disallow /email/
Disallow /stats/
Disallow /ads/
Disallow /googads/
Disallow /amznads/
Disallow /404.html
Disallow /422.html
Disallow /500.html
Disallow /503.html
Disallow /javascripts/
Disallow /stylesheets/
Disallow /ssl/
Disallow /system/
Disallow /printfriendly.js

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /