marginweb.com
robots.txt

Robots Exclusion Standard data for marginweb.com

Resource Scan

Scan Details

Site Domain marginweb.com
Base Domain marginweb.com
Scan Status Ok
Last Scan2024-09-29T08:29:18+00:00
Next Scan 2024-10-06T08:29:18+00:00

Last Scan

Scanned2024-09-29T08:29:18+00:00
URL https://marginweb.com/robots.txt
Redirect https://www.marginweb.com/robots.txt
Redirect Domain www.marginweb.com
Redirect Base marginweb.com
Domain IPs 13.251.96.10, 2406:da18:b3d:e201::64, 2406:da18:b3d:e202::64, 52.74.166.77
Redirect IPs 18.139.194.139, 2406:da18:880:3802::c8, 2406:da18:b3d:e201::64, 52.74.166.77
Response IP 18.139.194.139
Found Yes
Hash a011b53ba9b980a3226ecbbb2ec54e8f80b8966ab32c67d1707b9278335c9223
SimHash e956dbd7cf7f

Groups

*

Rule Path
Disallow /404
Disallow /cgu
Allow /

yahoo pipes 1.0

Rule Path
Disallow /

urlespion

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.marginweb.com/sitemap.xml

Comments

  • beware, the sections below WILL NOT INHERIT from the above!
  • http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=40360
  • disallow adsense bot, as we no longer do adsense.
  • User-agent: Mediapartners-Google
  • Disallow: /
  • Yahoo bot is evil.
  • User-agent: Slurp
  • Disallow: /
  • Yahoo Pipes is for feeds not web pages.
  • This isn't really an image
  • User-agent: Googlebot-Image
  • Disallow: /*/ivc/*
  • Disallow: /users/flair/