puttingourheadstogether.com
robots.txt

Robots Exclusion Standard data for puttingourheadstogether.com

Resource Scan

Scan Details

Site Domain puttingourheadstogether.com
Base Domain puttingourheadstogether.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-09T21:53:12+00:00
Next Scan 2025-12-08T21:53:12+00:00

Last Successful Scan

Scanned2023-04-30T23:54:44+00:00
URL https://www.puttingourheadstogether.com/robots.txt
Domain IPs 104.21.45.203, 172.67.218.188, 2606:4700:3030::ac43:dabc, 2606:4700:3036::6815:2dcb
Response IP 172.67.218.188
Found Yes
Hash 8af4ab3880ea12909ce44ce1f3ec271e979cc1ab52ce4bf9381514191b69af07
SimHash 6328c230cbf1

Groups

*

Rule Path
Disallow /t/trackback
Disallow /t/comments
Disallow /t/stats
Disallow /t/app
Disallow /.m/

*

Rule Path
Disallow /*.html?cid=*
Disallow /*/comments/page/*
Disallow /*/comments/atom.xml
Disallow /*/comments/rss.xml
Disallow /*/comments/index.rdf

googlebot-mobile

Rule Path
Allow /.m/
Disallow /

y!j-srd

Rule Path
Allow /.m/
Disallow /

y!j-mbs

Rule Path
Allow /.m/
Disallow /

active cache request

Rule Path
Disallow *

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

gsa-crawler

Rule Path
Disallow /

twitterbot

Rule Path
Disallow

Comments

  • block against duplicate content
  • block MSIE from abusing cache request