thistothat.com
robots.txt

Robots Exclusion Standard data for thistothat.com

Resource Scan

Scan Details

Site Domain thistothat.com
Base Domain thistothat.com
Scan Status Ok
Last Scan2026-01-24T04:57:19+00:00
Next Scan 2026-01-31T04:57:19+00:00

Last Scan

Scanned2026-01-24T04:57:19+00:00
URL https://thistothat.com/robots.txt
Domain IPs 69.27.100.2
Response IP 69.27.100.2
Found Yes
Hash 80e423e6af6cea1254e25a0a7d69bf7ba7fa6429e9b66e23308f8c0ab37dba21
SimHash 3f552282de90

Groups

*

Rule Path
Disallow /glue/avail.shtml
Disallow /glue/cost.shtml
Disallow /glue/footer_fr.shtml
Disallow /glue/footer.shtml
Disallow /glue/link.shtml
Disallow /glue/related.shtml
Disallow /glue/time.shtml
Disallow /glue/top1.shtml
Disallow /glue/toxic.shtml
Disallow /js

gptbot

Rule Path
Disallow /

Comments

  • Alphabetical order
  • Google search console checker somehow found that subpage and said bad things about it
  • Don't block /glue/ because that will block all subpages