tibbatts.com
robots.txt

Robots Exclusion Standard data for tibbatts.com

Resource Scan

Scan Details

Site Domain tibbatts.com
Base Domain tibbatts.com
Scan Status Ok
Last Scan2025-11-03T03:30:27+00:00
Next Scan 2025-12-03T03:30:27+00:00

Last Scan

Scanned2025-11-03T03:30:27+00:00
URL https://tibbatts.com/robots.txt
Domain IPs 217.199.187.194
Response IP 217.199.187.194
Found Yes
Hash 69851bdd0c07d92e958aea1e10b24066a9c56d37dabd6022729a8e34ccfd21ee
SimHash 6c208a12e78b

Groups

googlebot

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

bingbot

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

duckduckbot

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

slurp

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

openai

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

feedburner

Rule Path
Disallow /

specificfeeds

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

crawler

Rule Path
Disallow /

spider

Rule Path
Disallow /

bot

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /tmp/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /xmlrpc.php
Disallow /trackback/
Disallow /feed/
Disallow /comments/
Disallow /?s=
Disallow /*?replytocom
Disallow /*.php$
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 10

Comments

  • Allow major search engines
  • Disallow everything for bad / irrelevant bots
  • Generic catch-all block list for scrapers and email harvesters
  • Default rules for all other crawlers
  • Crawl-delay to stop hammering