standard-democrat.com
robots.txt

Robots Exclusion Standard data for standard-democrat.com

Resource Scan

Scan Details

Site Domain standard-democrat.com
Base Domain standard-democrat.com
Scan Status Ok
Last Scan2025-04-05T00:05:15+00:00
Next Scan 2025-04-12T00:05:15+00:00

Last Scan

Scanned2025-04-05T00:05:15+00:00
URL https://standard-democrat.com/robots.txt
Redirect https://www.standard-democrat.com/robots.txt
Redirect Domain www.standard-democrat.com
Redirect Base standard-democrat.com
Domain IPs 76.76.21.21
Redirect IPs 3.165.102.123, 3.165.102.23, 3.165.102.43, 3.165.102.71
Response IP 3.165.102.123
Found Yes
Hash cb8c9cc8938cbbaf2149d36a2ce5913903511a55501ceebd260fcb37189b3082
SimHash 5910c950e0d0

Groups

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.standard-democrat.com/sitemap.xml

Comments

  • Block specific AI crawlers
  • Allow all other crawlers
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.