plainchicken.com
robots.txt

Robots Exclusion Standard data for plainchicken.com

Resource Scan

Scan Details

Site Domain plainchicken.com
Base Domain plainchicken.com
Scan Status Ok
Last Scan2024-11-13T21:04:05+00:00
Next Scan 2024-11-20T21:04:05+00:00

Last Scan

Scanned2024-11-13T21:04:05+00:00
URL https://plainchicken.com/robots.txt
Domain IPs 104.21.42.212, 172.67.166.78, 2606:4700:3035::6815:2ad4, 2606:4700:3037::ac43:a64e
Response IP 104.21.42.212
Found Yes
Hash d2a9d099a8bb0b7d1d7cd16f25c7fac6c4a46f25d5b788a1cb9939b203ca39ac
SimHash 710cd940a5b2

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /?s=*
Disallow /page/*/?s=
Disallow /search/

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.plainchicken.com/sitemap_index.xml

Warnings

  • 1 invalid line.