geminiworktops.com
robots.txt

Robots Exclusion Standard data for geminiworktops.com

Resource Scan

Scan Details

Site Domain geminiworktops.com
Base Domain geminiworktops.com
Scan Status Ok
Last Scan2026-03-31T07:59:08+00:00
Next Scan 2026-04-30T07:59:08+00:00

Last Scan

Scanned2026-03-31T07:59:08+00:00
URL https://geminiworktops.com/robots.txt
Redirect https://www.geminiworktops.com/robots.txt
Redirect Domain www.geminiworktops.com
Redirect Base geminiworktops.com
Domain IPs 172.66.40.123, 172.66.43.133, 2606:4700:3108::ac42:287b, 2606:4700:3108::ac42:2b85
Redirect IPs 172.66.40.123, 172.66.43.133, 2606:4700:3108::ac42:287b, 2606:4700:3108::ac42:2b85
Response IP 172.66.43.133
Found Yes
Hash 57266eac822859d8d4929249b278eaaf2c0394d15236d5742a467b1c447d2e48
SimHash 233c1912a493

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

ccbot

Rule Path
Allow /

claudebot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

google-extended

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

cohere-ai

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.geminiworktops.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.geminiworktops.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • AI Crawlers - Explicitly Welcome