skinn.be
robots.txt

Robots Exclusion Standard data for skinn.be

Resource Scan

Scan Details

Site Domain skinn.be
Base Domain skinn.be
Scan Status Ok
Last Scan2025-10-14T11:03:14+00:00
Next Scan 2025-10-21T11:03:14+00:00

Last Scan

Scanned2025-10-14T11:03:14+00:00
URL https://skinn.be/robots.txt
Redirect https://www.skinn.agency/robots.txt
Redirect Domain www.skinn.agency
Redirect Base skinn.agency
Domain IPs 176.62.167.177, 2a00:1c98:1000:1031:0:2:5a51:dab4
Redirect IPs 188.208.37.82, 2a00:1c98:1000:12c1:0:3:e3a0:29b1
Response IP 188.208.37.82
Found Yes
Hash 07584948043e0d90453ccb426fed5627c5d537983b404965256c74b9db475316
SimHash 6194992a4f3e

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.skinn.agency/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.skinn.agency/
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
  • Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site
  • Disallow Perplexity bot, as there's no benefit to allowing it to index your site