silesia24.pl
robots.txt

Robots Exclusion Standard data for silesia24.pl

Resource Scan

Scan Details

Site Domain silesia24.pl
Base Domain silesia24.pl
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-09-17T04:08:53+00:00
Next Scan 2024-12-16T04:08:53+00:00

Last Successful Scan

Scanned2024-06-21T19:47:15+00:00
URL https://silesia24.pl/robots.txt
Domain IPs 94.154.117.227
Response IP 94.154.117.227
Found Yes
Hash 10bee4230a56e11ec852a7b79776f35f4c47e9e30b6277ca96fa11e850e8881c
SimHash 61541d2a479a

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /*singleDay%3D*
Disallow /*startDate%3D*
Disallow /*szukaj?q=*
Disallow /*?tx_news_pi1*
Disallow /*dns%3D*
Disallow /*mode%3D*
Disallow /*author%3D*
Disallow /*?amp*
Disallow /*type%3D*
Disallow /*?mdrv=*
Disallow /*?cHash=*
Disallow /*?function=*
Disallow /*tx_mdnewsauthor*
Disallow /*wyniki-wyszukiwania*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://silesia24.pl/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://silesia24.pl/
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
  • Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site
  • Disallow Perplexity bot, as there's no benefit to allowing it to index your site