publiq.be
robots.txt

Robots Exclusion Standard data for publiq.be

Archived Snapshots

Resource Scan

Scan Details

Site Domain	publiq.be
Base Domain	publiq.be
Scan Status	Ok
Last Scan	2025-10-26T01:13:22+00:00
Next Scan	2025-11-02T01:13:22+00:00

Last Scan

Scanned	2025-10-26T01:13:22+00:00
URL	https://publiq.be/robots.txt
Redirect	https://www.publiq.be/robots.txt
Redirect Domain	www.publiq.be
Redirect Base	publiq.be
Domain IPs	5.134.4.28
Redirect IPs	5.134.4.28
Response IP	5.134.4.28
Found	Yes
Hash	63df641c7769ec4b2e09ec1e86ba5eeb5c4b990801ab24004ba83c2c4813d560
SimHash	211c9922cf36

Groups

*

Rule	Path
Disallow	/cpresources/
Disallow	/vendor/
Disallow	/.env
Disallow	/cache/

Rule

Path

Disallow

/cpresources/

Disallow

/vendor/

Disallow

/.env

Disallow

/cache/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.publiq.be/nl/sitemaps-1-sitemap.xml

Field

Value

sitemap

https://www.publiq.be/nl/sitemaps-1-sitemap.xml

Back to top

Comments

robots.txt for https://www.publiq.be/
live - don't allow web crawlers to index cpresources/ or vendor/
Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site
Disallow Perplexity bot, as there's no benefit to allowing it to index your site

Back to top

publiq.berobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

gptbot

google-extended

perplexitybot

Other Records

Comments

publiq.be
robots.txt