artpub.com
robots.txt

Robots Exclusion Standard data for artpub.com

Resource Scan

Scan Details

Site Domain artpub.com
Base Domain artpub.com
Scan Status Ok
Last Scan2025-10-04T01:46:34+00:00
Next Scan 2025-10-11T01:46:34+00:00

Last Scan

Scanned2025-10-04T01:46:34+00:00
URL https://www.artpub.com/robots.txt
Redirect https://artpub.nl/robots.txt
Redirect Domain artpub.nl
Redirect Base artpub.nl
Domain IPs 141.105.127.148
Redirect IPs 141.105.127.148
Response IP 141.105.127.148
Found Yes
Hash 0fef3f65898c09f7f4e432d53c7598a118fd180ac9af13b4a5aff8098f6ca3f6
SimHash 41631c76f5df

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow *groupon*
Disallow /zoeken?*
Disallow /helpdesk?*
Disallow */workshops?*
Disallow */brochure-aanvragen-groepsuitjes?*
Disallow *?category=*
Disallow /offerte?*

Other Records

Field Value
sitemap https://artpub.nl/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://artpub.nl/
  • live - don't allow web crawlers to index cpresources/ or vendor/