tjerklangelaar.nl
robots.txt

Robots Exclusion Standard data for tjerklangelaar.nl

Resource Scan

Scan Details

Site Domain tjerklangelaar.nl
Base Domain tjerklangelaar.nl
Scan Status Ok
Last Scan2024-09-25T09:38:39+00:00
Next Scan 2024-10-02T09:38:39+00:00

Last Scan

Scanned2024-09-25T09:38:39+00:00
URL https://tjerklangelaar.nl/robots.txt
Domain IPs 85.10.132.201
Response IP 85.10.132.201
Found Yes
Hash ee873eb5386defe99e3f1c434d3d790961f166f6f9f1aff1f8e212882a983eda
SimHash c04a5b854414

Groups

googlebot
googlebot-image
mediapartners-google
msnbot
msnbot-media
slurp
yahoo-blogs
yahoo-mmcrawler

Rule Path
Disallow

*

Rule Path
Disallow /
Disallow /.settings
Disallow /cgi-bin
Disallow /gallery
Disallow /images
Disallow /javascript
Disallow /modules
Disallow /php
Disallow /phpSitemapNG
Disallow /rss
Disallow /stylesheets
Disallow /tmp

Other Records

Field Value
sitemap http://www.tjerklangelaar.nl/sitemap.xml