tjerklangelaar.nl
robots.txt
Robots Exclusion Standard data for tjerklangelaar.nl
Resource Scan
Scan Details
Site Domain | tjerklangelaar.nl |
Base Domain | tjerklangelaar.nl |
Scan Status | Ok |
Last Scan | 2024-10-30T10:40:30+00:00 |
Next Scan | 2024-11-06T10:40:30+00:00 |
Last Scan
Scanned | 2024-10-30T10:40:30+00:00 |
URL | https://tjerklangelaar.nl/robots.txt |
Domain IPs | 85.10.132.201 |
Response IP | 85.10.132.201 |
Found | Yes |
Hash | ee873eb5386defe99e3f1c434d3d790961f166f6f9f1aff1f8e212882a983eda |
SimHash | c04a5b854414 |
Groups
googlebot
googlebot-image
mediapartners-google
msnbot
msnbot-media
slurp
yahoo-blogs
yahoo-mmcrawler
Rule | Path |
---|---|
Disallow |
*
Rule | Path |
---|---|
Disallow | / |
Disallow | /.settings |
Disallow | /cgi-bin |
Disallow | /gallery |
Disallow | /images |
Disallow | /javascript |
Disallow | /modules |
Disallow | /php |
Disallow | /phpSitemapNG |
Disallow | /rss |
Disallow | /stylesheets |
Disallow | /tmp |
Other Records
Field | Value |
---|---|
sitemap | http://www.tjerklangelaar.nl/sitemap.xml |