webwereld.nl
robots.txt
Robots Exclusion Standard data for webwereld.nl
Resource Scan
Scan Details
Site Domain | webwereld.nl |
Base Domain | webwereld.nl |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a server error. |
Last Scan | 2024-03-26T14:17:47+00:00 |
Next Scan | 2024-06-24T14:17:47+00:00 |
Last Successful Scan
Scanned | 2023-03-03T09:22:29+00:00 |
URL | https://webwereld.nl/robots.txt |
Domain IPs | 104.26.14.225, 104.26.15.225, 172.67.71.106, 2606:4700:20::681a:ee1, 2606:4700:20::681a:fe1, 2606:4700:20::ac43:476a |
Response IP | 104.26.14.225 |
Found | Yes |
Hash | 0715a8f93a4aa19887d447f77e996f23aa7bea618d6ecd7d63ba1d492474a13c |
SimHash | e80a5c628714 |
Groups
*
Rule | Path |
---|---|
Disallow | /resources/ |
Disallow | /config/ |
Disallow | /handlers/ |
Disallow | /includes/ |
Disallow | /interceptors/ |
Disallow | /layouts/ |
Disallow | /logs/ |
Disallow | /model/ |
Disallow | /search/ |
Disallow | /plugins/ |
Disallow | /fusionreactor/ |
Disallow | /cp.html |
Disallow | /widget/ |
Allow | / |
Other Records
Field | Value |
---|---|
crawl-delay | 2 |
Other Records
Field | Value |
---|---|
sitemap | https://webwereld.nl/sitemap-index.xml |
sitemap | https://webwereld.nl/sitemap.xml |