cleve.nl
robots.txt
            Robots Exclusion Standard data for cleve.nl
Resource Scan
Scan Details
| Site Domain | cleve.nl | 
| Base Domain | cleve.nl | 
| Scan Status | Failed | 
| Failure Stage | Fetching resource. | 
| Failure Reason | Server returned a client error. | 
| Last Scan | 2025-10-18T16:45:54+00:00 | 
| Next Scan | 2025-11-17T16:45:54+00:00 | 
Last Successful Scan
| Scanned | 2025-09-12T03:26:06+00:00 | 
| URL | https://cleve.nl/robots.txt | 
| Domain IPs | 212.125.139.3 | 
| Response IP | 212.125.139.3 | 
| Found | Yes | 
| Hash | 699c0f158110c17ee7c880b0d40509f96cd10844ca713b5a7389748d74a1e8ab | 
| SimHash | 280048008ab0 | 
Groups
*
          | Rule | Path | 
|---|---|
| Allow | /wp-admin/admin-ajax.php | 
| Allow | /wp-content/uploads/ | 
| Allow | /post-sitemap.xml | 
| Allow | /page-sitemap.xml | 
| Disallow | /wp-admin/ | 
| Disallow | /refer/ | 
| Disallow | /*.js$ | 
| Disallow | /*.css$ | 
| Disallow | /*.php$ | 
| Disallow | /*?p=*& | 
| Disallow | /*?SID= | 
Other Records
| Field | Value | 
|---|---|
| sitemap | https://cleve.nl/sitemap_index.xml | 
Comments