cleve.nl
robots.txt

Robots Exclusion Standard data for cleve.nl

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cleve.nl
Base Domain	cleve.nl
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-10-18T16:45:54+00:00
Next Scan	2025-11-17T16:45:54+00:00

Last Successful Scan

Scanned	2025-09-12T03:26:06+00:00
URL	https://cleve.nl/robots.txt
Domain IPs	212.125.139.3
Response IP	212.125.139.3
Found	Yes
Hash	699c0f158110c17ee7c880b0d40509f96cd10844ca713b5a7389748d74a1e8ab
SimHash	280048008ab0

Groups

*

Rule	Path
Allow	/wp-admin/admin-ajax.php
Allow	/wp-content/uploads/
Allow	/post-sitemap.xml
Allow	/page-sitemap.xml
Disallow	/wp-admin/
Disallow	/refer/
Disallow	/*.js$
Disallow	/*.css$
Disallow	/*.php$
Disallow	/?p=&
Disallow	/*?SID=

Rule

Path

Allow

/wp-admin/admin-ajax.php

Allow

/wp-content/uploads/

Allow

/post-sitemap.xml

Allow

/page-sitemap.xml

Disallow

/wp-admin/

Disallow

/refer/

Disallow

/*.js$

Disallow

/*.css$

Disallow

/*.php$

Disallow

/*?p=*&

Disallow

/*?SID=

msie 10.0

Rule	Path
Disallow	/

Rule

Path

Disallow

/

censysinspect/1.1

Rule	Path
Disallow	/

Rule

Path

Disallow

/

mj12bot/v1.4.8

Rule	Path
Disallow

Rule

Path

Disallow

Back to top

Other Records

Field	Value
sitemap	https://cleve.nl/sitemap_index.xml

Field

Value

sitemap

https://cleve.nl/sitemap_index.xml

Back to top

Comments

Paths (no clean URLs)

Back to top

cleve.nlrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

msie 10.0

censysinspect/1.1

mj12bot/v1.4.8

Other Records

Comments

cleve.nl
robots.txt