theuselesswebindex.com
robots.txt

Robots Exclusion Standard data for theuselesswebindex.com

Resource Scan

Scan Details

Site Domain theuselesswebindex.com
Base Domain theuselesswebindex.com
Scan Status Ok
Last Scan2026-01-20T04:48:03+00:00
Next Scan 2026-01-27T04:48:03+00:00

Last Scan

Scanned2026-01-20T04:48:03+00:00
URL https://theuselesswebindex.com/robots.txt
Redirect https://www.theuselesswebindex.com/robots.txt
Redirect Domain www.theuselesswebindex.com
Redirect Base theuselesswebindex.com
Domain IPs 85.13.150.230
Redirect IPs 85.13.150.230
Response IP 85.13.150.230
Found Yes
Hash 94b837286e8a38e219a26a1c2d602bf5c9adc6ea7567685aaa6a038d9055663a
SimHash e20c87049f93

Groups

*

Rule Path
Disallow /static/
Allow /static/layoutbilder/
Allow /static/websites/

emailcollector

Rule Path
Disallow /

gagarobot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

vscooter

Rule Path
Disallow /

roverbot*

Rule Path
Disallow /

mirago

Rule Path
Disallow /

psbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

webcapture*

Rule Path
Disallow /

websauger*

Rule Path
Disallow /

teleport*

Rule Path
Disallow /

webwhacker*

Rule Path
Disallow /

webzip*

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

net attache*

Rule Path
Disallow /

webreaper*

Rule Path
Disallow /

sitesnagger*

Rule Path
Disallow /

httrack*

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.theuselesswebindex.com/sitemap.xml
sitemap https://www.theuselesswebindex.com/most-useless-websites.xml