s5h.net
robots.txt

Robots Exclusion Standard data for s5h.net

Resource Scan

Scan Details

Site Domain s5h.net
Base Domain s5h.net
Scan Status Ok
Last Scan2024-09-20T06:10:26+00:00
Next Scan 2024-09-27T06:10:26+00:00

Last Scan

Scanned2024-09-20T06:10:26+00:00
URL https://s5h.net/robots.txt
Redirect http://www.usenix.org.uk/robots.txt
Redirect Domain www.usenix.org.uk
Redirect Base usenix.org.uk
Domain IPs 2001:ba8:1f1:f1cb::2, 85.119.82.99
Redirect IPs 2001:ba8:1f1:f1cb::2, 85.119.82.99
Response IP 85.119.82.99
Found Yes
Hash cfa3ccb1948f58f499763432e081391f16c4e0ef0f52de330efcbe461bc426c0
SimHash 985018e68510

Groups

*

Rule Path
Allow /

*

Rule Path
Disallow /royaljelly
Disallow /royaljelly*
Disallow /content/royaljelly
Disallow /content/royaljelly*
Disallow /testsite/royaljelly
Disallow /testsite/royaljelly*
Disallow /testsite/homepage*
Disallow /content/homepage*

googlebot

Rule Path
Disallow /royaljelly
Disallow /royaljelly*
Disallow /content/royaljelly
Disallow /content/royaljelly*
Disallow /testsite/royaljelly
Disallow /testsite/royaljelly*
Disallow /testsite/homepage*
Disallow /content/homepage*
Allow /

mediapartners-google

Rule Path
Allow /

Other Records

Field Value
sitemap http://www.usenix.org.uk/seo/sitemap.xml.gz