horlogetime.com
robots.txt

Robots Exclusion Standard data for horlogetime.com

Resource Scan

Scan Details

Site Domain horlogetime.com
Base Domain horlogetime.com
Scan Status Ok
Last Scan2025-05-21T16:24:41+00:00
Next Scan 2025-05-28T16:24:41+00:00

Last Scan

Scanned2025-05-21T16:24:41+00:00
URL https://horlogetime.com/robots.txt
Redirect https://novelspot.net/robots.txt
Redirect Domain novelspot.net
Redirect Base novelspot.net
Domain IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Redirect IPs 104.18.10.170, 104.18.11.170, 2606:4700::6812:aaa, 2606:4700::6812:baa
Response IP 104.18.11.170
Found Yes
Hash 0053e05738754000de6dc7e59d486744ffc1b6b783dcf120ee69c0b89cb7e100
SimHash be4b78464911

Groups

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/