hortweek.com
robots.txt

Robots Exclusion Standard data for hortweek.com

Resource Scan

Scan Details

Site Domain hortweek.com
Base Domain hortweek.com
Scan Status Ok
Last Scan2025-08-08T07:47:40+00:00
Next Scan 2025-09-07T07:47:40+00:00

Last Scan

Scanned2025-08-08T07:47:40+00:00
URL https://hortweek.com/robots.txt
Redirect https://www.hortweek.com/robots.txt
Redirect Domain www.hortweek.com
Redirect Base hortweek.com
Domain IPs 104.21.75.192, 172.67.180.226, 2606:4700:3033::ac43:b4e2, 2606:4700:3035::6815:4bc0
Redirect IPs 104.21.75.192, 172.67.180.226, 2606:4700:3033::ac43:b4e2, 2606:4700:3035::6815:4bc0
Response IP 104.21.75.192
Found Yes
Hash 0ce2b8684586d817f96042f0a06a9b2191923ccf5b301d24ea6678cf849e54fd
SimHash e81c5c4045b1

Groups

*

Rule Path
Disallow /search/
Disallow /download-latest-landscape-project-leads/landscape/article/1353252
Disallow /landscape-project-leads-archive/landscape/article/1353251
Disallow /login?
Disallow /rulesforcommenting/
Disallow /PAGE/*
Disallow /page/*
Disallow /register/?

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.hortweek.com/newsmap.xml
sitemap https://www.hortweek.com/sitemap.xml