whtop.com
robots.txt

Robots Exclusion Standard data for whtop.com

Resource Scan

Scan Details

Site Domain whtop.com
Base Domain whtop.com
Scan Status Ok
Last Scan2025-04-16T02:49:04+00:00
Next Scan 2025-04-23T02:49:04+00:00

Last Scan

Scanned2025-04-16T02:49:04+00:00
URL https://whtop.com/robots.txt
Domain IPs 104.21.40.7, 172.67.173.118
Response IP 172.67.173.118
Found Yes
Hash 124a50a2b592f4d211ac576ae94de1bfdbdce578734aa3f56c39b31b0e744873
SimHash c104947344e2

Groups

*

Rule Path
Disallow /photos2.acenter/*
Disallow /photos2.aright/*
Disallow /message.err403
Disallow /*/message.err403
Disallow /message.err400
Disallow /*/message.err400
Disallow /utils.votes/*
Disallow /*/utils.votes/*
Disallow /rss.*
Disallow /directory/search/*
Disallow /news/search/*
Disallow /*/directory/search/*
Disallow /*/news/search/*
Disallow /reviews.add/*
Disallow /*/reviews.add/*
Disallow /*/*.htm

Other Records

Field Value
sitemap https://www.whtop.com/sitemap.xml

Warnings

  • 1 invalid line.