manh.com
robots.txt
Robots Exclusion Standard data for manh.com
Resource Scan
Scan Details
Site Domain | manh.com |
Base Domain | manh.com |
Scan Status | Ok |
Last Scan | 2024-04-23T22:42:07+00:00 |
Next Scan | 2024-05-23T22:42:07+00:00 |
Last Scan
Scanned | 2024-04-23T22:42:07+00:00 |
URL | https://manh.com/robots.txt |
Redirect | https://www.manh.com/robots.txt |
Redirect Domain | www.manh.com |
Redirect Base | manh.com |
Domain IPs | 217.114.94.2 |
Redirect IPs | 104.18.41.115, 172.64.146.141, 2606:4700:4400::6812:2973, 2606:4700:4400::ac40:928d |
Response IP | 104.18.41.115 |
Found | Yes |
Hash | c163707af2c0be21b6d749f7e2ab72634e27eebeed37107d20dc9e34d55a14c3 |
SimHash | 80004ac1c310 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /search?query=* |
Disallow | /*?ite= |
Disallow | /CMP/ |
Disallow | /staging/ |
Disallow | /staging |
Disallow | /Recycle-Bin/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.manh.com/sitemap.xml |