hotsheet.com
robots.txt

Robots Exclusion Standard data for hotsheet.com

Resource Scan

Scan Details

Site Domain hotsheet.com
Base Domain hotsheet.com
Scan Status Ok
Last Scan2024-11-16T11:40:13+00:00
Next Scan 2024-11-23T11:40:13+00:00

Last Scan

Scanned2024-11-16T11:40:13+00:00
URL https://hotsheet.com/robots.txt
Redirect https://www.hotsheet.com/robots.txt
Redirect Domain www.hotsheet.com
Redirect Base hotsheet.com
Domain IPs 104.130.26.115
Redirect IPs 104.130.26.115
Response IP 104.130.26.115
Found Yes
Hash 4b4ff7211f6f2582669a27a5525c19e42fd4e3fb63e9ab64aabad6945dc768a2
SimHash fd0d95450590

Groups

*

Rule Path
Disallow /ads/
Disallow /cache/
Disallow /images/
Disallow /dmoz/
Disallow /label/
Disallow /link/
Disallow /local/
Disallow /wap/
Disallow /magpierss/
Disallow /index_22.php
Disallow /index_33.php
Disallow /add-spot.php
Disallow /add-video.php
Disallow /add-post.php
Disallow /news202*.php$
Disallow /cat*.php$
Disallow /bld*.php$

Other Records

Field Value
sitemap https://www.hotsheet.com/sitemap.xml