hcamag.com
robots.txt

Robots Exclusion Standard data for hcamag.com

Resource Scan

Scan Details

Site Domain hcamag.com
Base Domain hcamag.com
Scan Status Ok
Last Scan2024-11-15T04:32:19+00:00
Next Scan 2024-11-22T04:32:19+00:00

Last Scan

Scanned2024-11-15T04:32:19+00:00
URL https://hcamag.com/robots.txt
Redirect https://www.hcamag.com/robots.txt
Redirect Domain www.hcamag.com
Redirect Base hcamag.com
Domain IPs 104.18.16.186, 104.18.17.186, 2606:4700::6812:10ba, 2606:4700::6812:11ba
Redirect IPs 104.18.16.186, 104.18.17.186, 2606:4700::6812:10ba, 2606:4700::6812:11ba
Response IP 104.18.16.186
Found Yes
Hash 35e88609da87249c18336ebd28272c1bc985fa8e3d0c188cc2e7b8890413774a
SimHash 61251427d7d1

Groups

twitterbot

Rule Path
Disallow

*

Rule Path
Disallow /au/business-news/
Disallow /us/business-news/
Disallow /ca/business-news/
Disallow /nz/business-news/
Disallow /asia/business-news/
Disallow /1042886/
Disallow /cdn-cgi/
Disallow *__hstc%3D

Other Records

Field Value
sitemap https://www.hcamag.com/sitemap.xml