happy-headlines.com
robots.txt

Robots Exclusion Standard data for happy-headlines.com

Resource Scan

Scan Details

Site Domain happy-headlines.com
Base Domain happy-headlines.com
Scan Status Ok
Last Scan2024-11-14T03:35:15+00:00
Next Scan 2024-11-21T03:35:15+00:00

Last Scan

Scanned2024-11-14T03:35:15+00:00
URL https://happy-headlines.com/robots.txt
Redirect https://www.happy-headlines.com/robots.txt
Redirect Domain www.happy-headlines.com
Redirect Base happy-headlines.com
Domain IPs 75.2.70.75, 99.83.190.102
Redirect IPs 52.197.0.54, 52.199.221.217, 54.178.223.218
Response IP 54.178.223.218
Found Yes
Hash eb7e9fb656b70f63ffe219512153516dbee24268f3d3dc89e4acf63ccfb4b7a9
SimHash 5554de15411a

Groups

*

Rule Path
Disallow /carousel/
Disallow /change-log/
Disallow /checkout/
Disallow /image-licensing/
Disallow /instructions-guidelines/
Disallow /order-confirmation/
Disallow /paypal-checkout/
Disallow /sign-in/
Disallow /sign-up/
Disallow /style-guide/
Disallow /product/
Disallow /places/
Disallow /traveltellers/

Other Records

Field Value
sitemap https://www.happy-headlines.com/sitemap.xml